Query lcl|NC_019422.1_cdsid_YP_006990557.1 [gene=D864_gp03] [protein=phage portal protein] [protein_id=YP_006990557.1] [location=1687..2841] Match_columns 384 No_of_seqs 116 out of 1059 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 18:01:49 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102080 Length: 429 100.0 1.2E-92 7.3E-96 524.6 44.9 383 1-384 1-416 (429) 2 protein:vir:105002 Length: 432 100.0 1.4E-92 8.5E-96 524.2 45.2 383 1-384 1-419 (432) 3 protein:vir:107605 Length: 432 100.0 1.4E-92 8.5E-96 524.2 45.2 383 1-384 1-419 (432) 4 protein:vir:102855 Length: 432 100.0 1.4E-92 8.5E-96 524.2 45.2 383 1-384 1-419 (432) 5 protein:vir:81152 Length: 411 100.0 3.5E-92 2.2E-95 522.0 43.8 382 1-384 1-410 (411) 6 protein:vir:1380 Length: 422 # 100.0 7.5E-92 4.7E-95 520.2 43.7 380 1-384 1-410 (422) 7 protein:vir:102118 Length: 409 100.0 1.4E-91 8.4E-95 518.8 44.5 381 1-384 1-408 (409) 8 protein:vir:4509 Length: 424 # 100.0 1.9E-91 1.2E-94 517.9 44.4 382 1-384 16-412 (424) 9 protein:vir:100249 Length: 431 100.0 3.8E-90 2.4E-93 510.8 44.5 380 1-384 1-425 (431) 10 protein:vir:4454 Length: 414 # 100.0 9.3E-90 5.8E-93 508.7 44.4 380 1-384 1-399 (414) 11 protein:vir:105064 Length: 421 100.0 2.2E-89 1.4E-92 506.6 44.6 379 1-384 1-403 (421) 12 protein:vir:189 Length: 424 # 100.0 1.4E-89 8.5E-93 507.8 42.7 379 1-384 14-415 (424) 13 protein:vir:483 Length: 413 # 100.0 4.7E-89 2.9E-92 504.8 44.8 380 1-384 4-396 (413) 14 protein:vir:1884 Length: 424 # 100.0 2.1E-89 1.3E-92 506.8 42.8 379 1-384 14-415 (424) 15 protein:vir:4337 Length: 434 # 100.0 3.1E-89 1.9E-92 505.8 43.5 379 1-384 18-411 (434) 16 protein:vir:5737 Length: 419 # 100.0 4.8E-89 3E-92 504.8 43.7 379 1-384 1-398 (419) 17 protein:vir:6240 Length: 457 # 100.0 3.5E-88 2.2E-91 500.1 42.7 381 1-384 1-415 (457) 18 protein:vir:100150 Length: 437 100.0 4.4E-88 2.7E-91 499.5 43.1 380 1-384 1-409 (437) 19 protein:vir:1266 Length: 416 # 100.0 8.8E-88 5.5E-91 497.8 44.3 377 2-384 1-411 (416) 20 protein:vir:97060 Length: 432 100.0 4.8E-88 3E-91 499.3 42.2 376 1-384 7-411 (432) 21 protein:vir:10362 Length: 432 100.0 4.5E-88 2.8E-91 499.5 42.0 376 1-384 7-411 (432) 22 protein:vir:81072 Length: 432 100.0 5.9E-88 3.7E-91 498.8 42.4 376 1-384 7-411 (432) 23 protein:vir:93610 Length: 454 100.0 5.4E-88 3.3E-91 499.0 42.1 378 1-384 2-405 (454) 24 protein:vir:2683 Length: 412 # 100.0 1.9E-87 1.2E-90 496.0 42.1 374 1-384 1-406 (412) 25 protein:vir:4598 Length: 416 # 100.0 2.1E-87 1.3E-90 495.8 42.0 376 1-384 1-413 (416) 26 protein:vir:81095 Length: 416 100.0 2.1E-87 1.3E-90 495.8 42.0 376 1-384 1-413 (416) 27 protein:vir:80333 Length: 419 100.0 4E-87 2.5E-90 494.2 43.3 378 1-384 1-397 (419) 28 protein:vir:7853 Length: 518 # 100.0 5.2E-87 3.2E-90 493.6 43.8 381 1-384 1-406 (518) 29 protein:vir:9408 Length: 441 # 100.0 3.8E-87 2.3E-90 494.4 42.3 376 1-384 26-438 (441) 30 protein:vir:79984 Length: 441 100.0 3.8E-87 2.3E-90 494.4 42.3 376 1-384 26-438 (441) 31 protein:vir:1431 Length: 419 # 100.0 5.5E-87 3.4E-90 493.5 43.0 378 1-384 1-397 (419) 32 protein:vir:1326 Length: 457 # 100.0 5E-87 3.1E-90 493.7 42.8 381 1-384 1-416 (457) 33 protein:vir:98396 Length: 441 100.0 4.3E-87 2.7E-90 494.1 42.2 375 1-384 26-438 (441) 34 protein:vir:101648 Length: 518 100.0 1.1E-86 6.6E-90 491.9 43.8 381 1-384 1-414 (518) 35 protein:vir:96980 Length: 409 100.0 7.6E-87 4.7E-90 492.7 41.9 374 1-384 4-403 (409) 36 protein:vir:81218 Length: 423 100.0 1.1E-86 6.6E-90 491.9 41.9 384 1-384 1-415 (423) 37 protein:vir:8418 Length: 409 # 100.0 2.1E-86 1.3E-89 490.3 42.6 371 1-384 1-391 (409) 38 protein:vir:93943 Length: 409 100.0 7.7E-86 4.8E-89 487.2 41.8 374 1-384 4-403 (409) 39 protein:vir:9702 Length: 406 # 100.0 2.1E-85 1.3E-88 484.8 42.7 374 1-384 1-384 (406) 40 protein:vir:960 Length: 413 # 100.0 2.1E-85 1.3E-88 484.8 41.0 375 1-384 13-412 (413) 41 protein:vir:94426 Length: 409 100.0 4.1E-85 2.6E-88 483.2 41.9 374 1-384 4-403 (409) 42 protein:vir:95378 Length: 406 100.0 1.4E-84 8.8E-88 480.3 42.8 374 1-384 1-397 (406) 43 protein:vir:3868 Length: 417 # 100.0 2.7E-84 1.7E-87 478.7 41.6 373 1-384 1-401 (417) 44 protein:vir:80134 Length: 403 100.0 5E-84 3.1E-87 477.3 42.2 374 1-384 1-394 (403) 45 protein:vir:101647 Length: 460 100.0 8E-83 5E-86 470.7 41.9 380 1-384 2-448 (460) 46 protein:vir:94666 Length: 723 100.0 6E-81 3.7E-84 460.4 40.6 368 1-384 1-400 (723) 47 protein:vir:8317 Length: 409 # 100.0 1E-79 6.2E-83 453.7 39.6 356 1-384 1-408 (409) 48 protein:vir:3843 Length: 397 # 100.0 7.6E-79 4.7E-82 448.8 40.5 363 1-384 1-383 (397) 49 protein:vir:102727 Length: 945 100.0 2.2E-78 1.4E-81 446.3 41.6 379 1-384 62-500 (945) 50 protein:vir:104259 Length: 403 100.0 2E-78 1.3E-81 446.5 40.1 367 1-384 1-392 (403) 51 protein:vir:8100 Length: 466 # 100.0 3.4E-78 2.1E-81 445.3 41.0 376 1-384 1-462 (466) 52 protein:vir:9359 Length: 348 # 100.0 1.2E-77 7.2E-81 442.4 39.2 322 53-384 1-342 (348) 53 protein:vir:94002 Length: 378 100.0 4.2E-78 2.6E-81 444.8 34.2 344 1-384 1-360 (378) 54 protein:vir:93867 Length: 378 100.0 6.8E-78 4.2E-81 443.7 34.0 344 1-384 1-360 (378) 55 protein:vir:1661 Length: 378 # 100.0 9.5E-78 5.9E-81 442.8 34.6 344 1-384 1-360 (378) 56 protein:vir:4854 Length: 386 # 100.0 1.5E-76 9.5E-80 436.2 38.5 361 1-384 1-381 (386) 57 protein:vir:100882 Length: 383 100.0 4.2E-76 2.6E-79 433.8 40.4 359 1-384 1-381 (383) 58 protein:vir:100187 Length: 385 100.0 4.4E-76 2.7E-79 433.7 39.9 357 1-384 1-381 (385) 59 protein:vir:95965 Length: 385 100.0 2.6E-76 1.6E-79 434.9 37.5 363 1-384 1-381 (385) 60 protein:vir:6210 Length: 394 # 100.0 5.3E-76 3.3E-79 433.3 39.0 364 1-384 1-390 (394) 61 protein:vir:101289 Length: 395 100.0 7.2E-76 4.4E-79 432.6 38.0 361 1-384 1-374 (395) 62 protein:vir:9507 Length: 395 # 100.0 7.2E-76 4.4E-79 432.6 38.0 361 1-384 1-374 (395) 63 protein:vir:100650 Length: 395 100.0 7.2E-76 4.4E-79 432.6 38.0 361 1-384 1-374 (395) 64 protein:vir:80796 Length: 574 100.0 5.2E-75 3.2E-78 427.8 42.6 379 1-384 63-483 (574) 65 protein:vir:78310 Length: 376 100.0 8.1E-76 5E-79 432.3 34.8 362 1-384 1-374 (376) 66 protein:vir:4089 Length: 395 # 100.0 1.3E-74 8E-78 425.7 36.3 368 1-384 1-387 (395) 67 protein:vir:80644 Length: 551 100.0 1.7E-73 1.1E-76 419.5 41.9 378 1-384 31-480 (551) 68 protein:vir:4952 Length: 386 # 100.0 7.6E-74 4.7E-77 421.4 39.3 361 1-384 1-381 (386) 69 protein:vir:7407 Length: 392 # 100.0 8.4E-74 5.2E-77 421.2 38.9 360 1-384 3-386 (392) 70 protein:vir:94869 Length: 378 100.0 2.1E-74 1.3E-77 424.5 34.7 343 1-384 1-360 (378) 71 protein:vir:63755 Length: 547 100.0 3.7E-73 2.3E-76 417.7 40.8 379 1-384 1-476 (547) 72 protein:vir:858 Length: 378 # 100.0 4.7E-74 2.9E-77 422.6 35.7 343 1-384 1-360 (378) 73 protein:vir:4995 Length: 384 # 100.0 7.2E-74 4.5E-77 421.6 35.5 356 1-372 1-384 (384) 74 protein:vir:4828 Length: 382 # 100.0 2.6E-73 1.6E-76 418.5 37.9 357 1-384 1-377 (382) 75 protein:vir:100691 Length: 535 100.0 1.3E-72 7.9E-76 414.7 41.1 377 1-384 53-478 (535) 76 protein:vir:98643 Length: 395 100.0 4.2E-73 2.6E-76 417.4 36.6 370 1-384 1-388 (395) 77 protein:vir:9641 Length: 395 # 100.0 2.8E-73 1.7E-76 418.4 35.5 365 1-384 1-388 (395) 78 protein:vir:3989 Length: 392 # 100.0 1.7E-72 1E-75 414.1 39.7 358 1-384 3-386 (392) 79 protein:vir:1023 Length: 392 # 100.0 1.7E-72 1E-75 414.1 39.7 358 1-384 3-386 (392) 80 protein:vir:1082 Length: 359 # 100.0 7.9E-73 4.9E-76 415.9 37.8 344 1-366 1-359 (359) 81 protein:vir:95599 Length: 563 100.0 1.7E-71 1.1E-74 408.5 40.3 375 1-384 43-486 (563) 82 protein:vir:99312 Length: 563 100.0 1.7E-71 1.1E-74 408.5 40.3 375 1-384 43-486 (563) 83 protein:vir:96579 Length: 576 100.0 1.1E-70 6.9E-74 404.1 38.1 375 1-384 32-485 (576) 84 protein:vir:4194 Length: 540 # 100.0 1.1E-70 6.5E-74 404.2 36.8 364 3-384 1-425 (540) 85 protein:vir:3153 Length: 467 # 100.0 8.6E-70 5.3E-73 399.2 38.4 353 32-384 1-431 (467) 86 protein:vir:4156 Length: 542 # 100.0 6.7E-70 4.1E-73 399.8 37.5 366 3-384 1-427 (542) 87 protein:vir:79772 Length: 648 100.0 9.1E-64 5.6E-67 366.2 34.7 374 1-384 34-470 (648) 88 protein:vir:99452 Length: 651 100.0 4.5E-63 2.8E-66 362.4 34.3 380 1-384 1-522 (651) 89 protein:vir:78641 Length: 278 100.0 1.2E-59 7.3E-63 343.6 33.0 269 53-331 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 1.2E-52 7.3E-56 305.3 28.7 321 4-349 1-368 (368) 91 protein:vir:100328 Length: 346 100.0 2.4E-51 1.5E-54 298.1 31.3 316 1-336 1-346 (346) 92 protein:vir:103971 Length: 376 100.0 7.7E-51 4.8E-54 295.3 31.7 301 1-338 21-376 (376) 93 protein:vir:79207 Length: 351 100.0 1.3E-50 8.1E-54 294.1 31.0 298 4-338 1-351 (351) 94 protein:vir:267 Length: 348 # 100.0 1.9E-50 1.2E-53 293.2 31.7 305 1-339 1-348 (348) 95 protein:vir:78191 Length: 351 100.0 1.9E-50 1.2E-53 293.2 31.0 292 4-338 1-351 (351) 96 protein:vir:98567 Length: 340 100.0 1.8E-49 1.1E-52 287.9 29.7 308 1-335 1-340 (340) 97 protein:vir:1150 Length: 350 # 100.0 3.7E-49 2.3E-52 286.1 29.5 306 4-331 1-350 (350) 98 protein:vir:4698 Length: 251 # 100.0 1.5E-49 9.2E-53 288.3 26.9 239 1-247 1-251 (251) 99 protein:vir:6058 Length: 344 # 100.0 5.8E-49 3.6E-52 285.1 30.0 306 4-336 1-344 (344) 100 protein:vir:5691 Length: 344 # 100.0 6.8E-49 4.2E-52 284.7 29.1 307 4-336 1-344 (344) 101 protein:vir:3780 Length: 345 # 100.0 1.2E-48 7.3E-52 283.3 29.7 313 1-333 1-345 (345) 102 protein:vir:2013 Length: 344 # 100.0 8.2E-49 5.1E-52 284.2 28.6 307 4-336 1-344 (344) 103 protein:vir:3743 Length: 345 # 100.0 3.5E-48 2.2E-51 280.7 31.8 313 1-333 1-345 (345) 104 protein:vir:78749 Length: 337 100.0 2.8E-48 1.7E-51 281.3 29.6 313 4-332 1-337 (337) 105 protein:vir:98853 Length: 219 100.0 1.2E-38 7.7E-42 228.4 22.6 200 129-335 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.9 1.1E-24 7E-28 151.9 29.2 368 1-384 1-427 (437) 107 protein:vir:94049 Length: 532 99.8 1.8E-20 1.1E-23 128.8 28.9 368 1-384 55-498 (532) 108 protein:vir:107742 Length: 537 99.8 2.1E-20 1.3E-23 128.5 28.9 364 1-384 55-514 (537) 109 protein:vir:79647 Length: 435 99.8 1.2E-20 7.3E-24 129.8 24.8 359 1-384 1-433 (435) 110 protein:vir:103860 Length: 528 99.8 1.5E-17 9.2E-21 112.8 33.3 369 1-384 5-433 (528) 111 protein:vir:99232 Length: 526 99.8 3.4E-17 2.1E-20 110.9 33.9 370 1-384 1-430 (526) 112 protein:vir:99853 Length: 488 99.8 1.9E-17 1.2E-20 112.2 32.0 367 1-384 17-403 (488) 113 protein:vir:80040 Length: 461 99.7 1.7E-18 1.1E-21 118.0 25.2 367 1-384 20-460 (461) 114 protein:vir:99563 Length: 862 99.7 1.1E-17 7.1E-21 113.5 29.0 364 1-384 101-539 (862) 115 protein:vir:104338 Length: 422 99.7 3E-18 1.8E-21 116.7 24.8 356 1-384 1-422 (422) 116 protein:vir:79063 Length: 491 99.7 1.4E-16 8.9E-20 107.5 33.5 366 1-384 3-417 (491) 117 protein:vir:1986 Length: 512 # 99.7 2.2E-16 1.4E-19 106.4 33.4 370 1-384 1-424 (512) 118 protein:vir:107880 Length: 491 99.7 3.8E-16 2.4E-19 105.1 33.8 364 1-384 3-411 (491) 119 protein:vir:96068 Length: 765 99.7 4.7E-17 2.9E-20 110.1 28.2 362 1-384 44-511 (765) 120 protein:vir:108215 Length: 469 99.7 4E-16 2.5E-19 105.0 32.8 368 7-384 1-444 (469) 121 protein:vir:79233 Length: 526 99.7 8.3E-16 5.2E-19 103.3 33.8 370 1-384 1-430 (526) 122 protein:vir:107662 Length: 427 99.7 9.3E-18 5.7E-21 114.0 22.5 353 1-384 1-423 (427) 123 protein:vir:79538 Length: 502 99.7 5.1E-16 3.2E-19 104.4 28.1 381 1-384 1-492 (502) 124 protein:vir:96738 Length: 505 99.6 1.3E-15 8.1E-19 102.2 24.0 379 1-384 8-504 (505) 125 protein:vir:95542 Length: 548 99.6 1.2E-14 7.4E-18 96.9 26.4 380 1-384 1-497 (548) 126 protein:vir:389 Length: 530 # 99.6 1.2E-14 7.6E-18 96.9 26.0 384 1-384 1-522 (530) 127 protein:vir:3420 Length: 533 # 99.5 2.1E-14 1.3E-17 95.6 25.2 384 1-384 3-526 (533) 128 protein:vir:98816 Length: 446 99.5 3.5E-14 2.2E-17 94.4 24.6 354 1-370 1-446 (446) 129 protein:vir:95254 Length: 488 99.5 3.4E-13 2.1E-16 89.0 29.4 378 4-384 1-462 (488) 130 protein:vir:79511 Length: 448 99.5 1.2E-13 7.4E-17 91.4 26.7 372 1-384 1-434 (448) 131 protein:vir:6382 Length: 553 # 99.5 6.5E-14 4E-17 92.9 24.5 382 1-384 2-553 (553) 132 protein:vir:10321 Length: 495 99.5 1.2E-13 7.2E-17 91.5 23.4 379 1-384 1-495 (495) 133 protein:vir:77981 Length: 448 99.4 9.2E-13 5.7E-16 86.6 28.0 366 4-384 1-429 (448) 134 protein:vir:105782 Length: 449 99.3 7.7E-12 4.8E-15 81.5 24.5 358 1-384 23-445 (449) 135 protein:vir:78589 Length: 695 99.3 4.8E-12 3E-15 82.6 23.2 367 1-384 67-539 (695) 136 protein:vir:106491 Length: 646 99.3 2.9E-12 1.8E-15 83.8 21.7 377 1-384 1-476 (646) 137 protein:vir:101541 Length: 694 99.3 5.6E-12 3.5E-15 82.3 23.3 367 1-384 69-538 (694) 138 protein:vir:3648 Length: 695 # 99.3 6E-12 3.7E-15 82.1 22.7 367 1-384 67-539 (695) 139 protein:vir:78161 Length: 355 99.3 1.5E-11 9.4E-15 79.9 24.2 275 107-384 1-322 (355) 140 protein:vir:102426 Length: 631 99.2 9.8E-12 6.1E-15 80.9 20.5 380 1-384 1-503 (631) 141 protein:vir:97376 Length: 320 99.2 2.2E-13 1.3E-16 90.0 10.6 311 1-373 1-320 (320) 142 protein:vir:106716 Length: 698 99.2 3.7E-11 2.3E-14 77.8 22.5 369 1-384 67-539 (698) 143 protein:vir:8654 Length: 629 # 99.1 2.5E-11 1.5E-14 78.7 19.0 379 1-384 1-486 (629) 144 protein:vir:99088 Length: 629 99.1 3E-11 1.8E-14 78.3 18.8 379 1-384 1-486 (629) 145 protein:vir:106027 Length: 629 99.0 3.6E-10 2.2E-13 72.4 21.2 376 1-384 1-494 (629) 146 protein:vir:107517 Length: 639 99.0 2E-10 1.3E-13 73.7 17.8 380 1-384 1-486 (639) 147 protein:vir:97900 Length: 639 99.0 2E-10 1.3E-13 73.7 17.8 380 1-384 1-486 (639) 148 protein:vir:4073 Length: 279 # 98.8 2E-10 1.2E-13 73.8 11.3 272 41-369 1-279 (279) 149 protein:vir:5839 Length: 533 # 98.8 1.9E-08 1.2E-11 62.9 21.9 375 1-384 20-497 (533) 150 protein:vir:7768 Length: 484 # 98.7 8.8E-08 5.5E-11 59.3 23.4 362 1-384 1-465 (484) 151 protein:vir:98444 Length: 434 98.7 1E-07 6.3E-11 58.9 23.5 339 21-384 1-431 (434) 152 protein:vir:2427 Length: 485 # 98.5 3.7E-07 2.3E-10 55.9 25.6 362 1-384 6-483 (485) 153 protein:vir:102602 Length: 456 98.4 6.2E-07 3.9E-10 54.6 25.4 362 1-384 7-455 (456) 154 protein:vir:105819 Length: 456 98.4 6.2E-07 3.9E-10 54.6 25.4 362 1-384 7-455 (456) 155 protein:vir:7987 Length: 456 # 98.4 1.1E-06 6.8E-10 53.3 24.1 361 1-384 7-451 (456) 156 protein:vir:5961 Length: 503 # 98.3 1.2E-06 7.4E-10 53.1 29.7 359 1-384 13-485 (503) 157 protein:vir:94742 Length: 409 98.3 2E-06 1.2E-09 51.9 30.9 331 1-366 1-409 (409) 158 protein:vir:104082 Length: 485 98.2 2.4E-06 1.5E-09 51.4 28.4 362 1-384 5-483 (485) 159 protein:vir:9751 Length: 422 # 98.2 2.5E-06 1.5E-09 51.3 24.1 347 1-382 1-422 (422) 160 protein:vir:9815 Length: 500 # 98.2 3.2E-06 2E-09 50.7 23.1 364 1-384 1-500 (500) 161 protein:vir:3028 Length: 500 # 98.2 3.2E-06 2E-09 50.7 23.1 364 1-384 1-500 (500) 162 protein:vir:4223 Length: 486 # 98.1 3.6E-06 2.2E-09 50.5 24.8 362 1-384 6-485 (486) 163 protein:vir:80680 Length: 441 98.1 5.4E-06 3.4E-09 49.5 26.9 354 1-384 6-440 (441) 164 protein:vir:1634 Length: 409 # 98.0 7.1E-06 4.4E-09 48.8 29.4 332 1-366 1-409 (409) 165 protein:vir:1587 Length: 508 # 97.9 1.1E-05 6.8E-09 47.8 26.1 364 1-384 1-506 (508) 166 protein:vir:4898 Length: 502 # 97.9 1.1E-05 7E-09 47.7 25.7 371 1-384 31-498 (502) 167 protein:vir:38 Length: 496 # N 97.9 1.3E-05 8.1E-09 47.4 24.3 366 1-384 16-494 (496) 168 protein:vir:9568 Length: 410 # 97.8 1.5E-05 9.4E-09 47.0 29.0 341 1-384 1-410 (410) 169 protein:vir:2341 Length: 488 # 97.8 1.8E-05 1.1E-08 46.6 27.1 368 1-384 10-479 (488) 170 protein:vir:99072 Length: 479 97.8 1.8E-05 1.1E-08 46.6 28.4 356 1-384 20-458 (479) 171 protein:vir:9306 Length: 511 # 97.8 1.9E-05 1.2E-08 46.5 28.4 372 1-384 40-498 (511) 172 protein:vir:99916 Length: 504 97.8 1.9E-05 1.2E-08 46.5 27.1 369 1-384 23-487 (504) 173 protein:vir:78537 Length: 480 97.8 2.1E-05 1.3E-08 46.2 26.0 360 1-384 1-462 (480) 174 protein:vir:95806 Length: 440 97.8 2.2E-05 1.3E-08 46.2 28.6 367 3-384 1-436 (440) 175 protein:vir:99781 Length: 511 97.7 2.5E-05 1.5E-08 45.8 28.0 372 1-384 29-498 (511) 176 protein:vir:103219 Length: 201 97.7 2.1E-06 1.3E-09 51.8 12.0 165 205-384 1-199 (201) 177 protein:vir:105154 Length: 525 97.7 6.4E-06 4E-09 49.0 14.7 366 1-384 51-511 (525) 178 protein:vir:94101 Length: 474 97.7 2.8E-05 1.7E-08 45.6 28.7 364 1-384 1-463 (474) 179 protein:vir:105889 Length: 474 97.7 2.8E-05 1.7E-08 45.6 28.7 364 1-384 1-463 (474) 180 protein:vir:98883 Length: 517 97.7 2.8E-05 1.8E-08 45.5 26.1 367 1-384 1-517 (517) 181 protein:vir:78227 Length: 480 97.7 3.3E-05 2E-08 45.2 26.9 360 1-384 1-462 (480) 182 protein:vir:79703 Length: 505 97.6 3.4E-05 2.1E-08 45.1 23.8 364 1-384 1-505 (505) 183 protein:vir:99522 Length: 470 97.6 3.6E-05 2.2E-08 45.0 29.0 361 1-384 1-468 (470) 184 protein:vir:96240 Length: 511 97.6 3.9E-05 2.4E-08 44.8 28.6 372 1-384 27-498 (511) 185 protein:vir:94805 Length: 492 97.6 4.3E-05 2.7E-08 44.5 29.8 353 1-384 43-476 (492) 186 protein:vir:1236 Length: 483 # 97.5 4.7E-05 2.9E-08 44.3 30.4 353 1-384 34-467 (483) 187 protein:vir:106639 Length: 481 97.5 5.1E-05 3.2E-08 44.1 30.9 366 1-384 23-480 (481) 188 protein:vir:80959 Length: 499 97.5 5.3E-05 3.3E-08 44.0 26.3 366 1-384 16-497 (499) 189 protein:vir:98265 Length: 524 97.5 5.3E-05 3.3E-08 44.0 25.4 376 1-384 17-523 (524) 190 protein:vir:103951 Length: 511 97.5 5.7E-05 3.6E-08 43.8 29.5 372 1-384 39-498 (511) 191 protein:vir:104500 Length: 537 97.5 6.2E-05 3.8E-08 43.7 25.1 379 1-384 3-536 (537) 192 protein:vir:93747 Length: 472 97.4 6.8E-05 4.2E-08 43.4 29.7 353 1-384 23-461 (472) 193 protein:vir:97336 Length: 492 97.4 6.9E-05 4.3E-08 43.4 28.8 353 1-384 42-476 (492) 194 protein:vir:8184 Length: 474 # 97.4 7.9E-05 4.9E-08 43.1 27.3 369 1-384 17-474 (474) 195 protein:vir:3964 Length: 453 # 97.3 8.9E-05 5.5E-08 42.8 28.5 361 1-384 19-444 (453) 196 protein:vir:96494 Length: 501 97.2 0.00012 7.7E-08 42.0 30.4 371 1-384 38-497 (501) 197 protein:vir:4782 Length: 522 # 97.2 0.00014 9E-08 41.6 26.2 366 1-384 1-519 (522) 198 protein:vir:5665 Length: 511 # 97.0 0.00021 1.3E-07 40.7 23.6 376 1-384 5-511 (511) 199 protein:vir:95113 Length: 474 97.0 0.00023 1.4E-07 40.5 28.9 348 1-384 36-463 (474) 200 protein:vir:97447 Length: 474 97.0 0.00023 1.4E-07 40.5 29.2 352 1-384 21-458 (474) 201 protein:vir:94498 Length: 474 97.0 0.00023 1.4E-07 40.5 29.2 352 1-384 21-458 (474) 202 protein:vir:106282 Length: 521 97.0 0.00024 1.5E-07 40.4 24.3 377 1-384 21-520 (521) 203 protein:vir:97171 Length: 512 96.9 0.00027 1.7E-07 40.2 30.6 372 1-384 42-499 (512) 204 protein:vir:96366 Length: 511 96.9 0.00029 1.8E-07 40.0 29.9 372 1-384 39-498 (511) 205 protein:vir:78805 Length: 511 96.9 0.00029 1.8E-07 40.0 29.9 372 1-384 39-498 (511) 206 protein:vir:79043 Length: 479 96.8 0.00033 2E-07 39.7 30.6 357 1-384 20-475 (479) 207 protein:vir:2732 Length: 501 # 96.8 0.00033 2.1E-07 39.6 30.3 371 1-384 38-486 (501) 208 protein:vir:2500 Length: 501 # 96.8 0.00035 2.2E-07 39.5 25.2 348 1-384 33-494 (501) 209 protein:vir:733 Length: 453 # 96.6 0.00045 2.8E-07 38.9 30.6 365 1-384 11-452 (453) 210 protein:vir:3609 Length: 452 # 96.6 0.00046 2.9E-07 38.9 29.9 355 1-384 9-442 (452) 211 protein:vir:103177 Length: 533 96.6 0.0005 3.1E-07 38.7 24.1 379 1-384 1-513 (533) 212 protein:vir:78907 Length: 518 96.6 0.00051 3.2E-07 38.6 28.7 368 1-383 1-518 (518) 213 protein:vir:9871 Length: 429 # 96.6 0.00051 3.2E-07 38.6 31.3 349 1-384 7-425 (429) 214 protein:vir:96266 Length: 474 96.5 0.00054 3.3E-07 38.5 27.5 352 1-384 25-458 (474) 215 protein:vir:95899 Length: 474 96.5 0.00054 3.3E-07 38.5 27.5 352 1-384 25-458 (474) 216 protein:vir:94546 Length: 506 96.4 0.00061 3.8E-07 38.2 29.5 370 1-384 28-493 (506) 217 protein:vir:106999 Length: 564 96.3 0.00079 4.9E-07 37.6 23.4 379 1-384 1-533 (564) 218 protein:vir:96839 Length: 474 96.0 0.0012 7.5E-07 36.6 29.7 352 1-384 14-462 (474) 219 protein:vir:108049 Length: 524 95.9 0.0013 7.8E-07 36.5 24.4 377 1-384 15-522 (524) 220 protein:vir:6896 Length: 523 # 95.9 0.0013 8.2E-07 36.4 22.9 377 1-384 21-521 (523) 221 protein:vir:104892 Length: 558 95.8 0.0015 9.3E-07 36.1 24.9 379 1-384 5-539 (558) 222 protein:vir:78083 Length: 537 95.8 0.0015 9.4E-07 36.0 29.3 362 1-384 11-505 (537) 223 protein:vir:7208 Length: 524 # 95.7 0.0016 9.9E-07 35.9 23.0 376 1-384 18-522 (524) 224 protein:vir:103458 Length: 524 95.6 0.0017 1.1E-06 35.7 23.0 376 1-384 18-522 (524) 225 protein:vir:6596 Length: 521 # 95.5 0.0019 1.2E-06 35.5 25.2 377 1-384 16-520 (521) 226 protein:vir:105292 Length: 478 95.4 0.002 1.3E-06 35.3 29.3 352 1-384 32-469 (478) 227 protein:vir:81017 Length: 521 95.2 0.0025 1.6E-06 34.8 25.3 377 1-384 20-520 (521) 228 protein:vir:101806 Length: 516 95.1 0.0028 1.7E-06 34.6 25.2 376 1-384 11-515 (516) 229 protein:vir:101189 Length: 516 95.1 0.0028 1.7E-06 34.6 25.2 376 1-384 11-515 (516) 230 protein:vir:102950 Length: 471 94.9 0.0031 2E-06 34.3 28.2 352 1-384 1-465 (471) 231 protein:vir:9922 Length: 489 # 94.7 0.0038 2.3E-06 33.9 25.5 372 1-384 13-481 (489) 232 protein:vir:105461 Length: 470 94.6 0.004 2.5E-06 33.7 28.6 353 1-384 1-470 (470) 233 protein:vir:100598 Length: 516 93.8 0.0063 3.9E-06 32.7 25.6 376 1-384 11-515 (516) 234 protein:vir:96179 Length: 468 93.8 0.0063 3.9E-06 32.7 28.7 352 1-384 24-463 (468) 235 protein:vir:106571 Length: 499 93.2 0.0083 5.2E-06 32.0 31.3 358 1-384 16-474 (499) 236 protein:vir:107112 Length: 478 93.1 0.0087 5.4E-06 31.9 28.0 353 1-384 14-476 (478) 237 protein:vir:94956 Length: 452 87.9 0.035 2.2E-05 28.5 26.0 353 1-384 1-450 (452) 238 protein:vir:101494 Length: 527 83.9 0.064 4E-05 27.1 16.2 363 1-384 1-517 (527) 239 protein:vir:102239 Length: 527 83.5 0.068 4.2E-05 27.0 16.3 363 1-384 1-517 (527) 240 protein:vir:101418 Length: 569 81.4 0.086 5.3E-05 26.4 12.8 370 1-384 53-553 (569) 241 protein:vir:94709 Length: 522 76.3 0.14 8.5E-05 25.3 20.3 337 1-384 38-482 (522) 242 protein:vir:102330 Length: 451 67.2 0.26 0.00016 23.8 30.2 350 1-382 1-451 (451) 243 protein:vir:7017 Length: 515 # 60.8 0.37 0.00023 23.0 20.7 335 1-384 42-478 (515) 244 protein:vir:1538 Length: 535 # 54.4 0.5 0.00031 22.2 19.7 335 1-384 56-521 (535) 245 protein:vir:3361 Length: 535 # 53.6 0.53 0.00033 22.1 19.5 335 1-384 56-521 (535) 246 protein:vir:102668 Length: 547 53.2 0.53 0.00033 22.1 23.0 320 1-384 54-505 (547) 247 protein:vir:100039 Length: 522 50.3 0.61 0.00038 21.7 18.3 358 1-384 1-508 (522) 248 protein:vir:1785 Length: 555 # 46.6 0.73 0.00045 21.3 19.1 328 1-384 35-491 (555) 249 protein:vir:8883 Length: 543 # 44.7 0.8 0.0005 21.1 17.8 341 1-384 1-467 (543) 250 protein:vir:80165 Length: 651 42.2 0.9 0.00056 20.8 25.9 367 1-384 26-589 (651) 251 protein:vir:103330 Length: 517 36.3 1.2 0.00073 20.2 17.7 360 1-384 1-477 (517) 252 protein:vir:96988 Length: 516 34.8 1.3 0.00079 20.0 19.5 354 1-384 5-479 (516) 253 protein:vir:105641 Length: 516 34.0 1.3 0.00082 19.9 18.3 346 1-384 9-479 (516) 254 protein:vir:2198 Length: 536 # 24.0 2.2 0.0014 18.7 20.6 342 1-384 1-465 (536) 255 protein:vir:94572 Length: 535 23.9 2.2 0.0014 18.7 19.0 350 1-384 1-486 (535) 256 protein:vir:10447 Length: 536 22.1 2.5 0.0016 18.4 20.2 342 1-384 1-465 (536) No 1 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=1.2e-92 Score=524.58 Aligned_cols=383 Identities=18% Similarity=0.225 Sum_probs=340.0 Q ss_pred Ccchhhhccc---CCC-------cchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce- Q lcl|NC_019422. 1 MNIFKSKKKN---KEA-------PGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK- 69 (384) Q Consensus 1 M~~f~~~~~~---~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~- 69 (384) ||||++.+.. ..+ +......+++.+..+..++..+++++++|++||++||+.+|++||++|++++++.+ T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~~~ 80 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQR 80 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCceee Confidence 9999865421 111 12223445555555666777889999999999999999999999999998877754 Q ss_pred eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEE Q lcl|NC_019422. 70 TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFL 143 (384) Q Consensus 70 ~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~ 143 (384) ..+++++++|+.+||++||+++||+.++.+++++||+|++++++..|++.+|||++|++|++..++.+. .+++. T Consensus 81 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~ 160 (429) T protein:vir:10 81 GTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVV 160 (429) T ss_pred ccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEE Confidence 456778888889999999999999999999999999999999999999999999999999999886542 23455 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) ..+|..+.++++||||++++++.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++++++++. T Consensus 161 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~ 240 (429) T protein:vir:10 161 NTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFREN 240 (429) T ss_pred ccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHH Confidence 56678889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYYESE 296 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~~~~ 296 (384) |++.+.| ..++++++|+++|++|++++.++.++++.+. ++.+++||++|||||.+|| +++.+++..+|++.| T Consensus 241 ~~~~~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~ 319 (429) T protein:vir:10 241 FESMSSG-LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDT 319 (429) T ss_pred HHHHhcc-ccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH Confidence 9999976 5577899999999999999999999998776 5789999999999999996 246689999999999 Q ss_pred HHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecC Q lcl|NC_019422. 297 IEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRL 375 (384) Q Consensus 297 i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~ 375 (384) |.|++.+|+++||++|+++.++..+.+++||++.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+ T Consensus 320 l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~ 399 (429) T protein:vir:10 320 LQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNG 399 (429) T ss_pred HHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc Confidence 999999999999999999999888999999999999999999999875 8999999999999999999999999999999 Q ss_pred ceeecC--------CCC Q lcl|NC_019422. 376 DTAVVE--------GGE 384 (384) Q Consensus 376 n~~~~~--------~ge 384 (384) |++|++ +|| T Consensus 400 n~~~~d~~~~~~~k~g~ 416 (429) T protein:vir:10 400 NMLPIDMAGQAYLKGGD 416 (429) T ss_pred cccchhhccccccCCCC Confidence 998864 344 No 2 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.4e-92 Score=524.21 Aligned_cols=383 Identities=19% Similarity=0.234 Sum_probs=339.9 Q ss_pred Ccchhhhccc-----CCC-c-------chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKN-----KEA-P-------GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~-----~~~-~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||++++.. +.+ + ......+++.+..+..++.++++++++|++||++||+.||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999975321 111 1 12334455545556667778899999999999999999999999999998777 Q ss_pred ce-eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEE Q lcl|NC_019422. 68 FK-TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFL 140 (384) Q Consensus 68 ~~-~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~ 140 (384) .+ ..+++++++|+.+||++||+++||+.++.+++++||+|++++++..|++.+|||++|.+|++..++.+ ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 54 45677788888899999999999999999999999999999999999999999999999999887543 234 Q ss_pred EEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHH Q lcl|NC_019422. 141 KFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKE 220 (384) Q Consensus 141 ~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~ 220 (384) ++...+|..+.++++||||+|++++.++++|+||+..+.+++....++++++.++|+||++|+++|++++.+++++.+++ T Consensus 161 y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~ 240 (432) T protein:vir:10 161 YVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVF 240 (432) T ss_pred EEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHH Confidence 45556788889999999999998899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHH Q lcl|NC_019422. 221 VKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYY 293 (384) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~ 293 (384) ++.|++.++| ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+|| +++.|++..+|+ T Consensus 241 ~~~~~~~~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~ 319 (432) T protein:vir:10 241 RENFESMSSG-LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFY 319 (432) T ss_pred HHHHHHHhcc-cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHH Confidence 9999999976 4577899999999999999999999999876 5788999999999999996 235689999999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPV 372 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~ 372 (384) +.||.|++.+|+++||++|+++.++..+.+++||++++++.|.++++++++ ++.+|++|+||+|+++|+||+||||+++ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 999999999999999999999999999999999999999999999999876 8999999999999999999999999999 Q ss_pred ecCceeecC--------CCC Q lcl|NC_019422. 373 RRLDTAVVE--------GGE 384 (384) Q Consensus 373 ~~~n~~~~~--------~ge 384 (384) +|+|++|++ +|| T Consensus 400 ~~~n~~~~~~~~~~~~k~~~ 419 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD 419 (432) T ss_pred ecccccchhhccccccCCCC Confidence 999998874 333 No 3 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.4e-92 Score=524.21 Aligned_cols=383 Identities=19% Similarity=0.234 Sum_probs=339.9 Q ss_pred Ccchhhhccc-----CCC-c-------chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKN-----KEA-P-------GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~-----~~~-~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||++++.. +.+ + ......+++.+..+..++.++++++++|++||++||+.||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999975321 111 1 12334455545556667778899999999999999999999999999998777 Q ss_pred ce-eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEE Q lcl|NC_019422. 68 FK-TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFL 140 (384) Q Consensus 68 ~~-~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~ 140 (384) .+ ..+++++++|+.+||++||+++||+.++.+++++||+|++++++..|++.+|||++|.+|++..++.+ ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 54 45677788888899999999999999999999999999999999999999999999999999887543 234 Q ss_pred EEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHH Q lcl|NC_019422. 141 KFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKE 220 (384) Q Consensus 141 ~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~ 220 (384) ++...+|..+.++++||||+|++++.++++|+||+..+.+++....++++++.++|+||++|+++|++++.+++++.+++ T Consensus 161 y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~ 240 (432) T protein:vir:10 161 YVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVF 240 (432) T ss_pred EEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHH Confidence 45556788889999999999998899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHH Q lcl|NC_019422. 221 VKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYY 293 (384) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~ 293 (384) ++.|++.++| ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+|| +++.|++..+|+ T Consensus 241 ~~~~~~~~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~ 319 (432) T protein:vir:10 241 RENFESMSSG-LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFY 319 (432) T ss_pred HHHHHHHhcc-cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHH Confidence 9999999976 4577899999999999999999999999876 5788999999999999996 235689999999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPV 372 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~ 372 (384) +.||.|++.+|+++||++|+++.++..+.+++||++++++.|.++++++++ ++.+|++|+||+|+++|+||+||||+++ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 999999999999999999999999999999999999999999999999876 8999999999999999999999999999 Q ss_pred ecCceeecC--------CCC Q lcl|NC_019422. 373 RRLDTAVVE--------GGE 384 (384) Q Consensus 373 ~~~n~~~~~--------~ge 384 (384) +|+|++|++ +|| T Consensus 400 ~~~n~~~~~~~~~~~~k~~~ 419 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD 419 (432) T ss_pred ecccccchhhccccccCCCC Confidence 999998874 333 No 4 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.4e-92 Score=524.21 Aligned_cols=383 Identities=19% Similarity=0.234 Sum_probs=339.9 Q ss_pred Ccchhhhccc-----CCC-c-------chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKN-----KEA-P-------GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~-----~~~-~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||++++.. +.+ + ......+++.+..+..++.++++++++|++||++||+.||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999975321 111 1 12334455545556667778899999999999999999999999999998777 Q ss_pred ce-eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEE Q lcl|NC_019422. 68 FK-TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFL 140 (384) Q Consensus 68 ~~-~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~ 140 (384) .+ ..+++++++|+.+||++||+++||+.++.+++++||+|++++++..|++.+|||++|.+|++..++.+ ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 54 45677788888899999999999999999999999999999999999999999999999999887543 234 Q ss_pred EEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHH Q lcl|NC_019422. 141 KFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKE 220 (384) Q Consensus 141 ~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~ 220 (384) ++...+|..+.++++||||+|++++.++++|+||+..+.+++....++++++.++|+||++|+++|++++.+++++.+++ T Consensus 161 y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~ 240 (432) T protein:vir:10 161 YVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVF 240 (432) T ss_pred EEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHH Confidence 45556788889999999999998899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHH Q lcl|NC_019422. 221 VKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYY 293 (384) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~ 293 (384) ++.|++.++| ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+|| +++.|++..+|+ T Consensus 241 ~~~~~~~~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~ 319 (432) T protein:vir:10 241 RENFESMSSG-LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFY 319 (432) T ss_pred HHHHHHHhcc-cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHH Confidence 9999999976 4577899999999999999999999999876 5788999999999999996 235689999999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPV 372 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~ 372 (384) +.||.|++.+|+++||++|+++.++..+.+++||++++++.|.++++++++ ++.+|++|+||+|+++|+||+||||+++ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 999999999999999999999999999999999999999999999999876 8999999999999999999999999999 Q ss_pred ecCceeecC--------CCC Q lcl|NC_019422. 373 RRLDTAVVE--------GGE 384 (384) Q Consensus 373 ~~~n~~~~~--------~ge 384 (384) +|+|++|++ +|| T Consensus 400 ~~~n~~~~~~~~~~~~k~~~ 419 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD 419 (432) T ss_pred ecccccchhhccccccCCCC Confidence 999998874 333 No 5 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=3.5e-92 Score=521.97 Aligned_cols=382 Identities=16% Similarity=0.196 Sum_probs=335.1 Q ss_pred CcchhhhcccCCCcc---hhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce-eccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---KVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK-TNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~-~~~~~~~ 76 (384) ||||+++++...... ......+..+..+...+...++++++|++||+.||++||++||++|++++++.+ ...++++ T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~ 80 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIVKSDREELY 80 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCceeeecccHHH Confidence 999997654332211 111122223333344556778999999999999999999999999998877754 4567778 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-------EEEEEEc-Cce Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-------FLKFLLR-NGK 148 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-------~~~~~~~-~g~ 148 (384) ++|+.+||++||+++||+.++.+++++||||++++++ .|.+.+||+++|..|++..++++. +|.|... +|+ T Consensus 81 ~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~ 159 (411) T protein:vir:81 81 NLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYNDPYDGK 159 (411) T ss_pred HHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEEEecCCce Confidence 8888999999999999999999999999999999998 589999999999999999887642 3344433 678 Q ss_pred EEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHh Q lcl|NC_019422. 149 IVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNY 228 (384) Q Consensus 149 ~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~ 228 (384) .+.++++||||+|++++.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++.++++++|.+.+ T Consensus 160 ~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~ 239 (411) T protein:vir:81 160 MYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFA 239 (411) T ss_pred EEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHh Confidence 88999999999999888999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVG 301 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~ 301 (384) .| ..++|+++++++|++|++++.++.++++.+. ++.+++||++|||||.+||. ++.|++..+|++.||.|++ T Consensus 240 ~g-~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~ 318 (411) T protein:vir:81 240 NG-SKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVL 318 (411) T ss_pred cC-ccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHH Confidence 77 4567899999999999999999999999877 56789999999999999962 3567888999999999999 Q ss_pred HHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeec Q lcl|NC_019422. 302 LQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVV 380 (384) Q Consensus 302 ~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~ 380 (384) ++|+++||++|+++.++..+.+++||++++++.|.+++++.++ ++.+|++|+||+|+++|+||+||||++++++|++|+ T Consensus 319 ~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~~~~~n~~pl 398 (411) T protein:vir:81 319 KQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNLMANGNYIPL 398 (411) T ss_pred HHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccch Confidence 9999999999999998888999999999999999999999875 899999999999999999999999999999999998 Q ss_pred --------CCCC Q lcl|NC_019422. 381 --------EGGE 384 (384) Q Consensus 381 --------~~ge 384 (384) ++|| T Consensus 399 ~~~~~~~~kgGd 410 (411) T protein:vir:81 399 SMLGANYGKGGD 410 (411) T ss_pred hhhhhhhccCCC Confidence 3666 No 6 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=7.5e-92 Score=520.16 Aligned_cols=380 Identities=19% Similarity=0.231 Sum_probs=335.6 Q ss_pred CcchhhhcccCCCcchh-HH------------Hhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV-MM------------ELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~-~~------------~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ||||++.++.+...... .. .+... ...+..++...++++++|++||+.||+.||++|+++++..+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 99999765443322111 00 11111 11233456678899999999999999999999999997543 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-------E Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-------L 138 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-------~ 138 (384) +..+++++++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+|+|++|.+|++..+.++ . T Consensus 81 ---~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~ 157 (422) T protein:vir:13 81 ---EYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKV 157 (422) T ss_pred ---ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceE Confidence 345678999999999999999999999999999999999999999999999999999999999998875 3 Q ss_pred EEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHH Q lcl|NC_019422. 139 FLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIK 218 (384) Q Consensus 139 ~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~ 218 (384) +|.+...+|....++++||||++.+++.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+ T Consensus 158 ~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~ 237 (422) T protein:vir:13 158 WYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKK 237 (422) T ss_pred EEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHH Confidence 45666778899999999999999988899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHH Q lcl|NC_019422. 219 KEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNA 291 (384) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~ 291 (384) ++++.|++.++| .+++++++|+++|++|++++.++.|+|+.+. ++.+++||++|||||.+||. ++.+++..+ T Consensus 238 ~~~~~~~~~~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~ 316 (422) T protein:vir:13 238 IFKKEFESMSNG-LENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKD 316 (422) T ss_pred HHHHHHHHHhcC-ccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 999999999876 4567899999999999999999999999877 56889999999999999973 466889999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019422. 292 YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDK 370 (384) Q Consensus 292 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~ 370 (384) |++.||.|++++||++||++|+++.++..+.+|+||++++++.|.++++++++ ++++|++|+||+|+++|+||+||||+ T Consensus 317 f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~ 396 (422) T protein:vir:13 317 FYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDR 396 (422) T ss_pred HHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999998888899999999999999999999986 89999999999999999999999999 Q ss_pred eeecCceeecCCCC Q lcl|NC_019422. 371 PVRRLDTAVVEGGE 384 (384) Q Consensus 371 ~~~~~n~~~~~~ge 384 (384) +++|+|++|++..+ T Consensus 397 ~~~~~n~~~l~~~~ 410 (422) T protein:vir:13 397 LLVNGNMIPIEMAG 410 (422) T ss_pred eeeccCccchhhcc Confidence 99999999986432 No 7 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.4e-91 Score=518.75 Aligned_cols=381 Identities=18% Similarity=0.216 Sum_probs=338.5 Q ss_pred CcchhhhcccCCC----cchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA----PGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~ 76 (384) |-| .++++++.. ....+..+++.+..+.+++.++++++++|++||+.||+.||++||++|+.++++.+...++++ T Consensus 1 m~f-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~l~ 79 (409) T protein:vir:10 1 MLF-RKGFKNQSQEISIDDKKILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVPDHYLE 79 (409) T ss_pred Ccc-cccccCcCCCCCCChHHHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeeccCchHH Confidence 874 444443333 233345555555556666778899999999999999999999999999987666667778888 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-------EEEEEEcCceE Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-------FLKFLLRNGKI 149 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-------~~~~~~~~g~~ 149 (384) ++|+.+||++||+++||+.++.+++++||+|++++++..|.+.+|||++|.+|++..+.++. .|.+....|.. T Consensus 80 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~ 159 (409) T protein:vir:10 80 YLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDLGQR 159 (409) T ss_pred HHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEeCCcee Confidence 88999999999999999999999999999999999999999999999999999999876543 35666677888 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhc Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYL 229 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~ 229 (384) +.++++||||+|+++ .++++|+||+..+.+++....++++++.++|+||++|+++|++++.+++++.+++++.|++.+. T Consensus 160 ~~~~~~evih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~ 238 (409) T protein:vir:10 160 HKFMSDEILHFKGLT-ADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSS 238 (409) T ss_pred EEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhc Confidence 999999999999764 6789999999999999999999999999999999999999999999999999999999999997 Q ss_pred cccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~~~~i~P~~~ 302 (384) | ..++++++++++|++|++++.++.++++.+. ++.+++||++|||||.+|| +++.+++..+|+++||.|+++ T Consensus 239 g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~ 317 (409) T protein:vir:10 239 G-LKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILN 317 (409) T ss_pred c-ccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 6 4567899999999999999999999999876 5789999999999999996 345688999999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE 381 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~ 381 (384) +|+++||++|+++.++..+.+++||++++++.|.+++++.+ +++++|++|+||+|+++|+||+||||++++|+|++|++ T Consensus 318 ~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~n~~~~~ 397 (409) T protein:vir:10 318 MYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLINGNMIPVK 397 (409) T ss_pred HHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchh Confidence 99999999999998888889999999999999999999987 48999999999999999999999999999999999986 Q ss_pred C--------CC Q lcl|NC_019422. 382 G--------GE 384 (384) Q Consensus 382 ~--------ge 384 (384) . || T Consensus 398 ~~~~~~~kgGe 408 (409) T protein:vir:10 398 MAGEQYSKGGE 408 (409) T ss_pred hccccccccCC Confidence 3 44 No 8 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=1.9e-91 Score=517.95 Aligned_cols=382 Identities=12% Similarity=0.125 Sum_probs=334.5 Q ss_pred CcchhhhcccCCCcch------hHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee-ccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK------VMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT-NPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-~~~ 73 (384) +.+|+.+++.+....+ ......+.+..+.+++.++++++++|++||++||+.||++|+++|++++++.++ ..+ T Consensus 16 ~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~ 95 (424) T protein:vir:45 16 RVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDH 95 (424) T ss_pred hHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccc Confidence 5555554333332211 122223334445566778899999999999999999999999999887666544 456 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) +++++|+.+|||+||+++||+.++.+++++||+|+++.|+..|++++|+|++|..|++..+.+...|.+...+| ...++ T Consensus 96 ~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~~~~y~~~~~~~-~~~~~ 174 (424) T protein:vir:45 96 PAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGGRYTYGLYNEYG-AFAIS 174 (424) T ss_pred hHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCCeEEEEEEecCc-eEEEC Confidence 77788889999999999999999999999999999999999999999999999999998888777777766555 46789 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccc Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDS 233 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~ 233 (384) ++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.+++|+.+++++.|++.++|... T Consensus 175 ~~eVih~r~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~ 253 (424) T protein:vir:45 175 PDDMIHIRAL-GNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRR 253 (424) T ss_pred cccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccc Confidence 9999999976 468899999999999999999999999999999999999999999999999999999999999988777 Q ss_pred cCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 234 EAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQLSN 306 (384) Q Consensus 234 ~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i~~ 306 (384) ++|+++|+++|++|++++.++.|+|+.+. ++++++||++|||||.+||. ++.|++..+|++.||.|+++.||+ T Consensus 254 n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~ 333 (424) T protein:vir:45 254 QENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQ 333 (424) T ss_pred cCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 88999999999999999999999999876 57889999999999999973 456899999999999999999999 Q ss_pred HHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 307 QYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 307 ~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) +||++|+++.++..+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++..+.. T Consensus 334 ~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~~~~~~ 412 (424) T protein:vir:45 334 ELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVSVNAANPAGDF 412 (424) T ss_pred HHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccc Confidence 99999999998888899999999999999999999875 8999999999999999999999999999999997643222 No 9 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=3.8e-90 Score=510.82 Aligned_cols=380 Identities=13% Similarity=0.185 Sum_probs=332.2 Q ss_pred CcchhhhcccCCCc-----------------------------chhHHHhh-ccccCcceechhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-----------------------------GKVMMELI-SDSGNGFYSWHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~~-----------------------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia 50 (384) ||+|+++++.+... .+.+..++ +++..+-..+...++++++|++||+.|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 99999765432210 01111222 1222344456678899999999999999 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEE Q lcl|NC_019422. 51 KAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVE 130 (384) Q Consensus 51 ~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~ 130 (384) +.+|++|+++|++++++.+...+++.++|+.+||++||+++||+.++.+++++||+|++++|+. |.+++|+|++|.+|+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~ 159 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAK 159 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeE Confidence 9999999999998777767778888999999999999999999999999999999999999985 889999999999999 Q ss_pred EEEcCCCE-EEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC Q lcl|NC_019422. 131 AIYENEVL-FLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK 209 (384) Q Consensus 131 ~~~~~~~~-~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~ 209 (384) +..+.++. .|.|...+|..+.++++||||+|++ +.++++|+||+..+..++....+++++..++|+||++|+++|+++ T Consensus 160 ~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 238 (431) T protein:vir:10 160 GRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDL-SIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP 238 (431) T ss_pred EEEcCCCeEEEEEEeCCceEEEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC Confidence 98876654 5667777889999999999999966 578899999999999999999999999999999999999999999 Q ss_pred CCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------c Q lcl|NC_019422. 210 TALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------S 282 (384) Q Consensus 210 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~ 282 (384) +.+++++.+++++.|.+.++| ..|+|+++|+++|++|++++.++.|+|+.+. ++++++||++|||||.+|| + T Consensus 239 ~~ls~e~~~~~~~~~~~~~~g-~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~ 317 (431) T protein:vir:10 239 KELSDNAYGRMKASVQENHTG-SENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWG 317 (431) T ss_pred CCCCHHHHHHHHHHHHHHhcC-ccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCcc Confidence 999999999999999999976 4678999999999999999999999999877 5688999999999999998 3 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHh----CCCCCHHHHH Q lcl|NC_019422. 283 KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVD----RGSLTPNEWR 357 (384) Q Consensus 283 ~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~----~g~~t~NE~R 357 (384) ++.|++..+|++.||.|++++||++||++|+++.++ .+.+++||++++++.|.+++++.++ ++. +||||+||+| T Consensus 318 sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R 396 (431) T protein:vir:10 318 SGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKML-GQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVR 396 (431) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhc-CCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHH Confidence 567899999999999999999999999999987665 4678999999999999999998874 564 4579999999 Q ss_pred HHhCCCCCCC--CCeeeecCceeecCCCC Q lcl|NC_019422. 358 KIMNLSPIEN--GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 358 ~~lG~~p~~~--gd~~~~~~n~~~~~~ge 384 (384) +++|+||+|+ ||++++|.|+.+...++ T Consensus 397 ~~~gl~p~~~~~gD~~~~p~n~~~~~~~~ 425 (431) T protein:vir:10 397 EMLDLPRADDPVADQLRNPMTQKQKGSGD 425 (431) T ss_pred HHhCCCCCCCccccceecccccccCCCCC Confidence 9999999955 99999999999887777 No 10 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=9.3e-90 Score=508.68 Aligned_cols=380 Identities=16% Similarity=0.210 Sum_probs=331.2 Q ss_pred CcchhhhcccCCCc-ch---hHHHhhcc---ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee-cc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-GK---VMMELISD---SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT-NP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~-~~---~~~~~~~~---~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-~~ 72 (384) ||||++.++.++.. .. .+...+.. +..+...+.++++++++|++||+.||+.||++||++|+.++++.+. .. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~~~~ 80 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATG 80 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCceeeccc Confidence 99999755443322 11 12233322 2233445667899999999999999999999999999988776544 45 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-EEEEEEEcCceEEE Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-LFLKFLLRNGKIVS 151 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~g~~~~ 151 (384) ++++++|+.+||++||+++||+.++.+++++||+|++++++ .|++.+|+||+|..|++..+.++ ..|.+...+|.... T Consensus 81 ~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~ 159 (414) T protein:vir:44 81 ERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPDGSTDV 159 (414) T ss_pred chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcEEEEEEecCceEEE Confidence 66778888999999999999999999999999999999886 69999999999999999887665 45667777888899 Q ss_pred EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc Q lcl|NC_019422. 152 YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI 231 (384) Q Consensus 152 ~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~ 231 (384) ++++||||++++ +.++++|+||+..+..++....++++++.++|+||++|++++++++.+++|+.++++++|.+.++|. T Consensus 160 ~~~~evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~ 238 (414) T protein:vir:44 160 LSQEDIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGL 238 (414) T ss_pred EccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCc Confidence 999999999966 6788999999999999999999999999999999999999999999999999999999999999764 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i 304 (384) .++++++++++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|+++||.|++++| T Consensus 239 -~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~i 317 (414) T protein:vir:44 239 -GNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRI 317 (414) T ss_pred -cccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 578899999999999999999999999876 56889999999999999973 4668899999999999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC-- Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE-- 381 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~-- 381 (384) +++||++|+++.++. +.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++.. T Consensus 318 e~~ln~~L~~~~~~~-~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~ 396 (414) T protein:vir:44 318 EQRINTGLVRKSKQG-VFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKPSD 396 (414) T ss_pred HHHHHhhcCCccccC-ceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceecccccccccCCc Confidence 999999999988764 778999999999999999999875 8999999999999999999999999999999987542 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) +.+ T Consensus 397 ~~~ 399 (414) T protein:vir:44 397 GSK 399 (414) T ss_pred ccc Confidence 111 No 11 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=2.2e-89 Score=506.64 Aligned_cols=379 Identities=15% Similarity=0.187 Sum_probs=330.5 Q ss_pred CcchhhhcccCCC-c-chhHHHhhcc-----ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--ec Q lcl|NC_019422. 1 MNIFKSKKKNKEA-P-GKVMMELISD-----SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK--TN 71 (384) Q Consensus 1 M~~f~~~~~~~~~-~-~~~~~~~~~~-----~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--~~ 71 (384) |.++...++.+.+ . ...+..++.. +..+-.++.++++++++||+||++||+.||++||++|++++++.. .. T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~ 80 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQRAT 80 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeecc Confidence 8887755433332 2 2223333322 223445677889999999999999999999999999998876653 44 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEE Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVS 151 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~ 151 (384) +++++++|+.+||++||+++||+.++.+++++||||++++++..|++.+||||+|.+|++..+.++..+++....|+ . T Consensus 81 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~y~~~~~g~--~ 158 (421) T protein:vir:10 81 DHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPYYEIPEIGE--T 158 (421) T ss_pred cchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEEEEEcCCCc--E Confidence 67788999999999999999999999999999999999999999999999999999999999988877766655665 5 Q ss_pred EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC----ChHHHHHHHHHHHHH Q lcl|NC_019422. 152 YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL----RPDDIKKEVKSFEKN 227 (384) Q Consensus 152 ~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----~~e~~~~~~~~~~~~ 227 (384) ++++||||++++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.+ ++++.+++++.|++. T Consensus 159 ~~~~eiih~~~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~ 237 (421) T protein:vir:10 159 LPMRMMHHVKVF-SLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDR 237 (421) T ss_pred EchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHH Confidence 789999999965 578999999999999999999999999999999999999999998755 889999999999999 Q ss_pred hccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHHHHHHHHH Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYYESEIEPV 300 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~~~~i~P~ 300 (384) ++| .+++++++|+++|++|++++.++.++|+.+. ++++++||++|||||.+|| +++.|++..+|+++||.|+ T Consensus 238 ~~g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~ 316 (421) T protein:vir:10 238 YSG-INNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAW 316 (421) T ss_pred hcC-ccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHH Confidence 976 5678899999999999999999999999876 5789999999999999997 2466899999999999999 Q ss_pred HHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceee Q lcl|NC_019422. 301 GLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAV 379 (384) Q Consensus 301 ~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~ 379 (384) +.+||++||++|+++.++ .+.+++||++.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++ T Consensus 317 ~~~ie~~ln~kL~~~~~~-~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~ 395 (421) T protein:vir:10 317 LKRHEGALQRDLLLPSER-RDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMVD 395 (421) T ss_pred HHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 999999999999998775 4778999999999999999999885 79999999999999999999999999999999876 Q ss_pred cCC---CC Q lcl|NC_019422. 380 VEG---GE 384 (384) Q Consensus 380 ~~~---ge 384 (384) ++. |+ T Consensus 396 ~~~~~~~~ 403 (421) T protein:vir:10 396 SAQIIPGD 403 (421) T ss_pred ccccccCC Confidence 532 22 No 12 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=1.4e-89 Score=507.76 Aligned_cols=379 Identities=13% Similarity=0.156 Sum_probs=327.5 Q ss_pred CcchhhhcccCCCcc---h---hHHHhh--ccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--- Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---K---VMMELI--SDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK--- 69 (384) Q Consensus 1 M~~f~~~~~~~~~~~---~---~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--- 69 (384) =|||++.++...... + ....-+ ..+..+..++.++++++++|++||++||+.+|++|+++|+.++++.+ T Consensus 14 ~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCceeee Confidence 455655444322211 1 111111 22333445677889999999999999999999999999998766533 Q ss_pred eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceE Q lcl|NC_019422. 70 TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKI 149 (384) Q Consensus 70 ~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~ 149 (384) ...++++++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..+.+...|.|. .+|+. T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~y~~~-~~g~~ 172 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ-RDSEY 172 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCCeEEEEEE-eCCeE Confidence 24677888899999999999999999999999999999999999999999999999999999988777766665 45778 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCC-CChHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|+++ .++++|+||+..+..++....++++++.++|+||++|+++|+++.. +++++.+++++.|++.+ T Consensus 173 ~~~~~~eVihir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~ 251 (424) T protein:vir:18 173 ADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) T ss_pred EEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHh Confidence 899999999999764 6889999999999999999999999999999999999999999875 78999999999998776 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEWNAYYESEIEP 299 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~~~~i~P 299 (384) .+ .++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|+++||.| T Consensus 252 ~~--~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P 329 (424) T protein:vir:18 252 GG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) T ss_pred CC--cccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHH Confidence 44 467899999999999999999999999877 57889999999999999972 45689999999999999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCcee Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTA 378 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~ 378 (384) ++++||++||++|+++.++. +.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++ T Consensus 330 ~~~~ie~~ln~~L~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~n~~ 408 (424) T protein:vir:18 330 YISRWENSIQRWLIPSKDVG-RLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAMRQAQYV 408 (424) T ss_pred HHHHHHHHHHhhcCCccccC-CeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCcc Confidence 99999999999999998774 689999999999999999999875 8999999999999999999999999999999999 Q ss_pred ecCC-CC Q lcl|NC_019422. 379 VVEG-GE 384 (384) Q Consensus 379 ~~~~-ge 384 (384) |++. |+ T Consensus 409 ~l~~~~~ 415 (424) T protein:vir:18 409 PITDLGT 415 (424) T ss_pred chhhhhc Confidence 9864 32 No 13 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=4.7e-89 Score=504.84 Aligned_cols=380 Identities=16% Similarity=0.204 Sum_probs=332.6 Q ss_pred CcchhhhcccCCCcchhHHHhhccc---cCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce-eccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDS---GNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK-TNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~-~~~~~~~ 76 (384) ++||++++.........+...+... ..+-..+.+.++++++|++||++||+.+|++|+++++.++++.+ ..++++. T Consensus 4 ~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~ 83 (413) T protein:vir:48 4 SGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTRVVDERLH 83 (413) T ss_pred chhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcceeecccHHH Confidence 5667765544443333444444332 22333456789999999999999999999999999998777654 4467788 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-EEEEEEcCceEEEEehh Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-FLKFLLRNGKIVSYPYS 155 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~~~~~~g~~~~~~~~ 155 (384) ++|+.+||++||+++||+.++.+++++||+|++++++ .|++.+|||++|++|++..+.++. .|.+...+|....++++ T Consensus 84 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~ 162 (413) T protein:vir:48 84 KLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPDGSVDVLTQD 162 (413) T ss_pred HHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceEEEEEEecCceEEEEccc Confidence 8888999999999999999999999999999999986 689999999999999999887754 46667778888899999 Q ss_pred heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccC Q lcl|NC_019422. 156 DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEA 235 (384) Q Consensus 156 evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 235 (384) ||||++++ +.++++|+||+..+..++....++.+++.++|+||++|+++|++++.+++++.+++++.|++.++| ..++ T Consensus 163 evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g-~~n~ 240 (413) T protein:vir:48 163 EIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTG-LGNA 240 (413) T ss_pred cEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcC-cccc Confidence 99999876 467899999999999999999999999999999999999999999999999999999999999976 4678 Q ss_pred CcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 236 GGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 236 ~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i~~~l 308 (384) ++++++++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|++.||.|++++|+++| T Consensus 241 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l 320 (413) T protein:vir:48 241 HRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRI 320 (413) T ss_pred CcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999999999999999999876 57889999999999999973 35588999999999999999999999 Q ss_pred hhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 309 TEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 309 ~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |++|+++.++ .+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++++.+. T Consensus 321 ~~~L~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~ 396 (413) T protein:vir:48 321 NTGLVRESKQ-GKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTSPSAG 396 (413) T ss_pred HhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeecccccccccccc Confidence 9999998775 4789999999999999999999885 8999999999999999999999999999999998864322 No 14 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=2.1e-89 Score=506.80 Aligned_cols=379 Identities=13% Similarity=0.161 Sum_probs=327.4 Q ss_pred CcchhhhcccCC-----Ccch-hHHHhh--ccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--- Q lcl|NC_019422. 1 MNIFKSKKKNKE-----APGK-VMMELI--SDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK--- 69 (384) Q Consensus 1 M~~f~~~~~~~~-----~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--- 69 (384) =|||++.++... .+.. .....+ ..+..+...+.++++++++|++||++||+.||++||++|+.++++.+ T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeecCCceeee Confidence 456665443221 1111 111111 12333445677889999999999999999999999999998766643 Q ss_pred eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceE Q lcl|NC_019422. 70 TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKI 149 (384) Q Consensus 70 ~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~ 149 (384) ...++++++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..+.+...|.|.. +|+. T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~~~~y~~~~-~g~~ 172 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR-DSEY 172 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCCeEEEEEEe-CCeE Confidence 246778888999999999999999999999999999999999999999999999999999999887777666654 5778 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCC-CChHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|+++ .++++|+||+..+.+++....++++++.++|+||++|+++|+++.. +++++.+++++.|++.+ T Consensus 173 ~~~~~~eIih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~ 251 (424) T protein:vir:18 173 ADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) T ss_pred EEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHh Confidence 899999999999764 6889999999999999999999999999999999999999999865 78999999999999877 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEWNAYYESEIEP 299 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~~~~i~P 299 (384) .+ .++++++++++|++|++++.++.|+|+.+. ++++++||++|||||.+||. ++.|++..+|+++||.| T Consensus 252 ~g--~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P 329 (424) T protein:vir:18 252 GG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) T ss_pred CC--cccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHH Confidence 54 467899999999999999999999999876 57899999999999999972 46689999999999999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCcee Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTA 378 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~ 378 (384) +++.||++||++|+++.++. +.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++ T Consensus 330 ~~~~ie~~l~~~L~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~~~~~~n~~ 408 (424) T protein:vir:18 330 YISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) T ss_pred HHHHHHHHHHhhcCCccccC-CeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCcc Confidence 99999999999999998764 688999999999999999999885 8999999999999999999999999999999999 Q ss_pred ecCC-CC Q lcl|NC_019422. 379 VVEG-GE 384 (384) Q Consensus 379 ~~~~-ge 384 (384) |++. |+ T Consensus 409 ~l~~~~~ 415 (424) T protein:vir:18 409 PITDLGT 415 (424) T ss_pred chHhhhc Confidence 9753 22 No 15 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=3.1e-89 Score=505.80 Aligned_cols=379 Identities=14% Similarity=0.145 Sum_probs=327.7 Q ss_pred CcchhhhcccCCCcchhHHH-hhcc-ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--eccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMME-LISD-SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK--TNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--~~~~~~~ 76 (384) ..+|++.++.........+. +.+. +..+-.++.++++++++|++||++||+.||++||++|+++++|.+ ...++++ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~ 97 (434) T protein:vir:43 18 SSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLY 97 (434) T ss_pred hhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHH Confidence 22344333333333333322 3222 234455677889999999999999999999999999998876643 3567788 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE-EEEEEcCceEEEEehh Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF-LKFLLRNGKIVSYPYS 155 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-~~~~~~~g~~~~~~~~ 155 (384) ++|+.+||++||+++||+.++.+++++||+|+++.++ .|++++|+||+|.+|++..+.++.. |++...+|..+.++++ T Consensus 98 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~g~~~y~~~~~~g~~~~~~~~ 176 (434) T protein:vir:43 98 DVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDENGRLKYFYTTKKGARREIERT 176 (434) T ss_pred HHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCCCeEEEEEEecCceEEEEccc Confidence 8888999999999999999999999999999998876 6999999999999999999887654 4555667888999999 Q ss_pred heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccC Q lcl|NC_019422. 156 DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEA 235 (384) Q Consensus 156 evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 235 (384) ||||++++ +.++++|+||+..+..++....++++++.++|+||++|++++++++.+++++.+++++.|++ +.+ ..++ T Consensus 177 eVih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~-~~g-~~na 253 (434) T protein:vir:43 177 NMLHIPAF-TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKS-VSG-AMNS 253 (434) T ss_pred cEEEecCc-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHH-hcC-cccc Confidence 99999965 78899999999999999999999999999999999999999999999999999999998865 444 3578 Q ss_pred CcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 236 GGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEWNAYYESEIEPVGLQLSN 306 (384) Q Consensus 236 ~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~~~~i~P~~~~i~~ 306 (384) |+++|+++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+++||.|++.+||+ T Consensus 254 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~ 333 (434) T protein:vir:43 254 GRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQQ 333 (434) T ss_pred CCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999999999999999999877 56889999999999999972 455888999999999999999999 Q ss_pred HHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 307 QYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 307 ~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) +||++|+++.++ .+.+++||++.+++.|.+++++.++ ++.+|++|+||+|+++|+||+||||++++|+|++|++..+ T Consensus 334 ~ln~kL~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~ 411 (434) T protein:vir:43 334 CVNKRLLTAPER-IRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLG 411 (434) T ss_pred HHHhhcCChhhh-cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhh Confidence 999999998775 3678999999999999999999875 8999999999999999999999999999999999987554 No 16 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=4.8e-89 Score=504.77 Aligned_cols=379 Identities=15% Similarity=0.136 Sum_probs=328.0 Q ss_pred CcchhhhcccCCCcchhHHHhhc-----cccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee--ccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELIS-----DSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT--NPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~--~~~ 73 (384) |+||+..++........+....+ .+..+..++..+++++++|++||++||+.||++||++|+++++|.++ .++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~ 80 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIAFDH 80 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccc Confidence 99999765554432222222111 12233445667899999999999999999999999999988777543 456 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) ++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..+.++..++.....| ..++ T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~~y~~~~~~--~~~~ 158 (419) T protein:vir:57 81 PLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMPYYDIPSIG--EILP 158 (419) T ss_pred hHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceEEEEEcCCc--eEEc Confidence 67788889999999999999999999999999999999999999999999999999999998887665544444 3588 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC----CCChHHHHHHHHHHHHHhc Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT----ALRPDDIKKEVKSFEKNYL 229 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~----~~~~e~~~~~~~~~~~~~~ 229 (384) .+||||++++ +.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+++++.+++++.|.+.++ T Consensus 159 ~~~vih~r~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~ 237 (419) T protein:vir:57 159 MRMVHHIKSF-SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYG 237 (419) T ss_pred hhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhc Confidence 9999999965 5788999999999999999999999999999999999999999864 4578899999999999997 Q ss_pred cccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~ 302 (384) | ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+++||+|+++ T Consensus 238 g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~ 316 (419) T protein:vir:57 238 G-VRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILK 316 (419) T ss_pred c-ccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 6 4567899999999999999999999999887 56889999999999999973 45589999999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE 381 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~ 381 (384) .|+++||++|+++.++ .+.+++||++++++.|.++++++++ ++++|++|+||+|+++|+||+||||++++|+|+++++ T Consensus 317 ~ie~~l~~~ll~~~~~-~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~ 395 (419) T protein:vir:57 317 RHESAMMRDLLLPSER-RDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVDSK 395 (419) T ss_pred HHHHHHHhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccc Confidence 9999999999998766 4789999999999999999999885 7999999999999999999999999999999998765 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) ..+ T Consensus 396 ~~~ 398 (419) T protein:vir:57 396 ALT 398 (419) T ss_pred ccc Confidence 322 No 17 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=3.5e-88 Score=500.05 Aligned_cols=381 Identities=12% Similarity=0.101 Sum_probs=327.2 Q ss_pred CcchhhhcccCCCcc-------------hhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-------------KVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||++.++.+..+. +....+.+.+..+-.++...++++++|++||+.||+.||++|+++|++.+++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 999997554332221 1112222222334445667899999999999999999999999999988777 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-----EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-----FLKF 142 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~~ 142 (384) .+...++.+..|+.+||++||+++||+.++.+++++||+|+++.++ .|.+.+||+|+|.+|++..+.... ++.| T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEE Confidence 7888889999999999999999999999999999999999999665 689999999999999987754321 2223 Q ss_pred EEc-Cc---eEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHH Q lcl|NC_019422. 143 LLR-NG---KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIK 218 (384) Q Consensus 143 ~~~-~g---~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~ 218 (384) ... +| ....++++||||+|.+++.+.++|+||+..+++++....++++++.++|+||++|+++|++++.+++|+++ T Consensus 160 ~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~ 239 (457) T protein:vir:62 160 DIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLA 239 (457) T ss_pred EEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHH Confidence 332 22 23468899999999998888899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHH Q lcl|NC_019422. 219 KEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEW 289 (384) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~ 289 (384) ++++.|++.++| ..++++++|+++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++. T Consensus 240 ~~~~~~~~~~~G-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~ 318 (457) T protein:vir:62 240 RAREAWRAANSG-VDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN 318 (457) T ss_pred HHHHHHHHHhcC-ccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH Confidence 999999999976 4678999999999999999999999999876 56889999999999999972 4568889 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019422. 290 NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG 368 (384) Q Consensus 290 ~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g 368 (384) .+|+++||.|++++||++||++|+++.+. .+.+++||++.+++.|.++++++++ ++++|+||+||+|+++|+||+||| T Consensus 319 ~~f~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g 397 (457) T protein:vir:62 319 IAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDG 397 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999998875 4678999999999999999999885 899999999999999999999986 Q ss_pred --CeeeecCceeecCCCC Q lcl|NC_019422. 369 --DKPVRRLDTAVVEGGE 384 (384) Q Consensus 369 --d~~~~~~n~~~~~~ge 384 (384) |++++|+|+++++... T Consensus 398 ~~D~~~~~~n~~~~~~~~ 415 (457) T protein:vir:62 398 LGEKYRVPLNLGEIGEEP 415 (457) T ss_pred Ccceeeeccccccccccc Confidence 9999999998764322 No 18 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=4.4e-88 Score=499.51 Aligned_cols=380 Identities=11% Similarity=0.120 Sum_probs=325.7 Q ss_pred Cc-----chhhhcc--------cCCCcchhHHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MN-----IFKSKKK--------NKEAPGKVMMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~-----~f~~~~~--------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |. +|++.+. ..+.+....+..+.+ +..+..++.++++++++|++||++||++||++||++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~~ 80 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTKP 80 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEcC Confidence 44 2222111 112222333333322 22334456678999999999999999999999999999887 Q ss_pred Ccce--eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-EEEE Q lcl|NC_019422. 66 TEFK--TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-FLKF 142 (384) Q Consensus 66 ~~~~--~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~~ 142 (384) +|.+ ...++++++|+.+||++||+++||+.++.+++++||+|++++|+ .|++.+|||++|..|++..+.++. .|.| T Consensus 81 ~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g~~~y~~ 159 (437) T protein:vir:10 81 DGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSGALQYTY 159 (437) T ss_pred CCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCCeEEEEE Confidence 7654 35677888899999999999999999999999999999999998 499999999999999999876654 5666 Q ss_pred EEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHH Q lcl|NC_019422. 143 LLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVK 222 (384) Q Consensus 143 ~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~ 222 (384) ...+|..+.++++||||+|++ +.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+++++ T Consensus 160 ~~~~g~~~~~~~~dIih~r~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 238 (437) T protein:vir:10 160 RNVDGTVSTLAEDDVFHVRGF-SLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRT 238 (437) T ss_pred EecCceEEEEccccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHH Confidence 677888899999999999976 4788999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHHHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEWNAYY 293 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~ 293 (384) .|.+.++| ..++|+++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|+ T Consensus 239 ~~~~~~~g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~ 317 (437) T protein:vir:10 239 DLAEQFGG-AMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFL 317 (437) T ss_pred HHHHHhcC-ccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHH Confidence 99999977 4578999999999999999999999999876 56889999999999999962 45588999999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCe-e Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDK-P 371 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~-~ 371 (384) +.||.|++..|+++|+++|+++.++. +.+++||++++++.|.++++++++ ++.+|++|+||+|+++|+||+||||+ + T Consensus 318 ~~tl~P~~~~ie~~l~~kll~~~e~~-~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~ 396 (437) T protein:vir:10 318 TFTLRPWLTRIEQAARRSLLRPGERD-QFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVL 396 (437) T ss_pred HHHHHHHHHHHHHHHHhhccCccccC-ceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceE Confidence 99999999999999999999987765 468999999999999999999886 89999999999999999999998776 4 Q ss_pred eecCceeecCCCC Q lcl|NC_019422. 372 VRRLDTAVVEGGE 384 (384) Q Consensus 372 ~~~~n~~~~~~ge 384 (384) ++++|++|++... T Consensus 397 ~~~~~~~~~~~~~ 409 (437) T protein:vir:10 397 TVQSALLPIDKLG 409 (437) T ss_pred eecCcccchhhcc Confidence 5899999876432 No 19 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=8.8e-88 Score=497.85 Aligned_cols=377 Identities=15% Similarity=0.240 Sum_probs=324.1 Q ss_pred cchhh----hcccCCCcc---hhHHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee-c Q lcl|NC_019422. 2 NIFKS----KKKNKEAPG---KVMMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT-N 71 (384) Q Consensus 2 ~~f~~----~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-~ 71 (384) =||++ +........ ......+++ +.++......+++++++|++||+.||+.||++||++|++++++.+. . T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERKP 80 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccc Confidence 24443 322221111 112233322 2223345667889999999999999999999999999988776544 4 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-EEEEEEEcCceEE Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-LFLKFLLRNGKIV 150 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~g~~~ 150 (384) .++++++|+.+||++||+++||+.++.+++++||||+++.++..|.+.+||||+|.+|++..+.++ ..++....+|+.+ T Consensus 81 ~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~~~ 160 (416) T protein:vir:12 81 EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLNGKAI 160 (416) T ss_pred ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEecCCeEE Confidence 577888899999999999999999999999999999999999999999999999999999886654 4444445678889 Q ss_pred EEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 151 SYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 151 ~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) .++++||||++++ +.++++|+||+.++..++....++++++.++|+||+.|+++|++++.+++++++++++.|++.. T Consensus 161 ~~~~~eiih~~~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~-- 237 (416) T protein:vir:12 161 ELYDYEVLHFKGL-STDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVN-- 237 (416) T ss_pred EecCccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHh-- Confidence 9999999999965 5678999999999999999999999999999999999999999999999999999999887543 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQ 303 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~ 303 (384) ++++++++++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.+++..+|++.||.|++++ T Consensus 238 ---~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ 314 (416) T protein:vir:12 238 ---KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVN 314 (416) T ss_pred ---cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHH Confidence 35789999999999999999999999876 57889999999999999962 456889999999999999999 Q ss_pred HHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCC Q lcl|NC_019422. 304 LSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEG 382 (384) Q Consensus 304 i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ 382 (384) |+++||++|+++.++..+.+|+||++++++.|.++++++++ ++++|++|+||+|+++|+||+||||++++|+|+++++. T Consensus 315 ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~ 394 (416) T protein:vir:12 315 FEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDF 394 (416) T ss_pred HHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccc Confidence 99999999999998888999999999999999999999875 89999999999999999999999999999999987653 Q ss_pred ---------------CC Q lcl|NC_019422. 383 ---------------GE 384 (384) Q Consensus 383 ---------------ge 384 (384) || T Consensus 395 ~~~~~~~~~~~~~~gge 411 (416) T protein:vir:12 395 LEEYQRLKAGGAMKGGD 411 (416) T ss_pred cchhhccccccccCCCC Confidence 33 No 20 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=4.8e-88 Score=499.30 Aligned_cols=376 Identities=15% Similarity=0.147 Sum_probs=323.7 Q ss_pred CcchhhhcccCCCcchh-------------H--HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV-------------M--MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~-------------~--~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ||+|++.+.....+++. . ......+..+..++.++++++++|++||++||++||++||++|++++ T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 99999876655443321 0 01112233345566778999999999999999999999999999887 Q ss_pred Ccce-eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-EEEEEE Q lcl|NC_019422. 66 TEFK-TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-LFLKFL 143 (384) Q Consensus 66 ~~~~-~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~~ 143 (384) +|.+ +..++++++|+.+||++||+++||+.++.+++++||||++++++ .|++.+||+|+|..|++..+.++ ..|.+. T Consensus 87 ~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g~~~y~~~ 165 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNTAYRYR 165 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCCcEEEEEE Confidence 7654 45677888899999999999999999999999999999999997 48999999999999999987665 456677 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) ..+|+.+.++++||||+|++ +.++++|+||+..+.+++....+++++..++|+||++|++++++++.++++++++++++ T Consensus 166 ~~~g~~~~~~~~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~ 244 (432) T protein:vir:97 166 RTDGQMIDIPRQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSKK 244 (432) T ss_pred ecCceEEEEccccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHHH Confidence 77899999999999999865 67889999999999999999999999999999999999999999999999887776555 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc---------ccHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS---------KYSEDEWNAYY 293 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~---------~~~e~~~~~~~ 293 (384) | .+ ..++++++|+++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+ T Consensus 245 ~----~~-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~ 319 (432) T protein:vir:97 245 V----SG-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred H----hh-hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHH Confidence 4 44 3467899999999999999999999999876 57899999999999999973 34588899999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCe-e Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDK-P 371 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~-~ 371 (384) ++||.|+++.||++||++|+++.++ .+.+++||++.+++.|.++++++++ ++.+|++|+||+|+++|+||+||||. + T Consensus 320 ~~tl~P~~~~ie~~ln~kLl~~~e~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~~ 398 (432) T protein:vir:97 320 TMTLSPWLRRIEQSIALNLLTPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHHhhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 9999999999999999999998775 4678999999999999999999875 89999999999999999999998755 5 Q ss_pred eecCceeecCCCC Q lcl|NC_019422. 372 VRRLDTAVVEGGE 384 (384) Q Consensus 372 ~~~~n~~~~~~ge 384 (384) +++.|++|++... T Consensus 399 ~~~~~~~pl~~~~ 411 (432) T protein:vir:97 399 TVQSAMVPLDSIG 411 (432) T ss_pred eecccccchhhhc Confidence 5899999875322 No 21 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=4.5e-88 Score=499.47 Aligned_cols=376 Identities=15% Similarity=0.154 Sum_probs=323.7 Q ss_pred CcchhhhcccCCCcchh-------------HHH--hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV-------------MME--LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~-------------~~~--~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ||+|++.+.....+++. ... ....+..+..++.++++++++||+||++||+.||++||++|++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 99999876555443321 000 111223344556678999999999999999999999999999887 Q ss_pred Ccce-eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-EEEEEE Q lcl|NC_019422. 66 TEFK-TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-LFLKFL 143 (384) Q Consensus 66 ~~~~-~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~~ 143 (384) ++.+ +..++++++|+.+||++||+++||+.++.+++++||||++++++ .|++.+||+|+|.+|++..+.++ ..|++. T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~~~y~~~ 165 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNTAYRYR 165 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCcEEEEEE Confidence 7754 46677888889999999999999999999999999999999996 58999999999999999987665 456677 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) ..+|+.+.++++||||++++ +.++++|+||+..+.+++....++++++.++|+||++|++++++++.++++++++++++ T Consensus 166 ~~~g~~~~~~~~~iih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~ 244 (432) T protein:vir:10 166 RTDGQMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKK 244 (432) T ss_pred ecCceEEEEcCccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHH Confidence 77899999999999999855 67899999999999999999999999999999999999999999999999987776665 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc---------ccHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS---------KYSEDEWNAYY 293 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~---------~~~e~~~~~~~ 293 (384) |. + ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+ T Consensus 245 ~~----~-~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~ 319 (432) T protein:vir:10 245 VS----G-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred Hh----h-hhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHH Confidence 53 4 3467899999999999999999999999876 57899999999999999972 34578899999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC-ee Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD-KP 371 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd-~~ 371 (384) ++||.|+++.||++||++|+++.++ .+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+|||| .+ T Consensus 320 ~~tl~P~~~~ie~~ln~kL~~~~~~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~~ 398 (432) T protein:vir:10 320 SMTLSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 9999999999999999999998775 4578999999999999999999875 8999999999999999999999765 55 Q ss_pred eecCceeecCCCC Q lcl|NC_019422. 372 VRRLDTAVVEGGE 384 (384) Q Consensus 372 ~~~~n~~~~~~ge 384 (384) ++++|++|++... T Consensus 399 ~~~~~~~pl~~~~ 411 (432) T protein:vir:10 399 TVQSAMVPLDSIG 411 (432) T ss_pred eecCcccchhhhc Confidence 6899999876321 No 22 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=5.9e-88 Score=498.80 Aligned_cols=376 Identities=15% Similarity=0.153 Sum_probs=321.7 Q ss_pred CcchhhhcccCCCcch-------------hHHH--hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK-------------VMME--LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~~~~~~~-------------~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ||||++.+.....+.+ .... .+..+..+..++..+++++++|++||++||+.||++|+++|++++ T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecC Confidence 9999986543222111 0001 111122334456678999999999999999999999999999887 Q ss_pred Cccee-ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-EEEEEE Q lcl|NC_019422. 66 TEFKT-NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-LFLKFL 143 (384) Q Consensus 66 ~~~~~-~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~~ 143 (384) +|.++ ..++++++|+.+||++||+++||+.++.+++++||||+++.++ .|++.+||||+|..|++..+.++ ..|.+. T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~~~y~~~ 165 (432) T protein:vir:81 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGNTAYRYR 165 (432) T ss_pred CcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCcEEEEEE Confidence 77554 5677888888999999999999999999999999999999986 48999999999999999998765 456677 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) ..+|+.+.++++||||+|++ +.++++|+||+..+.++|....++++++.++|+||++|++++++++.++++++++++++ T Consensus 166 ~~~g~~~~~~~~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~ 244 (432) T protein:vir:81 166 RTDGQMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKK 244 (432) T ss_pred ecCceEEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHH Confidence 77899999999999999854 67889999999999999999999999999999999999999999999999988877666 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc---------ccHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS---------KYSEDEWNAYY 293 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~---------~~~e~~~~~~~ 293 (384) |. + ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+ T Consensus 245 ~~----~-~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~ 319 (432) T protein:vir:81 245 VS----G-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred Hh----h-hhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHH Confidence 53 4 3467899999999999999999999999876 57899999999999999973 34578899999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC-Cee Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG-DKP 371 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g-d~~ 371 (384) +.||.|+++.||++||++|+++.++ .+.+++||++++++.|.++++++++ ++++|++|+||+|+++|+||+||| |.+ T Consensus 320 ~~tl~P~~~~ie~~l~~kLl~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~~ 398 (432) T protein:vir:81 320 TMTLSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHHHhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 9999999999999999999998775 4689999999999999999999885 899999999999999999999976 456 Q ss_pred eecCceeecCCCC Q lcl|NC_019422. 372 VRRLDTAVVEGGE 384 (384) Q Consensus 372 ~~~~n~~~~~~ge 384 (384) ++++|++|++... T Consensus 399 ~~~~~~~pl~~~~ 411 (432) T protein:vir:81 399 TVQSAMVPLDSIG 411 (432) T ss_pred eecCcccchhhhc Confidence 6899999875211 No 23 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=5.4e-88 Score=499.03 Aligned_cols=378 Identities=15% Similarity=0.167 Sum_probs=325.0 Q ss_pred CcchhhhcccCCC-cc---hhH---HHhh-----ccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEA-PG---KVM---MELI-----SDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~-~~---~~~---~~~~-----~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) .++|.+.+.+... .. ..+ +..+ +.+..+..++..+++++++|++||++||+.||++||++|++++++. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g~ 81 (454) T protein:vir:93 2 WNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQGI 81 (454) T ss_pred CCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCc Confidence 5566553322221 11 111 1222 1223344456678999999999999999999999999999887665 Q ss_pred -eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE-EEEEEcC Q lcl|NC_019422. 69 -KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF-LKFLLRN 146 (384) Q Consensus 69 -~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-~~~~~~~ 146 (384) ++..++.+++|+.+||++||+++||+.++.+++++||+|++++++..|.+.+|||++|++|++..+.++.. |.+.... T Consensus 82 ~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~y~~~~~~ 161 (454) T protein:vir:93 82 RRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVFYRITPDR 161 (454) T ss_pred cchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEEEEEEecc Confidence 44677888999999999999999999999999999999999999999999999999999999998877644 4444332 Q ss_pred ----ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHH Q lcl|NC_019422. 147 ----GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVK 222 (384) Q Consensus 147 ----g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~ 222 (384) +..+.++++||||++++.+.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+++++ T Consensus 162 ~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 241 (454) T protein:vir:93 162 NCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKS 241 (454) T ss_pred ccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHH Confidence 4567899999999998888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYES 295 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~ 295 (384) .|++.++| .++|+++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|++. T Consensus 242 ~~~~~~~g--~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 319 (454) T protein:vir:93 242 NWDSGYTG--ENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQ 319 (454) T ss_pred HHHHHhcc--cccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHH Confidence 99998865 468899999999999999999999999876 56889999999999999973 3568888999999 Q ss_pred HHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeec Q lcl|NC_019422. 296 EIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRR 374 (384) Q Consensus 296 ~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~ 374 (384) ||.|++..|+++||++|++.. +.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||+++++ T Consensus 320 ~l~P~~~~ie~~ln~~L~~~~----~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~ 395 (454) T protein:vir:93 320 CLQTLIESIELLLDEALETGE----NESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQ 395 (454) T ss_pred HHHHHHHHHHHHHHHhhcCCC----CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeec Confidence 999999999999999998754 357999999999999999999875 899999999999999999999999999999 Q ss_pred CceeecCCCC Q lcl|NC_019422. 375 LDTAVVEGGE 384 (384) Q Consensus 375 ~n~~~~~~ge 384 (384) .|+++++... T Consensus 396 ~~~~~~~~~~ 405 (454) T protein:vir:93 396 QQNYSLEALS 405 (454) T ss_pred cCccchHhhh Confidence 9998874321 No 24 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=1.9e-87 Score=495.98 Aligned_cols=374 Identities=18% Similarity=0.270 Sum_probs=320.7 Q ss_pred CcchhhhcccCCC-----------cchhHHHhhcccc-CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEA-----------PGKVMMELISDSG-NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~-----------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) |+||++..-.... +......+..... +++.++..+++++|+|++||++||++||++||+++++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~--- 77 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK--- 77 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccc--- Confidence 9999862111110 0011111111111 233456678999999999999999999999999998653 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcC Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRN 146 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~ 146 (384) ...+++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++.+|++++|..|++..+.++. .|.+...+ T Consensus 78 -~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~ 156 (412) T protein:vir:26 78 -VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT 156 (412) T ss_pred -cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC Confidence 356778889999999999999999999999999999999999999999999999999999999887654 45555667 Q ss_pred ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHH Q lcl|NC_019422. 147 GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEK 226 (384) Q Consensus 147 g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~ 226 (384) |..+.++++||||++++++.++++|+||+..+..++....+++++. ++.++..++++++.++.+++++.++++++|++ T Consensus 157 g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~ 234 (412) T protein:vir:26 157 GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQ 234 (412) T ss_pred ceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHHHHH Confidence 8888999999999999889999999999999999999999988874 55666667888899999999999999999988 Q ss_pred HhccccccCCcceecCCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHH Q lcl|NC_019422. 227 NYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEP 299 (384) Q Consensus 227 ~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P 299 (384) .+. ++++++++++|++|++++.++.++++.+.+ +++++||++|||||.+||+ ++.|++..+|++.||.| T Consensus 235 ~~~----~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P 310 (412) T protein:vir:26 235 YYE----ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLP 310 (412) T ss_pred Hhh----cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHH Confidence 664 467899999999999999999999998774 6889999999999999974 35588899999999999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCcee Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTA 378 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~ 378 (384) ++.+|+++||++|+++.++..+.+|+||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++++|++ T Consensus 311 ~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~ 390 (412) T protein:vir:26 311 IVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLY 390 (412) T ss_pred HHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc Confidence 999999999999999998888899999999999999999999875 7999999999999999999999999999999998 Q ss_pred ecC----------CCC Q lcl|NC_019422. 379 VVE----------GGE 384 (384) Q Consensus 379 ~~~----------~ge 384 (384) |++ ||| T Consensus 391 ~~~~~~~~~~~~~gG~ 406 (412) T protein:vir:26 391 PIDTPLELRKSLKGGD 406 (412) T ss_pred ccccchhhcccccCCC Confidence 874 343 No 25 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=2.1e-87 Score=495.83 Aligned_cols=376 Identities=16% Similarity=0.206 Sum_probs=316.0 Q ss_pred CcchhhhcccCCCcc-hhHHHh---hcccc-C-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-KVMMEL---ISDSG-N-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~-~~~~~~---~~~~~-~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||||++.+++..... ...... +..+. . +-..+...++++++|++||+.||+++|++||++++. +.....++ T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~---~~~~~~~~ 77 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN---GQINYSDR 77 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecC---ccccccch Confidence 999997655433322 111112 21111 1 122345678999999999999999999999999763 34556788 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-----CceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-----NGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-----~g~~ 149 (384) .+++|+.+||++||+++||+.++.+++++||||++++|+..|++.+|||++|++|++..+.++..+++... .+.. T Consensus 78 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~ 157 (416) T protein:vir:45 78 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIE 157 (416) T ss_pred HHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeE Confidence 88889999999999999999999999999999999999999999999999999999999887766544321 2345 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.++ +++++++++.|.+.+ T Consensus 158 ~~~~~~evihir~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~ 236 (416) T protein:vir:45 158 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 236 (416) T ss_pred EEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 77999999999965 6788999999999999999999999999999999999999999998875 567889999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc---cHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK---YSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~---~~e~~~~~~~~~~i~P~~~~i 304 (384) .| ..++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||.+ ++.++...++.+||.|++.+| T Consensus 237 ~g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~l~P~~~~i 315 (416) T protein:vir:45 237 SG-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYITCV 315 (416) T ss_pred cC-ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHHHH Confidence 77 5577899999999999999999999999876 578899999999999999743 233344455667999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eeeecCceeecC Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPVRRLDTAVVE 381 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~~~~n~~~~~ 381 (384) +++||++|+++.. +.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|||| ++++|+|++|++ T Consensus 316 e~~ln~~l~~~~~---~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~ 392 (416) T protein:vir:45 316 CAELNFKFNDEYV---NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIE 392 (416) T ss_pred HHHHhhhcccccc---CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccc Confidence 9999999987653 568999999999999999999886 7999999999999999999999877 689999998875 Q ss_pred ------------------CCC Q lcl|NC_019422. 382 ------------------GGE 384 (384) Q Consensus 382 ------------------~ge 384 (384) ||| T Consensus 393 ~~~~~~~~~~~~~~~~~kgGe 413 (416) T protein:vir:45 393 LVDEYQMNKSRATDKKLKGGE 413 (416) T ss_pred cccccCcccccccccccCCCC Confidence 344 No 26 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=2.1e-87 Score=495.83 Aligned_cols=376 Identities=16% Similarity=0.206 Sum_probs=316.0 Q ss_pred CcchhhhcccCCCcc-hhHHHh---hcccc-C-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-KVMMEL---ISDSG-N-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~-~~~~~~---~~~~~-~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||||++.+++..... ...... +..+. . +-..+...++++++|++||+.||+++|++||++++. +.....++ T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~---~~~~~~~~ 77 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN---GQINYSDR 77 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecC---ccccccch Confidence 999997655433322 111112 21111 1 122345678999999999999999999999999763 34556788 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-----CceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-----NGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-----~g~~ 149 (384) .+++|+.+||++||+++||+.++.+++++||||++++|+..|++.+|||++|++|++..+.++..+++... .+.. T Consensus 78 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~ 157 (416) T protein:vir:81 78 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIE 157 (416) T ss_pred HHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeE Confidence 88889999999999999999999999999999999999999999999999999999999887766544321 2345 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.++ +++++++++.|.+.+ T Consensus 158 ~~~~~~evihir~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~ 236 (416) T protein:vir:81 158 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 236 (416) T ss_pred EEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 77999999999965 6788999999999999999999999999999999999999999998875 567889999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc---cHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK---YSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~---~~e~~~~~~~~~~i~P~~~~i 304 (384) .| ..++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||.+ ++.++...++.+||.|++.+| T Consensus 237 ~g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~l~P~~~~i 315 (416) T protein:vir:81 237 SG-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYITCV 315 (416) T ss_pred cC-ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHHHH Confidence 77 5577899999999999999999999999876 578899999999999999743 233344455667999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eeeecCceeecC Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPVRRLDTAVVE 381 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~~~~n~~~~~ 381 (384) +++||++|+++.. +.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|||| ++++|+|++|++ T Consensus 316 e~~ln~~l~~~~~---~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~ 392 (416) T protein:vir:81 316 CAELNFKFNDEYV---NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIE 392 (416) T ss_pred HHHHhhhcccccc---CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccc Confidence 9999999987653 568999999999999999999886 7999999999999999999999877 689999998875 Q ss_pred ------------------CCC Q lcl|NC_019422. 382 ------------------GGE 384 (384) Q Consensus 382 ------------------~ge 384 (384) ||| T Consensus 393 ~~~~~~~~~~~~~~~~~kgGe 413 (416) T protein:vir:81 393 LVDEYQMNKSRATDKKLKGGE 413 (416) T ss_pred cccccCcccccccccccCCCC Confidence 344 No 27 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=4e-87 Score=494.23 Aligned_cols=378 Identities=13% Similarity=0.157 Sum_probs=325.5 Q ss_pred CcchhhhcccCCCc----chhHHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee-ccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP----GKVMMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT-NPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~----~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-~~~ 73 (384) |.|...+++..... ..++..++.. +..+..++..+++++++|++||++||+.||++||++|++++++.++ .++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDRKPATDH 80 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCccccccc Confidence 77665544443322 2333344332 3334556678899999999999999999999999999988777544 567 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) +++++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+||||+|.+|++..+.++..+++. .+. ..++ T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~y~~-~~~--~~~~ 157 (419) T protein:vir:80 81 PLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPMYRV-AGA--DPLP 157 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEE-cCc--cccc Confidence 7788888999999999999999999999999999999999999999999999999999998887655433 222 2478 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC----ChHHHHHHHHHHHHHhc Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL----RPDDIKKEVKSFEKNYL 229 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----~~e~~~~~~~~~~~~~~ 229 (384) .++|+|++++ +.++++|+||+..+..++....++++++.++|+||++|+++|++++.. ++++.+++++.|++.++ T Consensus 158 ~~~i~h~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (419) T protein:vir:80 158 QRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFG 236 (419) T ss_pred hhheEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhc Confidence 9999999965 578899999999999999999999999999999999999999987644 67788999999999997 Q ss_pred cccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~ 302 (384) | ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+++||.|++. T Consensus 237 g-~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~ 315 (419) T protein:vir:80 237 G-SGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVK 315 (419) T ss_pred C-ccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 7 4678999999999999999999999999876 56889999999999999963 35688999999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE 381 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~ 381 (384) .|+++|+++|+++.++ .+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++++ T Consensus 316 ~ie~~l~~kll~~~~~-~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~~~ 394 (419) T protein:vir:80 316 RHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDAS 394 (419) T ss_pred HHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccc Confidence 9999999999998765 4678999999999999999999875 7999999999999999999999999999999998764 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) ..+ T Consensus 395 ~~~ 397 (419) T protein:vir:80 395 KPQ 397 (419) T ss_pred ccc Confidence 332 No 28 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=5.2e-87 Score=493.63 Aligned_cols=381 Identities=15% Similarity=0.204 Sum_probs=322.2 Q ss_pred Ccchhhhc-ccCC-Ccc-hhHHHhhcc-ccCc------ceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSKK-KNKE-APG-KVMMELISD-SGNG------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~~-~~~~-~~~-~~~~~~~~~-~~~~------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) |=+=+-.. +... ++. +++-..... +..+ +....+.++++++|++||++||+.||++||++|++++++..+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~~ 80 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCcccc Confidence 43333211 1111 111 111111110 1111 122335678999999999999999999999999998888887 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEcC-- Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLRN-- 146 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~-- 146 (384) ..++.+++|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+||||+|.+|++..+.++.. |+|.... T Consensus 81 ~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~~~~ 160 (518) T protein:vir:78 81 EHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) T ss_pred ccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecCCc Confidence 888899999999999999999999999999999999999999999999999999999999998865433 3343332 Q ss_pred -ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHH Q lcl|NC_019422. 147 -GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFE 225 (384) Q Consensus 147 -g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~ 225 (384) ++.+.++++||||+|++++.+..+|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+++++.|+ T Consensus 161 ~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~ 240 (518) T protein:vir:78 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFD 240 (518) T ss_pred cceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHH Confidence 4677899999999998877666789999999999999999999999999999999999999999999999999999999 Q ss_pred HHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHH Q lcl|NC_019422. 226 KNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIE 298 (384) Q Consensus 226 ~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~ 298 (384) +.++| ..++|+++|+++|++|+++++++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+++||. T Consensus 241 ~~~~G-~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~ 319 (518) T protein:vir:78 241 RAHAG-SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) T ss_pred HHhcC-cccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHH Confidence 99976 4577999999999999999999999999876 57889999999999999972 3558899999999999 Q ss_pred HHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCC--CCCeeeecC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIE--NGDKPVRRL 375 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~--~gd~~~~~~ 375 (384) |++.+|+++||++|++..+. +.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+| |||++++++ T Consensus 320 P~~~~ie~eln~~L~~~~~~--~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~ 397 (518) T protein:vir:78 320 IPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) T ss_pred HHHHHHHHHHHHhhcccccC--cceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 99999999999999987664 458999999999999999999875 8999999999999999999996 799999999 Q ss_pred ceeecCCCC Q lcl|NC_019422. 376 DTAVVEGGE 384 (384) Q Consensus 376 n~~~~~~ge 384 (384) |++|++... T Consensus 398 n~~pl~~~~ 406 (518) T protein:vir:78 398 ALQPLGATP 406 (518) T ss_pred cceeccccc Confidence 999875322 No 29 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=3.8e-87 Score=494.40 Aligned_cols=376 Identities=17% Similarity=0.217 Sum_probs=312.9 Q ss_pred CcchhhhcccC-CCcchh---HHHhhccc-cC-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNK-EAPGKV---MMELISDS-GN-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~-~~~~~~---~~~~~~~~-~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||+|.+.++.. ..+... +...+... +. +...+...++++++|++||++||++||++|++++++ +.....++ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~---~~~~~~~~ 102 (441) T protein:vir:94 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN---GQINYSDR 102 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecC---ccccccch Confidence 66665433221 111111 11222111 11 112445678999999999999999999999999863 34556788 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-----CceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-----NGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-----~g~~ 149 (384) ++++|+.+||++||+++||+.++.+++++||||++++|+..|++.+|++++|+.|++..+.++..+++... .+.. T Consensus 103 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~ 182 (441) T protein:vir:94 103 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIE 182 (441) T ss_pred HHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeE Confidence 88889999999999999999999999999999999999999999999999999999999887766544432 2345 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.++ +++++++++.|.+.+ T Consensus 183 ~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:94 183 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred EEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHh Confidence 78999999999964 6788999999999999999999999999999999999999999999875 677889999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc---cHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK---YSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~---~~e~~~~~~~~~~i~P~~~~i 304 (384) +| ..++++++|+++|++|++++.++.|+|+.+. ++++++||++|||||.+||.+ .+.++...++..||.|++.+| T Consensus 262 ~G-~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~~tl~P~~~~i 340 (441) T protein:vir:94 262 SG-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYITCV 340 (441) T ss_pred cC-ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHHHH Confidence 87 5677899999999999999999999999877 578999999999999999743 222333445567999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eeeecCceeecC Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPVRRLDTAVVE 381 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~~~~n~~~~~ 381 (384) |++||++|+++. .+.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|||| .+++|+|++|++ T Consensus 341 e~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~ 417 (441) T protein:vir:94 341 CAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIE 417 (441) T ss_pred HHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccc Confidence 999999998764 3568999999999999999999886 7899999999999999999999988 588999999875 Q ss_pred ------------------CCC Q lcl|NC_019422. 382 ------------------GGE 384 (384) Q Consensus 382 ------------------~ge 384 (384) +|| T Consensus 418 ~~~~~~~~~~~~~~~~~kgGe 438 (441) T protein:vir:94 418 LVDEYQMNKSRATDKKLKGGE 438 (441) T ss_pred cccccccccccccccccCCCC Confidence 333 No 30 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=3.8e-87 Score=494.40 Aligned_cols=376 Identities=17% Similarity=0.217 Sum_probs=312.9 Q ss_pred CcchhhhcccC-CCcchh---HHHhhccc-cC-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNK-EAPGKV---MMELISDS-GN-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~-~~~~~~---~~~~~~~~-~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||+|.+.++.. ..+... +...+... +. +...+...++++++|++||++||++||++|++++++ +.....++ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~---~~~~~~~~ 102 (441) T protein:vir:79 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN---GQINYSDR 102 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecC---ccccccch Confidence 66665433221 111111 11222111 11 112445678999999999999999999999999863 34556788 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-----CceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-----NGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-----~g~~ 149 (384) ++++|+.+||++||+++||+.++.+++++||||++++|+..|++.+|++++|+.|++..+.++..+++... .+.. T Consensus 103 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~ 182 (441) T protein:vir:79 103 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIE 182 (441) T ss_pred HHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeE Confidence 88889999999999999999999999999999999999999999999999999999999887766544432 2345 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.++ +++++++++.|.+.+ T Consensus 183 ~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:79 183 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred EEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHh Confidence 78999999999964 6788999999999999999999999999999999999999999999875 677889999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc---cHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK---YSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~---~~e~~~~~~~~~~i~P~~~~i 304 (384) +| ..++++++|+++|++|++++.++.|+|+.+. ++++++||++|||||.+||.+ .+.++...++..||.|++.+| T Consensus 262 ~G-~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~~tl~P~~~~i 340 (441) T protein:vir:79 262 SG-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYITCV 340 (441) T ss_pred cC-ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHHHH Confidence 87 5677899999999999999999999999877 578999999999999999743 222333445567999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eeeecCceeecC Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPVRRLDTAVVE 381 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~~~~n~~~~~ 381 (384) |++||++|+++. .+.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|||| .+++|+|++|++ T Consensus 341 e~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~ 417 (441) T protein:vir:79 341 CAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIE 417 (441) T ss_pred HHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccc Confidence 999999998764 3568999999999999999999886 7899999999999999999999988 588999999875 Q ss_pred ------------------CCC Q lcl|NC_019422. 382 ------------------GGE 384 (384) Q Consensus 382 ------------------~ge 384 (384) +|| T Consensus 418 ~~~~~~~~~~~~~~~~~kgGe 438 (441) T protein:vir:79 418 LVDEYQMNKSRATDKKLKGGE 438 (441) T ss_pred cccccccccccccccccCCCC Confidence 333 No 31 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=5.5e-87 Score=493.51 Aligned_cols=378 Identities=13% Similarity=0.150 Sum_probs=323.9 Q ss_pred CcchhhhcccCC----CcchhHHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce-eccc Q lcl|NC_019422. 1 MNIFKSKKKNKE----APGKVMMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK-TNPE 73 (384) Q Consensus 1 M~~f~~~~~~~~----~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~-~~~~ 73 (384) |-|..+..++.. +...++..++.. +..+-.++.++++++++|++||++||+.||++||++|++++++.+ +.++ T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDRKPATDH 80 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcccccccc Confidence 644332222222 222333334432 333445667889999999999999999999999999998876654 4467 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) +++++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+|||++|.+|++..+.++..+++.... . .++ T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~~y~~~~~-~--~~~ 157 (419) T protein:vir:14 81 PLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKPVYRVRGS-D--PMP 157 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEccC-c--ccc Confidence 7778888899999999999999999999999999999999999999999999999999998877655443322 2 368 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC----ChHHHHHHHHHHHHHhc Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL----RPDDIKKEVKSFEKNYL 229 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----~~e~~~~~~~~~~~~~~ 229 (384) .++|+|++++ +.++++|+||+..+..++....++++++.++|+||++|+++|++++.+ ++++.+++++.|++.++ T Consensus 158 ~~~i~h~~~~-~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (419) T protein:vir:14 158 QRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFG 236 (419) T ss_pred hhheeEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhc Confidence 8999999865 578899999999999999999999999999999999999999998766 47788999999999997 Q ss_pred cccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~ 302 (384) | ..++++++++++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.|++..+|+++||.|++. T Consensus 237 g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~ 315 (419) T protein:vir:14 237 G-SGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVK 315 (419) T ss_pred C-ccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 7 5678999999999999999999999999876 56889999999999999973 35589999999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE 381 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~ 381 (384) +|+++||++|+++.++ .+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|+++++ T Consensus 316 ~ie~~l~~kll~~~~~-~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~ 394 (419) T protein:vir:14 316 RHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDAS 394 (419) T ss_pred HHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccc Confidence 9999999999998776 4778999999999999999999885 8999999999999999999999999999999998876 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) ..+ T Consensus 395 ~~~ 397 (419) T protein:vir:14 395 KPQ 397 (419) T ss_pred ccc Confidence 543 No 32 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=5e-87 Score=493.74 Aligned_cols=381 Identities=12% Similarity=0.099 Sum_probs=323.4 Q ss_pred CcchhhhcccCCCc-------------chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-------------GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||+++++....+ .+....+.+...++..++..+++++++|++||++||+.||++|+++|++++++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999765443322 12223333334445556677899999999999999999999999999988777 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE-----EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL-----FLKF 142 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~~ 142 (384) .+...++.+..++..||..||+++||+.++.+++++||+|+++.++ .|.+++||+|+|.+|++..+.... ++.| T Consensus 81 ~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:13 81 RKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred ccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEEE Confidence 6655555555555555558999999999999999999999999776 589999999999999998765432 2233 Q ss_pred EEc-Cc---eEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHH Q lcl|NC_019422. 143 LLR-NG---KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIK 218 (384) Q Consensus 143 ~~~-~g---~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~ 218 (384) ... +| ....++++||||++.+++.+.++|+||+..+.++|....++++++.++|+||++|+++|++++.+++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~ 239 (457) T protein:vir:13 160 DIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLA 239 (457) T ss_pred EEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHH Confidence 332 22 23468899999999998888899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHH Q lcl|NC_019422. 219 KEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEW 289 (384) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~ 289 (384) ++++.|++.++| ..++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.+++. T Consensus 240 ~~~~~~~~~~~g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~ 318 (457) T protein:vir:13 240 RAREAWRAANSG-VDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN 318 (457) T ss_pred HHHHHHHHHhcC-ccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH Confidence 999999999976 5678999999999999999999999999876 57889999999999999962 4458889 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019422. 290 NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG 368 (384) Q Consensus 290 ~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g 368 (384) .+|+++||.|+++.||++||++|+++.+. .+.+++||++++++.|.++++++++ ++++|++|+||+|+++|+||+||| T Consensus 319 ~~f~~~tl~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g 397 (457) T protein:vir:13 319 IAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDG 397 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCcccc-CceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999998775 4678999999999999999999875 899999999999999999999986 Q ss_pred --CeeeecCceeecCCC-C Q lcl|NC_019422. 369 --DKPVRRLDTAVVEGG-E 384 (384) Q Consensus 369 --d~~~~~~n~~~~~~g-e 384 (384) |++++|+|++++... + T Consensus 398 ~~d~~~~~~n~~~~~~~~~ 416 (457) T protein:vir:13 398 LGEKYRVPLNLGEVGEEPE 416 (457) T ss_pred cccceeecccccccccccc Confidence 999999999887432 1 No 33 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=4.3e-87 Score=494.09 Aligned_cols=375 Identities=16% Similarity=0.203 Sum_probs=313.9 Q ss_pred CcchhhhcccC-CCcchhHHHh---hccc-cC-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNK-EAPGKVMMEL---ISDS-GN-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~-~~~~~~~~~~---~~~~-~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||+|.+.++.. ..+......+ +... +. +-..+...++++++|++||+.||++||++|++++++ +.....++ T Consensus 26 ~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~---~~~~~~~~ 102 (441) T protein:vir:98 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN---GQINYSDR 102 (441) T ss_pred cccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHhhccCceEEecC---Ccccccch Confidence 88887543332 1122222222 1111 11 122455678999999999999999999999999863 34556778 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-----CceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-----NGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-----~g~~ 149 (384) ++++|+.+||++||+++||+.++.+++++||||++++|+..|++.+|||++|+.|++..+.++..+++... .+.. T Consensus 103 ~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~ 182 (441) T protein:vir:98 103 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIE 182 (441) T ss_pred HHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEEEEeccCcceee Confidence 88889999999999999999999999999999999999999999999999999999999887766544332 2456 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|++ +.++++|+||+..+.+++....++++++.++|+||++|+++|++++.++ +++.+++++.|++.+ T Consensus 183 ~~~~~~dviHir~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~ 261 (441) T protein:vir:98 183 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred EEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 78999999999965 6788999999999999999999999999999999999999999999875 677889999999999 Q ss_pred ccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc----cHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK----YSEDEWNAYYESEIEPVGLQ 303 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~----~~e~~~~~~~~~~i~P~~~~ 303 (384) +| ..++++++|+++|++|++++.++.|+++.+. ++++++||++|||||.+||.+ +.+++...| ..||+|++.+ T Consensus 262 ~G-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y-~~tl~P~~~~ 339 (441) T protein:vir:98 262 SG-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-LSTLKPYITC 339 (441) T ss_pred cC-ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-HHHHHHHHHH Confidence 87 5678899999999999999999999999876 578999999999999999743 224444444 4699999999 Q ss_pred HHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eeeecCceeec Q lcl|NC_019422. 304 LSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPVRRLDTAVV 380 (384) Q Consensus 304 i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~~~~n~~~~ 380 (384) ||++||++|+++.+ +.+++||++++++.|.++++++++ ++++|++|+||+|+++|+||+|||| .+++|+|++|+ T Consensus 340 ie~~ln~~L~~~~~---~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~ 416 (441) T protein:vir:98 340 VCAELNFKFNDEYV---NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNI 416 (441) T ss_pred HHHHHHhhcccccc---CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccc Confidence 99999999987653 568999999999999999999886 7999999999999999999999988 68899999887 Q ss_pred CC------------------CC Q lcl|NC_019422. 381 EG------------------GE 384 (384) Q Consensus 381 ~~------------------ge 384 (384) +. || T Consensus 417 ~~~~~~q~~~~~~~~~~~kgGe 438 (441) T protein:vir:98 417 ELVDEYQMNKSRATDKKLKGGE 438 (441) T ss_pred ccccccccccccccccccCCCC Confidence 53 33 No 34 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=1.1e-86 Score=491.92 Aligned_cols=381 Identities=14% Similarity=0.192 Sum_probs=323.5 Q ss_pred Ccchhhh-cccCCCcc-hhHH-Hhhcc-cc------CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSK-KKNKEAPG-KVMM-ELISD-SG------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~-~~~~~~~~-~~~~-~~~~~-~~------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) |=+=+-. .+.+...+ .+.+ ..+.. +. .......+.++++++|++||+.||++||++||++|++++++..+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~~ 80 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCcee Confidence 5443322 12221111 1111 11100 00 11222345678999999999999999999999999999888888 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEcC-- Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLRN-- 146 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~-- 146 (384) ..++.+++|+.+||++||+++||+.++.+++++||+|++++|+..|.+++||||+|..|++..+.+... |.|.... T Consensus 81 ~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~~~~~ 160 (518) T protein:vir:10 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) T ss_pred ccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecCCc Confidence 888999999999999999999999999999999999999999999999999999999999998865433 3344333 Q ss_pred -ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHH Q lcl|NC_019422. 147 -GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFE 225 (384) Q Consensus 147 -g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~ 225 (384) ++.+.++++||||+|++++.+..+|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+++++.|+ T Consensus 161 ~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~ 240 (518) T protein:vir:10 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) T ss_pred cceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHH Confidence 3667899999999998877666799999999999999999999999999999999999999999999999999999999 Q ss_pred HHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc------cccHHHHHHHHHHHHHH Q lcl|NC_019422. 226 KNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYSEDEWNAYYESEIE 298 (384) Q Consensus 226 ~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~e~~~~~~~~~~i~ 298 (384) +.++| ..|+|+++|+++|++|+++++++.|+++.+. ++.+++||++|||||.+|| +++.|++..+|++.||. T Consensus 241 ~~~~G-~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~ 319 (518) T protein:vir:10 241 RAHSG-SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) T ss_pred HHhcC-ccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHH Confidence 99976 4678999999999999999999999999876 5789999999999999997 24568899999999999 Q ss_pred HHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCC--CCCeeeecC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIE--NGDKPVRRL 375 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~--~gd~~~~~~ 375 (384) |++.+|+++||++|++..+. +.+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||++ |||++++++ T Consensus 320 P~l~~ie~~ln~~L~~~~~~--~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~ 397 (518) T protein:vir:10 320 IPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) T ss_pred HHHHHHHHHHHHhhcccccC--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecc Confidence 99999999999999988764 458999999999999999999875 8999999999999999999985 899999999 Q ss_pred ceeecCCC--------C Q lcl|NC_019422. 376 DTAVVEGG--------E 384 (384) Q Consensus 376 n~~~~~~g--------e 384 (384) |++|++.. + T Consensus 398 n~~pl~~~~~~~~~g~~ 414 (518) T protein:vir:10 398 ALQPLGATPDGAVEGEE 414 (518) T ss_pred cceecccccccccCCCC Confidence 99987422 2 No 35 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=7.6e-87 Score=492.73 Aligned_cols=374 Identities=19% Similarity=0.274 Sum_probs=322.1 Q ss_pred CcchhhhcccCC----Cc-chhHHHhhccccCc-ceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKE----AP-GKVMMELISDSGNG-FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~----~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) |+||+|+|+..- .. ......+..+.... +..+.+.++++++|++||++||++||++||+++++++ ...++ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~----~~~~~ 79 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK----VVNTE 79 (409) T ss_pred ccchhhhhhHHhhhhhccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc----ccchh Confidence 999998776521 11 11111111111112 2345567999999999999999999999999998653 45678 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcCceEEEE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRNGKIVSY 152 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~g~~~~~ 152 (384) ++++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+|||++|..|++..+.++. .|.+...+|+.+.+ T Consensus 80 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~ 159 (409) T protein:vir:96 80 VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIV 159 (409) T ss_pred HHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEE Confidence 8899999999999999999999999999999999999999999999999999999999877653 45555667888899 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccc Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQID 232 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~ 232 (384) +++||||++.+++.++++|+||+..+..++....+++++. ++.++..++++++.++.++++++++++++|.+.++ T Consensus 160 ~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~--- 234 (409) T protein:vir:96 160 HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYE--- 234 (409) T ss_pred ccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhh--- Confidence 9999999998889999999999999999999998887774 45555566788889999999999999999998774 Q ss_pred ccCCcceecCCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 233 SEAGGAAATDSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQLS 305 (384) Q Consensus 233 ~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i~ 305 (384) ++++++++++|++|++++.++.++|+.+.+ +++++||++|||||.+||. ++.|++...|+++||.|++++|+ T Consensus 235 -n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie 313 (409) T protein:vir:96 235 -ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 313 (409) T ss_pred -cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 467899999999999999999999998874 6788999999999999974 35689999999999999999999 Q ss_pred HHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC--- Q lcl|NC_019422. 306 NQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE--- 381 (384) Q Consensus 306 ~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~--- 381 (384) ++||++|+++.++..+.+|+||++++++.|.++++++++ ++++|++|+||+|+++|+||+||||++++|+|++|++ T Consensus 314 ~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~ 393 (409) T protein:vir:96 314 EEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPL 393 (409) T ss_pred HHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecccccccccch Confidence 999999999999888999999999999999999999876 8999999999999999999999999999999998874 Q ss_pred -------CCC Q lcl|NC_019422. 382 -------GGE 384 (384) Q Consensus 382 -------~ge 384 (384) +|| T Consensus 394 ~~~~~~~gG~ 403 (409) T protein:vir:96 394 ELRKSLKGGD 403 (409) T ss_pred hhcccccCCC Confidence 443 No 36 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=1.1e-86 Score=491.92 Aligned_cols=384 Identities=15% Similarity=0.108 Sum_probs=321.8 Q ss_pred CcchhhhcccCCCc-ch---hHHHhhccccCcce---echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc-eecc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-GK---VMMELISDSGNGFY---SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF-KTNP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~-~~---~~~~~~~~~~~~~~---~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~-~~~~ 72 (384) ||||+++...+... .+ +....+.......+ +....+.++|+|++||+.||+.+|++|+++|+++++|. +... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 99999864443321 11 22222222222111 23344578999999999999999999999998876664 3445 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC--CceeeEEEEcCceEEEEEcCCC---EEEEEEE--- Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY--NMPTQIYPLNALNVEAIYENEV---LFLKFLL--- 144 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~--g~~~~l~~l~~~~v~~~~~~~~---~~~~~~~--- 144 (384) ++..+.|+.+||++||+++||+.++.+++++||+|+++.++.. +....|+|+++..+++....++ ..|.+.. T Consensus 81 ~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~~ 160 (423) T protein:vir:81 81 EGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIIIESGD 160 (423) T ss_pred cchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEEEecC Confidence 5556667779999999999999999999999999999998753 4677888899998888766543 2343332 Q ss_pred cCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC-----CCChHHHHH Q lcl|NC_019422. 145 RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT-----ALRPDDIKK 219 (384) Q Consensus 145 ~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-----~~~~e~~~~ 219 (384) .+|..+.++++||||+|.+++.+.++|+||+..+..++....++++++.++|+||++|+++|+++. .++++++++ T Consensus 161 ~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~ 240 (423) T protein:vir:81 161 NDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTR 240 (423) T ss_pred CCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHH Confidence 467888999999999998888888899999999999999999999999999999999999998763 578999999 Q ss_pred HHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHH Q lcl|NC_019422. 220 EVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAY 292 (384) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~ 292 (384) ++++|++.+++...++|+++++++|++|++++.++.|+|+.+. ++.+++||++|||||.+||. ++.|++..+| T Consensus 241 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f 320 (423) T protein:vir:81 241 FMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKAL 320 (423) T ss_pred HHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHH Confidence 9999999998878889999999999999999999999999876 57888999999999999972 3568899999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCcccc-cCcceEEeechhhhccCHHHHHHHHH-HH-hCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019422. 293 YESEIEPVGLQLSNQYTEKLFTRKAR-SFGNEIVFEASNLQYASMSTKLNLVQ-MV-DRGSLTPNEWRKIMNLSPIENGD 369 (384) Q Consensus 293 ~~~~i~P~~~~i~~~l~~~l~~~~~~-~~~~~i~fd~~~~~~~d~~~~~~~~~-~~-~~g~~t~NE~R~~lG~~p~~~gd 369 (384) +++||.|++..||++|+++|+++.+. ..+.+++||++++++.|.++++++++ ++ +.||+|+||+|+++|+||+|||| T Consensus 321 ~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD 400 (423) T protein:vir:81 321 YGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD 400 (423) T ss_pred HHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc Confidence 99999999999999999999998765 46789999999999999999999875 56 46999999999999999999999 Q ss_pred eeeecCceeecCCCC Q lcl|NC_019422. 370 KPVRRLDTAVVEGGE 384 (384) Q Consensus 370 ~~~~~~n~~~~~~ge 384 (384) ++++|+|+.+.+..+ T Consensus 401 ~~~~p~n~~~~~~~~ 415 (423) T protein:vir:81 401 DLARPLNTEFGDSED 415 (423) T ss_pred eeecccccccCccCC Confidence 999999999876444 No 37 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=2.1e-86 Score=490.27 Aligned_cols=371 Identities=15% Similarity=0.156 Sum_probs=315.6 Q ss_pred CcchhhhcccCCCcc-hh-H----HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-KV-M----MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~-~~-~----~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||||++.++.+.... .. . ......+..+-..+...++++++|++||+.||+.+|++||++|+.++++ +...++ T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~-~~~~~~ 79 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNV-RIPVSP 79 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc-ccccch Confidence 999997655432211 11 0 0011112223345567889999999999999999999999999877554 456788 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEe-eCCCCceeeEEEEcCceEEEEEcCCC--EEEEEEEc-CceEE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVII-KDDYNMPTQIYPLNALNVEAIYENEV--LFLKFLLR-NGKIV 150 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~-~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~~~-~g~~~ 150 (384) ++++|+.+||++||+++||+.++.+++++||+|+++. ++..|.+.+||+++|.+|++....+. ..+++... +| . T Consensus 80 l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~g--~ 157 (409) T protein:vir:84 80 APKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRIDG--K 157 (409) T ss_pred HHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecCCc--e Confidence 8999999999999999999999999999999999986 67889999999999999998765443 33333333 33 4 Q ss_pred EEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 151 SYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 151 ~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) .++++||||++++++.+.++|+||+..+..++....++.+++.++|+||++|+++|++++.+++++.++++++|.+.+. T Consensus 158 ~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~- 236 (409) T protein:vir:84 158 VVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHH- 236 (409) T ss_pred EEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhc- Confidence 5889999999998888888999999999999999999999999999999999999999999999999999999987653 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc--------ccHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS--------KYSEDEWNAYYESEIEPVG 301 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~~~~i~P~~ 301 (384) ++++++++++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.|++..+|+++||.|++ T Consensus 237 ---n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~ 313 (409) T protein:vir:84 237 ---NRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWL 313 (409) T ss_pred ---cCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHH Confidence 56889999999999999999999999876 57899999999999999973 3457888999999999999 Q ss_pred HHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeec Q lcl|NC_019422. 302 LQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVV 380 (384) Q Consensus 302 ~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~ 380 (384) +.||++||++|. .+.+|+||++.+++.|.++++++++ ++++|++|+||+|+++|+||+||||++++|+|++++ T Consensus 314 ~~ie~~l~~~L~------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~ 387 (409) T protein:vir:84 314 RCIEQALDTFLP------RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPL 387 (409) T ss_pred HHHHHHHHHhcc------CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccc Confidence 999999999873 2568999999999999999999875 899999999999999999999999999999999998 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) +.-+ T Consensus 388 ~~~~ 391 (409) T protein:vir:84 388 GYVP 391 (409) T ss_pred ccCC Confidence 5322 No 38 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=7.7e-86 Score=487.22 Aligned_cols=374 Identities=19% Similarity=0.272 Sum_probs=319.4 Q ss_pred CcchhhhcccC-----CCcchhHHHhhcccc-CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNK-----EAPGKVMMELISDSG-NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~-----~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) =+++.++++.- ..+....+....... ..+..+.++++++++|++||+.||++||++||+++++++ ...++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~----~~~~~ 79 (409) T protein:vir:93 4 ENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK----VVNTE 79 (409) T ss_pred cchhhhhhhhhhhhhhccccccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc----cccch Confidence 34555533321 111111111111111 223345677999999999999999999999999998664 34677 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcCceEEEE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRNGKIVSY 152 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~g~~~~~ 152 (384) +.++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+||+++|+.|++..+.++. +|.+...+|..+.+ T Consensus 80 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~ 159 (409) T protein:vir:93 80 VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIV 159 (409) T ss_pred HHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEE Confidence 8899999999999999999999999999999999999999999999999999999998876654 45566667888899 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccc Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQID 232 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~ 232 (384) +++||||++++++.++++|+||+..+..++....+++++. ++.++..++++++.++.+++++++++++.|++.+. T Consensus 160 ~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~--- 234 (409) T protein:vir:93 160 HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYE--- 234 (409) T ss_pred ccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhh--- Confidence 9999999999889999999999999999999999888874 56666667889999999999999999999998764 Q ss_pred ccCCcceecCCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 233 SEAGGAAATDSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQLS 305 (384) Q Consensus 233 ~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i~ 305 (384) ++++++++++|++|++++.++.++|+.+.+ +++++||++|||||.+||. ++.|++..+|++.||.|++++|+ T Consensus 235 -~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie 313 (409) T protein:vir:93 235 -ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 313 (409) T ss_pred -cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 457899999999999999999999998874 6889999999999999974 35688999999999999999999 Q ss_pred HHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC--- Q lcl|NC_019422. 306 NQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE--- 381 (384) Q Consensus 306 ~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~--- 381 (384) ++||++|+++.++..+.+|+||++++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++|++ T Consensus 314 ~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~ 393 (409) T protein:vir:93 314 EEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPL 393 (409) T ss_pred HHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccccch Confidence 999999999998888899999999999999999999875 8999999999999999999999999999999998884 Q ss_pred -------CCC Q lcl|NC_019422. 382 -------GGE 384 (384) Q Consensus 382 -------~ge 384 (384) +|| T Consensus 394 ~~~~~~~gG~ 403 (409) T protein:vir:93 394 ELRKSLKGGD 403 (409) T ss_pred hhcccccCCC Confidence 343 No 39 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=2.1e-85 Score=484.84 Aligned_cols=374 Identities=15% Similarity=0.123 Sum_probs=320.8 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) |+||++++....+..+.+..++.+..+..++ ..+++++++|++||++||+.||++|+++++.+ +.+...+++.++|+ T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~--g~~~~~~~~~~lL~ 77 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYL-GVSALKNSDILTATSIIAGDIARFPLVKKDVN--GDIIHDEDINYLLN 77 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccc-cchhhccHHHHHHHHHHHHhhhhCeeEEEecC--ccccccchHHHHhh Confidence 9999987777767677676666554443333 34689999999999999999999999876544 34556678889898 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeEEEEcCceEEEEEcCCCE-EEEEEE-cCceEEEEehhhe Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQIYPLNALNVEAIYENEVL-FLKFLL-RNGKIVSYPYSDI 157 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~-~~~~~~-~~g~~~~~~~~ev 157 (384) .+||++||+++||+.++.+++++||+|+++.|+. .|.+.+|+|++|+.|++..+.++. .|.+.. .+|..+.++++|| T Consensus 78 ~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~~ev 157 (406) T protein:vir:97 78 VKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCFAHDV 157 (406) T ss_pred ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEEccccE Confidence 9999999999999999999999999999999985 689999999999999998877654 444443 4578889999999 Q ss_pred EEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCc Q lcl|NC_019422. 158 IHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGG 237 (384) Q Consensus 158 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 237 (384) ||+|++ +.++++|+||+.++..++....+++++..++|+||+.|++++..++.+++++.+++++.|++.+++ .++|+ T Consensus 158 ih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g--~n~g~ 234 (406) T protein:vir:97 158 IHWKFF-SHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREG--SVGGS 234 (406) T ss_pred EEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcc--cccCc Confidence 999964 678899999999999999999999999999999999999999999999999999999999988865 46789 Q ss_pred ceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019422. 238 AAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYYESEIEPVGLQLSNQYTEKL 312 (384) Q Consensus 238 ~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l 312 (384) ++|+++|++|++++.++.|+++.+. ++++++||++|||||.+||. ++.+++..+|++.||.|+++.||++|+++| T Consensus 235 ~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kl 314 (406) T protein:vir:97 235 PLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYFDAITSELGLKT 314 (406) T ss_pred eeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 9999999999999999999999876 56899999999999999974 356788899999999999999999999999 Q ss_pred cCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeecCceeecCCCC Q lcl|NC_019422. 313 FTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 313 ~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~~~~~n~~~~~~ge 384 (384) +++.++. +.+++||++++.+.+.+ .+.+++++|++|+||+|+++|+||+++ ||++++|+|++|++..| T Consensus 315 l~~~~~~-~~~i~fd~~~~~~~~~~---~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~ 384 (406) T protein:vir:97 315 LNDKDRR-LYHIEFDTRSVTGRNVD---EIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKE 384 (406) T ss_pred cChhhcc-ceeEEEecCccchhhHH---HHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccc Confidence 9987653 57889998765544332 234678999999999999999999965 99999999999886443 No 40 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=2.1e-85 Score=484.79 Aligned_cols=375 Identities=16% Similarity=0.179 Sum_probs=319.0 Q ss_pred CcchhhhcccCCCcc-----hhHHH---hhccccCcce-echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceec Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-----KVMME---LISDSGNGFY-SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTN 71 (384) Q Consensus 1 M~~f~~~~~~~~~~~-----~~~~~---~~~~~~~~~~-~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 71 (384) |+||+++++...... ..... ....+..... .....+.++++|++||++||+++|++||+++++++++.++. T Consensus 13 m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (413) T protein:vir:96 13 LKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRI 92 (413) T ss_pred CCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCcccc Confidence 999988654322110 00000 0001111111 11234678999999999999999999999999998888888 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC-ceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEE Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN-MPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIV 150 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g-~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~ 150 (384) .++++++|+.+||++||+++||+.++.+++++||+|++++++..| .+.+|||++|.+|++..+.+...|.+...++ T Consensus 93 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~~~y~~~~~~~--- 169 (413) T protein:vir:96 93 KNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDDLDYSITFDNK--- 169 (413) T ss_pred ccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCeEEEEEeecCc--- Confidence 899999999999999999999999999999999999999999887 5789999999999999998887777766653 Q ss_pred EEehhheEEEecc-CCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhc Q lcl|NC_019422. 151 SYPYSDIIHLRKD-FNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYL 229 (384) Q Consensus 151 ~~~~~evih~~~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~ 229 (384) .++++||||+|.+ ++.++++|+||+.++..++....++++++.++|+||++|+++|++++.+++++.++++++|++.++ T Consensus 170 ~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~ 249 (413) T protein:vir:96 170 EYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYL 249 (413) T ss_pred EEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 5789999999964 566789999999999999999999999999999999999999999999999999999999999998 Q ss_pred cccccCCcceecCCC-ceeeecc-cchhHHHHHHH-HHHHHHHHHHhCCCHHHhc-cccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSK-YDAEQVK-AESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ-SKYSEDEWNAYYESEIEPVGLQLS 305 (384) Q Consensus 230 ~~~~~~~~~~v~~~g-~~~~~l~-~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~-~~~~e~~~~~~~~~~i~P~~~~i~ 305 (384) |. .++|+++++++| .++.++. .++.++++.+. ++++++||++|||||.+|| +++++++..+|++.||.|+++.|+ T Consensus 250 g~-~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~~~~~~~l~P~~~~ie 328 (413) T protein:vir:96 250 KR-KEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGTYNKDEFNNFINTKIMSIAQVIQ 328 (413) T ss_pred Cc-cccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchHHHHHHHHHHHHHHHHHHHH Confidence 74 567888887655 5566664 57889999876 4688999999999999998 567788889999999999999999 Q ss_pred HHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCC- Q lcl|NC_019422. 306 NQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGG- 383 (384) Q Consensus 306 ~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~g- 383 (384) ++||++|+++ +.+++||++++++.|.++++++++ ++.+|++|+||+|+++|+||+||||++++|+|++|++.. T Consensus 329 ~~ln~~ll~~-----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd~~~~~~n~~~~~~~~ 403 (413) T protein:vir:96 329 QTYNKLIVEE-----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMDDLLVLENYLQQKDLV 403 (413) T ss_pred HHHHHhhCCC-----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccchhhcc Confidence 9999999874 468999999999999999999875 899999999999999999999999999999999997532 Q ss_pred --------C Q lcl|NC_019422. 384 --------E 384 (384) Q Consensus 384 --------e 384 (384) | T Consensus 404 ~~~~~~~~d 412 (413) T protein:vir:96 404 NQKKLIQDE 412 (413) T ss_pred cccCCCCCC Confidence 2 No 41 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=4.1e-85 Score=483.20 Aligned_cols=374 Identities=19% Similarity=0.265 Sum_probs=319.3 Q ss_pred CcchhhhcccCC-----CcchhHHHhhccccCc-ceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKE-----APGKVMMELISDSGNG-FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) =++++++++..- .+......+....... ...+.+.++++++|++||++||++||++||+++++++ ...++ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~----~~~~~ 79 (409) T protein:vir:94 4 ENIVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK----VVNTE 79 (409) T ss_pred cccchhhhhHHhhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc----ccchh Confidence 356665554311 0111111111111112 2245567999999999999999999999999998654 34577 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEcCceEEEE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLRNGKIVSY 152 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~g~~~~~ 152 (384) ++++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+|||++|.+|++..+.++.. |.+...+|..+.+ T Consensus 80 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~ 159 (409) T protein:vir:94 80 VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIV 159 (409) T ss_pred HHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEE Confidence 88899999999999999999999999999999999999999999999999999999988776543 4555667888899 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccc Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQID 232 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~ 232 (384) +++||||+|++++.++++|+||+..+.+++....+++++. ++.++..++++++.++.+++++++++++.|++.++ T Consensus 160 ~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~--- 234 (409) T protein:vir:94 160 HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYE--- 234 (409) T ss_pred ccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhh--- Confidence 9999999998889999999999999999999999988874 55566667789999999999999999999998764 Q ss_pred ccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 233 SEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------KYSEDEWNAYYESEIEPVGLQLS 305 (384) Q Consensus 233 ~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~~~e~~~~~~~~~~i~P~~~~i~ 305 (384) ++++++++++|++|++++.++.++|+.+. ++++++||++|||||.+||+ ++.|++..+|++.||.|+++.|+ T Consensus 235 -~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie 313 (409) T protein:vir:94 235 -ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 313 (409) T ss_pred -cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 46789999999999999999999999877 56889999999999999974 45689999999999999999999 Q ss_pred HHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC--- Q lcl|NC_019422. 306 NQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE--- 381 (384) Q Consensus 306 ~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~--- 381 (384) ++||++|+++.++..+.+|+||.+++++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++++|++|++ T Consensus 314 ~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~ 393 (409) T protein:vir:94 314 EEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPL 393 (409) T ss_pred HHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecccccccccch Confidence 999999999999888999999999999999999999875 8999999999999999999999999999999998874 Q ss_pred -------CCC Q lcl|NC_019422. 382 -------GGE 384 (384) Q Consensus 382 -------~ge 384 (384) ||| T Consensus 394 ~~~~~~kGG~ 403 (409) T protein:vir:94 394 ELRKSLKGGD 403 (409) T ss_pred hhcccccCCC Confidence 443 No 42 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=1.4e-84 Score=480.28 Aligned_cols=374 Identities=17% Similarity=0.199 Sum_probs=319.5 Q ss_pred CcchhhhcccCCCc----chhHHHhhc--cccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAP----GKVMMELIS--DSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~----~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||||++.++.+..+ .......+. ...........+++++++|++||++||+.+|++||++|+.++++.+...++ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 80 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRIRNE 80 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeecch Confidence 99998754333222 122222221 122233345577899999999999999999999999999999888888999 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCe--eEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNA--FAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSY 152 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~--~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~ 152 (384) ++++|+.+||++||+++||+.++.+++++|++ |+++.++..|.+.+|||++|.+|++..+.++..+.+ +| +.+ T Consensus 81 ~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~~~~---~~--~~~ 155 (406) T protein:vir:95 81 LSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQVLY---GG--QTF 155 (406) T ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEEEEe---cc--EEE Confidence 99999999999999999999999999999765 666789999999999999999999999988744433 22 368 Q ss_pred ehhheEEEec-cCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc Q lcl|NC_019422. 153 PYSDIIHLRK-DFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI 231 (384) Q Consensus 153 ~~~evih~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~ 231 (384) +++||||+++ +++.++++|+||+..+..++....++.+++.++|+||++|++++++++.+++++.++++++|.+.++|. T Consensus 156 ~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~ 235 (406) T protein:vir:95 156 NYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQA 235 (406) T ss_pred chhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccc Confidence 9999999996 467789999999999999999999999999999999999999999999999999999999999999875 Q ss_pred cccCCcceec-CCCceeeecc-cchhHHHHHHH-HHHHHHHHHHhCCCHHHhc-cccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 232 DSEAGGAAAT-DSKYDAEQVK-AESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ-SKYSEDEWNAYYESEIEPVGLQLSNQ 307 (384) Q Consensus 232 ~~~~~~~~v~-~~g~~~~~l~-~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~-~~~~e~~~~~~~~~~i~P~~~~i~~~ 307 (384) . ++++++|+ ++|.+++++. .++.++++.+. ++.+++||++|||||.+|| +++++++..+|++.||.|++++|+++ T Consensus 236 ~-n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~l~P~~~~ie~~ 314 (406) T protein:vir:95 236 T-EAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGEFNRDEYNNFINSTILPIAKGIEQE 314 (406) T ss_pred c-ccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCchHHHHHHHHHHHHHHHHHHHHHH Confidence 4 55666665 4566777765 58899998776 5788999999999999997 56778889999999999999999999 Q ss_pred HhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC----- Q lcl|NC_019422. 308 YTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE----- 381 (384) Q Consensus 308 l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~----- 381 (384) ||++|+++.+ .+++||++++++.|.+++++.++ ++.+|++|+||+|+++|+||+||||++++|+|++|++ T Consensus 315 l~~~l~~~~~----~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n~~~~~~~~~~ 390 (406) T protein:vir:95 315 LTRKLLISPD----LYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQ 390 (406) T ss_pred HHHhcCCCCC----cEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccCccchhhcccc Confidence 9999998643 47999999999999999999875 8999999999999999999999999999999998773 Q ss_pred ----CCC Q lcl|NC_019422. 382 ----GGE 384 (384) Q Consensus 382 ----~ge 384 (384) +|| T Consensus 391 ~~~k~g~ 397 (406) T protein:vir:95 391 SKLKGGD 397 (406) T ss_pred cccCCCC Confidence 333 No 43 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.7e-84 Score=478.70 Aligned_cols=373 Identities=15% Similarity=0.164 Sum_probs=313.7 Q ss_pred CcchhhhcccCCCcchhHHH----hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMME----LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~ 76 (384) |+||.+......+. +... .+..+..+-++. .+++++++||+||++||+.||++|+++++++.++. ...++++ T Consensus 1 m~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~~~-~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~-~~~~~~~ 76 (417) T protein:vir:38 1 MKLFRGLATEVDPH--WADHLLDSGVIPSFRGGYLG-ISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEV-IDLANIE 76 (417) T ss_pred CccccccccCCCcc--chhhhcccccccccCCceec-hhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcce-eccchHH Confidence 99996544443322 2111 122222223332 46889999999999999999999999998876654 3456788 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-CceeeEEEEcCceEEEEEcCCC-EEEEEEEcCc-eEEEEe Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-NMPTQIYPLNALNVEAIYENEV-LFLKFLLRNG-KIVSYP 153 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~g-~~~~~~ 153 (384) ++|+.+|||+||+++||+.++.+++++||+|++++|+.. |.+..|++++|..|++....++ ..|+|...+| ....++ T Consensus 77 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~ 156 (417) T protein:vir:38 77 YLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYRFTPYNSSMQKVCG 156 (417) T ss_pred HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEEEEEcCCcEEEEec Confidence 889999999999999999999999999999999999865 6799999999999999876655 4555655554 456789 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccc Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDS 233 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~ 233 (384) ++||||+|++ +.++++|+||+.++..++....++++++.++|+||++|+++++.++.+++++.+++++.|++.+++ . T Consensus 157 ~~dviH~r~~-~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g--~ 233 (417) T protein:vir:38 157 FEDVIHWKFF-SYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAG--A 233 (417) T ss_pred CcceEEecCC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcc--c Confidence 9999999965 678899999999999999999999999999999999999999999999999999999999998866 3 Q ss_pred cCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 234 EAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 234 ~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~~~~i~P~~~~i~~~l 308 (384) ++|+++++++|++|++++.++.++++.+. ++++++||++|||||.+||. ++.+++..+|++.||.|+++.|+++| T Consensus 234 n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l 313 (417) T protein:vir:38 234 DAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQLADDYIRNDLPFYFEPITSEF 313 (417) T ss_pred ccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67999999999999999999999999876 56899999999999999974 34578889999999999999999999 Q ss_pred hhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--CeeeecCceeecCC---- Q lcl|NC_019422. 309 TEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENG--DKPVRRLDTAVVEG---- 382 (384) Q Consensus 309 ~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~g--d~~~~~~n~~~~~~---- 382 (384) |++|+++.++. +.+++||++.+.+.+ +.+..+++++|++|+||+|+++|+||+||| |++++|+|+++++. T Consensus 314 ~~~Ll~~~~~~-~~~~~fd~~~l~~~~---~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~ 389 (417) T protein:vir:38 314 ELKLLDDAQRH-QYCIGFDTKSVNGLP---IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAY 389 (417) T ss_pred HhhhcChhhcc-cceEEechhhhhHHH---HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccc Confidence 99999987764 578999988876543 334456889999999999999999999876 88999999987762 Q ss_pred ----------CC Q lcl|NC_019422. 383 ----------GE 384 (384) Q Consensus 383 ----------ge 384 (384) || T Consensus 390 ~~~~~~~~kgg~ 401 (417) T protein:vir:38 390 QAEHAAELKGGD 401 (417) T ss_pred ccccccccCCCC Confidence 22 No 44 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=5e-84 Score=477.25 Aligned_cols=374 Identities=18% Similarity=0.220 Sum_probs=314.4 Q ss_pred CcchhhhcccCCC-cchhHHHhhcccc-Ccceec-hhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA-PGKVMMELISDSG-NGFYSW-HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIK 77 (384) Q Consensus 1 M~~f~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ 77 (384) ||||+..++...+ +......+.+... ...... ...+.++|+|++||++||+.||++|+++|++.++|.+...+++.+ T Consensus 1 Mg~~~~f~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~~ 80 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTNAISWFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNELSR 80 (403) T ss_pred CcccccccccccccccchhhhhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCChHHH Confidence 9999854433222 1222222222211 111111 134557899999999999999999999999988888888888999 Q ss_pred HHHhhccccCCHHHHHHHHHHHHHHh--CCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehh Q lcl|NC_019422. 78 FLLENPNPFMSGQILQEKMVTQLELN--SNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYS 155 (384) Q Consensus 78 ~l~~~PN~~~s~~~f~~~~~~~~l~~--G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~ 155 (384) +|+.+||+.||+++||+.++.++++. ||||+++.++..|++.+||||+|..|++..+.++..+.|.. ..++++ T Consensus 81 lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~~-----~~~~~~ 155 (403) T protein:vir:80 81 KIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQG-----KAYNYD 155 (403) T ss_pred HHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEee-----cccchh Confidence 99999999999999999999999984 78999999999999999999999999999998886655532 357899 Q ss_pred heEEEec-cCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccccc Q lcl|NC_019422. 156 DIIHLRK-DFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSE 234 (384) Q Consensus 156 evih~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 234 (384) ||||++. +.+.++++|+||+..+..++....++++++.++|+||++|++++++++.+++++.++.+++|.+.+.+.. + T Consensus 156 eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 234 (403) T protein:vir:80 156 EVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEAS-E 234 (403) T ss_pred hEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhh-h Confidence 9999995 5678889999999999999999999999999999999999999999999999999999999999987754 5 Q ss_pred CCcceecCCC-ceeeecc-cchhHHHHHHH-HHHHHHHHHHhCCCHHHhc-cccHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 235 AGGAAATDSK-YDAEQVK-AESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ-SKYSEDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 235 ~~~~~v~~~g-~~~~~l~-~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~-~~~~e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) +|++++++++ .++.++. .++.++++.+. ++++.+||++|||||.+|| ++..+++..+|+..||.|++++||++||+ T Consensus 235 ~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~ 314 (403) T protein:vir:80 235 AGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYDKDEYNNFINSTILPIAKGIEQELTR 314 (403) T ss_pred cCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6777776544 4555554 47788998766 5788999999999999998 55667778899999999999999999999 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecC-------- Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVE-------- 381 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~-------- 381 (384) +|+++.+ .+++||++.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++++|++|++ T Consensus 315 kll~~~~----~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n~~pl~~~~~~~~~ 390 (403) T protein:vir:80 315 KLLISPD----LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQNKL 390 (403) T ss_pred hccCCCC----cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccchhhc Confidence 9998654 47899999999999999999875 7999999999999999999999999999999999884 Q ss_pred -CCC Q lcl|NC_019422. 382 -GGE 384 (384) Q Consensus 382 -~ge 384 (384) +|| T Consensus 391 k~ge 394 (403) T protein:vir:80 391 KGGE 394 (403) T ss_pred cCCC Confidence 333 No 45 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=8e-83 Score=470.66 Aligned_cols=380 Identities=16% Similarity=0.186 Sum_probs=317.5 Q ss_pred CcchhhhcccCC----CcchhHHHhhccccCc-----ceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc--- Q lcl|NC_019422. 1 MNIFKSKKKNKE----APGKVMMELISDSGNG-----FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF--- 68 (384) Q Consensus 1 M~~f~~~~~~~~----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~--- 68 (384) -+++.+..+... .....+..++....++ -......++++|+|++||++||+++|++||++|+.+.++. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~ 81 (460) T protein:vir:10 2 ANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQ 81 (460) T ss_pred chhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchh Confidence 345554322211 1122233333322221 2234456899999999999999999999999999887763 Q ss_pred -------------------------eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC----CCcee Q lcl|NC_019422. 69 -------------------------KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD----YNMPT 119 (384) Q Consensus 69 -------------------------~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~----~g~~~ 119 (384) ....++..++|+.+||++||+++||+.++.+++++||||++++|+. .|++. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~ 161 (460) T protein:vir:10 82 LNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPS 161 (460) T ss_pred hhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeE Confidence 2334566788999999999999999999999999999999999964 47899 Q ss_pred eEEEEcCceEEEEEcCCCEEEEE--------EEcCceEEEEehhheEEEeccCCC-----CCccCccHHHHHHHHHHHHH Q lcl|NC_019422. 120 QIYPLNALNVEAIYENEVLFLKF--------LLRNGKIVSYPYSDIIHLRKDFNE-----NDLFGTSPAKVLEPIMEVVN 186 (384) Q Consensus 120 ~l~~l~~~~v~~~~~~~~~~~~~--------~~~~g~~~~~~~~evih~~~~~~~-----~~~~G~s~~~~~~~~i~~~~ 186 (384) +||||+|..|++..+.++..+.+ ...+|..+.++++||||+|++++. ++++|+||+..+..++.... T Consensus 162 ~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~ 241 (460) T protein:vir:10 162 QMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQN 241 (460) T ss_pred EEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHH Confidence 99999999999999887653321 234677889999999999988765 46799999999999999999 Q ss_pred HHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHH Q lcl|NC_019422. 187 TTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKA 265 (384) Q Consensus 187 ~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~ 265 (384) ++++++.++|+||+.|+++++.++.+++++.+++++.|++.++| ..++++++++++|++|+++++++.++++.+. +++ T Consensus 242 ~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g-~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 320 (460) T protein:vir:10 242 STIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKS-PDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYD 320 (460) T ss_pred HHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcC-ccccCCceecCCCceEEEccCChhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999976 4567899999999999999999999999776 578 Q ss_pred HHHHHHHhCCCHHHhcc--------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhh--ccC Q lcl|NC_019422. 266 IQRLYSFFNTNEKIIQS--------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQ--YAS 335 (384) Q Consensus 266 ~~~I~~~fgvp~~~l~~--------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~--~~d 335 (384) +++||++|||||.+||. ++.|++..+|++.||.|++..|+++||++|+++.++..+.+++||++.+. +.| T Consensus 321 ~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d 400 (460) T protein:vir:10 321 QKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTD 400 (460) T ss_pred HHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHH Confidence 99999999999999962 35689999999999999999999999999999998888999999999873 334 Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecCceeecCCCC Q lcl|NC_019422. 336 MSTKLNLVQMVDRGSLTPNEWRKIMNLSPI--ENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 336 ~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~--~~gd~~~~~~n~~~~~~ge 384 (384) .+++ ++++++|++|+||+|+++|+||+ ||||++++|+|++|++..+ T Consensus 401 ~~~~---~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~ 448 (460) T protein:vir:10 401 MVAM---ASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVS 448 (460) T ss_pred HHHH---HHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcc Confidence 4333 45678999999999999999998 5799999999999987544 No 46 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=6e-81 Score=460.40 Aligned_cols=368 Identities=13% Similarity=0.074 Sum_probs=304.7 Q ss_pred CcchhhhcccCCCcchhHHHhhcccc-CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSG-NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFL 79 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l 79 (384) |.-|..-...... .+. .....+.+.++++++||+||++||++||++||++++.+ +.....++++++| T Consensus 1 ~~~~~~~~g~~~~----------~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~--~~~~~~~~l~~lL 68 (723) T protein:vir:94 1 MTTFPSGAGGWNA----------WSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPD--GELDELHPLSQLW 68 (723) T ss_pred CcccccCCCcccc----------ccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCC--CccchhhHHHHHH Confidence 2222111111100 000 11122346789999999999999999999999998643 3345567888899 Q ss_pred HhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCceeeEEEEcCceEEEEEcCCC--------EEEEEEEcCce Q lcl|NC_019422. 80 LENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNMPTQIYPLNALNVEAIYENEV--------LFLKFLLRNGK 148 (384) Q Consensus 80 ~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~~~~l~~l~~~~v~~~~~~~~--------~~~~~~~~~g~ 148 (384) +.+||++||+++||+.++.+++++||+|+++.+++ .|.+.+|+++++..+.+....+. ..|.+...+|. T Consensus 69 ~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G~ 148 (723) T protein:vir:94 69 NVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDGV 148 (723) T ss_pred hhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCce Confidence 89999999999999999999999999999999754 58899999999988877655442 23455667888 Q ss_pred EEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHh Q lcl|NC_019422. 149 IVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNY 228 (384) Q Consensus 149 ~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~ 228 (384) .+.++++||||+|.+++.++++|+||+..+.++|....++++++.++|+||++|+++|+.+ .+++++.++++++|++.+ T Consensus 149 ~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~l~~e~~~~~~~~~~~~~ 227 (723) T protein:vir:94 149 RVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-DMDEQTFTKTVAAFRSQV 227 (723) T ss_pred eEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCHHHHHHHHHHHHHHh Confidence 8999999999999999999999999999999999999999999999999999999999986 589999999999999999 Q ss_pred ccccccCCcceecC----------CCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHH Q lcl|NC_019422. 229 LQIDSEAGGAAATD----------SKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYY 293 (384) Q Consensus 229 ~~~~~~~~~~~v~~----------~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~ 293 (384) +| ..|+|++++++ .|++|++++.++.|+++.+. ++.+++||++|||||.+|++ ++.+++..+|+ T Consensus 228 ~G-~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~ 306 (723) T protein:vir:94 228 EG-VQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVW 306 (723) T ss_pred hc-hhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHH Confidence 76 46788888875 58999999999999999876 57899999999999998864 35678889999 Q ss_pred HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--e Q lcl|NC_019422. 294 ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--K 370 (384) Q Consensus 294 ~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~ 370 (384) ++||.|+++.||++||++|++..+. ..+++||...+++.|.+++++.++ ++++|++|+||+|+++|+||+|||| . T Consensus 307 ~~tL~P~~~~ie~~ln~~Ll~~~g~--~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~ 384 (723) T protein:vir:94 307 TETLIPQMEVMASITDLQLLPDIGW--TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQM 384 (723) T ss_pred HHHHHHHHHHHHHHHhHhhcccccC--ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccc Confidence 9999999999999999999986543 356788888899999999999886 7999999999999999999999987 4 Q ss_pred eeecC--ceeecCCCC Q lcl|NC_019422. 371 PVRRL--DTAVVEGGE 384 (384) Q Consensus 371 ~~~~~--n~~~~~~ge 384 (384) ++.|. |+.|.+.+. T Consensus 385 ~~~p~~~~~a~~~~~~ 400 (723) T protein:vir:94 385 TLTPYRAQFAPAPAPA 400 (723) T ss_pred eeccccccccCCCCCC Confidence 45554 444544333 No 47 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=1e-79 Score=453.67 Aligned_cols=356 Identities=12% Similarity=0.115 Sum_probs=299.3 Q ss_pred CcchhhhcccC-------------------------CCcchhH---HHhhc--c---------ccCcceechhhhhhcHH Q lcl|NC_019422. 1 MNIFKSKKKNK-------------------------EAPGKVM---MELIS--D---------SGNGFYSWHGNLYKSDI 41 (384) Q Consensus 1 M~~f~~~~~~~-------------------------~~~~~~~---~~~~~--~---------~~~~~~~~~~~~~~~~~ 41 (384) ||||++.+.-. ..++... ..+.+ . ...+...+.++++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 99999876631 1111110 01111 0 11122345677899999 Q ss_pred HHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEe-eCCCCceee Q lcl|NC_019422. 42 VRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVII-KDDYNMPTQ 120 (384) Q Consensus 42 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~-~~~~g~~~~ 120 (384) |++||+.||+.||++|++++++++ ..+...++|+.+||+.||+++||+.++.++++ ||+|++++ ++..|.+++ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~-----~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~ 154 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGR-----IIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIR 154 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCc-----cccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEE Confidence 999999999999999999997543 23456678999999999999999999999887 99999864 888999999 Q ss_pred EEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) |+||+|..|++..+.++..+++ ..+ ...++||||+|++++.++++|+||+..++.++....++++++.++|+||+ T Consensus 155 L~pl~p~~v~v~~~~~g~~~y~-~~~----~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga 229 (409) T protein:vir:83 155 FRVVPPWLVNVELKKGARREYR-IGG----LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGG 229 (409) T ss_pred EEEECCcceEEEEcCCceEEEE-Ecc----ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 9999999999999887754433 222 13458999999999999999999999999999999999999999999999 Q ss_pred CcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCcee-eecccchhHHHHHHH-HHHHHHHHHHhCCCHH Q lcl|NC_019422. 201 TIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDA-EQVKAESYVPNAAQM-DKAIQRLYSFFNTNEK 278 (384) Q Consensus 201 ~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~-~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~ 278 (384) +|+++|++++.++++++++++++|++.+.+ ++|+++++.+|+++ ++++.++.|+|+.+. ++++++||++|||||. T Consensus 230 ~p~gil~~~~~ls~e~~~~~~~~~~~~~~~---nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~ 306 (409) T protein:vir:83 230 VPLYWLGVERRLSETEAVDLMDRWIESRSK---YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPF 306 (409) T ss_pred CcceEeecCCCCCHHHHHHHHHHHHHhhCC---ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHH Confidence 999999999999999999999999987754 67888888888887 578999999999877 5789999999999999 Q ss_pred Hhc---------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhC Q lcl|NC_019422. 279 IIQ---------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDR 348 (384) Q Consensus 279 ~l~---------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~ 348 (384) +|| +++.|++..+|++.||.|++++||++||++|+++. .+++||++++++.|++++++.++ ++++ T Consensus 307 llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~-----~~~~f~~~~llr~d~~~r~~~~~~~~~~ 381 (409) T protein:vir:83 307 LVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSP-----QHLELNRDDYTRPSLVERATAYKIMIEA 381 (409) T ss_pred HccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC-----cEEEeehhhhhccCHHHHHHHHHHHHhC Confidence 997 24568999999999999999999999999999753 37899999999999999999986 7999 Q ss_pred CCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 349 GSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 349 g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |++|+||+|+++|+||++|||++- +|+ T Consensus 382 G~lT~NE~R~~~glpp~~ggd~l~---------~~g 408 (409) T protein:vir:83 382 GVMEPNEARAMERLHSEAAAVRLS---------GGG 408 (409) T ss_pred CCcCHHHHHHHhCCCCCCCCcccC---------CCC Confidence 999999999999999999999882 333 No 48 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=7.6e-79 Score=448.85 Aligned_cols=363 Identities=16% Similarity=0.164 Sum_probs=306.1 Q ss_pred CcchhhhcccCCCc---chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP---GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIK 77 (384) Q Consensus 1 M~~f~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ 77 (384) |+||++.++.+... .+.+..++..+..+...+.++++++++|++||+.||+.||++||++ .++.++ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~-----------~~~~~~ 69 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS-----------ESDRSQ 69 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcCCceechHHhhccHHHHHHHHHHHHHHhhCcccc-----------cccHHH Confidence 99999755543332 2334445454445556777889999999999999999999999964 356788 Q ss_pred HHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEc---CceEEEE Q lcl|NC_019422. 78 FLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLR---NGKIVSY 152 (384) Q Consensus 78 ~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~---~g~~~~~ 152 (384) +|+.+||++||+++||+.++.+++++||||++++|+..|.+++|++++|.+|++..+.++.. |.+... +|..+.+ T Consensus 70 ~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~ 149 (397) T protein:vir:38 70 SIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENV 149 (397) T ss_pred HHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccccccceeEe Confidence 99999999999999999999999999999999999999999999999999999998876643 334332 4567889 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccc Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQID 232 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~ 232 (384) +++||||++++++.+.++|+||+.++..++....++.+++.++|+||++|++++++++.+++++.+++++.|+....+ T Consensus 150 ~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~-- 227 (397) T protein:vir:38 150 PAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQI-- 227 (397) T ss_pred cCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcc-- Confidence 999999999998888899999999999999999999999999999999999999999999999999999999876643 Q ss_pred ccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc----cHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 233 SEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK----YSEDEWNAYYESEIEPVGLQLSNQ 307 (384) Q Consensus 233 ~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~----~~e~~~~~~~~~~i~P~~~~i~~~ 307 (384) .++++++|+++|++|++++.++.++++.+. ++.+++||++|||||.+||++ .+.++...|+.+||+|++..|+++ T Consensus 228 ~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ie~~ 307 (397) T protein:vir:38 228 HNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQISGQYAKSLNRYVQAIVGE 307 (397) T ss_pred cccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 567889999999999999999999998766 578999999999999999863 223455678899999999999999 Q ss_pred HhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCcee------ec Q lcl|NC_019422. 308 YTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTA------VV 380 (384) Q Consensus 308 l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~------~~ 380 (384) ||++|+++.+ |++..+.+.|.+++++.++ ++++|++|+||+|+++|++|+++||.+....... +. T Consensus 308 ln~~l~~~~~--------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~ 379 (397) T protein:vir:38 308 LNDKLHANIS--------ANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQ 379 (397) T ss_pred HHHhccChhc--------ccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccccccccccccc Confidence 9999998643 3444456778999998875 8999999999999999999999998765444332 22 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) ++|+ T Consensus 380 ~~g~ 383 (397) T protein:vir:38 380 EGGE 383 (397) T ss_pred ccCC Confidence 3444 No 49 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=2.2e-78 Score=446.33 Aligned_cols=379 Identities=13% Similarity=0.173 Sum_probs=295.5 Q ss_pred Ccchhh----hcccCCCc-----chhHHHhhcc----------c-----cCcceechhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MNIFKS----KKKNKEAP-----GKVMMELISD----------S-----GNGFYSWHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~~f~~----~~~~~~~~-----~~~~~~~~~~----------~-----~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) .=||+. ||.....| .++...++.- . ..........++++++|++||+.||+++|++ T Consensus 62 ~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsL 141 (945) T protein:vir:10 62 IIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSK 141 (945) T ss_pred eeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccC Confidence 334442 11111111 1111111100 0 0011133467788999999999999999999 Q ss_pred ceEEEEecCCcc------eeccchHHHHHHhhccccCCHHH----HHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcC Q lcl|NC_019422. 57 TAKHIRSNETEF------KTNPEIYIKFLLENPNPFMSGQI----LQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNA 126 (384) Q Consensus 57 ~~~~~~~~~~~~------~~~~~~~~~~l~~~PN~~~s~~~----f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~ 126 (384) |+++|++.+++. +...++.++.|+.+||++||+.+ |++.++.+++++||+|++++|+..|.+.+|+|++| T Consensus 142 PlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdP 221 (945) T protein:vir:10 142 ELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDG 221 (945) T ss_pred ceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC Confidence 999999877663 23355666667779999999998 55668899999999999999999999999999999 Q ss_pred ceEEEEEcCCCEE-EEEE--EcCceEEEEehhheEEEeccCCCCC---ccCccHHHHHHHHHHHHHHHHHHHHHHHH-cc Q lcl|NC_019422. 127 LNVEAIYENEVLF-LKFL--LRNGKIVSYPYSDIIHLRKDFNEND---LFGTSPAKVLEPIMEVVNTTDQGVVKAIK-NS 199 (384) Q Consensus 127 ~~v~~~~~~~~~~-~~~~--~~~g~~~~~~~~evih~~~~~~~~~---~~G~s~~~~~~~~i~~~~~~~~~~~~~~~-ng 199 (384) .+|++..+.++.. +.|. ..++....++++|+||++.+...++ .+|+||+.++.+++....++++++.++|. || T Consensus 222 s~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNG 301 (945) T protein:vir:10 222 TTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGG 301 (945) T ss_pred cceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 9999998876643 3332 3334455788888776554444443 36999999999999999999999999995 68 Q ss_pred CCcceEEeeC----------CCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHH Q lcl|NC_019422. 200 NTIKWLLKFK----------TALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQR 268 (384) Q Consensus 200 ~~p~~il~~~----------~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~ 268 (384) ++|+++|+++ +.+++++.+++++.|++.++| .++++++++++|++|++++.++.|+++.+. ++++++ T Consensus 302 a~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG--~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~ee 379 (945) T protein:vir:10 302 SIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMG--DYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARK 379 (945) T ss_pred CccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCC--cccccceecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 8999999875 567899999999999999976 355778899999999999999999998775 678999 Q ss_pred HHHHhCCCHHHhc------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH Q lcl|NC_019422. 269 LYSFFNTNEKIIQ------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL 342 (384) Q Consensus 269 I~~~fgvp~~~l~------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~ 342 (384) ||++|||||.+|| +++.+++..+|+++||.|++.+|+++||++|+...+ +.+++|+++.....|.+++++. T Consensus 380 IArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~e---g~~i~fdFd~ldl~D~ksraEa 456 (945) T protein:vir:10 380 ICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRN---EKDIKLWFKEDDLEKERDWWNI 456 (945) T ss_pred HHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---CceeEEEecchhccCHHHHHHH Confidence 9999999999997 356789999999999999999999999999876443 3345555666666788999988 Q ss_pred HH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCc-eeecCCCC Q lcl|NC_019422. 343 VQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLD-TAVVEGGE 384 (384) Q Consensus 343 ~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n-~~~~~~ge 384 (384) ++ ++++|++|+||+|+++|+||+||||+++++.+ ++|.+... T Consensus 457 l~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ 500 (945) T protein:vir:10 457 IQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQA 500 (945) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccccc Confidence 75 79999999999999999999999999999874 55553211 No 50 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=2e-78 Score=446.52 Aligned_cols=367 Identities=15% Similarity=0.172 Sum_probs=308.6 Q ss_pred CcchhhhcccCCCcchhHHHh--hcc-ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC---Ccceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMEL--ISD-SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE---TEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---~~~~~~~~~ 74 (384) |||++.+.............. ++. +......+.+++.++++|++||+.||+.||++|++++++.. ++.....++ T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~ 80 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGVKTKT 80 (403) T ss_pred CcchhhhhhccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccccccccccch Confidence 999998765554332222111 111 11122345578899999999999999999999999986643 333445678 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEeh Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPY 154 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~ 154 (384) +.++|+.+||++||+++||+.++.+++++||||+++.+ ..++++++..|++..+.++.+++|...++ ..+++ T Consensus 81 l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~~~~~~~~~~--~~~~~ 152 (403) T protein:vir:10 81 LDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKFIKKFIFNNQ--INYRV 152 (403) T ss_pred HHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCceEEEEEecCc--eeecc Confidence 88999999999999999999999999999999998753 36899999999999988888887766554 45788 Q ss_pred hheEEEeccC----CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 155 SDIIHLRKDF----NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 155 ~evih~~~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) +||+|++... +.++++|+||+..+..++....++++++.++|+||++|+++++.++.++++++++++++|++.++| T Consensus 153 ~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g 232 (403) T protein:vir:10 153 DEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNP 232 (403) T ss_pred cceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCC Confidence 9999998543 358899999999999999999999999999999999999999999999999999999999999976 Q ss_pred ccccCCcceecCCCceeeeccc--chhHHHHHHH-HHHHHHHHHHhCCCHHHhcc---ccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKA--ESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS---KYSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~--~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~---~~~e~~~~~~~~~~i~P~~~~i 304 (384) ..|+|+++++++|++|++++. ++.|+|+.+. ++++++||++|||||.+||. ++.+++...|+++||.|++..| T Consensus 233 -~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~~~tl~P~~~~i 311 (403) T protein:vir:10 233 -STGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPNIELFYYMTIIPMLNKL 311 (403) T ss_pred -cccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHHHHH Confidence 567899999999999999985 5678998876 57899999999999999973 4567889999999999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhh--hccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCC--CCCeeeecCcee- Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNL--QYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIE--NGDKPVRRLDTA- 378 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~--~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~--~gd~~~~~~n~~- 378 (384) +++|+++|. .+++||++.+ ++.|.+++++.++ ++++|++|+||+|+++|+||+| +||++++|+|++ T Consensus 312 e~~l~~~L~--------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~ 383 (403) T protein:vir:10 312 TSSLTFFFG--------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAG 383 (403) T ss_pred HHHHHHhcC--------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccccccccccccc Confidence 999999872 3577887755 7889999998875 8999999999999999999995 699999999985 Q ss_pred ---ecCCCC Q lcl|NC_019422. 379 ---VVEGGE 384 (384) Q Consensus 379 ---~~~~ge 384 (384) ++.+|| T Consensus 384 ~~~~~~~~e 392 (403) T protein:vir:10 384 SATGVSGQE 392 (403) T ss_pred ccccCCCCc Confidence 445555 No 51 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=3.4e-78 Score=445.32 Aligned_cols=376 Identities=11% Similarity=0.042 Sum_probs=309.8 Q ss_pred CcchhhhcccCCCcc---------------------------hhHHHhhcccc------CcceechhhhhhcHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---------------------------KVMMELISDSG------NGFYSWHGNLYKSDIVRSIIR 47 (384) Q Consensus 1 M~~f~~~~~~~~~~~---------------------------~~~~~~~~~~~------~~~~~~~~~~~~~~~v~~~i~ 47 (384) ||||++..+...+.. +....++.+.. .+..++.++++++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 999998765543221 11122222111 122345677999999999999 Q ss_pred HHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC--------Ccee Q lcl|NC_019422. 48 PKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY--------NMPT 119 (384) Q Consensus 48 ~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~--------g~~~ 119 (384) +||+.||++||++++.++++.++..++.++.|+.+||++||+++||+.++.+++++||||++++|++. |.+. T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~ 160 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVV 160 (466) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCccee Confidence 99999999999999988877777788888999999999999999999999999999999999999765 4589 Q ss_pred eEEEEcCceEEEEEcCCCEE---EEEEEc----CceEEEEehhheEEEecc-CCCCCccCccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 120 QIYPLNALNVEAIYENEVLF---LKFLLR----NGKIVSYPYSDIIHLRKD-FNENDLFGTSPAKVLEPIMEVVNTTDQG 191 (384) Q Consensus 120 ~l~~l~~~~v~~~~~~~~~~---~~~~~~----~g~~~~~~~~evih~~~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 191 (384) ++++++|..|++..+.++.. |.|... .++.+.++++||||+|.+ ++.++++|+||+..+.+++....+++++ T Consensus 161 ~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~ 240 (466) T protein:vir:81 161 EERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKH 240 (466) T ss_pred EEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHH Confidence 99999999999999877643 333332 235678999999999965 5789999999999999999999999999 Q ss_pred HHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHH Q lcl|NC_019422. 192 VVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLY 270 (384) Q Consensus 192 ~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~ 270 (384) +.++|+||++|+++++.++.+++++.+++++.|++.++| ..++++++|+++|++|++++.++.|+|+.+. ++++++|| T Consensus 241 ~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g-~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia 319 (466) T protein:vir:81 241 QAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAG-VDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIA 319 (466) T ss_pred HHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcC-ccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999976 5678999999999999999999999999876 67899999 Q ss_pred HHhCCCHHHhcc---------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHH Q lcl|NC_019422. 271 SFFNTNEKIIQS---------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLN 341 (384) Q Consensus 271 ~~fgvp~~~l~~---------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~ 341 (384) ++|||||.+||. ++.|++..+|++.||.|++++||++||++|++..+.. ..+++||.+++++.|.+++.+ T Consensus 320 ~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~llr~d~~~r~~ 398 (466) T protein:vir:81 320 AAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDV-RLWYDADDVPFLREDEKDAAD 398 (466) T ss_pred HHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCc-ceEEEecchhhhccCHHHHHH Confidence 999999999962 3468999999999999999999999999999876643 468999999999999999876 Q ss_pred H-------HH-HHhCCCCCHHHHHHHhCCCCCCCCCeeee-cCceeec------------------CCCC Q lcl|NC_019422. 342 L-------VQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVR-RLDTAVV------------------EGGE 384 (384) Q Consensus 342 ~-------~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~-~~n~~~~------------------~~ge 384 (384) + ++ ++++|+ |+||+|.. +++||.++. +.++.++ +||| T Consensus 399 ~~~~~~~~~~~~~~~g~-t~nE~r~~-----~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~ 462 (466) T protein:vir:81 399 IQKVRAETINTLITAGY-EPESVVAA-----VNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGAD 462 (466) T ss_pred HHHHHHHHHHHHHHcCC-Chhhcccc-----ccCCccccccCCCcchhhhcccccccccCCCCcccCCCC Confidence 5 22 567785 99999964 445555432 2333222 1222 No 52 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=1.2e-77 Score=442.38 Aligned_cols=322 Identities=20% Similarity=0.291 Sum_probs=290.7 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEE Q lcl|NC_019422. 53 VGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAI 132 (384) Q Consensus 53 ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~ 132 (384) ||++|++++++++ ...++++++|+.+||++||+++||+.++.+++++||||++++|+..|.+.+|+|++|.+|++. T Consensus 1 ia~lp~~~~~~~~----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~ 76 (348) T protein:vir:93 1 MASLPLKMYEDYK----VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 76 (348) T ss_pred CcccceEeEecCc----CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEE Confidence 9999999998653 446788888889999999999999999999999999999999999999999999999999998 Q ss_pred EcCCCEE--EEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC Q lcl|NC_019422. 133 YENEVLF--LKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT 210 (384) Q Consensus 133 ~~~~~~~--~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~ 210 (384) .+.++.. |.+...+|..+.++++||||+|++++.++++|+||+..+..++....++++++ +..++..+.++++.++ T Consensus 77 ~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~ 154 (348) T protein:vir:93 77 IENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGS 154 (348) T ss_pred EeCCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCC Confidence 8776543 55666678889999999999999889999999999999999999999888885 4444555678889999 Q ss_pred CCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------c Q lcl|NC_019422. 211 ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------K 283 (384) Q Consensus 211 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~ 283 (384) .++++++++++++|++.+. ++++++++++|++|++++.++.++++.+. ++++++||++|||||.+||+ + T Consensus 155 ~l~~e~~~~~~~~~~~~~~----n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~ 230 (348) T protein:vir:93 155 NVSTEKRQQVLEDFKQYYE----ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFA 230 (348) T ss_pred CCCHHHHHHHHHHHHHHhh----cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc Confidence 9999999999999998763 46789999999999999999999999877 56889999999999999974 3 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCC Q lcl|NC_019422. 284 YSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNL 362 (384) Q Consensus 284 ~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~ 362 (384) +.|++..+|++.||.|+++.|+++||++|+++.++..+.+|+||.+++++.|.++++++++ ++++|++|+||+|+++|+ T Consensus 231 ~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~ 310 (348) T protein:vir:93 231 KNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDL 310 (348) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 5688999999999999999999999999999999888999999999999999999999875 899999999999999999 Q ss_pred CCCCCCCeeeecCceeecC----------CCC Q lcl|NC_019422. 363 SPIENGDKPVRRLDTAVVE----------GGE 384 (384) Q Consensus 363 ~p~~~gd~~~~~~n~~~~~----------~ge 384 (384) ||+||||++++++|++|++ +|| T Consensus 311 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~ 342 (348) T protein:vir:93 311 PPVEGGDKPLISGDLYPIDTPLELRKSLKGGD 342 (348) T ss_pred CCCCCcCeEeecccccccccchhhcccccCCC Confidence 9999999999999998865 444 No 53 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=4.2e-78 Score=444.78 Aligned_cols=344 Identities=18% Similarity=0.194 Sum_probs=284.6 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc------eeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF------KTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~------~~~~~~ 74 (384) ||||++..+............+.. ......++++++|++||+.||+.||++|+++++..+.+. ....++ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTA-----WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CCccccchhcccccccCCcceeee-----eccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccch Confidence 999998876554433332221111 122345678899999999999999999999987765542 234577 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) ++++|+.+||++||+++||+.++.+++++||+|++++++. .|.+..++|.. ..+.++ T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~----------------------~~~~~~ 133 (378) T protein:vir:94 76 LDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD----------------------DKKEYK 133 (378) T ss_pred HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC----------------------CeeEee Confidence 8888888999999999999999999999999999987654 46666555421 224578 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc-- Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI-- 231 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~-- 231 (384) ++||||++.+ .++..|+||++.+.+++... +++ +.|+++|++++.+++++.++.+++|++.+++. T Consensus 134 ~~diiH~~~~--~~~~~g~s~l~~~~~~i~~~----------~~~-~~~~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~ 200 (378) T protein:vir:94 134 PEELVRLTSP--FYINEDTSILDNALASIQTK----------LEQ-GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQE 200 (378) T ss_pred eeeeEEecCc--CCccchhHHHHHHHHHHHHH----------Hhc-ccccceeeeCCcCCHHHHHHHHHHHHHHHHHhhc Confidence 8999999854 66778999999988876432 333 46899999999999999999998888877642 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQLSNQYTEK 311 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~ 311 (384) ..++++++++++|++|++++.++.++++.++++++++||++|||||.+|++++++++..+|++.||.|++.+|+++|+++ T Consensus 201 ~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~~~se~~~~~f~~~tL~P~~~~ie~~l~~~ 280 (378) T protein:vir:94 201 GSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTASQEQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred ccccccceecCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 24567899999999999999999999988888999999999999999999999999999999999999999999999999 Q ss_pred ccCcccccCcc------eEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 312 LFTRKARSFGN------EIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 312 l~~~~~~~~~~------~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ||++.++..+. .++||++.+++.|.+++++.++ ++++||+|+||+|+++|+||+||||++++|+|++|++..+ T Consensus 281 Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~ 360 (378) T protein:vir:94 281 LISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLS 360 (378) T ss_pred cCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeecccccccccch Confidence 99987655443 4789999999999999999875 8999999999999999999999999999999999985333 No 54 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=6.8e-78 Score=443.66 Aligned_cols=344 Identities=17% Similarity=0.194 Sum_probs=284.1 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc------eeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF------KTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~------~~~~~~ 74 (384) ||||++.++............+.. ......++++++|++||++||+.||++||+++++++++. ....++ T Consensus 1 Mg~f~~~~~f~~~~~~~~~~~~~~-----~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQRVTA-----WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CccchhhhhhhccccCCCcceeee-----cccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccch Confidence 999998876544332222221111 122345688999999999999999999999998876543 234578 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-CceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-NMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) ++++|+.+||++||+++||+.++.+++++||+|++++++.. |++..++|. +..+.++ T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~----------------------~~~~~~~ 133 (378) T protein:vir:93 76 LDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA----------------------DDKKEYK 133 (378) T ss_pred HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec----------------------CCeeEec Confidence 88888889999999999999999999999999999887643 555544432 2234678 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc-- Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI-- 231 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~-- 231 (384) ++||||+|. +.++..|.|++..+...+. .++.+ +.|+++|++++.+++++.++++++|++.+++. T Consensus 134 ~~diih~r~--~~~~~~~~s~l~~~~~~i~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~ 200 (378) T protein:vir:93 134 TEELVRLTS--PFYINEDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQE 200 (378) T ss_pred cceeEEecC--ccccchhhHHHHHHHHHHH----------HHHhc-CcccceeeeCCcCCHHHHHHHHHHHHHHHHHhhc Confidence 999999984 4667779999988877653 24445 46899999999999999999999888877542 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQLSNQYTEK 311 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~ 311 (384) ..++++++++++|++|++++.++.++++.++++++++||++|||||.+|+++++|++..+|++.||.|++.+|+++||++ T Consensus 201 ~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~~~e~~~~~f~~~tl~P~~~~ie~~l~~k 280 (378) T protein:vir:93 201 GSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTATQEQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred ccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 24567899999999999999999999998888999999999999999999999999999999999999999999999999 Q ss_pred ccCcccccCc------ceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 312 LFTRKARSFG------NEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 312 l~~~~~~~~~------~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |+++.++..+ .+++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+||||++++|+|++|++... T Consensus 281 Ll~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~ 360 (378) T protein:vir:93 281 LISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLS 360 (378) T ss_pred cCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccccccccchh Confidence 9998765544 34889999999999999999875 8999999999999999999999999999999999985432 No 55 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=9.5e-78 Score=442.85 Aligned_cols=344 Identities=18% Similarity=0.195 Sum_probs=284.3 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc------eeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF------KTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~------~~~~~~ 74 (384) ||||++.++............+.. ......++++++|++||++||++||++||+++++.+++. ....++ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQRVTA-----WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CccchhhhhhhcccccCCcceeee-----cccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccch Confidence 999998776544433322221111 122345688999999999999999999999998876553 234578 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-CceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEe Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-NMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYP 153 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~ 153 (384) ++++|+.+||++||+++||+.++.+++++||+|++++++.. |++..++|.+ ..+.++ T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~----------------------~~~~~~ 133 (378) T protein:vir:16 76 LDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------------DKKEYK 133 (378) T ss_pred HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------------CeeEec Confidence 88989889999999999999999999999999999988754 5555444321 234578 Q ss_pred hhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc-- Q lcl|NC_019422. 154 YSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI-- 231 (384) Q Consensus 154 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~-- 231 (384) ++||||+|. +.++..|.|++..+...+.. .+. ++.|+++++.++.+++++.++.+++|++.+++. T Consensus 134 ~~diih~r~--~~~~~~~~s~l~~~~~~i~~----------~~~-~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~ 200 (378) T protein:vir:16 134 PEELVRLTS--PFYINEDTSILDNALASIQT----------KLE-QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQE 200 (378) T ss_pred ccceEEecC--ccCccchhHHHHHHHHHHHH----------HHh-cCccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhc Confidence 899999985 45677889999888776532 333 456899999999999999888899998887642 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQLSNQYTEK 311 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~ 311 (384) ..++++++++++|++|++++.++.++++.++++++++||++|||||.+|+++++|++..+|++.||.|++++|+++|+++ T Consensus 201 ~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~~~e~~~~~f~~~tl~P~~~~ie~~l~~k 280 (378) T protein:vir:16 201 GSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTASQEQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred ccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 24568899999999999999999999998889999999999999999999999999999999999999999999999999 Q ss_pred ccCcccccCc------ceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 312 LFTRKARSFG------NEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 312 l~~~~~~~~~------~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |+++.++..+ .+++||++.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++|++..+ T Consensus 281 Ll~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~ 360 (378) T protein:vir:16 281 LISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLS 360 (378) T ss_pred cCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccccccccchh Confidence 9998765433 35889999999999999999875 8999999999999999999999999999999999986332 No 56 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=1.5e-76 Score=436.22 Aligned_cols=361 Identities=16% Similarity=0.163 Sum_probs=305.7 Q ss_pred CcchhhhcccCCCcchhHH--------HhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMM--------ELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 72 (384) ||||++.++.+..+..... .+.+.+..+-..+..+++++|+|++||++||+.+|++|+++++. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~--------- 71 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRK--------- 71 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccc--------- Confidence 9999987766554433221 12222333444566778999999999999999999999999743 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEcC---c Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLRN---G 147 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~---g 147 (384) ....|+.+||++||+.+||+.++.+++++||+|++++|+..|.+.+|++++|++|++..+.++.. |.+...+ + T Consensus 72 --~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~ 149 (386) T protein:vir:48 72 --QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIP 149 (386) T ss_pred --hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecCcccc Confidence 35679999999999999999999999999999999999999999999999999999998876644 3343333 4 Q ss_pred eEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHH Q lcl|NC_019422. 148 KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKN 227 (384) Q Consensus 148 ~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~ 227 (384) ..+.++++||||++++++.++++|+||+..+..++....++++++.++|+||++|+++|++++.+++++.+++++.|.+. T Consensus 150 ~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~ 229 (386) T protein:vir:48 150 PKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAM 229 (386) T ss_pred ceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHh Confidence 56789999999999988888899999999999999999999999999999999999999999999999999988887653 Q ss_pred hccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~~~~i~P~~~ 302 (384) ..++++++|+++|++|++++.++.++|+.+. ++++++||++|||||.+||. ++++++..+|++.||.|+++ T Consensus 230 ----~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~ 305 (386) T protein:vir:48 230 ----KQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLR 305 (386) T ss_pred ----hcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 3467899999999999999999999999876 56889999999999999974 35688899999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCCCCCee-eecCceeec Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIENGDKP-VRRLDTAVV 380 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~~gd~~-~~~~n~~~~ 380 (384) .|+++|+++|+++. ++|+....+.|...+...+ +++++|++|+||+|+++|++|++++|.. ....|..|+ T Consensus 306 ~ie~~l~~~l~~~~--------~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~~~~~~~~ 377 (386) T protein:vir:48 306 PFLSELSQKLSCDV--------DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGENPNKTTL 377 (386) T ss_pred HHHHHHHHhhcchh--------hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhcCCCCCcc Confidence 99999999998743 4555555666766665554 5889999999999999999999877754 455688999 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) ++|| T Consensus 378 ~gGd 381 (386) T protein:vir:48 378 KGGE 381 (386) T ss_pred CCCC Confidence 9999 No 57 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=4.2e-76 Score=433.83 Aligned_cols=359 Identities=16% Similarity=0.171 Sum_probs=301.5 Q ss_pred CcchhhhcccC---C-Ccchh---HHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccc Q lcl|NC_019422. 1 MNIFKSKKKNK---E-APGKV---MMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPE 73 (384) Q Consensus 1 M~~f~~~~~~~---~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~ 73 (384) ||||+++...+ . ..... ..........+.+.+.++++++++|++||+.||+++|++||++++.. T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~--------- 71 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTENTA--------- 71 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecccc--------- Confidence 99998642111 1 11111 12222222233445567899999999999999999999999886422 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-CceEEEE Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-NGKIVSY 152 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~g~~~~~ 152 (384) ...|+.+||++||+++||+.++.+++++||+|++++++ ..+++|+++.+|++..+.++.++.+... +|..+.+ T Consensus 72 --~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 145 (383) T protein:vir:10 72 --TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDRPKMVL 145 (383) T ss_pred --hhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceEEEEEEcCCceEEEE Confidence 23366799999999999999999999999999999875 4678999999999999888887776655 5678889 Q ss_pred ehhheEEEeccCC--CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHhc Q lcl|NC_019422. 153 PYSDIIHLRKDFN--ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNYL 229 (384) Q Consensus 153 ~~~evih~~~~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~~ 229 (384) +++||||+|+.++ .++++|+||+.++..++....++++++.++|+||++|++++++++.++ +++.+++++.|++.++ T Consensus 146 ~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~ 225 (383) T protein:vir:10 146 RQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANT 225 (383) T ss_pred cccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhC Confidence 9999999997654 567899999999999999999999999999999999999999998774 7788999999998886 Q ss_pred cccccCCcceecCCCceeeecccchhHHHH-HHH-HHHHHHHHHHhCCCHHHhccc--------cHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQM-DKAIQRLYSFFNTNEKIIQSK--------YSEDEWNAYYESEIEP 299 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~-~~~~~~I~~~fgvp~~~l~~~--------~~e~~~~~~~~~~i~P 299 (384) + .++++++++++|++|++++.++.++++ .+. ++++++||++|||||.+||+. +.|++. .++..||.| T Consensus 226 ~--~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~-~~~~~~l~P 302 (383) T protein:vir:10 226 G--DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIK-ATYLANLNS 302 (383) T ss_pred c--cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHH-HHHHHHHHH Confidence 5 578899999999999999999999996 465 678999999999999999852 234444 455679999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCcee Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTA 378 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~ 378 (384) +++.|+++|+++|+.+ +++||++.+++.|.+++++.++ ++++|++|+||+|+++|++|+|+||.+....+.. T Consensus 303 ~~~~ie~~l~~~l~~~-------~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~ 375 (383) T protein:vir:10 303 YVNPIVDELRLKMNAP-------DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTN 375 (383) T ss_pred HHHHHHHHHHHhhCCc-------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCCcc Confidence 9999999999999753 5899999999999999999764 8999999999999999999999999999999999 Q ss_pred ecCCCC Q lcl|NC_019422. 379 VVEGGE 384 (384) Q Consensus 379 ~~~~ge 384 (384) ++++|| T Consensus 376 ~~~gGd 381 (383) T protein:vir:10 376 ETKGGD 381 (383) T ss_pred cCCCCC Confidence 999999 No 58 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=4.4e-76 Score=433.72 Aligned_cols=357 Identities=16% Similarity=0.180 Sum_probs=298.4 Q ss_pred CcchhhhcccCCCc-------chhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-------GKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~ 73 (384) ||||+++...+... ....+...........++.++++++++|++||++||+.+|++||++++.. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~~--------- 71 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTENTA--------- 71 (385) T ss_pred CccccchhcccccccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeeccc--------- Confidence 99998653222211 11122223333334456678899999999999999999999999986422 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcC-ceEEEE Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRN-GKIVSY 152 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~-g~~~~~ 152 (384) .+.|+.+||++||+++||+.++.+++++||||++++++ ..+++|+++.+|++..+.++..|++...+ +..+.+ T Consensus 72 --~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 145 (385) T protein:vir:10 72 --TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDRPQMVL 145 (385) T ss_pred --hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceEEEEEEcCCceEEEE Confidence 34467799999999999999999999999999999975 46799999999999999888888776654 577889 Q ss_pred ehhheEEEeccCC--CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHhc Q lcl|NC_019422. 153 PYSDIIHLRKDFN--ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNYL 229 (384) Q Consensus 153 ~~~evih~~~~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~~ 229 (384) +++||||+|+.++ .++++|+||+..+..++....++++++.++|+||++|++++++++.+. +++.+++++.|++.++ T Consensus 146 ~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~ 225 (385) T protein:vir:10 146 RQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANT 225 (385) T ss_pred ccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhC Confidence 9999999998654 568899999999999999999999999999999999999999998764 6789999999998886 Q ss_pred cccccCCcceecCCCceeeecccchhHHHH-HHH-HHHHHHHHHHhCCCHHHhccc--------cHHHHHHHHHHHHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQM-DKAIQRLYSFFNTNEKIIQSK--------YSEDEWNAYYESEIEP 299 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~-~~~~~~I~~~fgvp~~~l~~~--------~~e~~~~~~~~~~i~P 299 (384) + .++++++++++|++|++++.++.++++ .+. ++++++||++|||||.+||++ +.| +...++..||.| T Consensus 226 ~--~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~e-q~~~~~~~~l~P 302 (385) T protein:vir:10 226 G--DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNID-QIKATYLANLNS 302 (385) T ss_pred c--cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHH-HHHHHHHHHHHH Confidence 5 577899999999999999999999996 465 678999999999999999752 234 445566779999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCC--CCeeeecCc Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIEN--GDKPVRRLD 376 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~--gd~~~~~~n 376 (384) +++.|+++|+++|+++ .++||++++++.|.+++++.++ ++++|++|+||+|+++|++|+|+ ||++.+|.| T Consensus 303 ~~~~ie~~l~~~l~~~-------~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~~ 375 (385) T protein:vir:10 303 YVNPIVDELRLKMNAP-------DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTT 375 (385) T ss_pred HHHHHHHHHHHhhCCc-------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCccc Confidence 9999999999999753 4899999999999999999876 89999999999999999999964 567777766 Q ss_pred eeecCCCC Q lcl|NC_019422. 377 TAVVEGGE 384 (384) Q Consensus 377 ~~~~~~ge 384 (384) . +++|| T Consensus 376 ~--~~~g~ 381 (385) T protein:vir:10 376 Q--VKGGD 381 (385) T ss_pred c--cCCCC Confidence 4 56666 No 59 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=2.6e-76 Score=434.94 Aligned_cols=363 Identities=12% Similarity=0.128 Sum_probs=295.1 Q ss_pred CcchhhhcccCCCcchhH-HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM-MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFL 79 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l 79 (384) ||||+++++....+.... ... ......++++++++|++||++||+++|++||+++++++ ...++++++| T Consensus 1 Mg~f~~~f~~~~~~~~~~~~~~------~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~----~~~~~l~~lL 70 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWMYDLEF------LQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNT----KEKGTLYYLL 70 (385) T ss_pred CchhhhhhccCcccccccchhh------hhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCc----cccchHHHHH Confidence 999998766544432221 111 11234567899999999999999999999999997553 3457888889 Q ss_pred HhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEE Q lcl|NC_019422. 80 LENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIH 159 (384) Q Consensus 80 ~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih 159 (384) +.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+....+. .........+..+.++++|||| T Consensus 71 ~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~eiih 147 (385) T protein:vir:95 71 NVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHR--FTNVLVNDFEFKRVFTMDDVIY 147 (385) T ss_pred hcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccccc--ceeeeecccceeeeeccccEEE Confidence 89999999999999999999999999999887654 34555555556555443322 1111222234556789999999 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCc Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGG 237 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~ 237 (384) ++++++.+..+|.||+..+..++....+.. .+++.|++++++++ .+++++.+++++.|++.++|..++.++ T Consensus 148 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-------~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~ 220 (385) T protein:vir:95 148 LKYNNQKLDAFSLGLFEDYGEIFGRMIDLQ-------MLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIA 220 (385) T ss_pred ecCCCCCcccccchHHHHHHHHHHHHHHHH-------HhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCc Confidence 999988888999999999999887655432 23345789998864 578899999999999999887777778 Q ss_pred ceecCCCceeeeccc------chhHHHHHHH-HHHHHHHHHHhCCCHHHhccc--cHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 238 AAATDSKYDAEQVKA------ESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK--YSEDEWNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 238 ~~v~~~g~~~~~l~~------~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~--~~e~~~~~~~~~~i~P~~~~i~~~l 308 (384) ++++++|++|++++. ++.|+++.+. ++++++||++|||||.+|+++ +.+++..+|++.||.|++.+|+++| T Consensus 221 i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l 300 (385) T protein:vir:95 221 VVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEMADLEKTIESYLQFCINPLLRKIEAEL 300 (385) T ss_pred eEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999975 4568899877 578999999999999999875 4588999999999999999999999 Q ss_pred hhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCC--CCCCeeeecCceeecC---C Q lcl|NC_019422. 309 TEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPI--ENGDKPVRRLDTAVVE---G 382 (384) Q Consensus 309 ~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~--~~gd~~~~~~n~~~~~---~ 382 (384) |++|+++.++. +.+++||++++++.|.++++++++ ++++|++|+||+|+++|+||+ ||||++++|+|+++++ + T Consensus 301 ~~~L~~~~~~~-~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~~n~~~~~~~kg 379 (385) T protein:vir:95 301 NSKFFYQDEYL-NDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIITKNLQSADAFKG 379 (385) T ss_pred HhhcCChhhcc-cceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccC Confidence 99999988764 448999999999999999999875 899999999999999999998 6899999999999875 6 Q ss_pred CC Q lcl|NC_019422. 383 GE 384 (384) Q Consensus 383 ge 384 (384) || T Consensus 380 ge 381 (385) T protein:vir:95 380 GE 381 (385) T ss_pred CC Confidence 66 No 60 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=5.3e-76 Score=433.29 Aligned_cols=364 Identities=13% Similarity=0.152 Sum_probs=292.0 Q ss_pred CcchhhhcccC--CCcc-hhHHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchH Q lcl|NC_019422. 1 MNIFKSKKKNK--EAPG-KVMMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIY 75 (384) Q Consensus 1 M~~f~~~~~~~--~~~~-~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~ 75 (384) ||||++.++.. .... .+....+.. +..+..++.++++++++|++||+.||+.||++||+++++++ . +..++. T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g--~-~~~~~~ 77 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFG--N-EIKDDI 77 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCC--c-ccchhh Confidence 99999764332 1111 122332222 23344466778999999999999999999999999987543 2 345677 Q ss_pred HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehh Q lcl|NC_019422. 76 IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYS 155 (384) Q Consensus 76 ~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~ 155 (384) .+.|+.+||++||+++||+.++.+++++||+|+++.++..+.+ ..+.+..++.+. +.|.. +| +.++++ T Consensus 78 ~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~--------~~~~~~~~~~~~-~~~~~-~~--~~~~~~ 145 (394) T protein:vir:62 78 ALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA--------SNVFTELDDNLV-EHFNI-GG--HEIPPC 145 (394) T ss_pred HHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc--------ccceEEECCceE-EEEee-CC--EEechh Confidence 7888899999999999999999999999999999986554422 244555554433 23322 22 568999 Q ss_pred heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC--hHHHHHHHHHHHHHhccccc Q lcl|NC_019422. 156 DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR--PDDIKKEVKSFEKNYLQIDS 233 (384) Q Consensus 156 evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~--~e~~~~~~~~~~~~~~~~~~ 233 (384) ||||+|++ +.++++|+||+..+..++....++++++.++|+||++|++++++++.++ +++++++++.|.+.++| .. T Consensus 146 eiih~r~~-~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g-~~ 223 (394) T protein:vir:62 146 MIRHVKNI-GADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLES-ID 223 (394) T ss_pred heEEecCc-CCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhcc-cc Confidence 99999976 4788999999999999999999999999999999999999999998776 44578899999999987 45 Q ss_pred cCCcceecCCCc--eeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc---ccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 234 EAGGAAATDSKY--DAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS---KYSEDEWNAYYESEIEPVGLQLSNQ 307 (384) Q Consensus 234 ~~~~~~v~~~g~--~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~---~~~e~~~~~~~~~~i~P~~~~i~~~ 307 (384) ++|+++|++.|. ++++++.++.++++.+. ++++++||++|||||.+||+ ++.|++..+|++.||.|++++||++ T Consensus 224 n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~~~~~~l~P~~~~ie~~ 303 (394) T protein:vir:62 224 EARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKEDIEKAMMYIHNKAVRPIMKNFEDH 303 (394) T ss_pred ccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHH Confidence 678888887776 55688888999999877 56789999999999999976 5678899999999999999999999 Q ss_pred HhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecCceeecC--- Q lcl|NC_019422. 308 YTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPI--ENGDKPVRRLDTAVVE--- 381 (384) Q Consensus 308 l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~--~~gd~~~~~~n~~~~~--- 381 (384) |+++|+++.+. ...+|+||...++.. .++++++ +++++|++|+||+|+++|+||+ |+||++++++|+++++ T Consensus 304 l~~kll~~~~~-~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~~ 380 (394) T protein:vir:62 304 LSLLFYAQNSG-KRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKKE 380 (394) T ss_pred HhhhhcCcccc-CceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeecccccccccccc Confidence 99999988664 356777777776554 4566665 5899999999999999999999 6799999999998874 Q ss_pred -------CCC Q lcl|NC_019422. 382 -------GGE 384 (384) Q Consensus 382 -------~ge 384 (384) +|| T Consensus 381 ~~~~~~kgge 390 (394) T protein:vir:62 381 ATDGSLGGGE 390 (394) T ss_pred cccccCCCCC Confidence 455 No 61 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=7.2e-76 Score=432.55 Aligned_cols=361 Identities=16% Similarity=0.148 Sum_probs=295.4 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) ||||+++++.+....... ++..+...+..+++++++|++||++||+++|++||++++++ +...++.+++|+ T Consensus 1 Mg~f~~lf~~~~~~~~~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~~~~~~~~ll~ 71 (395) T protein:vir:10 1 MSILEKIFKTRKDITYML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RIQKNDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccccc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----ccccchHHHHHH Confidence 999998766654433211 11223344567789999999999999999999999998643 456788899999 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-CceEEEEehhheEE Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-NGKIVSYPYSDIIH 159 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~g~~~~~~~~evih 159 (384) .+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+...+.+... .+....++++|||| T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih 146 (395) T protein:vir:10 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQRTFTMQEVIY 146 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCceeeeeeccccEEE Confidence 9999999999999999999999999988776543 35667666666655554444433333 34567899999999 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCC-CChHHHHHHHHHHHHHhccccccCCcc Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNYLQIDSEAGGA 238 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~ 238 (384) ++++++.+..+|.||+..+..++.... +.|.+|+.|+++|.+++. +++++.+++++.|++.+.+...+...+ T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:10 147 LKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 999998889999999999988876543 456788889999999765 688889999999988887654444557 Q ss_pred eecCCCceeeecccchhHH-----HHHHH-HHHHHHHHHHhCCCHHHhccccH--HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 239 AATDSKYDAEQVKAESYVP-----NAAQM-DKAIQRLYSFFNTNEKIIQSKYS--EDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 239 ~v~~~g~~~~~l~~~~~~~-----~~~~~-~~~~~~I~~~fgvp~~~l~~~~~--e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) +++++|++|++++.++.++ |+.+. ++++++||++|||||.+||++.+ +++..+|+++||.|++.+||++||+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~ 299 (395) T protein:vir:10 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNA 299 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7789999999999887664 66665 57899999999999999987654 8999999999999999999999999 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC--CeeeecCceeecCCCC Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG--DKPVRRLDTAVVEGGE 384 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g--d~~~~~~n~~~~~~ge 384 (384) +|+++.++..+ ++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|+| |++++|+|+++++.++ T Consensus 300 kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~ 374 (395) T protein:vir:10 300 KLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGE 374 (395) T ss_pred hhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccc Confidence 99998776554 578999999999999999875 899999999999999999999765 9999999999987555 No 62 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=7.2e-76 Score=432.55 Aligned_cols=361 Identities=16% Similarity=0.148 Sum_probs=295.4 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) ||||+++++.+....... ++..+...+..+++++++|++||++||+++|++||++++++ +...++.+++|+ T Consensus 1 Mg~f~~lf~~~~~~~~~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~~~~~~~~ll~ 71 (395) T protein:vir:95 1 MSILEKIFKTRKDITYML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RIQKNDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccccc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----ccccchHHHHHH Confidence 999998766654433211 11223344567789999999999999999999999998643 456788899999 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-CceEEEEehhheEE Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-NGKIVSYPYSDIIH 159 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~g~~~~~~~~evih 159 (384) .+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+...+.+... .+....++++|||| T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih 146 (395) T protein:vir:95 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQRTFTMQEVIY 146 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCceeeeeeccccEEE Confidence 9999999999999999999999999988776543 35667666666655554444433333 34567899999999 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCC-CChHHHHHHHHHHHHHhccccccCCcc Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNYLQIDSEAGGA 238 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~ 238 (384) ++++++.+..+|.||+..+..++.... +.|.+|+.|+++|.+++. +++++.+++++.|++.+.+...+...+ T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:95 147 LKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 999998889999999999988876543 456788889999999765 688889999999988887654444557 Q ss_pred eecCCCceeeecccchhHH-----HHHHH-HHHHHHHHHHhCCCHHHhccccH--HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 239 AATDSKYDAEQVKAESYVP-----NAAQM-DKAIQRLYSFFNTNEKIIQSKYS--EDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 239 ~v~~~g~~~~~l~~~~~~~-----~~~~~-~~~~~~I~~~fgvp~~~l~~~~~--e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) +++++|++|++++.++.++ |+.+. ++++++||++|||||.+||++.+ +++..+|+++||.|++.+||++||+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~ 299 (395) T protein:vir:95 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNA 299 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7789999999999887664 66665 57899999999999999987654 8999999999999999999999999 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC--CeeeecCceeecCCCC Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG--DKPVRRLDTAVVEGGE 384 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g--d~~~~~~n~~~~~~ge 384 (384) +|+++.++..+ ++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|+| |++++|+|+++++.++ T Consensus 300 kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~ 374 (395) T protein:vir:95 300 KLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGE 374 (395) T ss_pred hhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccc Confidence 99998776554 578999999999999999875 899999999999999999999765 9999999999987555 No 63 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=7.2e-76 Score=432.55 Aligned_cols=361 Identities=16% Similarity=0.148 Sum_probs=295.4 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) ||||+++++.+....... ++..+...+..+++++++|++||++||+++|++||++++++ +...++.+++|+ T Consensus 1 Mg~f~~lf~~~~~~~~~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~~~~~~~~ll~ 71 (395) T protein:vir:10 1 MSILEKIFKTRKDITYML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RIQKNDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccccc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----ccccchHHHHHH Confidence 999998766654433211 11223344567789999999999999999999999998643 456788899999 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-CceEEEEehhheEE Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-NGKIVSYPYSDIIH 159 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~g~~~~~~~~evih 159 (384) .+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+...+.+... .+....++++|||| T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih 146 (395) T protein:vir:10 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQRTFTMQEVIY 146 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCceeeeeeccccEEE Confidence 9999999999999999999999999988776543 35667666666655554444433333 34567899999999 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCC-CChHHHHHHHHHHHHHhccccccCCcc Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNYLQIDSEAGGA 238 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~ 238 (384) ++++++.+..+|.||+..+..++.... +.|.+|+.|+++|.+++. +++++.+++++.|++.+.+...+...+ T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:10 147 LKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 999998889999999999988876543 456788889999999765 688889999999988887654444557 Q ss_pred eecCCCceeeecccchhHH-----HHHHH-HHHHHHHHHHhCCCHHHhccccH--HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 239 AATDSKYDAEQVKAESYVP-----NAAQM-DKAIQRLYSFFNTNEKIIQSKYS--EDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 239 ~v~~~g~~~~~l~~~~~~~-----~~~~~-~~~~~~I~~~fgvp~~~l~~~~~--e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) +++++|++|++++.++.++ |+.+. ++++++||++|||||.+||++.+ +++..+|+++||.|++.+||++||+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~ 299 (395) T protein:vir:10 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNA 299 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7789999999999887664 66665 57899999999999999987654 8999999999999999999999999 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC--CeeeecCceeecCCCC Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG--DKPVRRLDTAVVEGGE 384 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g--d~~~~~~n~~~~~~ge 384 (384) +|+++.++..+ ++||++.+++.|.++++++++ ++++|++|+||+|+++|+||+|+| |++++|+|+++++.++ T Consensus 300 kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~ 374 (395) T protein:vir:10 300 KLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGE 374 (395) T ss_pred hhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccc Confidence 99998776554 578999999999999999875 899999999999999999999765 9999999999987555 No 64 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=5.2e-75 Score=427.83 Aligned_cols=379 Identities=11% Similarity=0.065 Sum_probs=291.3 Q ss_pred Ccchhhhccc--CCC--cchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc----eecc Q lcl|NC_019422. 1 MNIFKSKKKN--KEA--PGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF----KTNP 72 (384) Q Consensus 1 M~~f~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~----~~~~ 72 (384) |++.+.-... +.. +...+...++....+-.+..-.......|++|+..|+..+|++||+|++++.++. +... T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~ 142 (574) T protein:vir:80 63 IGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIAN 142 (574) T ss_pred hhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhh Confidence 2222111000 000 0001111111111111111111223456777888888888999999987765432 2334 Q ss_pred chHHHHHHh----hccccC-CHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEEE Q lcl|NC_019422. 73 EIYIKFLLE----NPNPFM-SGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFLK 141 (384) Q Consensus 73 ~~~~~~l~~----~PN~~~-s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~~ 141 (384) .+.++.|+. .|||++ |+.+||+.++.+++++||+|++++++..|.+.+||||+|.+|++..+.++ ..|+ T Consensus 143 ~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~ 222 (574) T protein:vir:80 143 IKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFV 222 (574) T ss_pred hhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEE Confidence 455555554 356665 78899999999999999999999999999999999999999999987665 3455 Q ss_pred EEEcCceEEEEehhheEEEeccCCC---CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC--CCChHH Q lcl|NC_019422. 142 FLLRNGKIVSYPYSDIIHLRKDFNE---NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT--ALRPDD 216 (384) Q Consensus 142 ~~~~~g~~~~~~~~evih~~~~~~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~ 216 (384) +...++....++++||||++++... ++.+|+||+..+..+|....++++++.++|+||++|+++|++++ .+++++ T Consensus 223 ~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~ 302 (574) T protein:vir:80 223 QVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQA 302 (574) T ss_pred EEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHH Confidence 5556677888999999999976543 46789999999999999999999999999999999999999864 479999 Q ss_pred HHHHHHHHHHHhccccccCCcc-eecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------------ Q lcl|NC_019422. 217 IKKEVKSFEKNYLQIDSEAGGA-AATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------------ 282 (384) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~-~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------------ 282 (384) .+++++.|.+.++|. .++|++ +++++|++|++++.++.|+++.+. ++++++||++|||||.+||. T Consensus 303 ~~~lk~~~~~~~~G~-~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~ 381 (574) T protein:vir:80 303 LDIFRREWRSSLAGI-NGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGG 381 (574) T ss_pred HHHHHHHHHHHhccc-cccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccc Confidence 999999999999874 456775 566889999999999999999876 57899999999999999962 Q ss_pred ----ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019422. 283 ----KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRK 358 (384) Q Consensus 283 ----~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~ 358 (384) ++.|++..+|++.||.|++..||++||++|++.++. ..+++|+..+++.. .+..++.+++.+||||+||+|+ T Consensus 382 ~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~--~~~~~f~~~d~~~~--~~~~~~~~~~~~G~lT~NE~R~ 457 (574) T protein:vir:80 382 SLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFGE--KYQFQFRGGDLSAQ--LDKLKIIEQEGKVFRTVNEIRH 457 (574) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC--ceEEEecccchhhH--HHHHHHHHHHhCCccCHHHHHH Confidence 456889999999999999999999999999987653 46788887776543 3334445677889999999999 Q ss_pred HhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 359 IMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 359 ~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ++|+||+||||++++|+|+++++... T Consensus 458 ~lgl~Pi~gGD~~~~~~n~~~~~~~~ 483 (574) T protein:vir:80 458 DKGLEPIKGGDVILNGVHIQAIGQAL 483 (574) T ss_pred HhCCCCCCCCCEeeeccceeeccccc Confidence 99999999999999999999886332 No 65 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=8.1e-76 Score=432.27 Aligned_cols=362 Identities=15% Similarity=0.135 Sum_probs=284.0 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) ||||++.+++........ +....-..+...++++++|++||++||+++|++||++++++ ++..++++++|+ T Consensus 1 Mg~f~~l~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~----~~~~~~l~~ll~ 71 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMW-----DLDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGE----TSVRDKLYYKLN 71 (376) T ss_pred CchhhhhhccCCcccccc-----chhhccccchhhhhhhHHHHHHHHHHHHhhcccceeecccc----ccccchHHHHHh Confidence 999998766544322111 11111223456789999999999999999999999998643 456788899999 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEE Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHL 160 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~ 160 (384) .+||++||+++||+.++.+++++||+|+++.++..|.+.+++++.+..+........... ..+....++++||||+ T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~evih~ 147 (376) T protein:vir:78 72 IRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVFEGVTVK----DYRYNRNFSMDDVIFL 147 (376) T ss_pred hccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeeeeeeeee----cceeeeeeccccEEEe Confidence 999999999999999999999999999999999999999999999987755433222111 1123457899999999 Q ss_pred eccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCccee Q lcl|NC_019422. 161 RKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAA 240 (384) Q Consensus 161 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v 240 (384) +++.......+.++...+...+... .....+.++.++.++++.++.+++++.+++++.|++.+++..++.+++++ T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~ 222 (376) T protein:vir:78 148 EYGNERLSAFTDGMFEDYGELFGKM-----IRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVP 222 (376) T ss_pred ccCCCCchhhhhHHHHHHHHHHHHH-----HHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEE Confidence 9765432223323333332222111 11223344555555566778899999999999999999887677778899 Q ss_pred cCCCceeeecccchhHH-----HHHHH-HHHHHHHHHHhCCCHHHhccccH--HHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019422. 241 TDSKYDAEQVKAESYVP-----NAAQM-DKAIQRLYSFFNTNEKIIQSKYS--EDEWNAYYESEIEPVGLQLSNQYTEKL 312 (384) Q Consensus 241 ~~~g~~~~~l~~~~~~~-----~~~~~-~~~~~~I~~~fgvp~~~l~~~~~--e~~~~~~~~~~i~P~~~~i~~~l~~~l 312 (384) +++|++|++++.++.++ |+.+. ++++++||++|||||.+||++.+ |++..+|+++||.|++.+||++||++| T Consensus 223 l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kl 302 (376) T protein:vir:78 223 QLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNMKAYMEYCIDPLTKKLEDELNAKL 302 (376) T ss_pred cCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999887654 77776 56889999999999999987655 888999999999999999999999999 Q ss_pred cCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCC--CeeeecCceeecC-CCC Q lcl|NC_019422. 313 FTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENG--DKPVRRLDTAVVE-GGE 384 (384) Q Consensus 313 ~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~g--d~~~~~~n~~~~~-~ge 384 (384) +++.+ .+++|+++.+++.|.+++++.++ ++++|++|+||+|+++|+||+|+| |++++|+|++|++ +|| T Consensus 303 l~~~~----~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~~~n~~~~~~~~e 374 (376) T protein:vir:78 303 FTFSE----FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLITKNYQSADEGGE 374 (376) T ss_pred CCccc----ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccCceehhcccc Confidence 99754 36778888899999999999985 899999999999999999999875 9999999999997 444 No 66 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.3e-74 Score=425.67 Aligned_cols=368 Identities=10% Similarity=0.091 Sum_probs=283.3 Q ss_pred CcchhhhcccCCCcchhH--HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM--MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKF 78 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~ 78 (384) ||||++++.......... .... .+........++++++++|++||++||+++|++||++++++ ++..+++.++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~----~~~~~~~~~l 75 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDTV-WCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKG----EEVRKKNWYM 75 (395) T ss_pred CchHHHHHhhhcccccccccccch-hhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCC----ccccchHHHH Confidence 999998766655433221 1111 12223334557789999999999999999999999998744 3456788999 Q ss_pred HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcC-ceEEEEehhhe Q lcl|NC_019422. 79 LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRN-GKIVSYPYSDI 157 (384) Q Consensus 79 l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~-g~~~~~~~~ev 157 (384) |+.+||+.||+++||+.++.+++++||||+++.++.. ++.++..+.........+..+...+ +..+.++++|| T Consensus 76 L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~------~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~ev 149 (395) T protein:vir:40 76 FNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI------YVADSFTKNDKSLYENTYTEVTLKDLTLKKEFKESEV 149 (395) T ss_pred HHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce------eecCCccccccccccceeeeeeecCceeeeeeccccE Confidence 9999999999999999999999999999999987653 2322222211111111111111111 23456899999 Q ss_pred EEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCc Q lcl|NC_019422. 158 IHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGG 237 (384) Q Consensus 158 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 237 (384) ||+|+++.....++.+.+..+...+.... ....+.++.++.++++.+..+++++.+++++.|++.+.+...++++ T Consensus 150 ih~r~~~~~~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (395) T protein:vir:40 150 LHLTLNNESIKSIIDGFYLLYGDLLTAAV-----NKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDS 224 (395) T ss_pred EEeecCCCCccccchhHHHHHHHHHHHHH-----HHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCc Confidence 99997654433344444444444333222 2233345555666666778899999999999999999887778889 Q ss_pred ceecCCCceeeecccchhHHHHHHHH-HH---HHHHHHHhCCCHHHhccccH--HHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019422. 238 AAATDSKYDAEQVKAESYVPNAAQMD-KA---IQRLYSFFNTNEKIIQSKYS--EDEWNAYYESEIEPVGLQLSNQYTEK 311 (384) Q Consensus 238 ~~v~~~g~~~~~l~~~~~~~~~~~~~-~~---~~~I~~~fgvp~~~l~~~~~--e~~~~~~~~~~i~P~~~~i~~~l~~~ 311 (384) ++++++|++|++++.++.++++.+++ +. +++||++|||||.+||++.+ +++..+|++.||.|++++||++||++ T Consensus 225 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~k 304 (395) T protein:vir:40 225 ALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTVGLSEQVNSFLMFSINPIAEMFTDEGNRK 304 (395) T ss_pred eeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999998864 33 47899999999999987654 88899999999999999999999999 Q ss_pred ccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCC--CCCeeeecCceeecC------- Q lcl|NC_019422. 312 LFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIE--NGDKPVRRLDTAVVE------- 381 (384) Q Consensus 312 l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~--~gd~~~~~~n~~~~~------- 381 (384) ||++.++..+.+++||++++++.|.+++++.++ ++++|++|+||+|+++|+||++ +||++++|+|++|++ T Consensus 305 Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~k 384 (395) T protein:vir:40 305 FYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERFVTKNYAPLGENEEDLK 384 (395) T ss_pred cCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccC Confidence 999988888899999999999999999999875 8999999999999999999995 599999999999885 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) +|| T Consensus 385 gge 387 (395) T protein:vir:40 385 GGD 387 (395) T ss_pred CCC Confidence 334 No 67 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=1.7e-73 Score=419.54 Aligned_cols=378 Identities=10% Similarity=0.061 Sum_probs=290.3 Q ss_pred Ccchhh--------hccc-CCCc-chhHHHhhccccCccee------c------hhhhhhcHHHHHHHHHHHHhhccC-- Q lcl|NC_019422. 1 MNIFKS--------KKKN-KEAP-GKVMMELISDSGNGFYS------W------HGNLYKSDIVRSIIRPKAKAVGKM-- 56 (384) Q Consensus 1 M~~f~~--------~~~~-~~~~-~~~~~~~~~~~~~~~~~------~------~~~~~~~~~v~~~i~~ia~~ia~~-- 56 (384) --++.+ |... +... ........ ....++.. . -+.+-.+|+|++||+.||+.||++ T Consensus 31 ~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~-~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~ 109 (551) T protein:vir:80 31 YSIAIQQREQEQISKAMNNKEVAYSQPVIGSM-SANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCK 109 (551) T ss_pred eeeecccccHHHHHHhhccCcceeecccccce-ecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhh Confidence 112211 1111 1100 00011000 00111110 0 123446899999999999999974 Q ss_pred ---------ceEEEEecCCccee----ccchHHHHHHhhcccc-----CCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce Q lcl|NC_019422. 57 ---------TAKHIRSNETEFKT----NPEIYIKFLLENPNPF-----MSGQILQEKMVTQLELNSNAFAVIIKDDYNMP 118 (384) Q Consensus 57 ---------~~~~~~~~~~~~~~----~~~~~~~~l~~~PN~~-----~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~ 118 (384) +|.+.-++.+.... ...+....++.+||+. +|+.+|++.++.+++++||+|++++|+..|.+ T Consensus 110 ~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~ 189 (551) T protein:vir:80 110 PARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSM 189 (551) T ss_pred hhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcE Confidence 45542222221111 1122355677899987 48889999999999999999999999999999 Q ss_pred eeEEEEcCceEEEEEcCCCE------EEEEEEcCceEEEEehhheEEEeccCC---CCCccCccHHHHHHHHHHHHHHHH Q lcl|NC_019422. 119 TQIYPLNALNVEAIYENEVL------FLKFLLRNGKIVSYPYSDIIHLRKDFN---ENDLFGTSPAKVLEPIMEVVNTTD 189 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~~~~~------~~~~~~~~g~~~~~~~~evih~~~~~~---~~~~~G~s~~~~~~~~i~~~~~~~ 189 (384) .+||||+|.+|++..+.++. +|++...++..+.++++||||+++++. .++.+|+||+.++..++....+++ T Consensus 190 ~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~ 269 (551) T protein:vir:80 190 VRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTE 269 (551) T ss_pred EEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHH Confidence 99999999999999887764 344444555677899999999997653 346789999999999999999999 Q ss_pred HHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcceec-CCCceeeecccchhHHHHHHH-HHH Q lcl|NC_019422. 190 QGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-DSKYDAEQVKAESYVPNAAQM-DKA 265 (384) Q Consensus 190 ~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-~~g~~~~~l~~~~~~~~~~~~-~~~ 265 (384) +++.++|+||++|+++|++++ .+++++.+++++.|.+.++| .+++|+++++ ++|++|++++.++.|++|.+. +++ T Consensus 270 ~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G-~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~ 348 (551) T protein:vir:80 270 AFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSG-INGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYL 348 (551) T ss_pred HHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcC-ccccCccccccCCCceEEEccCChhHHHHHHHHHHH Confidence 999999999999999999864 48899999999999999976 4567886555 689999999999999999876 678 Q ss_pred HHHHHHHhCCCHHHhcc----------------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeech Q lcl|NC_019422. 266 IQRLYSFFNTNEKIIQS----------------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEAS 329 (384) Q Consensus 266 ~~~I~~~fgvp~~~l~~----------------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~ 329 (384) +++||++|||||.+||. ++.+++..+|++.||.|++..||++||++|++.++. .++|+++ T Consensus 349 ~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~----~~~f~f~ 424 (551) T protein:vir:80 349 INVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEFGD----KYTFQFV 424 (551) T ss_pred HHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC----ceEEEee Confidence 99999999999999962 456788889999999999999999999999986542 3556666 Q ss_pred hhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecCceeecCCCC Q lcl|NC_019422. 330 NLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSP-IENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 330 ~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p-~~~gd~~~~~~n~~~~~~ge 384 (384) .+...+..+++++++++.+|+||+||+|+++|+|| +||||+++.|.|++++..+. T Consensus 425 ~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~ 480 (551) T protein:vir:80 425 GGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLM 480 (551) T ss_pred ccChhhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccc Confidence 77777788888888888889999999999999998 79999999999988764222 No 68 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=7.6e-74 Score=421.45 Aligned_cols=361 Identities=19% Similarity=0.189 Sum_probs=300.1 Q ss_pred CcchhhhcccCCCcchh---HHHh-----hccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV---MMEL-----ISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~---~~~~-----~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 72 (384) ||||++.++.+...... +..+ .+.+..+-..+..+++++++|++||++||+++|++|+++++.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~-------- 72 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQ-------- 72 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccch-------- Confidence 99999877655443222 2222 2223334445667789999999999999999999999998644 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEc---Cc Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLR---NG 147 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~---~g 147 (384) ...|+.+||++||+++||+.++.+++++||||++++++..|++.+||+++|.+|++..+.++.. |.|... .| T Consensus 73 ---~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~ 149 (386) T protein:vir:49 73 ---LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPHIA 149 (386) T ss_pred ---hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCcccc Confidence 3458899999999999999999999999999999999999999999999999999998876543 444332 36 Q ss_pred eEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHH Q lcl|NC_019422. 148 KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKN 227 (384) Q Consensus 148 ~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~ 227 (384) ..+.++++||||++++++.++++|+||+.++..++....++.+++.++|+||+.|+++|++++.+++++.++.++.|... T Consensus 150 ~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~ 229 (386) T protein:vir:49 150 PKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAM 229 (386) T ss_pred ceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHh Confidence 77889999999999988888899999999999999999999999999999999999999999999999988888877642 Q ss_pred hccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcccc----HHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSKY----SEDEWNAYYESEIEPVGL 302 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~~----~e~~~~~~~~~~i~P~~~ 302 (384) ..++|+++|+++|++|++++.++.++++.+. ++++++||++|||||.+||++. +.++..+++..++.|+++ T Consensus 230 ----~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~l~ 305 (386) T protein:vir:49 230 ----KQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSRYLR 305 (386) T ss_pred ----ccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHHHHHHHHHHH Confidence 3577899999999999999999999999876 6789999999999999998532 234567899999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeee-cCceeec Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVR-RLDTAVV 380 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~-~~n~~~~ 380 (384) .++++|+++|+. +++||.+.+++.|..++...++ ++.+|++|+||+|++++..++...+.+.. ..+..++ T Consensus 306 ~i~~~~~~~l~~--------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~ 377 (386) T protein:vir:49 306 PFVSEMSKKLSC--------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNRTSL 377 (386) T ss_pred HHHHHHHHHhcc--------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCC Confidence 999999999864 4678899999999888887764 89999999999999997766532222221 2234577 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) ++|| T Consensus 378 ~gGd 381 (386) T protein:vir:49 378 KGGE 381 (386) T ss_pred CCCC Confidence 8999 No 69 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=8.4e-74 Score=421.20 Aligned_cols=360 Identities=16% Similarity=0.165 Sum_probs=293.6 Q ss_pred CcchhhhcccCCCcchh-------------HHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV-------------MMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ||||+++++....+... ++....+ ..+..++..+++++++|++||+.||++||++|++++++.. T Consensus 3 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~-- 79 (392) T protein:vir:74 3 LPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLG-DNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN-- 79 (392) T ss_pred chhhhhhhcccCcccccccccccccCchhhhhhhccC-CCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh-- Confidence 99998766554332111 1111111 2233456678899999999999999999999999986542 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEc Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLR 145 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~ 145 (384) ..|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+|+|++|.+|++..+.++. .|.+... T Consensus 80 ---------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~~~~~ 150 (392) T protein:vir:74 80 ---------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFD 150 (392) T ss_pred ---------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEec Confidence 34888999999999999999999999999999999999999999999999999999876553 3444444 Q ss_pred C---ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHH Q lcl|NC_019422. 146 N---GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVK 222 (384) Q Consensus 146 ~---g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~ 222 (384) + +..+.++++||||++.+++.+.++|+||+.++..++....++++++.++|+||++|+++|++++....++ +.++ T Consensus 151 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~--~~~~ 228 (392) T protein:vir:74 151 DPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD--KDKA 228 (392) T ss_pred CCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH--HHHH Confidence 3 3467899999999998877777999999999999999999999999999999999999999987655442 4456 Q ss_pred HHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHHHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYYESEI 297 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~~~~i 297 (384) +|.+.+.+. .++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++++++..+|+++|| T Consensus 229 ~~~~~~~~~-~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l 307 (392) T protein:vir:74 229 SRSRSFMKR-SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASAL 307 (392) T ss_pred HHHHHHhcc-ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHH Confidence 777778764 577899999999999999999999998776 57899999999999999975 334567889999999 Q ss_pred HHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCc Q lcl|NC_019422. 298 EPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLD 376 (384) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n 376 (384) .|+++.|+++|+++|++. ++||...+.+.|.+++++.+ +++++|++|+||+|+++....+. .++.....| T Consensus 308 ~p~~~~ie~~l~~~l~~~--------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~-pne~r~~en 378 (392) T protein:vir:74 308 NRYLRPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI-PKDLPAPEN 378 (392) T ss_pred HHHHHHHHHHHHHhccch--------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCC-ccccchhcC Confidence 999999999999999763 57888888888988888776 48999999999999987333222 244455567 Q ss_pred eeecCCCC Q lcl|NC_019422. 377 TAVVEGGE 384 (384) Q Consensus 377 ~~~~~~ge 384 (384) +-|+++|| T Consensus 379 l~~~~~Gd 386 (392) T protein:vir:74 379 TNKKTTGQ 386 (392) T ss_pred CCCCCCCC Confidence 77777777 No 70 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=2.1e-74 Score=424.49 Aligned_cols=343 Identities=18% Similarity=0.207 Sum_probs=276.0 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcc-eechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc------eeccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGF-YSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF------KTNPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~------~~~~~ 73 (384) ||||++..+....... ++....+ ......++++++|++||+.||+.||++|+++|++++++. ....+ T Consensus 1 M~if~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLN------NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred CchhHHhHhhhhcccc------cCcceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccccccc Confidence 9999987654332111 1111222 223445688899999999999999999999998865442 34568 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-CCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEE Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-DDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSY 152 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~ 152 (384) +++++|+.+||++||+++||+.++.+++++||+|++++. +..|.+..+++. .++ +.+ T Consensus 75 ~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~--------------------~~~--~~~ 132 (378) T protein:vir:94 75 DLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFA--------------------NDK--KEY 132 (378) T ss_pred hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEe--------------------cCc--EEe Confidence 889999999999999999999999999999999998654 455655443332 222 457 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc-- Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ-- 230 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~-- 230 (384) +++||+|++.+.+.+. +.+++..+...+. ..+++ +.++++|+.++.+++++.++.+++|++.+.+ T Consensus 133 ~~~dvih~~~~~~~~~--~~~~~~~~~~~~~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~ 199 (378) T protein:vir:94 133 KPEELVRLTSPFYINE--DTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREKALATIKNMQ 199 (378) T ss_pred chhceeeecCcCCccc--chhHHHHHHHHHH----------HHHhh-CCcccceeeCCcCCHHHHHHHHHHHHHHHHHhh Confidence 8999999996655443 4466666655432 22333 4688999999999998888888888876653 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) ...++++++++++|++|++++.++.++++.+++++.++||++|||||.+|+++.+|++..+|+++||.|++..|+++||+ T Consensus 200 ~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~~~e~~~~~f~~~tl~P~~~~ie~~l~~ 279 (378) T protein:vir:94 200 EGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTATQEQQIYFYNSTIIPLLIQLEKELTY 279 (378) T ss_pred cccccccceeccCCceEEEccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 22456789999999999999999999998888899999999999999999999999999999999999999999999999 Q ss_pred cccCcccccCc------ceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCC Q lcl|NC_019422. 311 KLFTRKARSFG------NEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGG 383 (384) Q Consensus 311 ~l~~~~~~~~~------~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~g 383 (384) +|+++.++..+ ..++|+++.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++|++.. T Consensus 280 ~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~ 359 (378) T protein:vir:94 280 KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNL 359 (378) T ss_pred hcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeecccccchhcc Confidence 99998665433 35789999999999999999986 899999999999999999999999999999999997433 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) + T Consensus 360 ~ 360 (378) T protein:vir:94 360 S 360 (378) T ss_pred h Confidence 3 No 71 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=3.7e-73 Score=417.70 Aligned_cols=379 Identities=10% Similarity=0.071 Sum_probs=292.9 Q ss_pred CcchhhhcccCCCcch---------------------hHHHh------------h--ccccCcceec------------h Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK---------------------VMMEL------------I--SDSGNGFYSW------------H 33 (384) Q Consensus 1 M~~f~~~~~~~~~~~~---------------------~~~~~------------~--~~~~~~~~~~------------~ 33 (384) ||||.+........+. ..... + .....++... - T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 9999964321110000 00000 0 0011122111 0 Q ss_pred hhhhhcHHHHHHHHHHHHhhccC-----------ceEE--EEecCCcce--eccchHHHHHHhhccccC-----CHHHHH Q lcl|NC_019422. 34 GNLYKSDIVRSIIRPKAKAVGKM-----------TAKH--IRSNETEFK--TNPEIYIKFLLENPNPFM-----SGQILQ 93 (384) Q Consensus 34 ~~~~~~~~v~~~i~~ia~~ia~~-----------~~~~--~~~~~~~~~--~~~~~~~~~l~~~PN~~~-----s~~~f~ 93 (384) +.+..+|+|++||+.||+.||++ .|++ ..++....+ ....+....++.+||+.+ |+.+|| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~ 160 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 160 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHH Confidence 13446899999999999999964 2333 111111111 112234556777899874 888999 Q ss_pred HHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEEEcCceEEEEehhheEEEeccCCC- Q lcl|NC_019422. 94 EKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFLLRNGKIVSYPYSDIIHLRKDFNE- 166 (384) Q Consensus 94 ~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~~~g~~~~~~~~evih~~~~~~~- 166 (384) +.++.+++++||+|++++|+..|.+.+||+|+|.+|++..+.++. .|++...++..+.++++||||+++++.. T Consensus 161 ~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~~ 240 (547) T protein:vir:63 161 KKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSD 240 (547) T ss_pred HHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCCC Confidence 999999999999999999999999999999999999999877653 3444445556778999999999976543 Q ss_pred --CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCccee-c Q lcl|NC_019422. 167 --NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAA-T 241 (384) Q Consensus 167 --~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v-~ 241 (384) .+.+|+||+.++..++....++++++.++|+||++|+++|.+++ .+++++.+++++.|++.++|. .++|++++ + T Consensus 241 ~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~-~nagk~~vl~ 319 (547) T protein:vir:63 241 IYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGI-NGSWQIPVVS 319 (547) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCc-cccccccccc Confidence 36789999999999999999999999999999999999999865 489999999999999999874 56787755 4 Q ss_pred CCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc----------------cccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 242 DSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ----------------SKYSEDEWNAYYESEIEPVGLQL 304 (384) Q Consensus 242 ~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~----------------~~~~e~~~~~~~~~~i~P~~~~i 304 (384) ++|++|+++++++.|++|.+. ++++++||++|||||.+|| .++.+++..+|++.||.|++..| T Consensus 320 ~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~i 399 (547) T protein:vir:63 320 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFI 399 (547) T ss_pred CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 789999999999999999876 6789999999999999996 24568888999999999999999 Q ss_pred HHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecCceeecCCC Q lcl|NC_019422. 305 SNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSP-IENGDKPVRRLDTAVVEGG 383 (384) Q Consensus 305 ~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p-~~~gd~~~~~~n~~~~~~g 383 (384) +++||++|++.++ . .++|+++.+...+..+++++++++..|+||+||+|+++|+|| +||||+++.|.|+.++... T Consensus 400 e~~ln~~L~~~~~--~--~~~~~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 400 EDFINKHIVAEFG--D--KYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred HHHHHhhcccccC--C--ceEEEeeccccccHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeeccccccccccc Confidence 9999999997654 2 345555666677787888877888889999999999999998 6999999999998876422 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) . T Consensus 476 ~ 476 (547) T protein:vir:63 476 M 476 (547) T ss_pred c Confidence 2 No 72 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=4.7e-74 Score=422.58 Aligned_cols=343 Identities=17% Similarity=0.193 Sum_probs=273.7 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcc-eechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc------eeccc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGF-YSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF------KTNPE 73 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~------~~~~~ 73 (384) ||||++..+...... .++..+.+ ......++++++|++||+.||++||++||++|++++++. ....+ T Consensus 1 M~~f~k~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:85 1 MNLFGKVVSFSRGKL------NNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred Cchhhhhhhhhhccc------ccCCcceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccc Confidence 999997654332211 11111222 223445689999999999999999999999998876543 34568 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-CCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEE Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-DDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSY 152 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~ 152 (384) +++++|+.+||++||+++||+.++.+++++||||++++. +..|.+..+++ ..++ +.+ T Consensus 75 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~--------------------~~~~--~~~ 132 (378) T protein:vir:85 75 DLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLF--------------------ANDK--KEY 132 (378) T ss_pred hHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEe--------------------cCCC--EEE Confidence 889999999999999999999999999999999998754 44454433222 1222 346 Q ss_pred ehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc-- Q lcl|NC_019422. 153 PYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ-- 230 (384) Q Consensus 153 ~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~-- 230 (384) .++||||++.+.+.++ +.+.+..+...+ ...++ ++.|++++++++.+++++.++.+++|++.+++ T Consensus 133 ~~~dvih~~~~~~~~~--~~~~~~~a~~~~----------~~~~~-~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~ 199 (378) T protein:vir:85 133 KPEELVRLVSPFYINE--DTSILDNALASI----------QTKLE-QGKLRGLLKINAFLDIDNTQEYREKALATIKNMQ 199 (378) T ss_pred cccceEEEecCcCccc--hhhHHHHHHHHH----------HHHHh-cCCcceEEEeCCcCCHHHHHHHHHHHHHHHHHhh Confidence 7899999986654444 334444444332 22334 45789999999999999999999888876653 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) ...++++++++++|++|++++.++.++++..++++.++||++|||||.+|+++++|++..+|++.||.|++.+||++||+ T Consensus 200 ~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~s~~e~~~~~f~~~tL~P~~~~ie~~l~~ 279 (378) T protein:vir:85 200 EGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKSELLTGYFMNENILLGTATQEQQIYFYNSTIIPLLIQLEKELTY 279 (378) T ss_pred cccccccceecCCCceEEeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 23456789999999999999999999998777889999999999999999999999999999999999999999999999 Q ss_pred cccCcccccCcc------eEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCC Q lcl|NC_019422. 311 KLFTRKARSFGN------EIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGG 383 (384) Q Consensus 311 ~l~~~~~~~~~~------~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~g 383 (384) +|+++.++..+. +++||.+.+++.|.+++++.++ ++++|++|+||+|+++|+||+||||++++|+|++|++.- T Consensus 280 kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~ 359 (378) T protein:vir:85 280 KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKNL 359 (378) T ss_pred hcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccccccc Confidence 999987665432 4789999999999999999875 899999999999999999999999999999999998633 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) . T Consensus 360 ~ 360 (378) T protein:vir:85 360 S 360 (378) T ss_pred h Confidence 3 No 73 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=7.2e-74 Score=421.58 Aligned_cols=356 Identities=15% Similarity=0.148 Sum_probs=293.5 Q ss_pred CcchhhhcccCCCcchh---HH-----HhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV---MM-----ELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 72 (384) ||||+++...+..+... +. .+++.+..+.+.+..+++++++|++||++||+.+|++||++++.. T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~-------- 72 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQ-------- 72 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecch-------- Confidence 99999765444433221 11 123334445556677889999999999999999999999998644 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcC---c Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRN---G 147 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~---g 147 (384) ...|+.+||++||+++||+.++.+++++||+|++++++..|.+.+|+|++|.+|++..+.++. .|.+...+ | T Consensus 73 ---~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~ 149 (384) T protein:vir:49 73 ---LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPRIP 149 (384) T ss_pred ---hhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccc Confidence 235889999999999999999999999999999999999999999999999999998876543 34444433 5 Q ss_pred eEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHH Q lcl|NC_019422. 148 KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKN 227 (384) Q Consensus 148 ~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~ 227 (384) ..+.++++||||++++++.+.++|+||+.++..++....++++++.++|+||++|+++|++++..++++.++. +.+. T Consensus 150 ~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~---~~~~ 226 (384) T protein:vir:49 150 PKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQ---SRSR 226 (384) T ss_pred ceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHH---HHHH Confidence 6788999999999998888889999999999999999999999999999999999999999999887765443 3344 Q ss_pred hccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc--------cHHHHHHHHHHHHHH Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK--------YSEDEWNAYYESEIE 298 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~--------~~e~~~~~~~~~~i~ 298 (384) +.+ ..++++++++++|++|++++.++.++|+.+. ++++++||++|||||.+||++ +.++...+++..++. T Consensus 227 ~~~-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~ 305 (384) T protein:vir:49 227 QAM-KQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLR 305 (384) T ss_pred Hhc-ccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHH Confidence 444 4678899999999999999999999998776 578999999999999999853 234566778888999 Q ss_pred HHHHHHHHHHhhcccC---cccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFT---RKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIENGD--KPV 372 (384) Q Consensus 299 P~~~~i~~~l~~~l~~---~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd--~~~ 372 (384) |++..++++|++++.. ......+.+++|+++.+++.+..++.++++ +...|+++ ||+|+.+|++|+|||| +.| T Consensus 306 pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 306 PFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCCCCCCCC Confidence 9999999999988743 223334567899999999999999999986 56778876 9999999999999863 444 No 74 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2.6e-73 Score=418.50 Aligned_cols=357 Identities=15% Similarity=0.185 Sum_probs=290.8 Q ss_pred CcchhhhcccCCCcchhHHH-----hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMME-----LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIY 75 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~ 75 (384) ||||+++.+.+......... +...+..+...+..+++++++|++||+.||+.+|++||++++.. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~----------- 69 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKK----------- 69 (382) T ss_pred CccccccccCCcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecch----------- Confidence 99999887665544333222 22333344455667789999999999999999999999998654 Q ss_pred HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE--EEEEEcC---ceEE Q lcl|NC_019422. 76 IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF--LKFLLRN---GKIV 150 (384) Q Consensus 76 ~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~---g~~~ 150 (384) ...|+.+||++||+++||+.++.+++++||||++++++..|.+++|||++|.+|++..+.++.. |.+...+ |..+ T Consensus 70 ~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~ 149 (382) T protein:vir:48 70 LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQ 149 (382) T ss_pred hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecCcccccee Confidence 2458999999999999999999999999999999999999999999999999999998776543 4444433 4677 Q ss_pred EEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 151 SYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 151 ~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) .++++||||++++++.+.++|+||+.++..++....++.+++.++|+||+.|+++|++++.+++++.+++++.|.+. T Consensus 150 ~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~--- 226 (382) T protein:vir:48 150 HVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAM--- 226 (382) T ss_pred EEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhh--- Confidence 89999999999998888899999999999999999999999999999999999999999999999988888877653 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----KYSEDEWNAYYESEIEPVGLQLS 305 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----~~~e~~~~~~~~~~i~P~~~~i~ 305 (384) ..++|+++|+++|++|++++.++.++++.+. ++++++||++|||||.+||. ++++++..+|++.||.|+++.|+ T Consensus 227 -~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~ 305 (382) T protein:vir:48 227 -KQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFL 305 (382) T ss_pred -ccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 3467899999999999999999999999876 57889999999999999974 34567788999999999999999 Q ss_pred HHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhC---CCC--CCCCCeeeecCceeec Q lcl|NC_019422. 306 NQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMN---LSP--IENGDKPVRRLDTAVV 380 (384) Q Consensus 306 ~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG---~~p--~~~gd~~~~~~n~~~~ 380 (384) ++|+++|+++.+.+.... ++.+. ..-.....+++.+|++|+||+|+.++ +.| +++++.+. .++ T Consensus 306 ~~l~~~l~~~~~~~~~~~--~~~~~-----~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~-----~~~ 373 (382) T protein:vir:48 306 SELSQKLSCDVDADIFPA--VDPTG-----SNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPN-----STL 373 (382) T ss_pred HHHHHHhcChhhhhhhhh--hccch-----hHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCC-----CCC Confidence 999999998765433222 22111 11122334578899999999999874 332 34555443 347 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) +||| T Consensus 374 ~GGd 377 (382) T protein:vir:48 374 KGGE 377 (382) T ss_pred CCCC Confidence 8888 No 75 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.3e-72 Score=414.74 Aligned_cols=377 Identities=11% Similarity=0.048 Sum_probs=292.0 Q ss_pred Cc----chhhhcccCCCcchhHHHhhccccCcc--eechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce---ec Q lcl|NC_019422. 1 MN----IFKSKKKNKEAPGKVMMELISDSGNGF--YSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK---TN 71 (384) Q Consensus 1 M~----~f~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~---~~ 71 (384) |. .|..+.+.++. .....+.......- .....++....++++|+..+++.++++|+++++.+.++.. .. T Consensus 53 ~~~~~~g~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~ 130 (535) T protein:vir:10 53 ADGNVAGQYSVASISDV--LSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKR 130 (535) T ss_pred ccCCcccccccCccccc--cCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhh Confidence 22 22222222222 22233332221111 1122445567788999999999999999999877654432 23 Q ss_pred cchHHHHHHhhccccCCHHH----HHHHHHHHHHHh-CCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC----EEEEE Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQI----LQEKMVTQLELN-SNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV----LFLKF 142 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~----f~~~~~~~~l~~-G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~----~~~~~ 142 (384) .+.+..+|+.+||+.|++++ |++.++.+++++ |++|++++++..|++.+||||+|.+|++..+..+ ..+++ T Consensus 131 ~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~ 210 (535) T protein:vir:10 131 AHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQ 210 (535) T ss_pred hhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCceEEEE Confidence 35566778889999998876 555566665554 5789999999999999999999999999887543 34444 Q ss_pred EEcCceEEEEehhheEEEeccCCC---CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC----CCChH Q lcl|NC_019422. 143 LLRNGKIVSYPYSDIIHLRKDFNE---NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT----ALRPD 215 (384) Q Consensus 143 ~~~~g~~~~~~~~evih~~~~~~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~----~~~~e 215 (384) ...++....++++||||+++++.. ++.+|+||+.++..+|....++++++.++|+||++|+++|++++ .++++ T Consensus 211 ~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e 290 (535) T protein:vir:10 211 FVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQM 290 (535) T ss_pred EecCceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHH Confidence 455667788999999999986543 46789999999999999999999999999999999999999975 47889 Q ss_pred HHHHHHHHHHHHhccccccCCcceec-CCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc----------- Q lcl|NC_019422. 216 DIKKEVKSFEKNYLQIDSEAGGAAAT-DSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS----------- 282 (384) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~v~-~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~----------- 282 (384) +++++++.|++.++|. +++++++++ ++|++|++++.++.|++|.+. ++++++||++|||||.+||. T Consensus 291 ~~e~lk~~~~~~~~G~-~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~ 369 (535) T protein:vir:10 291 MLAGIRRQWTSQGSGL-GGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSG 369 (535) T ss_pred HHHHHHHHHHHHhcCc-ccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchh Confidence 9999999999999874 567776555 579999999999999999876 57899999999999999973 Q ss_pred -------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHH Q lcl|NC_019422. 283 -------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNE 355 (384) Q Consensus 283 -------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE 355 (384) ++.|++...|++.||.|++..||++||++|++..+. +++|+++.+++.|.+++.+++++...|+||+|| T Consensus 370 ~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~----~~~f~f~~l~~~d~~~r~~~~~~~~~g~lT~NE 445 (535) T protein:vir:10 370 TKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDT----DYRFSFTLGDAQDKLQEEQVWKLKLANGYFINE 445 (535) T ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCC----eEEEEeccccccCHHHHHHHHHHHHcCCCCHHH Confidence 234677788999999999999999999999986543 467788899999999999988877778899999 Q ss_pred HHHHhCCCCCCCCCeeeecC---ceeecC-CCC Q lcl|NC_019422. 356 WRKIMNLSPIENGDKPVRRL---DTAVVE-GGE 384 (384) Q Consensus 356 ~R~~lG~~p~~~gd~~~~~~---n~~~~~-~ge 384 (384) +|+++|+||+||||++++.. |+.... .++ T Consensus 446 ~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~ 478 (535) T protein:vir:10 446 YRKDHGLKTVDGLDVPGFIGSAENFINATGFGQ 478 (535) T ss_pred HHHHhCCCCCCCccccccccchhhccccccccc Confidence 99999999999999977643 222111 111 No 76 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=4.2e-73 Score=417.38 Aligned_cols=370 Identities=12% Similarity=0.127 Sum_probs=281.5 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) ||||++....+........ ............++++++|++||+.||+.||++|+++++.+++ .+..+++.++|+ T Consensus 1 MGlf~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~--~~~~~~~~~lL~ 74 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSDDD----SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKL--TENQKDWLYWIN 74 (395) T ss_pred CcchhhhcCCCcccccccc----cchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCc--ccccchHHHHHh Confidence 9999987544333221111 1111112344667899999999999999999999999976543 345678899999 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcC-ceEEEEehhheEE Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRN-GKIVSYPYSDIIH 159 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~-g~~~~~~~~evih 159 (384) .+|||+||+++||+.++.+++++||||+++.++..+.+.. ..+.........++.....+ +..+.++++|||| T Consensus 75 ~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih 148 (395) T protein:vir:98 75 TKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIYVAD------SFTQDKKISGSQFKVSRVQGQTYEKTFTFDQVIY 148 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCceecCC------cccccccccCcccceeeecCceeeeEecCccEEE Confidence 9999999999999999999999999999999876443222 22222111121222222222 2246789999999 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHH--HHHHHHHHHccCCcceEEeeCCCC-ChHHHHHHHHHHHHHhccccccCC Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTT--DQGVVKAIKNSNTIKWLLKFKTAL-RPDDIKKEVKSFEKNYLQIDSEAG 236 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~--~~~~~~~~~ng~~p~~il~~~~~~-~~e~~~~~~~~~~~~~~~~~~~~~ 236 (384) +|+.++....++.+++.....++...... .....+++.+++.+.+++...... ++++.++.++.+++.+.+...+.+ T Consensus 149 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (395) T protein:vir:98 149 LKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESV 228 (395) T ss_pred ecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCc Confidence 99877655556666666666666554443 344566778888888888766554 455567777777777766555667 Q ss_pred cceecCCCceeeecccc------hhHHHHHHH-HHHHHHHHHHhCCCHHHhcccc--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 237 GAAATDSKYDAEQVKAE------SYVPNAAQM-DKAIQRLYSFFNTNEKIIQSKY--SEDEWNAYYESEIEPVGLQLSNQ 307 (384) Q Consensus 237 ~~~v~~~g~~~~~l~~~------~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~~--~e~~~~~~~~~~i~P~~~~i~~~ 307 (384) +++++++|++|++++.+ +.++++.+. ++++++||++|||||.+||++. .|++..+|+++||.|++.+||++ T Consensus 229 ~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ 308 (395) T protein:vir:98 229 VGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGPIESLITNIVDG 308 (395) T ss_pred ceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 78899999999999865 456688776 4678999999999999998764 47888999999999999999999 Q ss_pred HhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCC--CCeeeecCceeecC--C Q lcl|NC_019422. 308 YTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIEN--GDKPVRRLDTAVVE--G 382 (384) Q Consensus 308 l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~--gd~~~~~~n~~~~~--~ 382 (384) ||++|+++.++..+. +|+++.+++.|.++++++++ ++++|++|+||+|+++|+||+|+ ||++++|+|++|++ + T Consensus 309 l~~kll~~~~~~~g~--~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~~~~~n~~~~~~~g 386 (395) T protein:vir:98 309 LEYAIFDKSETLQGS--FIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLERG 386 (395) T ss_pred HHHhcCChhhhcCcc--eeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceeccccc Confidence 999999988776665 46678999999999999986 89999999999999999999976 99999999999996 7 Q ss_pred CC Q lcl|NC_019422. 383 GE 384 (384) Q Consensus 383 ge 384 (384) || T Consensus 387 ge 388 (395) T protein:vir:98 387 GE 388 (395) T ss_pred CC Confidence 88 No 77 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=2.8e-73 Score=418.36 Aligned_cols=365 Identities=13% Similarity=0.113 Sum_probs=276.4 Q ss_pred CcchhhhcccCCCcchhH--HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM--MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKF 78 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~ 78 (384) ||||++.+..+....... ...++ ......++++++|++||+.||+.+|++||++++++++ ....+++.++ T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~~------~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~--~~~~~~~~~l 72 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTTS------EKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKL--TENQKDWLYW 72 (395) T ss_pred CcchhhhcCCCCccccccccccchh------hhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcc--ccccchHHHH Confidence 999998665544332221 11111 2334668899999999999999999999999876433 3456778888 Q ss_pred HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcC-ceEEEEehhhe Q lcl|NC_019422. 79 LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRN-GKIVSYPYSDI 157 (384) Q Consensus 79 l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~-g~~~~~~~~ev 157 (384) |+.+||++||+++||+.++.+++++||+|+++.++..+.+...++.... .. +..++.....+ .....++++|| T Consensus 73 L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~~~-----~~-~~~~~~v~~~~~~~~~~~~~~dv 146 (395) T protein:vir:96 73 INTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQDKK-----LS-GNKFKVSRVQGQTYEKIFTFDQV 146 (395) T ss_pred HhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccccc-----cc-cceeeeeeeccceeeeEeccCce Confidence 9999999999999999999999999999999999765433333222111 11 11111122222 23457899999 Q ss_pred EEEeccCCCCCccCccHHHHHHHHHHHHH------HHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccc Q lcl|NC_019422. 158 IHLRKDFNENDLFGTSPAKVLEPIMEVVN------TTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQI 231 (384) Q Consensus 158 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~------~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~ 231 (384) ||+|++++....++.+++..+...+.... ...++..+++.+++.|.+++..++....+..++..+++ +.+. T Consensus 147 ih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 223 (395) T protein:vir:96 147 IYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKRT---IEKI 223 (395) T ss_pred EEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHHHHHHH---HHHh Confidence 99998876555555555454444443332 23467788899999999999888766665555444444 3333 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHHHH-------HHHHHHHHhCCCHHHhccc--cHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQMDK-------AIQRLYSFFNTNEKIIQSK--YSEDEWNAYYESEIEPVGL 302 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~-------~~~~I~~~fgvp~~~l~~~--~~e~~~~~~~~~~i~P~~~ 302 (384) ..+.++++++++|++|++++.++.++++.+.+. .+++||++|||||.+|+++ +.|++..+|+++||.|++. T Consensus 224 ~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~~~sn~e~~~~~f~~~~L~P~~~ 303 (395) T protein:vir:96 224 RTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGPIESLIT 303 (395) T ss_pred hcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCccHHHHHHHHHHHHHHHHHH Confidence 345667888999999999999999998866532 3578999999999999874 5678899999999999999 Q ss_pred HHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCC--CCeeeecCceee Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIEN--GDKPVRRLDTAV 379 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~--gd~~~~~~n~~~ 379 (384) +||++|+++|+++.++..+. +|+++.+++.|.++++++++ ++.+|++|+||+|+++|+||+|+ ||++++|+|++| T Consensus 304 ~ie~~l~~~Ll~~~e~~~~~--~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~~~~~~N~~~ 381 (395) T protein:vir:96 304 NIVDGLEYAIFDKSETLEGS--FIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYES 381 (395) T ss_pred HHHHHHHhhcCChhhhcCce--eEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccee Confidence 99999999999987776654 46778999999999999886 79999999999999999999966 999999999999 Q ss_pred c--CCCC Q lcl|NC_019422. 380 V--EGGE 384 (384) Q Consensus 380 ~--~~ge 384 (384) + ++|| T Consensus 382 ~~~~gge 388 (395) T protein:vir:96 382 VLERGGE 388 (395) T ss_pred chhccCC Confidence 9 4677 No 78 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=1.7e-72 Score=414.07 Aligned_cols=358 Identities=15% Similarity=0.169 Sum_probs=291.0 Q ss_pred CcchhhhcccCCCcchh----------HHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV----------MMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~----------~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) ||||+++++...++... ...+++. ..++...+...++++++|++||++||+.||++|+++++... T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~--- 79 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN--- 79 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh--- Confidence 99999766544332211 1111111 11233446677899999999999999999999999975442 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcC Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRN 146 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~ 146 (384) ..|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+|+|++|.+|++..+.++. .|.+...+ T Consensus 80 --------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~ 151 (392) T protein:vir:39 80 --------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDD 151 (392) T ss_pred --------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecC Confidence 45888999999999999999999999999999999999999999999999999999876553 34444443 Q ss_pred ---ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 147 ---GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 147 ---g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) +....++++||||++++++.+.++|+||+.++..++....++++++.++|+||++|+++|++++....+ ++.+++ T Consensus 152 ~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--~~~~~~ 229 (392) T protein:vir:39 152 PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--DKDKAS 229 (392) T ss_pred cccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--HHHHHH Confidence 346789999999999888777799999999999999999999999999999999999999998765443 334566 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc----cHHHHHHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK----YSEDEWNAYYESEIE 298 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~----~~e~~~~~~~~~~i~ 298 (384) |.+.+.+. .++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||.+ +++++..+|++.||. T Consensus 230 ~~~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~ 308 (392) T protein:vir:39 230 RSRSFMKR-SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALN 308 (392) T ss_pred HHHHHhcc-ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHH Confidence 77777664 567899999999999999999999998876 578899999999999999753 345678899999999 Q ss_pred HHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHh---CCCCCCCCCeeeec Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIM---NLSPIENGDKPVRR 374 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~l---G~~p~~~gd~~~~~ 374 (384) |+++.|+++|+++|++. ++||...+.+.|..++++.+ +++++|++|+||+|+++ |+.| ++.... T Consensus 309 P~~~~ie~~l~~~L~~~--------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p----~e~r~~ 376 (392) T protein:vir:39 309 RYLRPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIP----KDLPAP 376 (392) T ss_pred HHHHHHHHHHHHhcccc--------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCc----cccchh Confidence 99999999999999763 56778888888888888776 58899999999999987 5543 233344 Q ss_pred CceeecCCCC Q lcl|NC_019422. 375 LDTAVVEGGE 384 (384) Q Consensus 375 ~n~~~~~~ge 384 (384) .|+-|+.+|| T Consensus 377 e~l~~~~~Gd 386 (392) T protein:vir:39 377 ENTNKKTTGQ 386 (392) T ss_pred cCCCCCCCCC Confidence 5666666666 No 79 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=1.7e-72 Score=414.07 Aligned_cols=358 Identities=15% Similarity=0.169 Sum_probs=291.0 Q ss_pred CcchhhhcccCCCcchh----------HHHhhcc--ccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV----------MMELISD--SGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~----------~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) ||||+++++...++... ...+++. ..++...+...++++++|++||++||+.||++|+++++... T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~--- 79 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN--- 79 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh--- Confidence 99999766544332211 1111111 11233446677899999999999999999999999975442 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--EEEEEEcC Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--FLKFLLRN 146 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~ 146 (384) ..|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+|+|++|.+|++..+.++. .|.+...+ T Consensus 80 --------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~ 151 (392) T protein:vir:10 80 --------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDD 151 (392) T ss_pred --------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecC Confidence 45888999999999999999999999999999999999999999999999999999876553 34444443 Q ss_pred ---ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 147 ---GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 147 ---g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) +....++++||||++++++.+.++|+||+.++..++....++++++.++|+||++|+++|++++....+ ++.+++ T Consensus 152 ~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--~~~~~~ 229 (392) T protein:vir:10 152 PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--DKDKAS 229 (392) T ss_pred cccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--HHHHHH Confidence 346789999999999888777799999999999999999999999999999999999999998765443 334566 Q ss_pred HHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccc----cHHHHHHHHHHHHHH Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSK----YSEDEWNAYYESEIE 298 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~----~~e~~~~~~~~~~i~ 298 (384) |.+.+.+. .++++++|+++|++|++++.++.++++.+. ++++++||++|||||.+||.+ +++++..+|++.||. T Consensus 230 ~~~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~ 308 (392) T protein:vir:10 230 RSRSFMKR-SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALN 308 (392) T ss_pred HHHHHhcc-ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHH Confidence 77777664 567899999999999999999999998876 578899999999999999753 345678899999999 Q ss_pred HHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHh---CCCCCCCCCeeeec Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIM---NLSPIENGDKPVRR 374 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~l---G~~p~~~gd~~~~~ 374 (384) |+++.|+++|+++|++. ++||...+.+.|..++++.+ +++++|++|+||+|+++ |+.| ++.... T Consensus 309 P~~~~ie~~l~~~L~~~--------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p----~e~r~~ 376 (392) T protein:vir:10 309 RYLRPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIP----KDLPAP 376 (392) T ss_pred HHHHHHHHHHHHhcccc--------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCc----cccchh Confidence 99999999999999763 56778888888888888776 58899999999999987 5543 233344 Q ss_pred CceeecCCCC Q lcl|NC_019422. 375 LDTAVVEGGE 384 (384) Q Consensus 375 ~n~~~~~~ge 384 (384) .|+-|+.+|| T Consensus 377 e~l~~~~~Gd 386 (392) T protein:vir:10 377 ENTNKKTTGQ 386 (392) T ss_pred cCCCCCCCCC Confidence 5666666666 No 80 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=7.9e-73 Score=415.87 Aligned_cols=344 Identities=15% Similarity=0.165 Sum_probs=272.0 Q ss_pred CcchhhhcccCC-CcchhHHHhhc--cccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHH Q lcl|NC_019422. 1 MNIFKSKKKNKE-APGKVMMELIS--DSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIK 77 (384) Q Consensus 1 M~~f~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ 77 (384) |+||+.+++... .+..+...+.. .+..+..++..+++++++|++||+.||++||++|+. .++.++ T Consensus 1 M~~~~~f~~r~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~------------~~~~~~ 68 (359) T protein:vir:10 1 MSILNPFERRSSITPNNYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI------------GNQVFT 68 (359) T ss_pred CcccchhhccccCCCCcchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc------------cchHHH Confidence 999985433222 22222222222 222334456678899999999999999999999983 567899 Q ss_pred HHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc-CceEEEEehhh Q lcl|NC_019422. 78 FLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR-NGKIVSYPYSD 156 (384) Q Consensus 78 ~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~g~~~~~~~~e 156 (384) .|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+||+++|..|++..++++.+|.+... ++..+.++++| T Consensus 69 ~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~~~e 148 (359) T protein:vir:10 69 SVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYPSAKYNASE 148 (359) T ss_pred HHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCceEEEEcccc Confidence 99999999999999999999999999999999999999999999999999999998888877776644 56788999999 Q ss_pred eEEEeccC----CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC-CCChHHHHHHHHHHHHHhccc Q lcl|NC_019422. 157 IIHLRKDF----NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT-ALRPDDIKKEVKSFEKNYLQI 231 (384) Q Consensus 157 vih~~~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~ 231 (384) |||++.++ +.++++|+||+.++..++....+++++..++|+||++|+++|++++ .+++++.+++++.|++.+ ++ T Consensus 149 vih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~-~~ 227 (359) T protein:vir:10 149 MIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKAN-GG 227 (359) T ss_pred eEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHh-Cc Confidence 99999765 3578899999999999999999999999999999999999999975 689999999999998766 43 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcccc----HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSKY----SEDEWNAYYESEIEPVGLQLSN 306 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~~----~e~~~~~~~~~~i~P~~~~i~~ 306 (384) .++|+++|+++|++|++++.++.|+|+.+. ++++++||++|||||.+||+.. +.++.++++..++.|.+..+++ T Consensus 228 -~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~ 306 (359) T protein:vir:10 228 -NNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLIS 306 (359) T ss_pred -cccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 578999999999999999999999999776 5789999999999999997532 2222334444444444444455 Q ss_pred HHhhcccCcccccCcceEEeechhhhccCHHH-HHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019422. 307 QYTEKLFTRKARSFGNEIVFEASNLQYASMST-KLNLVQMVDRGSLTPNEWRKIMNLSPIE 366 (384) Q Consensus 307 ~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~-~~~~~~~~~~g~~t~NE~R~~lG~~p~~ 366 (384) +|+.+|....+.+.+..+++| .+. +.++.+++++|++|+||+|+++|++|+= T Consensus 307 ~l~~~l~~~~~~~~~~~~~~d--------~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 307 ELRIKCDSSIGVDMSPITDYS--------NSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHhhhhhcccchhhhhcC--------HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 555544433223333333333 222 3344568999999999999999999985 No 81 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=1.7e-71 Score=408.52 Aligned_cols=375 Identities=9% Similarity=0.068 Sum_probs=290.7 Q ss_pred CcchhhhcccCC-CcchhHHHhhccccCccee------c-------hhhhhhcHHHHHHHHHHHHhhcc----------- Q lcl|NC_019422. 1 MNIFKSKKKNKE-APGKVMMELISDSGNGFYS------W-------HGNLYKSDIVRSIIRPKAKAVGK----------- 55 (384) Q Consensus 1 M~~f~~~~~~~~-~~~~~~~~~~~~~~~~~~~------~-------~~~~~~~~~v~~~i~~ia~~ia~----------- 55 (384) ++.+.+.+..+. +...+...+++.. .+++. . -.++-.++++++||+.+|+.+|. T Consensus 43 ~~~~~~~~~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~ 121 (563) T protein:vir:95 43 YQDLTKSLYGQQQAYAEPFIEMMDTN-PEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKG 121 (563) T ss_pred HHHHHhhhccCCCcchhhhHhhhccc-ccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 666665443333 2233333333222 12211 0 12334588999999999999885 Q ss_pred --CceEEEEecCCccee--ccchHHHHHHh----hcccc-CCHHHHHHHHHHHHHHhCCeeEEEe--eCCCCceeeEEEE Q lcl|NC_019422. 56 --MTAKHIRSNETEFKT--NPEIYIKFLLE----NPNPF-MSGQILQEKMVTQLELNSNAFAVII--KDDYNMPTQIYPL 124 (384) Q Consensus 56 --~~~~~~~~~~~~~~~--~~~~~~~~l~~----~PN~~-~s~~~f~~~~~~~~l~~G~~~~~~~--~~~~g~~~~l~~l 124 (384) +++++++.+.++.+. ...+.++.++. .|||+ +|+.+||+.++.+++++||+|++++ |+..|++.+|||| T Consensus 122 ~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl 201 (563) T protein:vir:95 122 LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAV 201 (563) T ss_pred ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEe Confidence 567776665544322 22333333332 33443 6889999999999999999999876 7788999999999 Q ss_pred cCceEEEEEcCCCEEE------EEEEcCceEEEEehhheEEEeccCCCC---CccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 125 NALNVEAIYENEVLFL------KFLLRNGKIVSYPYSDIIHLRKDFNEN---DLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 125 ~~~~v~~~~~~~~~~~------~~~~~~g~~~~~~~~evih~~~~~~~~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) +|++|++..+.++..+ ++...++....++++|+||++++...+ +.+|+||+.++..++....++++++.++ T Consensus 202 ~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~ 281 (563) T protein:vir:95 202 DPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRF 281 (563) T ss_pred CCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999998877543 344455566789999988777665444 7889999999999999999999999999 Q ss_pred HHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcc-eecCCCceeeecccchhHHHHHHH-HHHHHHHHH Q lcl|NC_019422. 196 IKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGA-AATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYS 271 (384) Q Consensus 196 ~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~ 271 (384) |+||++|+++|++++ .+++++++++++.|++.++|. .++|++ +|+++|++|+++++++.+++|.+. ++++++||+ T Consensus 282 f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~-~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~ 360 (563) T protein:vir:95 282 FSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI-NGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISA 360 (563) T ss_pred HHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-cccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHH Confidence 999999999999875 478999999999999999774 566775 789999999999999999999876 678999999 Q ss_pred HhCCCHHHhcc-----------------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhcc Q lcl|NC_019422. 272 FFNTNEKIIQS-----------------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYA 334 (384) Q Consensus 272 ~fgvp~~~l~~-----------------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~ 334 (384) +|||||.+||. ++.+++..+|++.||.|++..||++||++|+++++. .+.|+ +++. T Consensus 361 afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~----~~~~~---f~r~ 433 (563) T protein:vir:95 361 LYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGD----KYTFQ---FVGG 433 (563) T ss_pred HhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccc----ccEEE---eccC Confidence 99999999972 345788889999999999999999999999987542 23333 2466 Q ss_pred CHHHHHHHH---HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 335 SMSTKLNLV---QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 335 d~~~~~~~~---~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |.+++.+.+ +++.+|+||+||+|+++|+||+||||++++|.|+++++.++ T Consensus 434 D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~ 486 (563) T protein:vir:95 434 DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQ 486 (563) T ss_pred CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccc Confidence 777777654 45788999999999999999999999999999988775443 No 82 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=1.7e-71 Score=408.52 Aligned_cols=375 Identities=9% Similarity=0.068 Sum_probs=290.7 Q ss_pred CcchhhhcccCC-CcchhHHHhhccccCccee------c-------hhhhhhcHHHHHHHHHHHHhhcc----------- Q lcl|NC_019422. 1 MNIFKSKKKNKE-APGKVMMELISDSGNGFYS------W-------HGNLYKSDIVRSIIRPKAKAVGK----------- 55 (384) Q Consensus 1 M~~f~~~~~~~~-~~~~~~~~~~~~~~~~~~~------~-------~~~~~~~~~v~~~i~~ia~~ia~----------- 55 (384) ++.+.+.+..+. +...+...+++.. .+++. . -.++-.++++++||+.+|+.+|. T Consensus 43 ~~~~~~~~~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~ 121 (563) T protein:vir:99 43 YQDLTKSLYGQQQAYAEPFIEMMDTN-PEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKG 121 (563) T ss_pred HHHHHhhhccCCCcchhhhHhhhccc-ccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 666665443333 2233333333222 12211 0 12334588999999999999885 Q ss_pred --CceEEEEecCCccee--ccchHHHHHHh----hcccc-CCHHHHHHHHHHHHHHhCCeeEEEe--eCCCCceeeEEEE Q lcl|NC_019422. 56 --MTAKHIRSNETEFKT--NPEIYIKFLLE----NPNPF-MSGQILQEKMVTQLELNSNAFAVII--KDDYNMPTQIYPL 124 (384) Q Consensus 56 --~~~~~~~~~~~~~~~--~~~~~~~~l~~----~PN~~-~s~~~f~~~~~~~~l~~G~~~~~~~--~~~~g~~~~l~~l 124 (384) +++++++.+.++.+. ...+.++.++. .|||+ +|+.+||+.++.+++++||+|++++ |+..|++.+|||| T Consensus 122 ~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl 201 (563) T protein:vir:99 122 LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAV 201 (563) T ss_pred ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEe Confidence 567776665544322 22333333332 33443 6889999999999999999999876 7788999999999 Q ss_pred cCceEEEEEcCCCEEE------EEEEcCceEEEEehhheEEEeccCCCC---CccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 125 NALNVEAIYENEVLFL------KFLLRNGKIVSYPYSDIIHLRKDFNEN---DLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 125 ~~~~v~~~~~~~~~~~------~~~~~~g~~~~~~~~evih~~~~~~~~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) +|++|++..+.++..+ ++...++....++++|+||++++...+ +.+|+||+.++..++....++++++.++ T Consensus 202 ~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~ 281 (563) T protein:vir:99 202 DPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRF 281 (563) T ss_pred CCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999998877543 344455566789999988777665444 7889999999999999999999999999 Q ss_pred HHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcc-eecCCCceeeecccchhHHHHHHH-HHHHHHHHH Q lcl|NC_019422. 196 IKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGA-AATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYS 271 (384) Q Consensus 196 ~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~ 271 (384) |+||++|+++|++++ .+++++++++++.|++.++|. .++|++ +|+++|++|+++++++.+++|.+. ++++++||+ T Consensus 282 f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~-~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~ 360 (563) T protein:vir:99 282 FSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI-NGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISA 360 (563) T ss_pred HHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-cccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHH Confidence 999999999999875 478999999999999999774 566775 789999999999999999999876 678999999 Q ss_pred HhCCCHHHhcc-----------------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhcc Q lcl|NC_019422. 272 FFNTNEKIIQS-----------------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYA 334 (384) Q Consensus 272 ~fgvp~~~l~~-----------------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~ 334 (384) +|||||.+||. ++.+++..+|++.||.|++..||++||++|+++++. .+.|+ +++. T Consensus 361 afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~----~~~~~---f~r~ 433 (563) T protein:vir:99 361 LYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGD----KYTFQ---FVGG 433 (563) T ss_pred HhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccc----ccEEE---eccC Confidence 99999999972 345788889999999999999999999999987542 23333 2466 Q ss_pred CHHHHHHHH---HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 335 SMSTKLNLV---QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 335 d~~~~~~~~---~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) |.+++.+.+ +++.+|+||+||+|+++|+||+||||++++|.|+++++.++ T Consensus 434 D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~ 486 (563) T protein:vir:99 434 DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQ 486 (563) T ss_pred CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccc Confidence 777777654 45788999999999999999999999999999988775443 No 83 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=1.1e-70 Score=404.09 Aligned_cols=375 Identities=11% Similarity=0.068 Sum_probs=279.2 Q ss_pred Ccchhhhc-------ccC-CCcchhHHHh---hccccCccee------c-------hhhhhhcHHHHHHHHHHHHhhcc- Q lcl|NC_019422. 1 MNIFKSKK-------KNK-EAPGKVMMEL---ISDSGNGFYS------W-------HGNLYKSDIVRSIIRPKAKAVGK- 55 (384) Q Consensus 1 M~~f~~~~-------~~~-~~~~~~~~~~---~~~~~~~~~~------~-------~~~~~~~~~v~~~i~~ia~~ia~- 55 (384) =+.|.... +.. ..+......+ +++ ..++.. . -.++-.+|+|++||+.||+.||+ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~-~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~ 110 (576) T protein:vir:96 32 QANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDT-NPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMY 110 (576) T ss_pred hHHHHHhhhhhhhhccccCCccchhhcceeeeeec-CCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhh Confidence 11122110 111 0111111111 111 011110 0 02234589999999999999996 Q ss_pred ----------CceEEEEecCCcc--ee--cc----chHHHHHHhhcccc-CCHHHHHHHHHHHHHHhCCeeEEEee--CC Q lcl|NC_019422. 56 ----------MTAKHIRSNETEF--KT--NP----EIYIKFLLENPNPF-MSGQILQEKMVTQLELNSNAFAVIIK--DD 114 (384) Q Consensus 56 ----------~~~~~~~~~~~~~--~~--~~----~~~~~~l~~~PN~~-~s~~~f~~~~~~~~l~~G~~~~~~~~--~~ 114 (384) ++|.+..+..++. .. .. .+.+..++..|||+ +|+.+||+.++.+++++||+|+++++ +. T Consensus 111 ~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~ 190 (576) T protein:vir:96 111 CQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKN 190 (576) T ss_pred hhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCC Confidence 3344433332221 11 11 11223334456666 58999999999999999999999885 45 Q ss_pred CCceeeEEEEcCceEEEEEcCCCEEEE------EEEcCceEEEEehhheEEEeccCCCC---CccCccHHHHHHHHHHHH Q lcl|NC_019422. 115 YNMPTQIYPLNALNVEAIYENEVLFLK------FLLRNGKIVSYPYSDIIHLRKDFNEN---DLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 115 ~g~~~~l~~l~~~~v~~~~~~~~~~~~------~~~~~g~~~~~~~~evih~~~~~~~~---~~~G~s~~~~~~~~i~~~ 185 (384) .|++++||||+|.+|++..+.++..+. +...++....++++||||++++...+ +.+|+||+.++..++... T Consensus 191 ~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~ 270 (576) T protein:vir:96 191 ATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAY 270 (576) T ss_pred CCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHH Confidence 688999999999999999998875432 22334566789999999887665544 678999999999999999 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCc-ceecCCCceeeecccchhHHHHHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGG-AAATDSKYDAEQVKAESYVPNAAQM 262 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~-~~v~~~g~~~~~l~~~~~~~~~~~~ 262 (384) .++++++.++|+||++|+++|++++ .+++++++++++.|++.++|. .++++ ++|+++|++|+++++++.+++|.+. T Consensus 271 ~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~-~nag~~p~vl~~G~~~~~ls~~~~d~qfle~ 349 (576) T protein:vir:96 271 NNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI-NGSWQVPVVMADDIKFVNMTPTANDMQFEKW 349 (576) T ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-cccccceeecCCCceEEeccCChhhHHHHHH Confidence 9999999999999999999999875 578999999999999999774 56677 5899999999999999999999876 Q ss_pred -HHHHHHHHHHhCCCHHHhcc-----------------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceE Q lcl|NC_019422. 263 -DKAIQRLYSFFNTNEKIIQS-----------------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEI 324 (384) Q Consensus 263 -~~~~~~I~~~fgvp~~~l~~-----------------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i 324 (384) ++++++||++|||||.+||. ++.|++..+|++.||.|++..||++||++|++.++. ++ T Consensus 350 ~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~----~~ 425 (576) T protein:vir:96 350 LTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSD----KY 425 (576) T ss_pred HHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccC----ce Confidence 57899999999999999963 467889999999999999999999999999987542 23 Q ss_pred EeechhhhccCHHHHHHHHH---HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 325 VFEASNLQYASMSTKLNLVQ---MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 325 ~fd~~~~~~~d~~~~~~~~~---~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) .|++ ++.|.+++.+.++ .+.+|+||+||+|+++|+||+||||+++.|.|++++.... T Consensus 426 ~~~f---~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~ 485 (576) T protein:vir:96 426 VFQF---VGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNT 485 (576) T ss_pred EEEe---ccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccc Confidence 3332 4556666666543 4567999999999999999999999999999987764222 No 84 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=1.1e-70 Score=404.23 Aligned_cols=364 Identities=12% Similarity=0.071 Sum_probs=282.7 Q ss_pred chhh------------hcccCCCcchhH---HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 3 IFKS------------KKKNKEAPGKVM---MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 3 ~f~~------------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) +|+. .++....+.... ..++.... .+......+..+++|++||+.||+.||+++++++..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pp~-~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~-- 77 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYVEPKV-HPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDG-- 77 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCccccCCC-CHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCcc-- Confidence 3332 222211111111 11111110 01112245678999999999999999999999854321 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEE------- Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFL------- 140 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~------- 140 (384) .+.. ..||++||+.+||+.++.+++++||||++++|+..|.+.+|+||+|.+|++..+..+.+. T Consensus 78 ------~~~~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~ 148 (540) T protein:vir:41 78 ------GVEE---LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHV 148 (540) T ss_pred ------chhh---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCceeEeeecCcee Confidence 1222 249999999999999999999999999999999999999999999999998877654321 Q ss_pred EEEE-----------cCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC Q lcl|NC_019422. 141 KFLL-----------RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK 209 (384) Q Consensus 141 ~~~~-----------~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~ 209 (384) .|.. .+...+.++++||||+|.+++.++++|+||+.++..++....++++++.++|+||++|+++|+++ T Consensus 149 ~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~ 228 (540) T protein:vir:41 149 TYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVT 228 (540) T ss_pred eeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 1111 11234578999999999999999999999999999999999999999999999999999999998 Q ss_pred CCCChHH----------HHHHHHHHHHHhccccccCCcceecC------CCceeeecccchhHHHHHHH-HHHHHHHHHH Q lcl|NC_019422. 210 TALRPDD----------IKKEVKSFEKNYLQIDSEAGGAAATD------SKYDAEQVKAESYVPNAAQM-DKAIQRLYSF 272 (384) Q Consensus 210 ~~~~~e~----------~~~~~~~~~~~~~~~~~~~~~~~v~~------~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~ 272 (384) +.+++++ .+++++.|.+.+++..+++|+++|++ +|++|++++.++.+++|.+. ++++++||++ T Consensus 229 g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~a 308 (540) T protein:vir:41 229 GEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAA 308 (540) T ss_pred cccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHH Confidence 8776543 35667777778888778899999984 79999999999999999876 5789999999 Q ss_pred hCCCHHHhc--------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH Q lcl|NC_019422. 273 FNTNEKIIQ--------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ 344 (384) Q Consensus 273 fgvp~~~l~--------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~ 344 (384) |||||.+|| +++.+++...|+++||.|+++.|+++||++|+++.+ .+.+++||.+++++.|.+++. .+ T Consensus 309 fgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~--~~~~i~f~~~~ll~~D~~~~~--~~ 384 (540) T protein:vir:41 309 HMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLD--PGARFVFNEEILMESEFVHNY--AL 384 (540) T ss_pred hCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC--CceEEEecchhhcchHHHHHH--HH Confidence 999999996 236688899999999999999999999999988654 467899999999988766553 34 Q ss_pred HHhCCCCCHHHHHHHh-CCCCCCCCCeeeecCceeecC--CCC Q lcl|NC_019422. 345 MVDRGSLTPNEWRKIM-NLSPIENGDKPVRRLDTAVVE--GGE 384 (384) Q Consensus 345 ~~~~g~~t~NE~R~~l-G~~p~~~gd~~~~~~n~~~~~--~ge 384 (384) ++++|++|+||+|+.| |++| ++|.++.|.|+...+ +++ T Consensus 385 lv~~G~lT~NE~Re~L~g~e~--gdd~~l~p~n~~~~~~~~~~ 425 (540) T protein:vir:41 385 LVQCGVLTPSEVREKLFGLDG--GPDMFMVPSSIGKSAMKRQK 425 (540) T ss_pred HHhCCCCCHHHHHHHhCcCcC--CCcccccccccccccccccc Confidence 7899999999999864 5443 346778888875432 222 No 85 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=8.6e-70 Score=399.24 Aligned_cols=353 Identities=12% Similarity=0.077 Sum_probs=283.8 Q ss_pred chhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce----ec-cchHHHHHHhhccccC--------CHHHHHHHHHH Q lcl|NC_019422. 32 WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK----TN-PEIYIKFLLENPNPFM--------SGQILQEKMVT 98 (384) Q Consensus 32 ~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~----~~-~~~~~~~l~~~PN~~~--------s~~~f~~~~~~ 98 (384) -..-+..+|+|++||+.||+.||++|++++.+.+.+.. .. ......++..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 22234568999999999999999999999765432211 11 2223345666788765 66789999999 Q ss_pred HHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE-------EEE-----------------------EEcCce Q lcl|NC_019422. 99 QLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF-------LKF-----------------------LLRNGK 148 (384) Q Consensus 99 ~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-------~~~-----------------------~~~~g~ 148 (384) +++++||+|++++|+..|++++|++|+|.+|++..+..... .+| ....|. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 99999999999999999999999999999999877654321 110 012355 Q ss_pred EEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHH Q lcl|NC_019422. 149 IVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKN 227 (384) Q Consensus 149 ~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~ 227 (384) .+.++++||||+|.+++.++++|+||+.++..++....++.+++.++|+||++|+++|+++ +.+++++.+++++.|.+. T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 6789999999999999999999999999999999999999999999999999999999986 578999999999999987 Q ss_pred hcc----------ccccCCcceecCCCceeeecc-------c-chhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------ Q lcl|NC_019422. 228 YLQ----------IDSEAGGAAATDSKYDAEQVK-------A-ESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------ 282 (384) Q Consensus 228 ~~~----------~~~~~~~~~v~~~g~~~~~l~-------~-~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------ 282 (384) +.+ +..+++++.+++.|.++++++ . ++.|++|.+. ++.+++||++|||||.+||. T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 764 234667888888776655543 2 5678999877 56889999999999999972 Q ss_pred -ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHh Q lcl|NC_019422. 283 -KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIM 360 (384) Q Consensus 283 -~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~l 360 (384) ++.+++..+|++.||.|+++.|+++||++|++......+.+++|+++.+++.|.++++++++ ++.+|++|+||+|+++ T Consensus 321 ~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 400 (467) T protein:vir:31 321 STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEF 400 (467) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 45688999999999999999999999999999887778889999999999999999999876 7999999999999999 Q ss_pred CCCCCCCCCee-------eecCceeecCCCC Q lcl|NC_019422. 361 NLSPIENGDKP-------VRRLDTAVVEGGE 384 (384) Q Consensus 361 G~~p~~~gd~~-------~~~~n~~~~~~ge 384 (384) |+||+++++.+ .++++..|.++.+ T Consensus 401 Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~ 431 (467) T protein:vir:31 401 GFEPFPEEHVYGGETLVAEVTGGSGPGGGIG 431 (467) T ss_pred CCCCCCcccccCCcccccccccccCCCCccc Confidence 99999643321 0112222222211 No 86 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=6.7e-70 Score=399.83 Aligned_cols=366 Identities=12% Similarity=0.049 Sum_probs=283.5 Q ss_pred chhhhcccCCCcch--------hHHHhhccccCcce-------echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 3 IFKSKKKNKEAPGK--------VMMELISDSGNGFY-------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 3 ~f~~~~~~~~~~~~--------~~~~~~~~~~~~~~-------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) +|+...+=++...+ ....+.......++ .....+..+++|++||+.||++||++||++++... T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~-- 78 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDE-- 78 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccc-- Confidence 66632111111000 00000001111111 11134557899999999999999999999853321 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEE---- Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFL---- 143 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~---- 143 (384) -.++...||++||+++||+.++.+++++||+|++++|+..|++.+|+||+|.+|++..+.++....+. T Consensus 79 --------~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~ 150 (542) T protein:vir:41 79 --------GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNI 150 (542) T ss_pred --------hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcc Confidence 12234459999999999999999999999999999999999999999999999999887665332111 Q ss_pred -------Ec-------CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC Q lcl|NC_019422. 144 -------LR-------NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK 209 (384) Q Consensus 144 -------~~-------~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~ 209 (384) .. +.....++++||||+|.+++.++++|+||+.++..++....++.+++.++|+||++|+++|+++ T Consensus 151 ~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~ 230 (542) T protein:vir:41 151 THFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVT 230 (542) T ss_pred eeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC Confidence 10 1123457889999999999899999999999999999999999999999999999999999986 Q ss_pred C----------CCChHHHHHHHHHHHHHhccccccCCcceec------CCCceeeecccchhHHHHHHH-HHHHHHHHHH Q lcl|NC_019422. 210 T----------ALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT------DSKYDAEQVKAESYVPNAAQM-DKAIQRLYSF 272 (384) Q Consensus 210 ~----------~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~------~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~ 272 (384) + .+++++.+++++.|++.+.|..+++|+++|+ ++|++|++++.++.+++|.+. ++++++||++ T Consensus 231 ~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~a 310 (542) T protein:vir:41 231 GEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAA 310 (542) T ss_pred CccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 4 4577899999999999999887888999988 479999999999999999876 5789999999 Q ss_pred hCCCHHHhcc--------ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH Q lcl|NC_019422. 273 FNTNEKIIQS--------KYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ 344 (384) Q Consensus 273 fgvp~~~l~~--------~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~ 344 (384) |||||.+||. ++.|++..+|+++||.|+++.|+++||++|++++++ +.+++|+.+.+++.|.++++ .. T Consensus 311 fgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~--~~~~~f~~~~ll~~d~~~~~--~~ 386 (542) T protein:vir:41 311 HMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNP--KTRFKFNDETLLESDSVRNC--AL 386 (542) T ss_pred hCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC--ceEEEecchhhcchHHHHHH--HH Confidence 9999999962 466899999999999999999999999999888764 46889999999887755443 34 Q ss_pred HHhCCCCCHHHHHHHh-CCCCCCCCCeeeecCceee--cCCCC Q lcl|NC_019422. 345 MVDRGSLTPNEWRKIM-NLSPIENGDKPVRRLDTAV--VEGGE 384 (384) Q Consensus 345 ~~~~g~~t~NE~R~~l-G~~p~~~gd~~~~~~n~~~--~~~ge 384 (384) ++++|++|+||+|+.| |++| ++|.++.|.|... +.+++ T Consensus 387 ~v~~GilT~NE~Re~L~g~~p--gdd~~l~p~~~~~~~~~~~~ 427 (542) T protein:vir:41 387 LVQSGVLTPAEARERLFGLDG--GPDIFMVPSKGAAKSVKRQE 427 (542) T ss_pred HHhCCCCCHHHHHHhhCCCCC--CCccccccccccccccccCC Confidence 7899999999999854 5543 2355566776532 33333 No 87 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=9.1e-64 Score=366.20 Aligned_cols=374 Identities=13% Similarity=0.187 Sum_probs=265.6 Q ss_pred Ccchhhhcc----------cCCCcchhHH-----HhhccccCc--c-------eechhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MNIFKSKKK----------NKEAPGKVMM-----ELISDSGNG--F-------YSWHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~~f~~~~~----------~~~~~~~~~~-----~~~~~~~~~--~-------~~~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) |+|=..-.. .+..+....+ ......+++ + ....+.+..+|+|++||++||++||++ T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l 113 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKA 113 (648) T ss_pred cccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhC Confidence 433221111 1111111111 111111111 1 111244557999999999999999999 Q ss_pred ceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc---------------eeeE Q lcl|NC_019422. 57 TAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM---------------PTQI 121 (384) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~---------------~~~l 121 (384) +|.+...++. .........++.+||++||+.+||+.++.+++++||+|++++|+..|. +.++ T Consensus 114 ~~~i~~~~~~---~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l 190 (648) T protein:vir:79 114 DWDFVSKNPN---AVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGY 190 (648) T ss_pred cceEEecCCc---cchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeee Confidence 9998654432 223334456678999999999999999999999999999999998873 4689 Q ss_pred EEEcCceEEEEEcCCCEEEE--EEEc-CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019422. 122 YPLNALNVEAIYENEVLFLK--FLLR-NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKN 198 (384) Q Consensus 122 ~~l~~~~v~~~~~~~~~~~~--~~~~-~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 198 (384) ||++|.+|++..+..+.... |... ++..+.++++||||++++.+.++++|+||+.+++.+|....++.++..++|+| T Consensus 191 ~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~N 270 (648) T protein:vir:79 191 FPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYR 270 (648) T ss_pred EeecCceeEEEEcCCCceeeeEEEecCCceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999988877644 3333 34567889999999999989999999999999999999999999999999999 Q ss_pred cCCcceEEeeCC-CCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecc----cchhHHHHHHH-HHHHHHHHHH Q lcl|NC_019422. 199 SNTIKWLLKFKT-ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVK----AESYVPNAAQM-DKAIQRLYSF 272 (384) Q Consensus 199 g~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~----~~~~~~~~~~~-~~~~~~I~~~ 272 (384) |++|+++++++. ....++.++.++.|.+.+.+. .+.+++.++..+. .++.++|+.+. ++++++||++ T Consensus 271 Ga~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~-------~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~a 343 (648) T protein:vir:79 271 NLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENM-------DVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTV 343 (648) T ss_pred cCCccEEEEeCCCccchHHHHHHHHHHHHhcccc-------cccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHH Confidence 999999999863 344566677777777766542 2223333333332 25678898876 5788999999 Q ss_pred hCCCHHHhccc-----cHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCcccc----cCcceEEeechhhhccCHHHH Q lcl|NC_019422. 273 FNTNEKIIQSK-----YSEDEWNAYYESEIEPVGLQLSNQYTEKL----FTRKAR----SFGNEIVFEASNLQYASMSTK 339 (384) Q Consensus 273 fgvp~~~l~~~-----~~e~~~~~~~~~~i~P~~~~i~~~l~~~l----~~~~~~----~~~~~i~fd~~~~~~~d~~~~ 339 (384) |||||.+||.. ++.++...++..++.|++..++..++.++ +.+... ....+++|+++++++.|.+++ T Consensus 344 FgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~ 423 (648) T protein:vir:79 344 LGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKL 423 (648) T ss_pred hCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHH Confidence 99999999732 22334445667788887766665554433 222111 123468999999999999999 Q ss_pred HHHH-HHHhCCCCCHHHHHHHhCCCCCCCCC-eeeecCceeecCCCC Q lcl|NC_019422. 340 LNLV-QMVDRGSLTPNEWRKIMNLSPIENGD-KPVRRLDTAVVEGGE 384 (384) Q Consensus 340 ~~~~-~~~~~g~~t~NE~R~~lG~~p~~~gd-~~~~~~n~~~~~~ge 384 (384) ++.+ +++.+||||+||+|+++|+||+|+|+ ..++..+..+..... T Consensus 424 a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~ 470 (648) T protein:vir:79 424 ENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQAT 470 (648) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhcc Confidence 8876 47899999999999999999998764 445555544432111 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=4.5e-63 Score=362.41 Aligned_cols=380 Identities=11% Similarity=0.050 Sum_probs=285.7 Q ss_pred Ccchhhh-----------------cccCCCcchhHHHhhcccc---Cccee--chhhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSK-----------------KKNKEAPGKVMMELISDSG---NGFYS--WHGNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~-----------------~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |.==.++ .+...+.+.....+.++.+ +.+.. -..-+..++++++||+.+|+.||+++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~ 80 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGF 80 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCc Confidence 3211100 0001111111111111110 00000 011223489999999999999999999 Q ss_pred EEEEecC-Cc---ceec----------cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEE Q lcl|NC_019422. 59 KHIRSNE-TE---FKTN----------PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPL 124 (384) Q Consensus 59 ~~~~~~~-~~---~~~~----------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l 124 (384) .+....+ ++ .... .++.+..+...+|+.+|+.++++.++.|++.+|++|+.++++..|+++.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~l 160 (651) T protein:vir:99 81 DLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYV 160 (651) T ss_pred eeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhc Confidence 9854221 11 1111 12223334455789999999999999999999999999999999999999999 Q ss_pred cCceEEEEEcCCC----------------------------------EEE------------------------------ Q lcl|NC_019422. 125 NALNVEAIYENEV----------------------------------LFL------------------------------ 140 (384) Q Consensus 125 ~~~~v~~~~~~~~----------------------------------~~~------------------------------ 140 (384) ++..+++..+... +.+ T Consensus 161 p~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~ 240 (651) T protein:vir:99 161 PARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEES 240 (651) T ss_pred ChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcce Confidence 9987765332110 000 Q ss_pred ------------EEE-EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe Q lcl|NC_019422. 141 ------------KFL-LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK 207 (384) Q Consensus 141 ------------~~~-~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 207 (384) .|. ...+....++++||||+|.+++.++++|+||+..+..++....++++++.++|+||++|+++|+ T Consensus 241 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~ 320 (651) T protein:vir:99 241 EREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIK 320 (651) T ss_pred eeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 011 1112344578899999999988999999999999999999999999999999999999999999 Q ss_pred eCC-CCChHHHHHHHHHHHHHhccccccCCcceecCC-----------Cceeeecccch-hHHHHHHH-HHHHHHHHHHh Q lcl|NC_019422. 208 FKT-ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-----------KYDAEQVKAES-YVPNAAQM-DKAIQRLYSFF 273 (384) Q Consensus 208 ~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-----------g~~~~~l~~~~-~~~~~~~~-~~~~~~I~~~f 273 (384) +++ .+++++.+++++.|++.+. +++++++++. |++|++++.++ .|++|.+. ++++++||++| T Consensus 321 ~~~~~ls~e~~~~lr~~~~~~~~----nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~af 396 (651) T protein:vir:99 321 VTGGELSEESKRDLRQMLNGLRE----ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVL 396 (651) T ss_pred ecCCCCCHHHHHHHHHHHHHHhc----cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHh Confidence 975 5899999999999987553 4677877754 99999999876 58999877 56899999999 Q ss_pred CCCHHHhc------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcc--eEEeechhhhccCHHHHHHHHH- Q lcl|NC_019422. 274 NTNEKIIQ------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGN--EIVFEASNLQYASMSTKLNLVQ- 344 (384) Q Consensus 274 gvp~~~l~------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~--~i~fd~~~~~~~d~~~~~~~~~- 344 (384) ||||.+|| +++.|++...|+++||.|++..|+++||++|+++.+...++ +++|+.+.+++.|.+++++.++ T Consensus 397 gVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~ 476 (651) T protein:vir:99 397 EVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRA 476 (651) T ss_pred CCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHH Confidence 99999996 45678999999999999999999999999999987766554 5567778899999999999876 Q ss_pred HHhCCCCCHHHHHHHhCCCCCC--CCCeeeecCceee----cCCCC Q lcl|NC_019422. 345 MVDRGSLTPNEWRKIMNLSPIE--NGDKPVRRLDTAV----VEGGE 384 (384) Q Consensus 345 ~~~~g~~t~NE~R~~lG~~p~~--~gd~~~~~~n~~~----~~~ge 384 (384) ++++|++|+||+|+++|+||++ +||.++.+.++.. ..+|| T Consensus 477 ~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge 522 (651) T protein:vir:99 477 MRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGE 522 (651) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCC Confidence 7999999999999999999995 4899998877653 34554 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=1.2e-59 Score=343.65 Aligned_cols=269 Identities=17% Similarity=0.252 Sum_probs=243.4 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEE Q lcl|NC_019422. 53 VGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAI 132 (384) Q Consensus 53 ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~ 132 (384) ||++||+++++++ ...++++++|+.+||++||+.+||+.++.+++++||||++++++..|++.+|||++|++|++. T Consensus 1 ia~l~~~~~~~~~----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~ 76 (278) T protein:vir:78 1 MASLPLKMYEDYK----VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 76 (278) T ss_pred CccceeEEEecCc----ccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEE Confidence 9999999998664 346788999999999999999999999999999999999999999999999999999999999 Q ss_pred EcCCCEE--EEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC Q lcl|NC_019422. 133 YENEVLF--LKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT 210 (384) Q Consensus 133 ~~~~~~~--~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~ 210 (384) .+.++.. |.+...+|..+.++++||||++++++.++++|+||+.++..++....++.+++...+.+ .|+++++.++ T Consensus 77 ~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~~~~ 154 (278) T protein:vir:78 77 IENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLKYGS 154 (278) T ss_pred EcCCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEEeCC Confidence 8877643 45556678889999999999999999999999999999999999999998887655544 5789999999 Q ss_pred CCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhcc------c Q lcl|NC_019422. 211 ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQS------K 283 (384) Q Consensus 211 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~------~ 283 (384) .+++|+.++++++|++.+. ++++++++++|++|++++.++.++++.+. ++++++||++|||||.+||. + T Consensus 155 ~l~~e~~~~~~~~~~~~~~----~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~s 230 (278) T protein:vir:78 155 NVGKEKRQQVLEDFKQYYE----ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFA 230 (278) T ss_pred CCCHHHHHHHHHHHHHHhc----cCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc Confidence 9999999999999987663 46789999999999999999999999876 57889999999999999973 4 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh Q lcl|NC_019422. 284 YSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL 331 (384) Q Consensus 284 ~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~ 331 (384) +.+++..+|++.||+|+++.|+++||++|+++.++..+.+++||++.| T Consensus 231 n~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 231 KNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 567889999999999999999999999999999999999999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=1.2e-52 Score=305.29 Aligned_cols=321 Identities=10% Similarity=0.065 Sum_probs=241.4 Q ss_pred hhhhcccCCCcc-----h---------------hHHHh--hccc---cCc-----c--eechhhhhhcHHHHHHHHHHHH Q lcl|NC_019422. 4 FKSKKKNKEAPG-----K---------------VMMEL--ISDS---GNG-----F--YSWHGNLYKSDIVRSIIRPKAK 51 (384) Q Consensus 4 f~~~~~~~~~~~-----~---------------~~~~~--~~~~---~~~-----~--~~~~~~~~~~~~v~~~i~~ia~ 51 (384) +.++++...... . ..... +++. ... + ....+.+++.|+-+.|+..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~ 80 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFR 80 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHh Confidence 443332211000 0 00000 0110 000 0 0111234555555566544443 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEE Q lcl|NC_019422. 52 AVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEA 131 (384) Q Consensus 52 ~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~ 131 (384) .-+ ..+..+...+....++.+||+.||+.+|++ ++.+++++||+|++++++..|++++|+|+++..|++ T Consensus 81 ~~~----------~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~ 149 (368) T protein:vir:79 81 AAA----------HHSSAVYVKRNILVSTFIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRR 149 (368) T ss_pred hcc----------ccchhhhhhcchhhhhcCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCccccee Confidence 332 111122222334467779999999999976 678999999999999999999999999999999987 Q ss_pred EEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-C Q lcl|NC_019422. 132 IYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-T 210 (384) Q Consensus 132 ~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~ 210 (384) ..+.+ .+++...+|+.+.++++||||+|.+++.++++|+||+.+++.++....++..+..++|+||++|+++|.++ . T Consensus 150 ~~~~~--~~~~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~ 227 (368) T protein:vir:79 150 GLDLN--TYFFVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDA 227 (368) T ss_pred eccCC--EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Confidence 66544 34556677888999999999999999999999999999999999999999999999999999999999886 5 Q ss_pred CCChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--- Q lcl|NC_019422. 211 ALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--- 281 (384) Q Consensus 211 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--- 281 (384) .+++++.++++++|++ +.| .+|.++++++ ++|++|++++.++.+++|.+.+ +++++||++|||||.+|| T Consensus 228 ~l~~e~~~~lk~~~~~-~~G-~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~ 305 (368) T protein:vir:79 228 AQKQEDVDTLREAMKS-AKG-PGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIP 305 (368) T ss_pred CCCHHHHHHHHHHHHH-hcC-CcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccC Confidence 7999999999999987 444 4678888888 6889999999999999998875 678899999999999996 Q ss_pred -----cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCC Q lcl|NC_019422. 282 -----SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRG 349 (384) Q Consensus 282 -----~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g 349 (384) +++.|++...|++++|.|+++.|+ ++|.+|.. ..++|+...+++.|.+.++.... .+. T Consensus 306 ~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~-------e~~rF~~~~l~~~D~~a~a~~~~--rsa 368 (368) T protein:vir:79 306 NNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD-------EVVRFAPYALGGHDQPAAAPGGQ--RSA 368 (368) T ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc-------ceeeechhHhhcccccccCCccc--ccC Confidence 246789999999999999999998 68877633 25789999999999887775321 111 No 91 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=2.4e-51 Score=298.11 Aligned_cols=316 Identities=11% Similarity=0.073 Sum_probs=235.8 Q ss_pred CcchhhhcccCC--CcchhHHHhhccccC-----c--------ceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKNKE--APGKVMMELISDSGN-----G--------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~~~--~~~~~~~~~~~~~~~-----~--------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |.=..+...... +........++...+ + .....+.+++-|.-+..+..+.+..+ .. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~---~h------ 71 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSST---HH------ 71 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhh---hc------ Confidence 332221110000 000000001110000 0 00111223333333333322222222 11 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEc Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLR 145 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 145 (384) ...-....+.+..++.+||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|..|++..+.++..+.+... T Consensus 72 ~~~i~~k~n~l~~l~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~~~~~~~~~ 150 (346) T protein:vir:10 72 ESAIITKANILLSTCEVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQFYYVPQRF 150 (346) T ss_pred chhhhhhhhhHHHHHhCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCeEEEEEEcc Confidence 00011122344556778999999999987 56889999999999999999999999999999999999888888877778 Q ss_pred CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHH Q lcl|NC_019422. 146 NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSF 224 (384) Q Consensus 146 ~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~ 224 (384) +|+.+.++++||||+|.+.+.++++|+||+..+..++....+++.+..++|+||++|+++|.++ ..+++++.++++++| T Consensus 151 ~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~ 230 (346) T protein:vir:10 151 DHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQL 230 (346) T ss_pred CCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 8999999999999999998889999999999999999999999999999999999999999885 578999999999999 Q ss_pred HHHhccccccCCcceecC-----CCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHHHHHH Q lcl|NC_019422. 225 EKNYLQIDSEAGGAAATD-----SKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSEDEWN 290 (384) Q Consensus 225 ~~~~~~~~~~~~~~~v~~-----~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e~~~~ 290 (384) ++.+ | .+|.++++++. .|+++++++.++.+++|.+.+ +++++||++|||||.+|| +++.|++.. T Consensus 231 ~~~~-g-~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~ 308 (346) T protein:vir:10 231 KQSK-G-VGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAE 308 (346) T ss_pred HHhc-C-ccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH Confidence 8865 4 36778888874 478899999999999998875 678899999999999996 246688999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 291 AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 291 ~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) .|++++|.|+++.|++ +|.+|..+ .++|+..++++.|. T Consensus 309 ~f~~~~l~P~~~~iee-~n~~L~~e-------~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 309 VFFITEIEPLQERLKE-FNQWLGQE-------VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHH-HHhhcccc-------eeeechhhhcccCC Confidence 9999999999999985 77676432 58999999998877 No 92 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=7.7e-51 Score=295.31 Aligned_cols=301 Identities=9% Similarity=0.088 Sum_probs=227.8 Q ss_pred Cc--chhhhcccCCCc-ch-----------------------------hHHHhhccccCc-cee----ch---hhhhhcH Q lcl|NC_019422. 1 MN--IFKSKKKNKEAP-GK-----------------------------VMMELISDSGNG-FYS----WH---GNLYKSD 40 (384) Q Consensus 1 M~--~f~~~~~~~~~~-~~-----------------------------~~~~~~~~~~~~-~~~----~~---~~~~~~~ 40 (384) -| -+++++...... .. ....+..-+.++ ++. .. +.+..++ T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~ 100 (376) T protein:vir:10 21 HGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRAST 100 (376) T ss_pred cccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhH Confidence 01 112222111110 00 001111111111 000 00 1111233 Q ss_pred HHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceee Q lcl|NC_019422. 41 IVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQ 120 (384) Q Consensus 41 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~ 120 (384) .+.+||...++.+++ .-+|||.||..+|++. +.+++++||+|++++++..|++++ T Consensus 101 ~h~s~l~~k~n~l~~------------------------~~~Pnp~lT~~~f~~~-v~d~ll~Gnay~~~~rn~~G~~~~ 155 (376) T protein:vir:10 101 HHSSALFFKANVLAS------------------------TFRPHRWLSRHAFERW-ALDFLTFGNGYLERRRNMVGGTLR 155 (376) T ss_pred HhhhhHHHHhHHHHh------------------------ccCCCCCCCHHHHHHH-HHHHHhcCCeEEEEEECCCCCEEE Confidence 333333333333221 2369999999999855 578999999999999999999999 Q ss_pred EEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) |+|++|..|++..+.++ +.+...+++.+.++++||||++.+++.++++|+|++.+++.++....++..++.++|+||+ T Consensus 156 L~pl~~~~vr~~~d~~~--~~~~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa 233 (376) T protein:vir:10 156 LEPALAKYVRRKADFNG--FVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGS 233 (376) T ss_pred EEEeCCcceEEEeeCCe--EEEEEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999998887664 3445567788899999999999999999999999999999999999999999999999999 Q ss_pred CcceEEeeC-CCCChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHh Q lcl|NC_019422. 201 TIKWLLKFK-TALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFF 273 (384) Q Consensus 201 ~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~f 273 (384) +|++||.++ ..+++|+.++++++|++ ..| .+|.++++++ ++|++|++++.++.+++|.+.+ +++++||++| T Consensus 234 ~pggIl~~~d~~l~~e~~~~lr~~~~~-~~G-~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af 311 (376) T protein:vir:10 234 HAGFILYMTDAAQKQDDVDNMRDALKN-AKG-PGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAH 311 (376) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHH-hcC-ccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHh Confidence 999999986 57999999999999987 444 4677888887 5789999999999999999885 6788999999 Q ss_pred CCCHHHhc--------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHH Q lcl|NC_019422. 274 NTNEKIIQ--------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMST 338 (384) Q Consensus 274 gvp~~~l~--------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~ 338 (384) ||||.++| +++.|++...|++++|.|+++.|+ ++|.+|..+ .++|+..++++.|.++ T Consensus 312 ~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~-------~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 312 RVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEE-------VVRFDDYEIPPAPVAA 376 (376) T ss_pred CCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhcccc-------ccccChhHhhcccccC Confidence 99999986 246689999999999999999998 588877432 4799999999999887 No 93 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=1.3e-50 Score=294.08 Aligned_cols=298 Identities=9% Similarity=0.075 Sum_probs=226.5 Q ss_pred hhhhcccCCCcchh-H---HH--------hh--ccc---cCc-----c--eechhhhhhcH--------------HHHHH Q lcl|NC_019422. 4 FKSKKKNKEAPGKV-M---ME--------LI--SDS---GNG-----F--YSWHGNLYKSD--------------IVRSI 45 (384) Q Consensus 4 f~~~~~~~~~~~~~-~---~~--------~~--~~~---~~~-----~--~~~~~~~~~~~--------------~v~~~ 45 (384) +++++..+...... . .. .+ ++. .+. + ....+.+++-| .+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhh Confidence 44333222111000 0 00 00 000 000 0 00111222222 22222 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEc Q lcl|NC_019422. 46 IRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLN 125 (384) Q Consensus 46 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~ 125 (384) |...++.++ -.-+|||.||..+|++ ++.+++++||+|++++|+..|++++|+|++ T Consensus 81 l~~k~n~l~------------------------~~~~Pnp~~t~~~f~~-~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~ 135 (351) T protein:vir:79 81 LFFKANVLA------------------------STFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPAL 135 (351) T ss_pred hhhhhhHHh------------------------hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEeC Confidence 222222211 1236999999999975 668999999999999999999999999999 Q ss_pred CceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceE Q lcl|NC_019422. 126 ALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWL 205 (384) Q Consensus 126 ~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i 205 (384) |.+|++..+.++ |++...+|+.+.++++||||+|.+++.++++|+|++.+++.++....++..+..++|+||++|+++ T Consensus 136 ~~~v~~~~~~~~--~~~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~i 213 (351) T protein:vir:79 136 AKYVRRKADFSG--FVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFI 213 (351) T ss_pred CcceeeeecCCe--EEEEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 999998777765 445567788899999999999999999999999999999999999999999999999999999999 Q ss_pred EeeC-CCCChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHH Q lcl|NC_019422. 206 LKFK-TALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEK 278 (384) Q Consensus 206 l~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~ 278 (384) |.++ ..+++++.++++++|++ ..| .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||. T Consensus 214 l~~~~~~ls~e~~~~lk~~~~~-~~G-~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~ 291 (351) T protein:vir:79 214 LYMTDAAQKQDDVDNMRDALKN-AKG-PGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ 291 (351) T ss_pred EEecCCCCCHHHHHHHHHHHHH-hcC-ccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 9886 57999999999999976 444 5577888877 5789999999999999998885 678899999999999 Q ss_pred Hhc--------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHH Q lcl|NC_019422. 279 IIQ--------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMST 338 (384) Q Consensus 279 ~l~--------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~ 338 (384) ++| +++.|++...|+++||.|+++.|++ +|.+|.. ..++|+..++++.|.++ T Consensus 292 llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~-------~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 292 LLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGD-------EVVTFDDYEIPPAPVAA 351 (351) T ss_pred HhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCc-------ceeeeChhhhccccccC Confidence 986 2466899999999999999999985 7876632 24799999999998887 No 94 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=1.9e-50 Score=293.21 Aligned_cols=305 Identities=10% Similarity=0.004 Sum_probs=223.5 Q ss_pred Cc----------------chhhh-cccCCCcchhHHHhhccccCc---cee----c---hhhhhhcHHHHHHHHHHHHhh Q lcl|NC_019422. 1 MN----------------IFKSK-KKNKEAPGKVMMELISDSGNG---FYS----W---HGNLYKSDIVRSIIRPKAKAV 53 (384) Q Consensus 1 M~----------------~f~~~-~~~~~~~~~~~~~~~~~~~~~---~~~----~---~~~~~~~~~v~~~i~~ia~~i 53 (384) |. .|..- .-..-.....+..+.+-+.++ ++. . .+.+..++.+.+||...++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h~~~i~~k~N~l 80 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYHGSLLKARANYV 80 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhhhhhHhhhhhHH Confidence 32 11110 000000000111111111111 100 0 001112333333333333332 Q ss_pred ccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEE Q lcl|NC_019422. 54 GKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIY 133 (384) Q Consensus 54 a~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~ 133 (384) ++ .-+||+.||..+|++. +.+++++||||++++|+..|++++|+|+++..|++.. T Consensus 81 ~~------------------------~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~ 135 (348) T protein:vir:26 81 AG------------------------RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRK 135 (348) T ss_pred hh------------------------cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeee Confidence 21 1269999999999775 5799999999999999999999999999999999876 Q ss_pred cCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCC Q lcl|NC_019422. 134 ENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TAL 212 (384) Q Consensus 134 ~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~ 212 (384) +.+ +++...+|+.+.|+++||||++.+++.++++|+||+.++++++....++..++.++|+||++|++||.++ ..+ T Consensus 136 d~~---~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~l 212 (348) T protein:vir:26 136 NGD---FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNL 212 (348) T ss_pred cCc---EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC Confidence 543 3344557788899999999999999999999999999999999999999999999999999999999875 579 Q ss_pred ChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc----- Q lcl|NC_019422. 213 RPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ----- 281 (384) Q Consensus 213 ~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~----- 281 (384) ++|+.++++++|++.. | .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| T Consensus 213 s~e~~~~lk~~~~~~~-G-~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~ 290 (348) T protein:vir:26 213 SEADEKALKEKIASSK-G-IGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQ 290 (348) T ss_pred CHHHHHHHHHHHHHhc-C-cccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCC Confidence 9999999999998864 4 3567888888 7899999999999999998885 578899999999999986 Q ss_pred ---cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh-hccCHHHH Q lcl|NC_019422. 282 ---SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL-QYASMSTK 339 (384) Q Consensus 282 ---~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~-~~~d~~~~ 339 (384) +++.|++...|++++|.|++..|+++||++|..+. +.+++||++.. ++.+..+. T Consensus 291 ~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~----~~~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 291 GANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPEIPD----NLKLKFNLNPGVESANGSAV 348 (348) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhCCCC----ccEEEEecCcccccchhhcC Confidence 24668999999999999999999999999876433 34677877643 33222222 No 95 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=1.9e-50 Score=293.23 Aligned_cols=292 Identities=9% Similarity=0.075 Sum_probs=226.7 Q ss_pred hhhhcccCCCcchh------------------------------HHHhhccccCcceechhhhhhcH------------- Q lcl|NC_019422. 4 FKSKKKNKEAPGKV------------------------------MMELISDSGNGFYSWHGNLYKSD------------- 40 (384) Q Consensus 4 f~~~~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~~~~~~~~------------- 40 (384) +++++..+...... ...+..-+ ..+.+++-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~------~~~~~~~pp~~~~~la~~~~~~ 74 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECW------SNGEWFEPPVSFAGLAKSFRAS 74 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhh------ccCceecCCCCHHHHHHHHhhh Confidence 44333222111000 00011101 111222222 Q ss_pred -HHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCcee Q lcl|NC_019422. 41 -IVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPT 119 (384) Q Consensus 41 -~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~ 119 (384) .+.+||...++.++ -.-+||+.||..+|++ ++.+++++||+|++++++..|+++ T Consensus 75 ~~h~~~l~~k~n~l~------------------------~~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~ 129 (351) T protein:vir:78 75 THHSSALFFKANVLA------------------------STFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTL 129 (351) T ss_pred HhhhhhhhhhhhHHh------------------------hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEE Confidence 22222222222221 1236999999999975 557899999999999999999999 Q ss_pred eEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_019422. 120 QIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNS 199 (384) Q Consensus 120 ~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 199 (384) +|+|+++..|++..+.++. .|...+|+.+.++++||||++.+++.++++|+|++.+++.++....++..++.++|+|| T Consensus 130 ~L~pl~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NG 207 (351) T protein:vir:78 130 RLEPALAKYVRRKADFSGF--VYVNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 207 (351) T ss_pred EEEEecCcceEEeeeCCeE--EEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 9999999999998877754 44556788899999999999999999999999999999999999999999999999999 Q ss_pred CCcceEEeeC-CCCChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHH Q lcl|NC_019422. 200 NTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSF 272 (384) Q Consensus 200 ~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~ 272 (384) ++|++||.++ +.+++++.++++++|++ ..| .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++ T Consensus 208 a~pggIl~~~~~~ls~e~~~~lr~~~~~-~~G-~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a 285 (351) T protein:vir:78 208 SHAGFILYMTDAAQKQDDVDNMRDALKN-AKG-PGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAA 285 (351) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHH-hcC-cccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHH Confidence 9999999986 57999999999999986 444 5677888887 5789999999999999998885 578899999 Q ss_pred hCCCHHHhc--------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHH Q lcl|NC_019422. 273 FNTNEKIIQ--------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMST 338 (384) Q Consensus 273 fgvp~~~l~--------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~ 338 (384) |||||.++| +++.|++...|++++|.|+++.|++ ++.+|..+ .++|+..++++.|.++ T Consensus 286 ~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~-------~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 286 HRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDE-------VVRFDDYEIPPAPVAA 351 (351) T ss_pred hCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcc-------ceecChhhhccccccC Confidence 999999986 3567899999999999999999985 67666322 5899999999999887 No 96 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=1.8e-49 Score=287.88 Aligned_cols=308 Identities=10% Similarity=0.049 Sum_probs=231.0 Q ss_pred CcchhhhcccCCCcc---hh---HHHh------hcc--ccCc-ceechhhhhhcHHHHHHHHHHHHhhcc--CceEEEEe Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---KV---MMEL------ISD--SGNG-FYSWHGNLYKSDIVRSIIRPKAKAVGK--MTAKHIRS 63 (384) Q Consensus 1 M~~f~~~~~~~~~~~---~~---~~~~------~~~--~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~--~~~~~~~~ 63 (384) |+ ++|+....+.. .. .+.+ +.. .+.. ....++.+++.|+-+..+-.+.+.-+. .++.. T Consensus 1 m~--~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~--- 75 (340) T protein:vir:98 1 MS--KRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYV--- 75 (340) T ss_pred CC--CCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhh--- Confidence 55 33322211110 00 0000 000 0000 011234456666655555444333321 11211 Q ss_pred cCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEE Q lcl|NC_019422. 64 NETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFL 143 (384) Q Consensus 64 ~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 143 (384) ..+.+.. .-+|||.||..+|++ ++.+++++||+|++++|+..|++++|+|+++..|++..+.+ .+++. T Consensus 76 -------k~n~l~~--~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~~--~~~~~ 143 (340) T protein:vir:98 76 -------KRNVLAS--TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDDS--VFWFV 143 (340) T ss_pred -------hhhHHhh--ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccCc--EEEEE Confidence 0111111 237999999999975 55799999999999999999999999999999998766544 44455 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVK 222 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~ 222 (384) ..+|+.+.++++||||+|.+++.++++|+|++.++++++....++..+..++|+||++|+++|.++ ..+++++.+++++ T Consensus 144 ~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~ 223 (340) T protein:vir:98 144 ENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRD 223 (340) T ss_pred ecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHH Confidence 567888899999999999999999999999999999999999999999999999999999999986 4799999999999 Q ss_pred HHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSEDE 288 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e~~ 288 (384) +|++. .| .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| +++.|++ T Consensus 224 ~~~~~-~G-~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~ 301 (340) T protein:vir:98 224 AMRNS-KG-LGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKV 301 (340) T ss_pred HHHHh-cC-ccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHH Confidence 99874 44 4677888887 5789999999999999999885 578899999999999996 2466899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccC Q lcl|NC_019422. 289 WNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYAS 335 (384) Q Consensus 289 ~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d 335 (384) ...|++++|.|+++.|++ +|.+|..+ .++|+..++++.| T Consensus 302 ~~~f~~~~l~Pl~~~iee-~n~~L~~e-------~~rF~~~~l~~~d 340 (340) T protein:vir:98 302 AKVFVRNELSPLQDRFRE-VNDWLGME-------VIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHHHHHHH-HHhccccc-------ccccCccccccCC Confidence 999999999999999984 88887543 3688888888888 No 97 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=3.7e-49 Score=286.08 Aligned_cols=306 Identities=10% Similarity=0.063 Sum_probs=221.5 Q ss_pred hhhhcccCCCcc-----hh----------HHHhhc--cccC---c-----c--eechhhhhhcHHHHHHHHHHHHh--hc Q lcl|NC_019422. 4 FKSKKKNKEAPG-----KV----------MMELIS--DSGN---G-----F--YSWHGNLYKSDIVRSIIRPKAKA--VG 54 (384) Q Consensus 4 f~~~~~~~~~~~-----~~----------~~~~~~--~~~~---~-----~--~~~~~~~~~~~~v~~~i~~ia~~--ia 54 (384) ++++++...... .. ....++ +..+ + + ....+.+++-|+-+..+-.+-+. -. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h 80 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYL 80 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhh Confidence 444332221100 00 000000 0000 0 0 00112233333333332111100 00 Q ss_pred cCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEc Q lcl|NC_019422. 55 KMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYE 134 (384) Q Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 134 (384) +.++.. ..+.+. ..-+||+.||..+|++ ++.+++++||||++++|+..|++++|+|++|..|++..+ T Consensus 81 ~~~l~~----------k~n~l~--~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~ 147 (350) T protein:vir:11 81 QSGLKF----------KRNMLA--KTFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTD 147 (350) T ss_pred ccchhh----------hhhhhh--hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeec Confidence 111111 001111 1237999999999986 567999999999999999999999999999999998776 Q ss_pred CCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCC Q lcl|NC_019422. 135 NEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALR 213 (384) Q Consensus 135 ~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~ 213 (384) .+. +++...+|..+.++++||||++.+++.++++|+||+.+++.++....++..+..++|+||++|+++|+++ ..++ T Consensus 148 ~~~--~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls 225 (350) T protein:vir:11 148 LET--FYQVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQN 225 (350) T ss_pred CCe--EEEEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCC Confidence 553 3444567888899999999999999999999999999999999999999999999999999999999986 4799 Q ss_pred hHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc------ Q lcl|NC_019422. 214 PDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ------ 281 (384) Q Consensus 214 ~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~------ 281 (384) +++.++++++|++.. | .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| T Consensus 226 ~e~~~~l~~~~~~~~-G-~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t 303 (350) T protein:vir:11 226 EEDIDALRTALKTAK-G-PGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNA 303 (350) T ss_pred HHHHHHHHHHHHHhc-C-ccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC Confidence 999999999998853 4 3677888877 4689999999999999998885 678899999999999996 Q ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh Q lcl|NC_019422. 282 --SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL 331 (384) Q Consensus 282 --~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~ 331 (384) +++.|++...|++++|.|+++.|+ ++|++|..+..+ +.+|++++| T Consensus 304 ~~~sn~e~~~~~f~~~~L~P~~~~ie-~ln~~l~~~~~~----F~~~~~~~l 350 (350) T protein:vir:11 304 GGFGSISDAAAVWASLELAPMQTRLQ-QVNEMIGEEVVR----FAQFDAPGL 350 (350) T ss_pred CCcCCHHHHHHHHHHHHHHHHHHHHH-HHHhhcCccccc----cCcccccCC Confidence 245689999999999999999998 588887654322 335677777 No 98 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=1.5e-49 Score=288.28 Aligned_cols=239 Identities=14% Similarity=0.171 Sum_probs=193.4 Q ss_pred CcchhhhcccCCCcch-hHHHhhc---c-ccC-cceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK-VMMELIS---D-SGN-GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~-~~~~~~~---~-~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ||||+++++....+.. .....+. . .+. +...+.+.++++++|++||+.||++||++|+++++.++ +...++ T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~---~~~~~~ 77 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ---INYSDR 77 (251) T ss_pred CCccccccccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCcc---ccccch Confidence 9999866544332222 2211211 1 111 12245567899999999999999999999999987543 445688 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEE-EEE----cCceE Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLK-FLL----RNGKI 149 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~-~~~----~~g~~ 149 (384) ++.+|+.+||+.||+++||+.++.+++++||||++++|+..|++.+|+||+|.+|++..+.++..++ +.. .+|.. T Consensus 78 ~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~ 157 (251) T protein:vir:46 78 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIE 157 (251) T ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEeccCCccee Confidence 8899999999999999999999999999999999999999999999999999999999987765443 322 23677 Q ss_pred EEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHh Q lcl|NC_019422. 150 VSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNY 228 (384) Q Consensus 150 ~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~ 228 (384) +.++++||||+|.+ +.++++|+||+.++..++....++++++.++|+||++|+++|++++.++ +++.+++++.|++.+ T Consensus 158 ~~~~~~diiH~r~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~ 236 (251) T protein:vir:46 158 RNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVL 236 (251) T ss_pred EEECCccEEEecCc-CCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 89999999999976 5789999999999999999999999999999999999999999999885 456788999999999 Q ss_pred ccccccCCcceecCCCcee Q lcl|NC_019422. 229 LQIDSEAGGAAATDSKYDA 247 (384) Q Consensus 229 ~~~~~~~~~~~v~~~g~~~ 247 (384) +|. +|+|++.+ |++= T Consensus 237 ~g~-~n~g~~~~---gm~~ 251 (251) T protein:vir:46 237 VEL-NKLGKLSY---SMNQ 251 (251) T ss_pred cCc-cccccccc---ccCC Confidence 764 56776554 3332 No 99 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=5.8e-49 Score=285.05 Aligned_cols=306 Identities=12% Similarity=0.068 Sum_probs=222.9 Q ss_pred hhhhcccCCCcch----------hHHHhhccc---cCc-----c--eechhhhhhcHHHHHHHHHHHHhhc--cCceEEE Q lcl|NC_019422. 4 FKSKKKNKEAPGK----------VMMELISDS---GNG-----F--YSWHGNLYKSDIVRSIIRPKAKAVG--KMTAKHI 61 (384) Q Consensus 4 f~~~~~~~~~~~~----------~~~~~~~~~---~~~-----~--~~~~~~~~~~~~v~~~i~~ia~~ia--~~~~~~~ 61 (384) +.++++....+.. ..+. +++. ... + ...++.+++-|.-+..+-.+.+.-+ +.++.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~-f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~- 78 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFT-FGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYV- 78 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEE-cCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhh- Confidence 3332222111100 0000 0000 000 0 0112233443333333322211111 112221 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEE Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLK 141 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 141 (384) ..+.+.. .-+||+.||+.+| +.++.+++++||||++++|+..|++++|+|+++..|++..+.+. |+ T Consensus 79 ---------k~n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~~--~~ 144 (344) T protein:vir:60 79 ---------KRNILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV--YW 144 (344) T ss_pred ---------hhhHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCCe--EE Confidence 0111111 3379999999999 57889999999999999999999999999999999998777664 44 Q ss_pred EEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHH Q lcl|NC_019422. 142 FLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKE 220 (384) Q Consensus 142 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~ 220 (384) +...+|+.+.++++||||++.+++.++++|+||+..++.++....++..+..++|+||++|+++|.++ ..+++++.+++ T Consensus 145 ~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~i 224 (344) T protein:vir:60 145 WVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEML 224 (344) T ss_pred EEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHH Confidence 45667888899999999999998899999999999999999999999999999999999999999986 57999999999 Q ss_pred HHHHHHHhccccccCCcceec------CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccH Q lcl|NC_019422. 221 VKSFEKNYLQIDSEAGGAAAT------DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYS 285 (384) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~v~------~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~ 285 (384) +++|++.. |. ++++.+++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| .++. T Consensus 225 k~~~~~~~-g~--~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~ 301 (344) T protein:vir:60 225 RENMVKSK-GR--NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDI 301 (344) T ss_pred HHHHHHhc-CC--CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccH Confidence 99998865 32 34566665 4789999999999999998875 688999999999999996 2456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 286 EDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 286 e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) |++...|++++|.|+++.++ +||.+|..+ .++|+.-++..+|- T Consensus 302 e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~-------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 302 EKVAKVFVRNELIPLQDRIR-EINGWLGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHhcCCc-------ccccCccccCCCCC Confidence 89999999999999999998 589887432 34666666655555 No 100 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=6.8e-49 Score=284.65 Aligned_cols=307 Identities=12% Similarity=0.070 Sum_probs=225.6 Q ss_pred hhhhcccCCCcchhH-------HHhhc--cc---cCc-----c--eechhhhhhcHHHHHHHHHHHHhhc--cCceEEEE Q lcl|NC_019422. 4 FKSKKKNKEAPGKVM-------MELIS--DS---GNG-----F--YSWHGNLYKSDIVRSIIRPKAKAVG--KMTAKHIR 62 (384) Q Consensus 4 f~~~~~~~~~~~~~~-------~~~~~--~~---~~~-----~--~~~~~~~~~~~~v~~~i~~ia~~ia--~~~~~~~~ 62 (384) ++++++....+.... ...++ +. ..+ + ...++.+++-|+-+..+-.+.+.-+ +.++... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k- 79 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK- 79 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceeh- Confidence 454444332221110 00000 00 000 0 0122334444544444444322221 2223221 Q ss_pred ecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEE Q lcl|NC_019422. 63 SNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKF 142 (384) Q Consensus 63 ~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 142 (384) .+.+. ..-+|||+||+.+| +.++.+++++||||++++|+..|++++|+|+++..|++..+.+. +++ T Consensus 80 ---------~n~l~--~~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~--~~~ 145 (344) T protein:vir:56 80 ---------RNILA--STFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV--YWW 145 (344) T ss_pred ---------hhhHH--hhcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCCE--EEE Confidence 11111 13489999999999 67789999999999999999999999999999999998776654 445 Q ss_pred EEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHH Q lcl|NC_019422. 143 LLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEV 221 (384) Q Consensus 143 ~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~ 221 (384) ...+|+.+.++++||||++.+++.++++|+||+.+++.++....+++.+..++|+||++|+++|.++ ..+++++.++++ T Consensus 146 ~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk 225 (344) T protein:vir:56 146 VPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLR 225 (344) T ss_pred EecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHH Confidence 5677888899999999999999999999999999999999999999999999999999999999986 479999999999 Q ss_pred HHHHHHhccccccCCcceec------CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHH Q lcl|NC_019422. 222 KSFEKNYLQIDSEAGGAAAT------DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSE 286 (384) Q Consensus 222 ~~~~~~~~~~~~~~~~~~v~------~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e 286 (384) ++|++.. |. ++++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| +++.| T Consensus 226 ~~~~~~~-g~--~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~e 302 (344) T protein:vir:56 226 ENMVKSK-GR--NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHhc-CC--CCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHH Confidence 9998865 32 45777777 4799999999999999998875 678899999999999997 23568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 287 DEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 287 ~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) ++...|+++||.|+++.++ ++|.+|..+. ++|+.-++..+|- T Consensus 303 q~~~~f~~~tL~Pl~~~ie-~~n~~l~~~~-------~~F~~y~l~~~~~ 344 (344) T protein:vir:56 303 KVAKVFVRNELIPLQDRIR-EINGWIGQEV-------IRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHhhhcccc-------ccCCCccccccCC Confidence 9999999999999999998 4888876443 3444333333332 No 101 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=1.2e-48 Score=283.33 Aligned_cols=313 Identities=10% Similarity=0.029 Sum_probs=232.9 Q ss_pred CcchhhhcccC---C---------CcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhc--cCceEEEEecCC Q lcl|NC_019422. 1 MNIFKSKKKNK---E---------APGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVG--KMTAKHIRSNET 66 (384) Q Consensus 1 M~~f~~~~~~~---~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia--~~~~~~~~~~~~ 66 (384) |.-........ . -+++.... +.+....++...+.+++.|.-+..+-.+.+.-+ +-.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~-~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~------ 73 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASP-ALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHS------ 73 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCccccc-chhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceee------ Confidence 43333211000 0 00111111 111222222334557777766655444422211 111211 Q ss_pred cceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEE---EE Q lcl|NC_019422. 67 EFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLK---FL 143 (384) Q Consensus 67 ~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~---~~ 143 (384) ..+-+. ..-+|||.||+.+|++ ++.+++++||||++++|+..|++.+|+|+++..|++..+.+..... .. T Consensus 74 ----k~n~l~--~~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~~ 146 (345) T protein:vir:37 74 ----RANMVS--SLYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSYLMKKSLY 146 (345) T ss_pred ----echHHH--hhccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeEEEEEeEe Confidence 111222 2347999999999986 4679999999999999999999999999999999998887664432 22 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVK 222 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~ 222 (384) ..+|+.+.++++||||+|.+++.++++|+|++.+++.++....++..++.++|+||++|+++|.++ ..+++++.+++++ T Consensus 147 ~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~ 226 (345) T protein:vir:37 147 DTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIAR 226 (345) T ss_pred cCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHH Confidence 345788899999999999999999999999999999999999999999999999999999999985 5789999999999 Q ss_pred HHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSEDE 288 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e~~ 288 (384) +|++. .| .+|.++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++| +++.|++ T Consensus 227 ~~~~~-~g-~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~ 304 (345) T protein:vir:37 227 KISES-KG-VGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKY 304 (345) T ss_pred HHHHh-cC-cccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHH Confidence 99874 34 3566777776 5899999999999999998885 688999999999999986 2456899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhc Q lcl|NC_019422. 289 WNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQY 333 (384) Q Consensus 289 ~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~ 333 (384) ...|++++|.|+++.|++++|+.+ + ...+..++|+..++.+ T Consensus 305 ~~~f~~~~l~P~~~~ie~~ln~~~--~--~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 305 REVYHYDEVMPLQEIIAETINQDP--E--IKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhc--c--CCCcceEEecchhhcC Confidence 999999999999999999999743 2 2334578888777766 No 102 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=8.2e-49 Score=284.22 Aligned_cols=307 Identities=12% Similarity=0.062 Sum_probs=225.9 Q ss_pred hhhhcccCCCcchhHH-------Hhh--ccc---c-----Ccc--eechhhhhhcHHHHHHHHHHH--HhhccCceEEEE Q lcl|NC_019422. 4 FKSKKKNKEAPGKVMM-------ELI--SDS---G-----NGF--YSWHGNLYKSDIVRSIIRPKA--KAVGKMTAKHIR 62 (384) Q Consensus 4 f~~~~~~~~~~~~~~~-------~~~--~~~---~-----~~~--~~~~~~~~~~~~v~~~i~~ia--~~ia~~~~~~~~ 62 (384) ++++++.+..+..... ..+ ++. . ..+ ...++.+++-|+-+..+-.+- +...+.++... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k- 79 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK- 79 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccceeh- Confidence 4444433322111100 000 000 0 000 012233444444444433331 11112233321 Q ss_pred ecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEE Q lcl|NC_019422. 63 SNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKF 142 (384) Q Consensus 63 ~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 142 (384) .+.+.. .-+||+.||+.+| +.++.+++++||||++++|+..|++++|+|+++..|++..+.+. |++ T Consensus 80 ---------~n~l~~--~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~~--~~~ 145 (344) T protein:vir:20 80 ---------RNILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV--YWW 145 (344) T ss_pred ---------hhhHHH--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCCE--EEE Confidence 111111 2379999999999 57789999999999999999999999999999999998776654 444 Q ss_pred EEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHH Q lcl|NC_019422. 143 LLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEV 221 (384) Q Consensus 143 ~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~ 221 (384) ...+|..+.++++||||++.+.+.++++|+||+..++.++.+..+++.+..++|+||++|+++|.++ ..+++++.++++ T Consensus 146 ~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik 225 (344) T protein:vir:20 146 VPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLR 225 (344) T ss_pred EccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHH Confidence 5667888999999999999988899999999999999999999999999999999999999999975 579999999999 Q ss_pred HHHHHHhccccccCCcceec------CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHH Q lcl|NC_019422. 222 KSFEKNYLQIDSEAGGAAAT------DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSE 286 (384) Q Consensus 222 ~~~~~~~~~~~~~~~~~~v~------~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e 286 (384) ++|++.. |. ++++.+++ .+|+++++++.++.+++|.+.+ +++++||++|||||.++| .++.| T Consensus 226 ~~~~~~~-g~--~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e 302 (344) T protein:vir:20 226 ENMVKSK-GR--NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHhc-CC--CCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHH Confidence 9998865 32 44666665 4689999999999999998885 678899999999999996 24568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 287 DEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 287 ~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) ++...|++++|.|+++.++ ++|.+|..+ .++|+..++...|. T Consensus 303 ~~~~~f~~~~l~P~~~~~e-~in~~lg~~-------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 303 KVAKVFVRNELIPLQDRIR-EINGWLGQE-------VIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHhcCCc-------ccccCccccccCCC Confidence 9999999999999999998 588776432 35676666665555 No 103 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=3.5e-48 Score=280.74 Aligned_cols=313 Identities=9% Similarity=0.026 Sum_probs=227.6 Q ss_pred Ccchhhhcc------------cCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHH--hhccCceEEEEecCC Q lcl|NC_019422. 1 MNIFKSKKK------------NKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAK--AVGKMTAKHIRSNET 66 (384) Q Consensus 1 M~~f~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~--~ia~~~~~~~~~~~~ 66 (384) |.=..+.-. ..+.+++...... +...-+....+.+++.|.-+..+-.+.+ .-.+-.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~------ 73 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPAL-DYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHS------ 73 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhh-cccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhh------ Confidence 322211000 0001111111111 1111111223445555544433322211 111111111 Q ss_pred cceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEE---EE Q lcl|NC_019422. 67 EFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLK---FL 143 (384) Q Consensus 67 ~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~---~~ 143 (384) ..+.+ ...-+|||.||..+|++ ++.+++++||+|++++|+..|++++|+|++|..|++..+.+..++. .. T Consensus 74 ----k~n~l--~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~~ 146 (345) T protein:vir:37 74 ----RANMV--SATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGGYSYLMKKSLY 146 (345) T ss_pred ----hhhHH--hhccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCCeeEEEeeeee Confidence 01111 12337999999999975 5578999999999999999999999999999999988777654432 22 Q ss_pred EcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHH Q lcl|NC_019422. 144 LRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVK 222 (384) Q Consensus 144 ~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~ 222 (384) ...|+.+.++++||||++.+++.++++|+|++..++.++....++..++.++|+||++|++||.++ ..+++++.+++++ T Consensus 147 ~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~ 226 (345) T protein:vir:37 147 DTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIAR 226 (345) T ss_pred ccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHH Confidence 335788899999999999998899999999999999999999999999999999999999999875 5799999999999 Q ss_pred HHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhc--------cccHHHH Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQ--------SKYSEDE 288 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~--------~~~~e~~ 288 (384) +|++.+.+ ++.+.++++ ++|+++++++.++.+++|.+.+ .++++||++|||||.++| .++.|++ T Consensus 227 ~~~~~~g~--~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~ 304 (345) T protein:vir:37 227 KISESKGV--GNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKY 304 (345) T ss_pred HHHHhcCc--cccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHH Confidence 99997643 444555554 4679999999999999998885 578899999999999996 2456899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhc Q lcl|NC_019422. 289 WNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQY 333 (384) Q Consensus 289 ~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~ 333 (384) ...|+++||.|+++.|++++|+.+ +...+..++||..++.+ T Consensus 305 ~~~f~~~~l~P~~~~ie~~ln~~~----e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 305 REVYHYDEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhh----ccCCcceEEECchhhcC Confidence 999999999999999999999732 23345789999999988 No 104 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=2.8e-48 Score=281.28 Aligned_cols=313 Identities=8% Similarity=-0.003 Sum_probs=227.2 Q ss_pred hhhhcccCCCcc-hhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhc---cCceEEEEecCCcceeccchHHHHH Q lcl|NC_019422. 4 FKSKKKNKEAPG-KVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVG---KMTAKHIRSNETEFKTNPEIYIKFL 79 (384) Q Consensus 4 f~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia---~~~~~~~~~~~~~~~~~~~~~~~~l 79 (384) +++++..+.... ......++- ......+...++..|+...-+..+ .-|+....- .+......+....| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~------~~p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~~~L--a~l~~~~~~h~~~L 72 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVF------SMPEAIDPTAWMTDYTGVFYNPYGEYYQPPIDRKGL--AKVARANAHHGAIL 72 (337) T ss_pred CCCcccCcccccccCceeEEEe------cCcccccCcchhHhhhhhhhccCcceecCCCCHHHH--HHHhhcchhhhhHH Confidence 333332222211 111111110 011222333333333333322222 112211000 00001111223457 Q ss_pred HhhccccCCH----HHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehh Q lcl|NC_019422. 80 LENPNPFMSG----QILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYS 155 (384) Q Consensus 80 ~~~PN~~~s~----~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~ 155 (384) ..+||+.++. .++++.++.+++++||||++++|+..|++++|+|+++.+|++..+.+ +.|...+++.+.++++ T Consensus 73 ~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~~---~~~~~~~~~~~~~~~~ 149 (337) T protein:vir:78 73 MARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDGC---FVYLQQGKPNLIYRPD 149 (337) T ss_pred HhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCCe---EEEEEcCCceEEECCc Confidence 7799976654 47899999999999999999999999999999999999998775533 2344556788899999 Q ss_pred heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC-CCChHHHHHHHHHHHHHhcccccc Q lcl|NC_019422. 156 DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT-ALRPDDIKKEVKSFEKNYLQIDSE 234 (384) Q Consensus 156 evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~~~~ 234 (384) ||||+|.+++.++++|+||+..++.++.+..+++.+..++|+||++|+++|.+++ .+++++.+++++.|++ ..| .+| T Consensus 150 eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~-~~G-~~n 227 (337) T protein:vir:78 150 DVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIAN-SKG-VGN 227 (337) T ss_pred cEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH-hcC-ccc Confidence 9999999998999999999999999999999999999999999999999999764 7999999999999976 444 456 Q ss_pred CCcceec-----CCCceeeecccchhHHHHHHHH-HHHHHHHHHhCCCHHHhcc---------ccHHHHHHHHHHHHHHH Q lcl|NC_019422. 235 AGGAAAT-----DSKYDAEQVKAESYVPNAAQMD-KAIQRLYSFFNTNEKIIQS---------KYSEDEWNAYYESEIEP 299 (384) Q Consensus 235 ~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~~-~~~~~I~~~fgvp~~~l~~---------~~~e~~~~~~~~~~i~P 299 (384) .++++++ ++|+++++++.++.+++|.+.+ +++++||++|||||.++|. ++.|++...|+++||.| T Consensus 228 ~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P 307 (337) T protein:vir:78 228 FRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLP 307 (337) T ss_pred ccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHH Confidence 7777777 6789999999999999998875 6888999999999999861 24688899999999999 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhh Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQ 332 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~ 332 (384) +++.|++++|.+|++.... ..++++..+++ T Consensus 308 ~~~~ie~~~n~~ll~~~~~---~~f~~~~~~~~ 337 (337) T protein:vir:78 308 LCELVQDAINSAGLPRALW---VTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHhhhcCChhhc---eeccccccccC Confidence 9999999999998875432 34566666666 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=1.2e-38 Score=228.40 Aligned_cols=200 Identities=7% Similarity=0.033 Sum_probs=163.1 Q ss_pred EEEEEcCCCEEEEEEE----cCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcce Q lcl|NC_019422. 129 VEAIYENEVLFLKFLL----RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKW 204 (384) Q Consensus 129 v~~~~~~~~~~~~~~~----~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 204 (384) |++..+.+ .+|.+.. ..|+.+.++++||+|+|.+.+.++++|+||+.+++.++....++.+++.++|+||++|++ T Consensus 1 ~r~~~dg~-~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~g 79 (219) T protein:vir:98 1 MRVCKDGN-YKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMGF 79 (219) T ss_pred CceeecCe-EEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Confidence 33322222 2232222 237788999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeCC-CCChHHHHHHHHHHHHHhccccccCCcceec-----CCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCH Q lcl|NC_019422. 205 LLKFKT-ALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-----DSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNE 277 (384) Q Consensus 205 il~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~ 277 (384) +|++++ .+++++.+++++.|++. .|. +|.++++++ ++|++|++++.++.|+||.+. ++++.+||++||||| T Consensus 80 il~~~~~~l~~e~~~~~~~~~~~~-~g~-~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp 157 (219) T protein:vir:98 80 ILYSTDPDMTEEMEDEIAERIRDS-KGV-GNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPP 157 (219) T ss_pred EEEeCCCCCCHHHHHHHHHHHHHh-cCc-ccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 998865 69999999999999875 453 456666665 578999999999999999887 468899999999999 Q ss_pred HHhc--------cccHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccC Q lcl|NC_019422. 278 KIIQ--------SKYSEDEWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYAS 335 (384) Q Consensus 278 ~~l~--------~~~~e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d 335 (384) .+|| +++.|++...|+++||.|+++.||++||++++.+. +.+++|+-+.....+ T Consensus 158 ~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~----~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 158 GLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKS----ALKVNFKQPEKRDKN 219 (219) T ss_pred HHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCC----ccEEeecCcccccCC Confidence 9986 35678999999999999999999999998754432 346777655554444 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.91 E-value=1.1e-24 Score=151.88 Aligned_cols=368 Identities=10% Similarity=0.093 Sum_probs=218.2 Q ss_pred CcchhhhcccCC---Ccch-hHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHH Q lcl|NC_019422. 1 MNIFKSKKKNKE---APGK-VMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~ 76 (384) |++.+...+... +..+ .....-....-.+......|..++.++++|+.+|+.+.+..+.+.- + +.+...-..+ T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~--~-d~~~~~~~~~ 77 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYS--N-DLNSKQLDLF 77 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEec--C-CCCHHHHHHH Confidence 777765433211 1111 0100000000111122234667899999999999999999998742 1 1111111122 Q ss_pred HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---------CceeeEEEEcCceEEEEEcCC--------CEE Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY---------NMPTQIYPLNALNVEAIYENE--------VLF 139 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---------g~~~~l~~l~~~~v~~~~~~~--------~~~ 139 (384) ...+.+= ...+-+..++.+.-++|.+++++..++. |.+..+.++++..+++..... +.. T Consensus 78 ~~~~~~l----~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p 153 (437) T protein:vir:52 78 TKFERSL----KLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRY 153 (437) T ss_pred HHHHHhh----cHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcc Confidence 2222222 2345555666667789999999988753 678899999999887533221 222 Q ss_pred EEEEEc-CceEEEEehhheEEEecc---CCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC---CC Q lcl|NC_019422. 140 LKFLLR-NGKIVSYPYSDIIHLRKD---FNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT---AL 212 (384) Q Consensus 140 ~~~~~~-~g~~~~~~~~evih~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---~~ 212 (384) ..|... ++..+.+-++.||||... .+.+.+.|.|+++.+.+.|.............+.+...+ ++++++ .+ T Consensus 154 ~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l 231 (437) T protein:vir:52 154 SEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKI 231 (437) T ss_pred eEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHh Confidence 233333 334556788899999632 345677899999999999999999998888888776554 445542 23 Q ss_pred ChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc------cHH Q lcl|NC_019422. 213 RPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK------YSE 286 (384) Q Consensus 213 ~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~------~~e 286 (384) +....+.+.+.+ +.+... .+.+++++++.+.+|+.++.+..+..- .+.....+||++.+||..+|.|. +.+ T Consensus 232 ~~~~~~~~~~~~-~~~~~~-~~~~~~~~~d~~~~~e~~~~~~sgl~~-~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge 308 (437) T protein:vir:52 232 AAGMENEVASVI-SAVQEI-KSATNSLLLDAENEYDRKELTFTGLKD-LLTEFRNAVAGAADMPVTILFGQSVSGLASGD 308 (437) T ss_pred cCCcHHHHHHHH-HHHHHh-cCCCceEEEcCCcceEEEecCcCCHHH-HHHHHHHHHHHHhcCchhhhcCcCcccccccH Confidence 222112222222 222222 345778999999999998877655442 23456779999999999887432 334 Q ss_pred HHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH-------HH-HHhCCCC Q lcl|NC_019422. 287 DEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL-------VQ-MVDRGSL 351 (384) Q Consensus 287 ~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-------~~-~~~~g~~ 351 (384) ....+|+. ..+.|+++.+-..|-+..+... .. .+.|.+++|...+.+++++. ++ ++++|++ T Consensus 309 ~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~--~~--~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i 384 (437) T protein:vir:52 309 EDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGL--PA--DWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVL 384 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CC--cceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCC Confidence 45555543 4578888888777776655532 22 35555667777776666543 32 5778999 Q ss_pred CHHHHHHHh----CCCCCCCCCeeee------cCceeecCCCC Q lcl|NC_019422. 352 TPNEWRKIM----NLSPIENGDKPVR------RLDTAVVEGGE 384 (384) Q Consensus 352 t~NE~R~~l----G~~p~~~gd~~~~------~~n~~~~~~ge 384 (384) +++|+|+.| .++.++..|..-. +.+..+.++.+ T Consensus 385 ~~~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (437) T protein:vir:52 385 NEYQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKME 427 (437) T ss_pred CHHHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCC Confidence 999999987 2333433221111 12222222111 No 107 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.83 E-value=1.8e-20 Score=128.78 Aligned_cols=368 Identities=12% Similarity=0.069 Sum_probs=207.8 Q ss_pred CcchhhhcccCCCc-chhHHHhh-ccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-GKVMMELI-SDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKF 78 (384) Q Consensus 1 M~~f~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~ 78 (384) +|.|.-........ ......+. .....++ ..-..|..++.++.+|+.+|+++.+-.+.+.-.++++. ..+.... T Consensus 55 ~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~-~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~---~~~~~~~ 130 (532) T protein:vir:94 55 QNAMAMDYGLQTGRNGRNALSFVEATSWPGF-PTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDEL---AADKATR 130 (532) T ss_pred ccccccccccCcccccccccccccccccchH-HHHHHHHcCchhhhhhccchHHHhhCCceEeeCCcccc---chHHHHH Confidence 11111000000000 00000000 0000111 11123457889999999999999999888743332221 2222222 Q ss_pred HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-------------------CCceeeEEEEcCceEEEEEcC--C- Q lcl|NC_019422. 79 LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-------------------YNMPTQIYPLNALNVEAIYEN--E- 136 (384) Q Consensus 79 l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-------------------~g~~~~l~~l~~~~v~~~~~~--~- 136 (384) |....... ...+-+..++.+..++|.+++++.... .|.+..+.+++|.+|++.... + T Consensus 131 i~~~~~~l-~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp 209 (532) T protein:vir:94 131 ITQKLEQY-NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDP 209 (532) T ss_pred HHHHHHhh-hHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccc Confidence 22222222 234455556667778999988875432 234568889999988865322 1 Q ss_pred -----CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceE Q lcl|NC_019422. 137 -----VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWL 205 (384) Q Consensus 137 -----~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i 205 (384) +....|....|+ .+-++.|+||.... +....+|.|.++.+...+.................... + T Consensus 210 ~sp~fg~P~~y~v~~g~--~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~--v 285 (532) T protein:vir:94 210 TLPSFYKPDSWIATSGK--KIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMT--N 285 (532) T ss_pred cccccCCceeEEEccCe--eeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--e Confidence 112233344444 35678899986332 22345799999999999999988888888877665443 3 Q ss_pred EeeC--CCCChHHHHHHHHHHHHHhccccccCCcceecCC-CceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc Q lcl|NC_019422. 206 LKFK--TALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-KYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS 282 (384) Q Consensus 206 l~~~--~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~ 282 (384) ++++ ..++.+..+++.+++..... .. +..++++++. ..+|++++.+..++.- -+.....+||++.|||...|.| T Consensus 286 ~k~~~a~~ls~~~~~~~~~r~~~~~~-~~-~n~g~~~id~~~e~~e~~~~~lsgl~~-~l~~~~~~iAaa~~IP~t~LfG 362 (532) T protein:vir:94 286 LATDMAQLLAPGGAQSLDARLQLFNL-YR-DNRNIGALDKGTEEIQQTNTPLSGLDS-LQAQSQEQMAAVSHIPLVKLLG 362 (532) T ss_pred eeechHHhhcchhHHHHHHHHHHHHh-hc-CCccceEEcCCCceeEEEecccCCHHH-HHHHHHHHHHhHhCCCeeeeec Confidence 3443 33444555555555543222 22 2345667764 5788888766555432 1345677899999999997632 Q ss_pred -------ccHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH------ Q lcl|NC_019422. 283 -------KYSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL------ 342 (384) Q Consensus 283 -------~~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~------ 342 (384) ++.+....+|+. ..+.|+++.+-+.|-+..+... .. .+.|.+.+|...+.+++++. T Consensus 363 ~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~--~~--d~~~~f~pL~~~s~kEkAei~~~~a~ 438 (532) T protein:vir:94 363 ITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQI--DP--GLAWEWSPLMELDDKELAEVRQLNAS 438 (532) T ss_pred CCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CC--CceEEeCCCCCCCHHHHHHHHHHHHH Confidence 223434444433 4478888888888876655432 22 35555677777777776653 Q ss_pred -H-HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCc----------------eeecCCCC Q lcl|NC_019422. 343 -V-QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLD----------------TAVVEGGE 384 (384) Q Consensus 343 -~-~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n----------------~~~~~~ge 384 (384) + +++.+|++++||+|+.++..|..+.+.....-+ ..+.+.++ T Consensus 439 a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (532) T protein:vir:94 439 TDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAP 498 (532) T ss_pred HHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCC Confidence 2 367889999999999999988755333221111 11111111 No 108 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.83 E-value=2.1e-20 Score=128.49 Aligned_cols=364 Identities=10% Similarity=0.052 Sum_probs=202.7 Q ss_pred CcchhhhcccCC------Ccc-hhHHH--------------hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_019422. 1 MNIFKSKKKNKE------APG-KVMME--------------LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAK 59 (384) Q Consensus 1 M~~f~~~~~~~~------~~~-~~~~~--------------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 59 (384) +++..+.-+... .+. ...+. +.+....++ .....|..++.++.+|+.+|+.+.+-.+. T Consensus 55 ~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~ 133 (537) T protein:vir:10 55 IAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGH-QMCALIATHWLVNKACSQMPRDAMRKGYK 133 (537) T ss_pred CcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccH-HHHHHHHhCchhhhhhhhhhHHhhcCCce Confidence 333332111000 000 00000 000000111 11123457899999999999999998888 Q ss_pred EEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC-C---------------CCceeeEEE Q lcl|NC_019422. 60 HIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD-D---------------YNMPTQIYP 123 (384) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~---------------~g~~~~l~~ 123 (384) +.-.++++ .+.+....|....+.... .+-+..++.+..++|.+++++... . .|....+.+ T Consensus 134 i~~~~~~~---~~~~~~~~l~~~~~~l~~-~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~v 209 (537) T protein:vir:10 134 IISDDGNE---LDPKDAKFIDRYDRAFNI-KKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQ 209 (537) T ss_pred eecCCccc---ccHHHHHHHHHHHHHhhH-HHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEE Confidence 74222221 222223333333333333 344444555556789988886532 1 234567888 Q ss_pred EcCceEEEEEc----CC------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHHHHHHHHHHH Q lcl|NC_019422. 124 LNALNVEAIYE----NE------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVLEPIMEVVNT 187 (384) Q Consensus 124 l~~~~v~~~~~----~~------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~~~~i~~~~~ 187 (384) ++|.++++... .+ +....|.. .|+ .+-++.|+|+.... +...+.|.|.++.+...|..... T Consensus 210 idp~~~~~~~~~~~~~dp~sp~fg~P~~y~v-~g~--~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~ 286 (537) T protein:vir:10 210 IDPYWCAPLLDAQASSNPVSMHFYEPTYWLI-NGK--KYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAER 286 (537) T ss_pred echhhcccccchhhhccCCccccCCceeeee-cCe--EecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHH Confidence 99988875321 11 11223333 333 35668899985332 22456799999999999999988 Q ss_pred HHHHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcceecCC-CceeeecccchhHHHHHHHHH Q lcl|NC_019422. 188 TDQGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-KYDAEQVKAESYVPNAAQMDK 264 (384) Q Consensus 188 ~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-g~~~~~l~~~~~~~~~~~~~~ 264 (384) ........+.+.... ++++++ .+..++ .+.+.+.. +....++ .++++++. +.+|+.++.+...++- -+.. T Consensus 287 t~~~~~~l~~~~~~~--v~k~~~~~~l~~~~--~~~~r~~~-~~~~r~n-~g~~~id~e~e~~e~~~~~lsgl~~-~l~~ 359 (537) T protein:vir:10 287 TANEGPMLAMTKRQT--VLKVDAAQVLANKQ--QFDETMSW-WTATRDN-YQVRVVDKDNEDVVQIDTTLNDLDK-VIMN 359 (537) T ss_pred HHHHHHHHHHhcCCc--eeeechHHhhcCHH--HHHHHHHH-HHhhcCC-cceeEecCCCceeEEEeccCCCHHH-HHHH Confidence 888888888776654 444442 333222 22222222 2222333 45666665 5888888766554431 2245 Q ss_pred HHHHHHHHhCCCHHHhcc-------ccHHHHHHHHH------HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh Q lcl|NC_019422. 265 AIQRLYSFFNTNEKIIQS-------KYSEDEWNAYY------ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL 331 (384) Q Consensus 265 ~~~~I~~~fgvp~~~l~~-------~~~e~~~~~~~------~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~ 331 (384) ....||++.|||...|-| ++.+....+|+ +..+.|.++.+.+.|-+..+.+ ...++|.+.+| T Consensus 360 ~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~-----~~~~~i~f~pL 434 (537) T protein:vir:10 360 QYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRK-----RIRVKVEFPPM 434 (537) T ss_pred HHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-----CcceEEEeCCC Confidence 667899999999997632 12343444433 3358899988888887766653 23466777888 Q ss_pred hccCHHHHHHH-------HH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCce------------eec-------CCCC Q lcl|NC_019422. 332 QYASMSTKLNL-------VQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDT------------AVV-------EGGE 384 (384) Q Consensus 332 ~~~d~~~~~~~-------~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~------------~~~-------~~ge 384 (384) ...+.+++++. ++ ++.+|++++||+|+.|+.+|.-+-+.+....+. .+. +++| T Consensus 435 ~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~ 514 (537) T protein:vir:10 435 DAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSE 514 (537) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccc Confidence 88888887764 33 678899999999999987664322211110000 000 0000 No 109 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.82 E-value=1.2e-20 Score=129.84 Aligned_cols=359 Identities=14% Similarity=0.104 Sum_probs=199.2 Q ss_pred CcchhhhcccCCCcchhHHHhhccc-cC---cce--------echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDS-GN---GFY--------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~-~~---~~~--------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) ||+|.++++.+..........+... +. ..+ .....|..++.++.+|+.+|+.+.+..+.+- +++.. T Consensus 1 ~~~~m~~~~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~--g~~~~ 78 (435) T protein:vir:79 1 MGVFMSDKVKAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVD--GVKNE 78 (435) T ss_pred CCcccccccccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceec--CCChH Confidence 9999988765554433332222110 00 111 1112345788999999999999999888762 11111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-CC---------CCceeeEEEEcCceEEEEEcC-C- Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-DD---------YNMPTQIYPLNALNVEAIYEN-E- 136 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~~---------~g~~~~l~~l~~~~v~~~~~~-~- 136 (384) . .....+.+ ....+-+..++....++|.+++++.. +. .|.+..+.++++..+++.... + T Consensus 79 ~-----~~~~~~~~----l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp 149 (435) T protein:vir:79 79 K-----SFKSRWDE----LRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNA 149 (435) T ss_pred H-----HHHHHHHH----hhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCC Confidence 1 11111111 12335566666777789998888774 22 245668889999888753321 1 Q ss_pred -----CEEEEEEEc--Cc-eEEEEehhheEEEecc------CCCCCccCccHH-HHHHHHHHHHHHHHHHHHHHHHccCC Q lcl|NC_019422. 137 -----VLFLKFLLR--NG-KIVSYPYSDIIHLRKD------FNENDLFGTSPA-KVLEPIMEVVNTTDQGVVKAIKNSNT 201 (384) Q Consensus 137 -----~~~~~~~~~--~g-~~~~~~~~evih~~~~------~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~~ng~~ 201 (384) +.+..|... ++ ..+.+-++.|+|+... .+....+|.|++ +.+.+.+.............+.+... T Consensus 150 ~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~ 229 (435) T protein:vir:79 150 RSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQ 229 (435) T ss_pred cccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 122223222 21 2345667889998532 234567899998 58889999998888888887766544 Q ss_pred cceEEeeCC---CCC-hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019422. 202 IKWLLKFKT---ALR-PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNE 277 (384) Q Consensus 202 p~~il~~~~---~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~ 277 (384) . ++++++ .++ ++......+++.... ...++.+.+++.+++.+|+.++.+..+..- -+.....+||++.|||. T Consensus 230 ~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~-~~~~~~~~~~i~~~~e~~e~~~~~lsgl~~-~~~~~~~~iaaa~~IP~ 305 (435) T protein:vir:79 230 A--VWKARDLALMCDDEEGRYAARLRLAQVD-DESGVGKAIGIDATDEEYEVLNSDVSGVPE-FLQEKIDRIVALTGIHE 305 (435) T ss_pred c--cccchhHHHhhcCccchHHHHHHHHHHH-HhcCCCCceeEecCCcceEEEecccCCHHH-HHHHHHHHHHhhhCCCe Confidence 3 344432 111 122223333332222 223344556666666788888766655432 13556779999999999 Q ss_pred HHhccc-------cHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH- Q lcl|NC_019422. 278 KIIQSK-------YSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL- 342 (384) Q Consensus 278 ~~l~~~-------~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~- 342 (384) ..|.|. +.+....+|+. ..+.|.++.+-..+ ... . .+.|.+++|...+.+++++. T Consensus 306 t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s----~--d~~~~f~pL~~~sekEkAei~ 375 (435) T protein:vir:79 306 IIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFM----ISE----T--EWSIEFEPLSVPSDKDKAEIM 375 (435) T ss_pred eeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC----C--CCeEEeCCCCCCCHHHHHHHH Confidence 877431 22334444433 33555554443332 211 2 34555677877777766653 Q ss_pred ------H-HHHhCCCCCHHHHHHHh-CCC---CCCCCCeeee--cCcee---ecCCCC Q lcl|NC_019422. 343 ------V-QMVDRGSLTPNEWRKIM-NLS---PIENGDKPVR--RLDTA---VVEGGE 384 (384) Q Consensus 343 ------~-~~~~~g~~t~NE~R~~l-G~~---p~~~gd~~~~--~~n~~---~~~~ge 384 (384) + +++..|+++++|+|+.| ... .+.+.+..-+ +.+.. .-++|| T Consensus 376 ~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~ 433 (435) T protein:vir:79 376 AKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELPEPEDLDPEPGQEGGL 433 (435) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCCccccCCCCCCCCCCC Confidence 2 25678999999999876 211 1211111111 12221 225566 No 110 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.77 E-value=1.5e-17 Score=112.84 Aligned_cols=369 Identities=9% Similarity=0.022 Sum_probs=217.6 Q ss_pred CcchhhhcccCCCcchh---HHH--------------------hhccccCcceechhhh-----hhcHHHHHHHHHHHHh Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV---MME--------------------LISDSGNGFYSWHGNL-----YKSDIVRSIIRPKAKA 52 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~---~~~--------------------~~~~~~~~~~~~~~~~-----~~~~~v~~~i~~ia~~ 52 (384) ...+++.........+. ... .+..-..+....--.+ .+.+.|.+|++.+... T Consensus 5 ~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~Rk~a 84 (528) T protein:vir:10 5 VDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSKRKRA 84 (528) T ss_pred ECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHH Confidence 33344332221111111 110 1111101110000011 1578899999999999 Q ss_pred hccCceEEEEecCCcc-eeccchHH-HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---CceeeEEEEcCc Q lcl|NC_019422. 53 VGKMTAKHIRSNETEF-KTNPEIYI-KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQIYPLNAL 127 (384) Q Consensus 53 ia~~~~~~~~~~~~~~-~~~~~~~~-~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~l~~l~~~ 127 (384) |.+++|.|....++.. .....+.. ..|...| .+.+++..+ .+.+.+|-+++++++... ..+..+.+.++. T Consensus 85 v~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~f~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (528) T protein:vir:10 85 VLGLDWTIEPPRNASAAEKADAEYLHELLLDLE----GIEDLMLDC-MDGVGHGYSAIELDWSLQGREWLPQAFDHRPQS 159 (528) T ss_pred HhcCCceEecCCCCCHHHHHHHHHHHHHHhCCc----cHHHHHHHH-HhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 9999999965432211 11111112 2222222 244555554 446679999999987543 247788899999 Q ss_pred eEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe Q lcl|NC_019422. 128 NVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK 207 (384) Q Consensus 128 ~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 207 (384) ++++..+.+ ..+........-..+++...++.++.......+|.+.+..|.-....-....++-..|.+.-|.|-.+.+ T Consensus 160 ~f~~~~~~~-~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igk 238 (528) T protein:vir:10 160 WFQLNPDDQ-DELRLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGK 238 (528) T ss_pred ceeeccCCC-cEEeccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEe Confidence 888755443 3333333332345566666555556666667789999999999999999999999999999999999999 Q ss_pred eCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchh-HHHHHHH-HHHHHHHHHHhCCCHHH---h-- Q lcl|NC_019422. 208 FKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESY-VPNAAQM-DKAIQRLYSFFNTNEKI---I-- 280 (384) Q Consensus 208 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~-~~~~~~~-~~~~~~I~~~fgvp~~~---l-- 280 (384) ++...++++.+++.+.+.+. .+++ .++++.|++++-+..+.. ...+..+ ++.-++|+.+.- -..+ . T Consensus 239 y~~~a~~~ek~~L~~al~~i----~~~~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL-GqtlTs~~~~ 311 (528) T protein:vir:10 239 YPPGTPDEEKVTLLRAVTGL----GHAA--AGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAIL-GGTLTSQTSE 311 (528) T ss_pred cCCCCCHHHHHHHHHHHHHH----hhCc--EEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHh-hhhhhccccc Confidence 99888888887777666432 2233 455666666665554322 2234443 666667666541 1111 1 Q ss_pred --ccccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccCccc-c--------cCcceEEeechhhhccCHHHHHHHHH-HHh Q lcl|NC_019422. 281 --QSKYSE-DEWNAYYESEIEPVGLQLSNQYTEKLFTRKA-R--------SFGNEIVFEASNLQYASMSTKLNLVQ-MVD 347 (384) Q Consensus 281 --~~~~~e-~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~-~--------~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~ 347 (384) +|+.+. +.......+.+.-.++.++..||+.|+...- . ...++++|+.. ...|++++++..+ ++. T Consensus 312 g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~--e~eDl~~~a~~~~~L~~ 389 (528) T protein:vir:10 312 SGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLK--DRADLAAMATSLPPLVK 389 (528) T ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC--CcccHHHHHHHHHHHHh Confidence 122221 2234556677778888899999988755421 1 11234555444 4677888887765 677 Q ss_pred CCC-CCHHHHHHHhCCCCCCCCCeeeecCceeecCC-----C-C Q lcl|NC_019422. 348 RGS-LTPNEWRKIMNLSPIENGDKPVRRLDTAVVEG-----G-E 384 (384) Q Consensus 348 ~g~-~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~-----g-e 384 (384) .|+ ++..++|+.+|+|....++.++.+....+... + . T Consensus 390 ~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~ 433 (528) T protein:vir:10 390 LGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPR 433 (528) T ss_pred CCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccc Confidence 786 89999999999987666666654332221100 0 0 No 111 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.76 E-value=3.4e-17 Score=110.90 Aligned_cols=370 Identities=9% Similarity=0.024 Sum_probs=218.9 Q ss_pred Ccc----hhhhcccCCCc---chhHH--------------------HhhccccCcceechhhh----h-hcHHHHHHHHH Q lcl|NC_019422. 1 MNI----FKSKKKNKEAP---GKVMM--------------------ELISDSGNGFYSWHGNL----Y-KSDIVRSIIRP 48 (384) Q Consensus 1 M~~----f~~~~~~~~~~---~~~~~--------------------~~~~~~~~~~~~~~~~~----~-~~~~v~~~i~~ 48 (384) |.- +++-....... ..... ..+..-..+.....-.+ + +.+.|.+|+.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~ 80 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 322 22211110000 01111 11111111110000011 2 47899999999 Q ss_pred HHHhhccCceEEEEecCCcc--eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC---ceeeEEE Q lcl|NC_019422. 49 KAKAVGKMTAKHIRSNETEF--KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN---MPTQIYP 123 (384) Q Consensus 49 ia~~ia~~~~~~~~~~~~~~--~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g---~~~~l~~ 123 (384) +...|.+++|.|....++.. +...+-....|...| .+.+++..+. +.+.+|-+++++++...| .+..+.+ T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~ 155 (526) T protein:vir:99 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHH 155 (526) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhccc----CHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeee Confidence 99999999999965432221 111122222232223 3566666664 577899999999876543 4778999 Q ss_pred EcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_019422. 124 LNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIK 203 (384) Q Consensus 124 l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 203 (384) .++.++.+..+.+. ...+....+.-..+++...+..++.......+|.+.+..|.-....-....++-..|.+.-|.|- T Consensus 156 r~~~~f~~~~~~~~-~l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~ 234 (526) T protein:vir:99 156 RPQSWFQLNPEDQN-ELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPI 234 (526) T ss_pred ecccceeeccCCCc-EEEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCce Confidence 99998887655543 33333333344556666554445565667789999999999999988889999999999999999 Q ss_pred eEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchh-HHHHHHH-HHHHHHHHHHh-CCCHHH- Q lcl|NC_019422. 204 WLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESY-VPNAAQM-DKAIQRLYSFF-NTNEKI- 279 (384) Q Consensus 204 ~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~-~~~~~~~-~~~~~~I~~~f-gvp~~~- 279 (384) .+.+++...++++.+++.+.+.+ + .+++ .++++.|++++-+..+.. ...+..+ ++.-++|+.+. |=-... T Consensus 235 ~igky~~~a~~~ek~~L~~av~~-i---~~d~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~ 308 (526) T protein:vir:99 235 RLGKYPPGTADEEKATLLRAVTG-L---GHAA--AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTST 308 (526) T ss_pred EEEecCCCCCHHHHHHHHHHHHH-H---hhCc--EEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999888888887777766543 2 2233 556666766665554322 2334444 66677777664 211111 Q ss_pred h--c--cccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccCcccc---------cCcceEEeechhhhccCHHHHHHHHH- Q lcl|NC_019422. 280 I--Q--SKYSE-DEWNAYYESEIEPVGLQLSNQYTEKLFTRKAR---------SFGNEIVFEASNLQYASMSTKLNLVQ- 344 (384) Q Consensus 280 l--~--~~~~e-~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~---------~~~~~i~fd~~~~~~~d~~~~~~~~~- 344 (384) . | ++.+. +.......+.+.-.++.++..||+.|+...-. ...++++|+. ....|++++++..+ T Consensus 309 ~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~ 386 (526) T protein:vir:99 309 TSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPA 386 (526) T ss_pred cccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHH Confidence 1 1 12221 22344556667778888888888877654211 1224555544 34677888888765 Q ss_pred HHhCCC-CCHHHHHHHhCCCCCCCCCeeeecCceee---cCCCC Q lcl|NC_019422. 345 MVDRGS-LTPNEWRKIMNLSPIENGDKPVRRLDTAV---VEGGE 384 (384) Q Consensus 345 ~~~~g~-~t~NE~R~~lG~~p~~~gd~~~~~~n~~~---~~~ge 384 (384) ++..|+ ++..++|+.+|+|....++..+.+..-.+ ...+. T Consensus 387 L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~ 430 (526) T protein:vir:99 387 LVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQ 430 (526) T ss_pred HHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCccccccccc Confidence 677785 89999999999987665655543322111 11111 No 112 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.75 E-value=1.9e-17 Score=112.23 Aligned_cols=367 Identities=10% Similarity=0.019 Sum_probs=211.3 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) .-++..-.+.-..+.+ ..+..-..+....-..++..+.|.+|++.+...|.+++|.|...+++.......+....++ T Consensus 17 ~d~~~~~~~~l~~~~~---~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~~~~~~~ae~v~~~l 93 (488) T protein:vir:99 17 RDITRPFISGLQVPND---SILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDRPIDQAAAEHLEQQL 93 (488) T ss_pred hhhhccccCCCCCCCh---HHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCChHHHHHHHHHHHHH Confidence 1111111111111111 1111000110011133466889999999999999999999965433211111112233333 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC---ceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhhe Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN---MPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDI 157 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g---~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~ev 157 (384) .+ ..+.+++..+. +.+.+|-+++++++...| .+..+.+.|+.++.+..+. +..+......+....++...- T Consensus 94 ~~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~-~l~~~~~~~~~~g~~lp~~~~ 167 (488) T protein:vir:99 94 QR----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQDG-GLRLLTPNNMFEGEPCPAPYF 167 (488) T ss_pred hC----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecCCC-ceEEeccCCCCCccccccCce Confidence 33 35778888875 578899999999885433 4678899999987764433 233322222223334443322 Q ss_pred EEE-eccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC-CCChHHHHHHHHHHHHHhccccccC Q lcl|NC_019422. 158 IHL-RKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT-ALRPDDIKKEVKSFEKNYLQIDSEA 235 (384) Q Consensus 158 ih~-~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~ 235 (384) +++ ++.......+|.|.+..|.-....-....++-..|.+.-|.|-.+.+++. ..++++.+++.+.+.. ..+++ T Consensus 168 ~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~----~~~~~ 243 (488) T protein:vir:99 168 WHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHA----IQTDS 243 (488) T ss_pred EEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHH----HhcCc Confidence 233 33434556799999999999998889999999999999999988888875 5667776666655443 22333 Q ss_pred CcceecCCCceeeecccchh-HHHHHHH-HHHHHHHHHHh-CCCHH-Hh-ccccH-HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 236 GGAAATDSKYDAEQVKAESY-VPNAAQM-DKAIQRLYSFF-NTNEK-II-QSKYS-EDEWNAYYESEIEPVGLQLSNQYT 309 (384) Q Consensus 236 ~~~~v~~~g~~~~~l~~~~~-~~~~~~~-~~~~~~I~~~f-gvp~~-~l-~~~~~-e~~~~~~~~~~i~P~~~~i~~~l~ 309 (384) .++++.|++++-++.... ...+..+ ++.-++|+.+. |=-.. -- +|+.+ .+.........+...++.+++.|| T Consensus 244 --~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~v~~d~~~aDa~~i~~tln 321 (488) T protein:vir:99 244 --AIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQADVRLDLVKADADLICESFN 321 (488) T ss_pred --EEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455666666665554222 2234443 56666776652 10000 00 11222 223344566777888888888888 Q ss_pred hcccCcc-----cccCcceEEeechhhhccCHHHHHHHHH-HHhC-C-CCCHHHHHHHhCCCCCCCCCeeeecCce-eec Q lcl|NC_019422. 310 EKLFTRK-----ARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDR-G-SLTPNEWRKIMNLSPIENGDKPVRRLDT-AVV 380 (384) Q Consensus 310 ~~l~~~~-----~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~-g-~~t~NE~R~~lG~~p~~~gd~~~~~~n~-~~~ 380 (384) ++|+... .....+++.|++. ...|++++++..+ ++.. | -++..++|+.+|+|+...++....+... .+. T Consensus 322 ~~li~~l~~~N~~~~~~p~~~~~~~--e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 399 (488) T protein:vir:99 322 LGPARWLTEWNFPGAQPPRVYRVIE--EPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAEATAPTPSTEFA 399 (488) T ss_pred HHHHHHHHHhCcCCcCCceeEecCC--CcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccccccCCCcccCC Confidence 8876642 1222345566544 4567888888775 5664 6 4799999999999987655554443322 222 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) +... T Consensus 400 ~~~~ 403 (488) T protein:vir:99 400 EGDQ 403 (488) T ss_pred CCCC Confidence 2111 No 113 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.75 E-value=1.7e-18 Score=117.99 Aligned_cols=367 Identities=14% Similarity=0.048 Sum_probs=197.5 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLL 80 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 80 (384) |+++.....-................-++......|..++.++.+|+.+|+.+-+-.+.+- .. +. +....+. T Consensus 20 ~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~-~~-~~------~~~~~~~ 91 (461) T protein:vir:80 20 NDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLK-TD-NK------EMKKNIE 91 (461) T ss_pred hHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeee-cC-CH------HHHHHHH Confidence 3333211100000000000000000001111123455678889999999999998888662 21 11 1111111 Q ss_pred hhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-------------CceeeEEEEcCc---eEEE---EEc----CCC Q lcl|NC_019422. 81 ENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-------------NMPTQIYPLNAL---NVEA---IYE----NEV 137 (384) Q Consensus 81 ~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-------------g~~~~l~~l~~~---~v~~---~~~----~~~ 137 (384) ..-+. ....+-+..++.+..++|.+++++...+. +.+..+..|++. .+.. ..+ .-+ T Consensus 92 ~~~~~-l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg 170 (461) T protein:vir:80 92 SKWRK-LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFG 170 (461) T ss_pred HHHHH-hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEeccccccchhhhcccCcCcccc Confidence 11111 12345566667778899999998864221 112233333332 2111 111 112 Q ss_pred EEEEEEEc--------------CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_019422. 138 LFLKFLLR--------------NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIK 203 (384) Q Consensus 138 ~~~~~~~~--------------~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 203 (384) ....|... ++..+.+-++.|||+......+..+|.|.++.+.+.+...............+...+ T Consensus 171 ~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~- 249 (461) T protein:vir:80 171 EVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFK- 249 (461) T ss_pred cceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCC- Confidence 22222222 122355777899999877777888999999999999999999888888888776554 Q ss_pred eEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_019422. 204 WLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQ 281 (384) Q Consensus 204 ~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~ 281 (384) ++++++ .+..+......+.+.. + . +..++++++.+.+++.++.+..+..- -++.....||++-+||...|. T Consensus 250 -v~k~~~l~~~~~~~~~~~~~~~~~-~---~-~~~g~~~~d~~e~~e~~~~~lsgl~~-~l~~~~~~iaa~s~iP~t~L~ 322 (461) T protein:vir:80 250 -VYKTDDIDALNKDDKANLTAMLDF-M---F-RTEALAIIKGDEQLTKESTNVSGMKD-LLDYGWDYLAGAVRMPKTVLK 322 (461) T ss_pred -ceecchHHhhhchHHHHHHHHHHH-h---c-CCceEEEEcCCcceEEEecCcCCHHH-HHHHHHHHHhhhhcCCeeeee Confidence 455553 2223333333344432 2 1 23457888999899988876655431 235677799999999998774 Q ss_pred c------ccHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccCccc-ccC-cceEEeechhhhccCHHHHHHH---- Q lcl|NC_019422. 282 S------KYSEDEWNAYY-------ESEIEPVGLQLSNQYTEKLFTRKA-RSF-GNEIVFEASNLQYASMSTKLNL---- 342 (384) Q Consensus 282 ~------~~~e~~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~-~~~-~~~i~fd~~~~~~~d~~~~~~~---- 342 (384) | +..+....+|+ ...+.|+++.+...|-+..+.... .+. ...+.+.+++|...+.+++++. T Consensus 323 G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~ 402 (461) T protein:vir:80 323 GQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLT 402 (461) T ss_pred cccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHH Confidence 3 12233444333 245778877777776654443211 111 1356677788888888887663 Q ss_pred ---HH-HHhCCCCCHHHHHHHh----CCCCCC--CCCee----eecCceee--cCCCC Q lcl|NC_019422. 343 ---VQ-MVDRGSLTPNEWRKIM----NLSPIE--NGDKP----VRRLDTAV--VEGGE 384 (384) Q Consensus 343 ---~~-~~~~g~~t~NE~R~~l----G~~p~~--~gd~~----~~~~n~~~--~~~ge 384 (384) ++ ++.+|++|++|+|+.+ |++|.. .|+.. +......+ =+.|+ T Consensus 403 a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 460 (461) T protein:vir:80 403 AEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNAD 460 (461) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCC Confidence 23 5788999999999854 343321 11111 00001100 12333 No 114 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.74 E-value=1.1e-17 Score=113.46 Aligned_cols=364 Identities=12% Similarity=0.031 Sum_probs=197.7 Q ss_pred Ccchhh---hcccCCCcchhHHHhh-----ccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc Q lcl|NC_019422. 1 MNIFKS---KKKNKEAPGKVMMELI-----SDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP 72 (384) Q Consensus 1 M~~f~~---~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 72 (384) +|+-+. +............... .....++ .....|..++.++.+|+.+|+++.+-.+.+.-.++ +.+. + T Consensus 101 Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gy-ql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d-~~e~-~ 177 (862) T protein:vir:99 101 DDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGH-QACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGE-GEEI-D 177 (862) T ss_pred hcchhhhhhccccccccccccchhccccccccCcccH-HHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCc-cccc-C Confidence 111110 1011010000000000 0111111 11234567899999999999999999888743221 1111 1 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC-C---------------CCceeeEEEEcCceEEEEE--- Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD-D---------------YNMPTQIYPLNALNVEAIY--- 133 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~---------------~g~~~~l~~l~~~~v~~~~--- 133 (384) ......+...-... ...+-+..++.+.-++|.+++++..+ . .|.+..+.+++|.++.+.. T Consensus 178 ~e~~~~ie~~~~rL-~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~ 256 (862) T protein:vir:99 178 EESLEKFKAIDVEF-KVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAE 256 (862) T ss_pred HHHHHHHHHHHHHh-hHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccccccc Confidence 11122222111111 12344445556566788877765421 1 2345778889988877522 Q ss_pred -cCC------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 134 -ENE------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 134 -~~~------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) ..+ +....|.. .|+ .+-++-|||+.... +...+.|.|.++.+...|.............+.+.. T Consensus 257 ~~~Dp~sp~yGkP~~y~I-~g~--~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~ 333 (862) T protein:vir:99 257 STADPSSQFFYEPEFWII-SGQ--KYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKR 333 (862) T ss_pred ccccccccccCCceeeee-cCe--eeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 111 11122222 233 23456777774332 233457999999999999999999988888888865 Q ss_pred CcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_019422. 201 TIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEK 278 (384) Q Consensus 201 ~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~ 278 (384) .. +++++. .+..+ +.+.+++.. +.... +..++++++.+.+|+.++.+.....-. +.....+||++.+||.. T Consensus 334 l~--v~ktd~l~~l~~e--d~l~~r~~~-~~~~r-dN~Gi~liD~eEe~e~ls~slSGL~dl-l~~~~q~IAaas~IP~t 406 (862) T protein:vir:99 334 TT--AIHTDTAKAIANE--DKFIQRLMF-WVRYR-DNHAVKVLGTDETMEQFDTSLADFDAV-IMGQYQLVASIAKTPAT 406 (862) T ss_pred cc--eeechhHhhhccH--HHHHHHHHH-HHhcc-CcceeEEecCCCceeEEecccCChHHH-HHHHHHHHHhhhCCCce Confidence 43 445543 22222 233333322 22222 335688999999999888776554321 23456689999999999 Q ss_pred Hhcc-------ccHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH-- Q lcl|NC_019422. 279 IIQS-------KYSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL-- 342 (384) Q Consensus 279 ~l~~-------~~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-- 342 (384) .|.| ++.+....+|+. .-+.|+++.+...+...+. ... .+.|.+..|...+.++++++ T Consensus 407 iLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg----~~~--d~~ieFnpL~~~sekEkAEi~k 480 (862) T protein:vir:99 407 KLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLG----IQH--EIDVVMEPVASMTAQQQADLNK 480 (862) T ss_pred eecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCC--cceEEeCCCCCCCHHHHHHHHH Confidence 6632 123444454443 4578888888877655442 222 35555678888887777754 Q ss_pred -----HH-HHhCCCCCHHHHHHHh------CCCCCCCCCe----eeecCceeecC-CCC Q lcl|NC_019422. 343 -----VQ-MVDRGSLTPNEWRKIM------NLSPIENGDK----PVRRLDTAVVE-GGE 384 (384) Q Consensus 343 -----~~-~~~~g~~t~NE~R~~l------G~~p~~~gd~----~~~~~n~~~~~-~ge 384 (384) ++ ++.+|++|++|+|+.| |++.++..|. ...+.|+..++ .|+ T Consensus 481 k~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~ 539 (862) T protein:vir:99 481 TKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGA 539 (862) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCc Confidence 23 5778999999999976 5555433221 11222222211 111 No 115 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.73 E-value=3e-18 Score=116.67 Aligned_cols=356 Identities=13% Similarity=0.062 Sum_probs=189.4 Q ss_pred Ccchhhhccc---CCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHH Q lcl|NC_019422. 1 MNIFKSKKKN---KEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIK 77 (384) Q Consensus 1 M~~f~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ 77 (384) |.--...... ....... .... ..-.+......|..++.++++|+.+|+.+.+-.|++- ++++..... .... T Consensus 1 ~~~~D~~~n~~~gg~~~~~~-~~~~--~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~--~~~~~~~~~-~~~~ 74 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEI-YGSL--QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDEPAFW-SRWD 74 (422) T ss_pred CccchhhHHHHcCCCCCccc-cCcc--cccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCcccc--CCCHHHHHH-HHHH Confidence 2111100000 0000000 0000 0000111123456789999999999999999888872 222111111 1122 Q ss_pred HHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-C---------CCCceeeEEEEcCceEEEEEcCC-------CEEE Q lcl|NC_019422. 78 FLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-D---------DYNMPTQIYPLNALNVEAIYENE-------VLFL 140 (384) Q Consensus 78 ~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~---------~~g~~~~l~~l~~~~v~~~~~~~-------~~~~ 140 (384) .| ...+-+..++....++|.+++++.. + ..|....+.++++..+++..... +... T Consensus 75 ~l--------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~ 146 (422) T protein:vir:10 75 DL--------EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPL 146 (422) T ss_pred Hh--------hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcce Confidence 22 2345566666777889999988875 2 23557789999999887643211 2222 Q ss_pred EEEEcC---ceEEEEehhheEEEeccC------CCCCccCccHHHH-HHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC Q lcl|NC_019422. 141 KFLLRN---GKIVSYPYSDIIHLRKDF------NENDLFGTSPAKV-LEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT 210 (384) Q Consensus 141 ~~~~~~---g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~ 210 (384) .|...+ +..+.+-++.|||+.... +....+|.|++.. +.+.+.............+.+.... ++++++ T Consensus 147 ~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~--v~~~~~ 224 (422) T protein:vir:10 147 TYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQA--VWKAKG 224 (422) T ss_pred EEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchh Confidence 232222 223455667788884322 3456689999986 6788999888888888877665543 444443 Q ss_pred ---CCC-hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc--- Q lcl|NC_019422. 211 ---ALR-PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--- 283 (384) Q Consensus 211 ---~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--- 283 (384) .++ .+.....++++..... ..++.+.+++.+++.+|++++.+.....-. +.....+||++.|||...|.|. T Consensus 225 l~~~~~~~~~~~~~~~r~~~~~~-~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~-~~~~~~~iaaa~~IP~t~L~G~s~~ 302 (422) T protein:vir:10 225 LAELCDDSEGFGAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIGGIDAF-LDKKFDRIVALSGIHEIILKNKNVG 302 (422) T ss_pred HHHhcCCccchHHHHHHHHHHHH-hcCCccceeEecCCcceEEEecccCChHHH-HHHHHHHHHhhhCCCeeeeccCCcc Confidence 111 2223333343333222 223445566666778899888776654321 3456779999999999877432 Q ss_pred ----cHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH-------HH- Q lcl|NC_019422. 284 ----YSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL-------VQ- 344 (384) Q Consensus 284 ----~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-------~~- 344 (384) +.+....+|+. .-+.|.++.+-..|- . ..++.++ +.+|...+.++++++ ++ T Consensus 303 Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~----~----s~~~~~~--f~pL~~~sekekaei~~~~a~a~~~ 372 (422) T protein:vir:10 303 GVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIV----N----AEEWSVE--FNPLAQESSKDKAEILEKNVNSIAA 372 (422) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----c----cCCcEEE--eCCCCCCCHHHHHHHHHHHHHHHHH Confidence 23334444433 345665555444332 1 1234444 567777777766654 22 Q ss_pred HHhCCCCCHHHHHHHhCCCCC----CCC--CeeeecCcee--ecCC--CC Q lcl|NC_019422. 345 MVDRGSLTPNEWRKIMNLSPI----ENG--DKPVRRLDTA--VVEG--GE 384 (384) Q Consensus 345 ~~~~g~~t~NE~R~~lG~~p~----~~g--d~~~~~~n~~--~~~~--ge 384 (384) ++.+|+++++|+|+.|--... .++ ++-.-..-.. |.++ +| T Consensus 373 ~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 373 LIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 577899999999988733221 111 1111000000 1111 11 No 116 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.73 E-value=1.4e-16 Score=107.45 Aligned_cols=366 Identities=11% Similarity=0.021 Sum_probs=210.4 Q ss_pred CcchhhhcccCC--CcchhH-----------------------HHhhccccCcceechhhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKSKKKNKE--APGKVM-----------------------MELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~~~~~~~--~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 55 (384) =+|+..-.+... .+.... ..++...+...... ..+++.+.|.+|++.+...|.+ T Consensus 3 ~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y-~~m~~D~~i~s~l~~Rk~av~~ 81 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVY-RELRADAHVGGCVRRRKAAVKA 81 (491) T ss_pred CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHH-HHHhhChHHHHHHHHHHHHHhC Confidence 122221000000 000000 00111111111111 2345688999999999999999 Q ss_pred CceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC---ceeeEEEEcCceEEEE Q lcl|NC_019422. 56 MTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN---MPTQIYPLNALNVEAI 132 (384) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g---~~~~l~~l~~~~v~~~ 132 (384) ++|.|...+++. ...+....++.++ .+.+++..+ .+.+.+|-+++++++...| .+..+.+.|+.++.+. T Consensus 82 ~~w~i~~~~~~~---~~a~~i~e~l~~~----~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d 153 (491) T protein:vir:79 82 LEWGLDRGKAKS---RVAKSIADVFADL----DLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD 153 (491) T ss_pred CCcEEecCCCCH---HHHHHHHHHHhcC----CHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeecccceeec Confidence 999996544332 1123334444443 466777776 4577899999999876443 4678999999988865 Q ss_pred EcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC Q lcl|NC_019422. 133 YENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL 212 (384) Q Consensus 133 ~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~ 212 (384) .+ ++..+...........+++...|+.++.......+|.|.+..|.-....-....++-..|.+.-|.|-.+.+++... T Consensus 154 ~~-~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a 232 (491) T protein:vir:79 154 PE-NQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSA 232 (491) T ss_pred cC-CceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCC Confidence 43 33333322222344567777777777776677789999999999999999999999999999999998899998888 Q ss_pred ChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccc---hhHHHHHHH-HHHHHHHHHHhC-CCHH-HhccccHH Q lcl|NC_019422. 213 RPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAE---SYVPNAAQM-DKAIQRLYSFFN-TNEK-IIQSKYSE 286 (384) Q Consensus 213 ~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~---~~~~~~~~~-~~~~~~I~~~fg-vp~~-~l~~~~~e 286 (384) ++++.+++.+.+.+. .+++ .++++.|++++-+..+ .....+.++ ++.-++|+.+.- =... --|++.+. T Consensus 233 ~~~ek~~l~~al~~~----~~~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~ 306 (491) T protein:vir:79 233 SDAETNLLLDRLEDM----VQDA--VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRAS 306 (491) T ss_pred CHHHHHHHHHHHHHH----hcCe--EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhh Confidence 888877777665542 2233 4556666666555432 222234443 555556555431 0000 00222221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhhcccCccc---ccCcceEEeechhhhccCHHHHHHHHH-HHhCCC-CCHHHHHHHh Q lcl|NC_019422. 287 -DEWNAYYESEIEPVGLQLSNQYTEKLFTRKA---RSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGS-LTPNEWRKIM 360 (384) Q Consensus 287 -~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~---~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~-~t~NE~R~~l 360 (384) +.........+.-.+..+++.||+ |+...- ......++|.+.+... +.+.+++..+ ++..|+ ++..++|+.+ T Consensus 307 ~~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~ee-~~~~~a~~~~~L~~~G~~i~~~~~~e~~ 384 (491) T protein:vir:79 307 AQAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQEQ-VDEIQAGRDEKLTRAGARFTPAYFKRAY 384 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcCc-hhHHHHHHHHHHHhCCCccCHHHHHHHh Confidence 222334455556667777777764 444311 1112234555554332 2344566654 677774 7999999999 Q ss_pred CCCCCCCCCeeeecCceeec--------CC-CC Q lcl|NC_019422. 361 NLSPIENGDKPVRRLDTAVV--------EG-GE 384 (384) Q Consensus 361 G~~p~~~gd~~~~~~n~~~~--------~~-ge 384 (384) |+|+.+.++.+..+..-.+. .. ++ T Consensus 385 Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 417 (491) T protein:vir:79 385 NLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQ 417 (491) T ss_pred CCCCCCCCccccCcCcccccccccccccCCCCC Confidence 99877655554422211110 00 00 No 117 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.72 E-value=2.2e-16 Score=106.39 Aligned_cols=370 Identities=9% Similarity=-0.016 Sum_probs=218.2 Q ss_pred Cc----chhhhcccCCC---cchhH--------------------HHhhccccCccee-----chhhhhhcHHHHHHHHH Q lcl|NC_019422. 1 MN----IFKSKKKNKEA---PGKVM--------------------MELISDSGNGFYS-----WHGNLYKSDIVRSIIRP 48 (384) Q Consensus 1 M~----~f~~~~~~~~~---~~~~~--------------------~~~~~~~~~~~~~-----~~~~~~~~~~v~~~i~~ 48 (384) |. .+++-.+.... ..... ...+..-..+... ..+-..+.+.|.+|++. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~ 80 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSK 80 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 32 22221110000 00000 0011110011000 11111357799999999 Q ss_pred HHHhhccCceEEEEecCCcc--eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC---CCCceeeEEE Q lcl|NC_019422. 49 KAKAVGKMTAKHIRSNETEF--KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD---DYNMPTQIYP 123 (384) Q Consensus 49 ia~~ia~~~~~~~~~~~~~~--~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~---~~g~~~~l~~ 123 (384) +...|.+++|.|....++.. +...+-....|...| .+.+++..+. +.+.+|-+++++++. +...+..+.+ T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~ 155 (512) T protein:vir:19 81 RRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHH 155 (512) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeee Confidence 99999999999964432211 111112223333333 3666777764 577899999999874 3345778999 Q ss_pred EcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_019422. 124 LNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIK 203 (384) Q Consensus 124 l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 203 (384) .++.++.+..+..... .+...+..-..+++...+..++.......+|.+.+..|.-....-....++-..|.+.-|.|- T Consensus 156 r~~~~f~~~~~~~~~l-r~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~ 234 (512) T protein:vir:19 156 RDPALFCANPDNLNEL-RLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPM 234 (512) T ss_pred eccccceeccCCCcEE-EecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Confidence 9999888765544333 333333344456777666666666677789999999999999999999999999999999998 Q ss_pred eEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccch-hHHHHHHH-HHHHHHHHHHh-CCCHHH- Q lcl|NC_019422. 204 WLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAES-YVPNAAQM-DKAIQRLYSFF-NTNEKI- 279 (384) Q Consensus 204 ~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~-~~~~~~~~-~~~~~~I~~~f-gvp~~~- 279 (384) .+.+++...++++.+++.+.+.+. .++ ..++++.|++++-+..+. ....+..+ ++.-++|+.+. |=-... T Consensus 235 ~igky~~~a~~~ek~~L~~al~~~----~~~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~ 308 (512) T protein:vir:19 235 RVGKYPTGSTNREKATLMQAVMDI----GRR--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTE 308 (512) T ss_pred eEEecCCCCCHHHHHHHHHHHHHH----hhC--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 888998888888877777666542 223 355667777766555432 22335543 66677777662 211111 Q ss_pred hccc--cH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCccc-cc--------CcceEEeechhhhccCHHHHHHHHHHHh Q lcl|NC_019422. 280 IQSK--YS-EDEWNAYYESEIEPVGLQLSNQYTEKLFTRKA-RS--------FGNEIVFEASNLQYASMSTKLNLVQMVD 347 (384) Q Consensus 280 l~~~--~~-e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~-~~--------~~~~i~fd~~~~~~~d~~~~~~~~~~~~ 347 (384) .|.+ .+ .+.......+.+...++.++..||++|+...- .. ..++++|+.. ...|++..++..+.+. T Consensus 309 ~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~--e~eDl~~~a~~~~~l~ 386 (512) T protein:vir:19 309 AGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTS--EAGDITALSDAIPKLA 386 (512) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC--ChhhHHHHHHHHHHHh Confidence 1111 22 12234456677778889999999988877531 11 1245666544 4567777777665433 Q ss_pred CC-CCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 348 RG-SLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 348 ~g-~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) .| -++..++|+.+|+|....++....+....+-.++. T Consensus 387 ~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~ 424 (512) T protein:vir:19 387 AGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQ 424 (512) T ss_pred cCCCCCHHHHHHHhCCCCCCCccccccCCCcccccccc Confidence 45 67999999999997655454444322221111111 No 118 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.72 E-value=3.8e-16 Score=105.13 Aligned_cols=364 Identities=11% Similarity=-0.013 Sum_probs=209.5 Q ss_pred Ccchhhhc---ccCCC----------------------cchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKSKK---KNKEA----------------------PGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~~~---~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 55 (384) =+|+..-- ..... .-+....++...+...... ..+++.+.|.+|++.+...|.+ T Consensus 3 ~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y-~~m~~D~~i~s~l~~Rk~av~~ 81 (491) T protein:vir:10 3 KGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVY-RELRADAHVGGCVRRRKAAVKA 81 (491) T ss_pred CceeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHH-HHHhhChHHHHHHHHHHHHHhC Confidence 01111000 00000 0000111111111111111 2345688999999999999999 Q ss_pred CceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC---ceeeEEEEcCceEEEE Q lcl|NC_019422. 56 MTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN---MPTQIYPLNALNVEAI 132 (384) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g---~~~~l~~l~~~~v~~~ 132 (384) ++|.|...+++. ...+....++.++ .+.+++..+. +.+.+|.+++++++...| .+..+.+.|+.++.+. T Consensus 82 ~~w~i~~~~~~~---~~~e~v~e~l~~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d 153 (491) T protein:vir:10 82 LEWGLDRGKAKS---RVAKSIADVFADL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD 153 (491) T ss_pred CCcEEecCCCCH---HHHHHHHHHHhcC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeec Confidence 999996543322 1223344444443 4777888875 678899999999986543 4678999999988764 Q ss_pred EcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC Q lcl|NC_019422. 133 YENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL 212 (384) Q Consensus 133 ~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~ 212 (384) .+ ++..+.-....+....+++...|+.++.......+|.|.+..|.-....-....++-..|.+.-|.|-.+.+++... T Consensus 154 ~~-~~l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a 232 (491) T protein:vir:10 154 PE-NQLRFRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSA 232 (491) T ss_pred cC-CceEEecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCC Confidence 43 23333222222344567777777777776667789999999999999999999999999999999999999999888 Q ss_pred ChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccc--h-hHHHHHHH-HHHHHHHHHHh-CCCHH-HhccccHH Q lcl|NC_019422. 213 RPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAE--S-YVPNAAQM-DKAIQRLYSFF-NTNEK-IIQSKYSE 286 (384) Q Consensus 213 ~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~--~-~~~~~~~~-~~~~~~I~~~f-gvp~~-~l~~~~~e 286 (384) ++++.+++.+.+.+. .++ ..++++.|++++-+..+ . ....+..+ ++.-++|+.+. |=... --|++.+. T Consensus 233 ~~~ek~~l~~al~~~----~~~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~ 306 (491) T protein:vir:10 233 SDGEKNLLLDCLEDM----VQD--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRAS 306 (491) T ss_pred CHHHHHHHHHHHHHH----hcC--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhH Confidence 888877777665542 223 34566677666655443 2 22234443 55555555442 10000 00222221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhhcccCc-----ccccCcceEEeechhhhccCHHHHHHHHH-HHhCCC-CCHHHHHH Q lcl|NC_019422. 287 -DEWNAYYESEIEPVGLQLSNQYTEKLFTR-----KARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGS-LTPNEWRK 358 (384) Q Consensus 287 -~~~~~~~~~~i~P~~~~i~~~l~~~l~~~-----~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~-~t~NE~R~ 358 (384) +.........+.-.+..++..+|+ |+.. +.....+.++| .... .+.+++++..+ ++..|+ ++..++|+ T Consensus 307 ~~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~--~~~~-e~~~~~a~~~~~L~~~G~~i~~~~i~e 382 (491) T protein:vir:10 307 AQAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDM--WEQE-QVDEIQAGRDQKLTQAGARFTPAYFKR 382 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEe--cCcC-chhHHHHHHHHHHHhCCCcCCHHHHHH Confidence 222334445555566667766764 4432 11122344444 4332 33456666654 677774 79999999 Q ss_pred HhCCCCCCCCCeeeecCc--eee-cCCCC Q lcl|NC_019422. 359 IMNLSPIENGDKPVRRLD--TAV-VEGGE 384 (384) Q Consensus 359 ~lG~~p~~~gd~~~~~~n--~~~-~~~ge 384 (384) .+|+|+.+.++.+..... ..+ ...++ T Consensus 383 ~~Gip~~~~~~~~~~~~~~~~~~~~~~~~ 411 (491) T protein:vir:10 383 AYNLQDGDLDERPLPVSAVDTVGAASFAE 411 (491) T ss_pred HhCCCCCCcCccccccCCCCCcccccccc Confidence 999987654444331111 110 01111 No 119 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.71 E-value=4.7e-17 Score=110.10 Aligned_cols=362 Identities=10% Similarity=0.037 Sum_probs=196.0 Q ss_pred Ccchhhhccc------------------CCCc-c---hh----------------HHHhh-ccccCcceechhhhhhcHH Q lcl|NC_019422. 1 MNIFKSKKKN------------------KEAP-G---KV----------------MMELI-SDSGNGFYSWHGNLYKSDI 41 (384) Q Consensus 1 M~~f~~~~~~------------------~~~~-~---~~----------------~~~~~-~~~~~~~~~~~~~~~~~~~ 41 (384) -+++....+. .+.. . .. ...+. .....++ ..-..|..++. T Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gy-ql~alY~~~~l 122 (765) T protein:vir:96 44 RGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGY-QACAIISQHWL 122 (765) T ss_pred hhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccH-HHHHHHHhCch Confidence 1111110000 0000 0 00 00000 0001111 11123557889 Q ss_pred HHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC-------- Q lcl|NC_019422. 42 VRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD-------- 113 (384) Q Consensus 42 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-------- 113 (384) ++.+|+.+|+++.+-.+.+.- .++ +. .+.....|...-.. ....+-+..++.+.-++|.+|+++.-. T Consensus 123 ~rkiVd~pAeDa~R~g~~I~~-~~~--e~-~~~~~~~l~~~~~r-l~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~ 197 (765) T protein:vir:96 123 VDKACSMSGEDAARNGWELKS-DGR--KL-SDEQSALIARRDME-FRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYE 197 (765) T ss_pred hhhhhhcchHHhhcCCceeec-Ccc--cc-CHHHHHHHHHHHHH-hhHHHHHHHHHHHhhhceeeEEEEEecccCcchhh Confidence 999999999999988887732 111 11 11222222211111 123455666677778899998876532 Q ss_pred --------CCCceeeEEEEcCceEEEEE----cCC------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCc Q lcl|NC_019422. 114 --------DYNMPTQIYPLNALNVEAIY----ENE------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDL 169 (384) Q Consensus 114 --------~~g~~~~l~~l~~~~v~~~~----~~~------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~ 169 (384) ..|....|.+++|.++.+.. ..+ +..-.|.. .|+. +-++.|||+.... +.... T Consensus 198 ~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i-~g~~--IH~SRli~~~g~~lpd~lk~~~~~ 274 (765) T protein:vir:96 198 KPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWII-SGKK--YHRSHLVVVRGPQPPDILKPTYIF 274 (765) T ss_pred ccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeee-cCce--eccceEEEecCCCchhhhccccCc Confidence 11345677888887766522 111 11112222 2332 3456788885332 23456 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCC--CCChHHHHHHHHHHHHHhccccccCCcceecCCCcee Q lcl|NC_019422. 170 FGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKT--ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDA 247 (384) Q Consensus 170 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~ 247 (384) +|.|.++.+...|.............+.+.... +++++. .+..+ +.+.+.+..... .. +..++++++.+.+| T Consensus 275 ~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~--~~l~~r~~~~~~-~r-~n~g~~~id~ee~~ 348 (765) T protein:vir:96 275 GGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANE--DAFNARLAFWIA-NR-DNHGVKVIGIDETM 348 (765) T ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccH--HHHHHHHHHHHH-hc-CCceeEEecCCcce Confidence 799999999999999999888888888776654 444443 22222 233343333222 22 33567889999999 Q ss_pred eecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc-------cHHHHHHHHHH-------HHHHHHHHHHHHHHhhccc Q lcl|NC_019422. 248 EQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK-------YSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLF 313 (384) Q Consensus 248 ~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~-------~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~ 313 (384) +.++.+..++.- -+.....+||++.+||...|-|. ..+....+|+. ..+.|.++.+-..|-+. T Consensus 349 e~~s~~lsgl~d-~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s-- 425 (765) T protein:vir:96 349 EQFDTNLSDFDS-VIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKS-- 425 (765) T ss_pred eEEecccCCHHH-HHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 988877665432 12456779999999999776331 23444444443 44667666666555432 Q ss_pred CcccccCcceEEeechhhhccCHHHHHHH-------HH-HHhCCCCCHHHHHHHhCCCCC------CCCC----eeeecC Q lcl|NC_019422. 314 TRKARSFGNEIVFEASNLQYASMSTKLNL-------VQ-MVDRGSLTPNEWRKIMNLSPI------ENGD----KPVRRL 375 (384) Q Consensus 314 ~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-------~~-~~~~g~~t~NE~R~~lG~~p~------~~gd----~~~~~~ 375 (384) ..... .+.+.+.+|...+.++++++ ++ ++.+|+++++|+|+.|+.++. ++.+ ....|. T Consensus 426 --~~i~~--d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe 501 (765) T protein:vir:96 426 --ESIDV--QLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPE 501 (765) T ss_pred --cCCCC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCcc Confidence 22333 45666788888888777764 23 577899999999999865542 2211 111122 Q ss_pred ceeecCC-CC Q lcl|NC_019422. 376 DTAVVEG-GE 384 (384) Q Consensus 376 n~~~~~~-ge 384 (384) +...++. ++ T Consensus 502 ~~~~~~~~~~ 511 (765) T protein:vir:96 502 NLAELEKAGA 511 (765) T ss_pred ccccccCCCc Confidence 2211110 00 No 120 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.71 E-value=4e-16 Score=104.99 Aligned_cols=368 Identities=7% Similarity=-0.002 Sum_probs=219.1 Q ss_pred hcccCCCcch--hHHHhhcc-ccCc------------------ceechhhhh-hcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_019422. 7 KKKNKEAPGK--VMMELISD-SGNG------------------FYSWHGNLY-KSDIVRSIIRPKAKAVGKMTAKHIRSN 64 (384) Q Consensus 7 ~~~~~~~~~~--~~~~~~~~-~~~~------------------~~~~~~~~~-~~~~v~~~i~~ia~~ia~~~~~~~~~~ 64 (384) .......+.+ ..-...+. ...+ ...+ ..+. +.+.|.+|++.+...|.+++|.|...+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly-~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~ 79 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVY-SRMDNEDSRVTSLLEAISLPIRSTPWRIRANG 79 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHH-HHHHhhChHHHHHHHHHHHHHhcCCceEecCC Confidence 2111111111 11111110 0000 0011 1222 478999999999999999999996544 Q ss_pred CCcceeccchHHHHHHhhc------------cccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-----C--ceeeEEEEc Q lcl|NC_019422. 65 ETEFKTNPEIYIKFLLENP------------NPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-----N--MPTQIYPLN 125 (384) Q Consensus 65 ~~~~~~~~~~~~~~l~~~P------------N~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-----g--~~~~l~~l~ 125 (384) ++. +..+.....|.... +-..++.+++..++.+.+.+|-++.++++... | .+..+-+.| T Consensus 80 ~~~--e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp 157 (469) T protein:vir:10 80 ASD--EVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRP 157 (469) T ss_pred CCH--HHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecC Confidence 332 11122222221111 11346778888888888889999999998633 3 356677777 Q ss_pred CceEE-EEEcCCCEEEEEEE-------------cCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 126 ALNVE-AIYENEVLFLKFLL-------------RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQG 191 (384) Q Consensus 126 ~~~v~-~~~~~~~~~~~~~~-------------~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 191 (384) +.++. ...+.++....+.. .++....+|+...|+.++.......+|.|.+..|.-....-....++ T Consensus 158 ~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~ 237 (469) T protein:vir:10 158 QWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRI 237 (469) T ss_pred cccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHH Confidence 76553 22333322222211 12234557777777777777677789999999999999999999999 Q ss_pred HHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHH Q lcl|NC_019422. 192 VVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLY 270 (384) Q Consensus 192 ~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~ 270 (384) ...+.+.-|.|--+.+++...++++.+++.+.......| ....++++.|++++-++.+.+...+.++ ++.-++|+ T Consensus 238 w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g----~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Is 313 (469) T protein:vir:10 238 EAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGG----INAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIA 313 (469) T ss_pred HHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcC----CceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHH Confidence 999999999998888998888888877776655432212 2335678889998888776555556554 56666666 Q ss_pred HHhCCCH-HH--hccccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCcc-----c-ccCcceEEeechhhhccCHHHHH Q lcl|NC_019422. 271 SFFNTNE-KI--IQSKYS-EDEWNAYYESEIEPVGLQLSNQYTEKLFTRK-----A-RSFGNEIVFEASNLQYASMSTKL 340 (384) Q Consensus 271 ~~fgvp~-~~--l~~~~~-e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~-----~-~~~~~~i~fd~~~~~~~d~~~~~ 340 (384) .+.--.- .. =|++.+ .+.........+...++.++..||++|+... . ....++++|+ ... .+.+..+ T Consensus 314 k~iLG~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~--~~e-~~~~~~a 390 (469) T protein:vir:10 314 LSGLAHFLNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFD--PIG-SRQDLTA 390 (469) T ss_pred HHHhcccccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEec--CCC-CcHHHHH Confidence 5542111 10 022322 2334556677778888899999998877642 1 1122455554 433 3445556 Q ss_pred HHHH-HHhCCC-----CCHHHHHHHhCCCCCCCCCeeeecC--ceee--cCCCC Q lcl|NC_019422. 341 NLVQ-MVDRGS-----LTPNEWRKIMNLSPIENGDKPVRRL--DTAV--VEGGE 384 (384) Q Consensus 341 ~~~~-~~~~g~-----~t~NE~R~~lG~~p~~~gd~~~~~~--n~~~--~~~ge 384 (384) +..+ ++..|+ ++.+.+|+.+|+|+.+.++....+. +..| ...++ T Consensus 391 ~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (469) T protein:vir:10 391 AAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPA 444 (469) T ss_pred HHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCcccc Confidence 6665 678887 4567899999999775554433221 1111 01111 No 121 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.70 E-value=8.3e-16 Score=103.26 Aligned_cols=370 Identities=8% Similarity=0.015 Sum_probs=219.1 Q ss_pred Cc----chhhhcccCCCc---chhHH--------------------HhhccccCcceec-h---hhhh-hcHHHHHHHHH Q lcl|NC_019422. 1 MN----IFKSKKKNKEAP---GKVMM--------------------ELISDSGNGFYSW-H---GNLY-KSDIVRSIIRP 48 (384) Q Consensus 1 M~----~f~~~~~~~~~~---~~~~~--------------------~~~~~~~~~~~~~-~---~~~~-~~~~v~~~i~~ 48 (384) |. .+++-.+..... ...+. ..+..-..+.... - .... +.+.|.+|+.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~ 80 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 32 122211111000 01111 1111100010000 0 0112 47899999999 Q ss_pred HHHhhccCceEEEEecCCcce--eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC---ceeeEEE Q lcl|NC_019422. 49 KAKAVGKMTAKHIRSNETEFK--TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN---MPTQIYP 123 (384) Q Consensus 49 ia~~ia~~~~~~~~~~~~~~~--~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g---~~~~l~~ 123 (384) +...|.+++|.|....++... ...+-....|...| .+.+++..+. +.+.+|-++.++++...| .+..+.+ T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l~~ 155 (526) T protein:vir:79 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHH 155 (526) T ss_pred HHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhccc----CHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEeee Confidence 999999999999654332211 11111222232223 3556666654 477899999999876543 4778999 Q ss_pred EcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_019422. 124 LNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIK 203 (384) Q Consensus 124 l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 203 (384) .++.++++..+.+. ...+......-..+++...+..++.......+|.+.+..|.-....-....++-..|.+.-|.|- T Consensus 156 r~~~~F~~~~~~~~-~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~ 234 (526) T protein:vir:79 156 RPQSWFQLNPEDQN-ELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPI 234 (526) T ss_pred ecccceEeccCCCc-EEEecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCce Confidence 99998887555543 33333333344566776655556666667789999999999999888889999999999999999 Q ss_pred eEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccch-hHHHHHHH-HHHHHHHHHHh-CCCHHH- Q lcl|NC_019422. 204 WLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAES-YVPNAAQM-DKAIQRLYSFF-NTNEKI- 279 (384) Q Consensus 204 ~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~-~~~~~~~~-~~~~~~I~~~f-gvp~~~- 279 (384) .+.+++...++++.+++.+.+.+. .++ ..++++.|++++-+..+. ....+..+ ++.-++|+.+. |=-... T Consensus 235 ~igky~~~a~~~ek~~L~~av~~i----~~d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~ 308 (526) T protein:vir:79 235 RLGKYPPGTADEEKATLLRAVTGL----GHA--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTST 308 (526) T ss_pred EEEecCCCCCHHHHHHHHHHHHHH----hcC--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 999998888888877777665442 223 356667777666665432 22335544 66677776663 221111 Q ss_pred h--c--cccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccCcccc---------cCcceEEeechhhhccCHHHHHHHHH- Q lcl|NC_019422. 280 I--Q--SKYSE-DEWNAYYESEIEPVGLQLSNQYTEKLFTRKAR---------SFGNEIVFEASNLQYASMSTKLNLVQ- 344 (384) Q Consensus 280 l--~--~~~~e-~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~---------~~~~~i~fd~~~~~~~d~~~~~~~~~- 344 (384) . | |+.+. +.......+.+...++.++..||+.|+...-. ...++++|+. ....|++++++..+ T Consensus 309 ~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~ 386 (526) T protein:vir:79 309 TSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPA 386 (526) T ss_pred cccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHH Confidence 1 1 12221 22345566667788899999999888654211 1123555544 35677888888765 Q ss_pred HHhCCC-CCHHHHHHHhCCCCCCCCCeeeecCcee---ecCCCC Q lcl|NC_019422. 345 MVDRGS-LTPNEWRKIMNLSPIENGDKPVRRLDTA---VVEGGE 384 (384) Q Consensus 345 ~~~~g~-~t~NE~R~~lG~~p~~~gd~~~~~~n~~---~~~~ge 384 (384) ++..|+ ++..++|+.+|+|....++.++.|..-. +...+. T Consensus 387 L~~~G~~i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~ 430 (526) T protein:vir:79 387 LVNVGLEIPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQ 430 (526) T ss_pred HHhCCCcCCHHHHHHHhCCCCCCCchhhccccCCcccccccccc Confidence 677785 7999999999997665555544332211 001111 No 122 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.70 E-value=9.3e-18 Score=113.97 Aligned_cols=353 Identities=10% Similarity=0.032 Sum_probs=192.0 Q ss_pred Ccchhhh------c-ccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccc Q lcl|NC_019422. 1 MNIFKSK------K-KNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPE 73 (384) Q Consensus 1 M~~f~~~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~ 73 (384) |..|+.= . .......+ . ..+..++ .....|..++.++++|+.+|+.+.+..+.+- .+ +..... . T Consensus 1 ~~~~~~d~~~~~~~~~~~~~~~~----~-~~~~~~~-~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~-g~-~~~~~~-~ 71 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGSPKP----F-FMSDASY-HVGSFYNDNATAKRIVDVIPEEMVTAGFKMS-GV-KDEKEF-K 71 (427) T ss_pred CCccccchHHHHhhcCCCCcccC----c-cccCchH-HHHHHHHcCchhhhhhccchHHhhcCCcccc-Cc-cHHHHH-H Confidence 6555520 0 00000000 0 0011111 1123456788999999999999999888872 11 111111 1 Q ss_pred hHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC----------CCCceeeEEEEcCceEEEEEcCC------- Q lcl|NC_019422. 74 IYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD----------DYNMPTQIYPLNALNVEAIYENE------- 136 (384) Q Consensus 74 ~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~----------~~g~~~~l~~l~~~~v~~~~~~~------- 136 (384) .....| ...+-+..++....++|.+++++..+ ..|.+..+.++++..+++..... T Consensus 72 ~~~~~l--------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~f 143 (427) T protein:vir:10 72 SLWDSY--------KLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRY 143 (427) T ss_pred HHHHHh--------hHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCcccccc Confidence 111222 23345666677778899999987532 34678899999998887643221 Q ss_pred CEEEEEEEcC--c-eEEEEehhheEEEeccC------CCCCccCccHHH-HHHHHHHHHHHHHHHHHHHHHccCCcceEE Q lcl|NC_019422. 137 VLFLKFLLRN--G-KIVSYPYSDIIHLRKDF------NENDLFGTSPAK-VLEPIMEVVNTTDQGVVKAIKNSNTIKWLL 206 (384) Q Consensus 137 ~~~~~~~~~~--g-~~~~~~~~evih~~~~~------~~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 206 (384) +.+..|...+ + ..+.+-++.|+|+.... +.+..+|.|++. .+.+.+.............+.+.... ++ T Consensus 144 g~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~--v~ 221 (427) T protein:vir:10 144 GEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQA--VW 221 (427) T ss_pred CcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccc--cc Confidence 2222233322 2 22456677899885332 345678999986 56788888888888878777665443 44 Q ss_pred eeCCC---CC-hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc Q lcl|NC_019422. 207 KFKTA---LR-PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS 282 (384) Q Consensus 207 ~~~~~---~~-~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~ 282 (384) ++++- ++ .+.....++++.... ...++.+.+++.+.+.+|+.++.+.....- -+.....+||++.+||...|.| T Consensus 222 k~~~l~~~~~~~~~~~~~~~r~~~~~-~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~-~~~~~~~~iaaa~~IP~t~L~G 299 (427) T protein:vir:10 222 KVKGLAEMCDDDDAQYAARLRLAQVD-DNSGVGRAIGIDAETEEYDVLNSDISGVPE-FLSSKMDRIVSLSGIHEIIIKN 299 (427) T ss_pred cchhHHHHhcCccchHHHHHHHHHHH-HhcCcccceeeecCCCceeEEecccCChHH-HHHHHHHHHHhhhCCCeeeecc Confidence 44321 11 111222333333322 122344556666677888888766655432 1345677899999999987732 Q ss_pred c-------cHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH------ Q lcl|NC_019422. 283 K-------YSEDEWNAYYE-------SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL------ 342 (384) Q Consensus 283 ~-------~~e~~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~------ 342 (384) . +.+....+|+. ..+.|.++.+-..+- . ..++ .+.+.++...+.++++++ T Consensus 300 ~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~----~----s~~~--~~~f~pL~~~s~kEkaei~~~~a~ 369 (427) T protein:vir:10 300 KNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----D----EEEW--SIEFEPLSVPSKKEESEITKNNVE 369 (427) T ss_pred CCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----c----CCCc--EEEeCCCCCCCHHHHHHHHHHHHH Confidence 1 22344444443 346666555444332 1 1234 444567777777766643 Q ss_pred -HH-HHhCCCCCHHHHHHHh----CCCCCCCCCeee--ecCce---e-ecCCCC Q lcl|NC_019422. 343 -VQ-MVDRGSLTPNEWRKIM----NLSPIENGDKPV--RRLDT---A-VVEGGE 384 (384) Q Consensus 343 -~~-~~~~g~~t~NE~R~~l----G~~p~~~gd~~~--~~~n~---~-~~~~ge 384 (384) ++ ++.+|+++++|+|+.| +.+.+++++..- .+... . +.++++ T Consensus 370 a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~ 423 (427) T protein:vir:10 370 SVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKL 423 (427) T ss_pred HHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCC Confidence 22 5778999999999876 444443322111 00111 1 112222 No 123 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.66 E-value=5.1e-16 Score=104.41 Aligned_cols=381 Identities=9% Similarity=0.041 Sum_probs=213.4 Q ss_pred CcchhhhcccCCCc-------chhHHHhh---------ccccCccee--------------chhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP-------GKVMMELI---------SDSGNGFYS--------------WHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~~-------~~~~~~~~---------~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ia 50 (384) |||+++.....+.. ........ ..+....+. ..+-+.++|.+..+|+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 99999643322210 00000000 000000000 0123457899999999888 Q ss_pred HhhccC-ceEEEEe--cCCcc--eecc---chHHHHHHhhc--cccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC----- Q lcl|NC_019422. 51 KAVGKM-TAKHIRS--NETEF--KTNP---EIYIKFLLENP--NPFMSGQILQEKMVTQLELNSNAFAVIIKDDY----- 115 (384) Q Consensus 51 ~~ia~~-~~~~~~~--~~~~~--~~~~---~~~~~~l~~~P--N~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~----- 115 (384) +.+-.- -+.+... ..+.. ++.. ......+..++ +..+++..+.+.++..++..|++|+.+.+... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 777543 3433211 11111 1111 11222233333 24578999999999999999999999876543 Q ss_pred --CceeeEEEEcCceEE------------EEEcCCCEEEEEEEc--------CceEEEEehhheEEEeccCCCCCccCcc Q lcl|NC_019422. 116 --NMPTQIYPLNALNVE------------AIYENEVLFLKFLLR--------NGKIVSYPYSDIIHLRKDFNENDLFGTS 173 (384) Q Consensus 116 --g~~~~l~~l~~~~v~------------~~~~~~~~~~~~~~~--------~g~~~~~~~~evih~~~~~~~~~~~G~s 173 (384) +.+..|-.|+|..+. |..+..|..+.|... ....+.+|+++|+|+..+...+...|+| T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis 240 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTS 240 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCCc Confidence 246788889998774 233444444333322 1345678999999999887788899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcce-ecCCCceeeeccc Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAA-ATDSKYDAEQVKA 252 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~-v~~~g~~~~~l~~ 252 (384) .+.+++..+........+.....+-.+...++|+.+....... +..-..-.+. ...-..|.++ .+..|.+++..+. T Consensus 241 ~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~-~~~~~~~~~~--~~~l~pG~i~~~L~pGe~i~~~~p 317 (502) T protein:vir:79 241 LLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEP-DGNGSKENER--ELTIQPGIIYDDLKPGEEIGMVKS 317 (502) T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccc-ccCCCCCccc--cccccCCccccccCCCceeeeeCC Confidence 9999999999998888887777777777778887643211110 0000000000 0011235444 5789999999887 Q ss_pred chhHHHHHH-HHHHHHHHHHHhCCCHHHhccccHH-----------------HHHHHHHHHHHHHHHHH-HHHHHhhccc Q lcl|NC_019422. 253 ESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKYSE-----------------DEWNAYYESEIEPVGLQ-LSNQYTEKLF 313 (384) Q Consensus 253 ~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~~e-----------------~~~~~~~~~~i~P~~~~-i~~~l~~~l~ 313 (384) +.....+.. .+...+.||+++|||...|.++.+. ..+..+....++|+.+. +++++-...+ T Consensus 318 ~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i 397 (502) T protein:vir:79 318 DRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVI 397 (502) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 766666654 4678899999999999998754221 01112444556665554 4444444433 Q ss_pred Ccc---cccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHhCCCCCCC----------CCee-------- Q lcl|NC_019422. 314 TRK---ARSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIMNLSPIEN----------GDKP-------- 371 (384) Q Consensus 314 ~~~---~~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~-------- 371 (384) +-. .......+.|-.......|+.+-++. ..++.+|+.|.-|+-...|.++-+. .+++ T Consensus 398 ~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~ 477 (502) T protein:vir:79 398 RLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDP 477 (502) T ss_pred CCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 311 11122233333344445565544444 4579999999998877777766321 0111 Q ss_pred -eecCcee-ecCCCC Q lcl|NC_019422. 372 -VRRLDTA-VVEGGE 384 (384) Q Consensus 372 -~~~~n~~-~~~~ge 384 (384) ..+.+.. +.+..| T Consensus 478 ~~~~~~~~~~~~~~e 492 (502) T protein:vir:79 478 ASDKGGSSAATKRQE 492 (502) T ss_pred CCCCCCCCCCCCCCC Confidence 1111111 111111 No 124 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.60 E-value=1.3e-15 Score=102.19 Aligned_cols=379 Identities=7% Similarity=-0.071 Sum_probs=215.6 Q ss_pred CcchhhhcccCCCcc----hhHHH---------hhcccc--Cccee---------------chhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG----KVMME---------LISDSG--NGFYS---------------WHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~~~----~~~~~---------~~~~~~--~~~~~---------------~~~~~~~~~~v~~~i~~ia 50 (384) |++..+..+...... ..... ....+. ....+ ..+-+.+++.+..+|+.+. T Consensus 8 ~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 87 (505) T protein:vir:96 8 PSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLK 87 (505) T ss_pred cchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 777776543211100 00000 000010 00000 1123457899999999888 Q ss_pred Hhhcc-CceEEEEecCC--c--cee---ccchHHHHHHhhcc----ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC-c Q lcl|NC_019422. 51 KAVGK-MTAKHIRSNET--E--FKT---NPEIYIKFLLENPN----PFMSGQILQEKMVTQLELNSNAFAVIIKDDYN-M 117 (384) Q Consensus 51 ~~ia~-~~~~~~~~~~~--~--~~~---~~~~~~~~l~~~PN----~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g-~ 117 (384) +.+-. ..+++...-.. + .++ .-......+...+| ..+++.++...++..++..|++|+.+.+...+ . T Consensus 88 ~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~ 167 (505) T protein:vir:96 88 NNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKW 167 (505) T ss_pred HHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCc Confidence 77764 45555322111 1 111 11222333444455 34678899999999999999999998765433 3 Q ss_pred eeeEEEEcCceEEEE----------------EcCCCEEEEEEEcC--------------ceEEEEehhheEEEeccCCCC Q lcl|NC_019422. 118 PTQIYPLNALNVEAI----------------YENEVLFLKFLLRN--------------GKIVSYPYSDIIHLRKDFNEN 167 (384) Q Consensus 118 ~~~l~~l~~~~v~~~----------------~~~~~~~~~~~~~~--------------g~~~~~~~~evih~~~~~~~~ 167 (384) +..|-.|+|..+..- .+..|..+.|.... ...+.+|.++|+|+..+.... T Consensus 168 ~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~g 247 (505) T protein:vir:96 168 GYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPH 247 (505) T ss_pred ceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhcccCCc Confidence 567888888877422 22233333333210 124457889999999887788 Q ss_pred CccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCcee Q lcl|NC_019422. 168 DLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDA 247 (384) Q Consensus 168 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~ 247 (384) ...|+|.+.+++..+.......+......+=.+...++|+.+.....+..... ..+... .-..|.+..+..|.++ T Consensus 248 Q~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~---~~~~~~--~l~pG~i~~L~pGe~i 322 (505) T protein:vir:96 248 QNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDD---QGEIVE--EVEAGTYQLLPYGIRF 322 (505) T ss_pred cccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccc---cCcccc--ccCCceeeecCCCCee Confidence 89999999999999999988888877777777777788876533211110000 001111 1124678889999999 Q ss_pred eecccchhHHHHHH-HHHHHHHHHHHhCCCHHHhccccHH------------------HHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019422. 248 EQVKAESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKYSE------------------DEWNAYYESEIEPVGLQ-LSNQ 307 (384) Q Consensus 248 ~~l~~~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~~e------------------~~~~~~~~~~i~P~~~~-i~~~ 307 (384) +.++.+.....+.. .+.+.+.||+++|||...|.++.++ ..+..+....++|+.+. ++++ T Consensus 323 ~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a 402 (505) T protein:vir:96 323 KEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMS 402 (505) T ss_pred eeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99988766666654 4678899999999999998654221 11123445566776554 5555 Q ss_pred HhhcccCccc--ccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeec Q lcl|NC_019422. 308 YTEKLFTRKA--RSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIMNLSPIEN----------GDKPVRR 374 (384) Q Consensus 308 l~~~l~~~~~--~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~~~~ 374 (384) +-...++-.. ......+.|-.......|+.+-++. ..++.+|+.|.-|+-...|.++-+- .+++=++ T Consensus 403 ~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~ 482 (505) T protein:vir:96 403 LLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVN 482 (505) T ss_pred HHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCC Confidence 5444443221 1122234444444455565555554 4579999999988876666665310 0111010 Q ss_pred -------CceeecC-----CCC Q lcl|NC_019422. 375 -------LDTAVVE-----GGE 384 (384) Q Consensus 375 -------~n~~~~~-----~ge 384 (384) ....+.. .+| T Consensus 483 ~~~~~~~~~~~~~~~~~~~~~d 504 (505) T protein:vir:96 483 PTPPEQESKDATTDEEDDSASD 504 (505) T ss_pred CCCCCCCCCCCCCCCCCCCCCC Confidence 0000111 111 No 125 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.56 E-value=1.2e-14 Score=96.93 Aligned_cols=380 Identities=9% Similarity=-0.009 Sum_probs=209.6 Q ss_pred CcchhhhcccCCCcc-------hhHHH---------hhccccCccee--------------chhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG-------KVMME---------LISDSGNGFYS--------------WHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~~~-------~~~~~---------~~~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ia 50 (384) ||||++.....+... ..... ....+....+. ..+-+.+++.+..+|+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 999997544332110 00000 00001000000 1122456889999999987 Q ss_pred Hhhcc---CceEEEEecCCcce--ec---cchHHHHHHhhcc--ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC----- Q lcl|NC_019422. 51 KAVGK---MTAKHIRSNETEFK--TN---PEIYIKFLLENPN--PFMSGQILQEKMVTQLELNSNAFAVIIKDDY----- 115 (384) Q Consensus 51 ~~ia~---~~~~~~~~~~~~~~--~~---~~~~~~~l~~~PN--~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~----- 115 (384) +.+-. +.++....+.++.. +. -......+..++. ..+++.++...++..++..|++++.+.+... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 77654 23332222222211 11 1112233333333 3578999999999999999999999886532 Q ss_pred --CceeeEEEEcCceEEE-------------EEcCCCEEEEEEEc------------CceEEEEehhheEEEeccCCCCC Q lcl|NC_019422. 116 --NMPTQIYPLNALNVEA-------------IYENEVLFLKFLLR------------NGKIVSYPYSDIIHLRKDFNEND 168 (384) Q Consensus 116 --g~~~~l~~l~~~~v~~-------------~~~~~~~~~~~~~~------------~g~~~~~~~~evih~~~~~~~~~ 168 (384) ..+..|..|+|..+.. ..+..+..+.|... ....+.+++++|+|+..+...+. T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ 240 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQ 240 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCcc Confidence 2356788898887742 22222332222221 12356789999999998877788 Q ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcce-ecCCCcee Q lcl|NC_019422. 169 LFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAA-ATDSKYDA 247 (384) Q Consensus 169 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~-v~~~g~~~ 247 (384) ..|+|.+.+++..+..............+=.+...++|+.+....... +..... ......-..|.++ .|..|.++ T Consensus 241 ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~-~~~~~~---~~~~~~~~pG~iv~~L~pGe~i 316 (548) T protein:vir:95 241 NRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTV-EPGKDR---KNRTIPIAPGMVFDDLEPGEDV 316 (548) T ss_pred ccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccC-CCCccc---ccccccccCCccccccCCCcee Confidence 999999999999999998888887777777777777877643211100 000000 0001111234444 57889999 Q ss_pred eecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccccHH-----------------HHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_019422. 248 EQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSKYSE-----------------DEWNAYYESEIEPVGLQ-LSNQY 308 (384) Q Consensus 248 ~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~~~e-----------------~~~~~~~~~~i~P~~~~-i~~~l 308 (384) +..+.+.....+... +.+.+.||+++|||.+.|.++.+. ..+..+....++|+.+. +++++ T Consensus 317 ~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~ 396 (548) T protein:vir:95 317 GMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYL 396 (548) T ss_pred eecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 988877656666544 678899999999999998764321 11122444555664443 44444 Q ss_pred hhcccCcc---cccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHhCCCCCCC----------------- Q lcl|NC_019422. 309 TEKLFTRK---ARSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIMNLSPIEN----------------- 367 (384) Q Consensus 309 ~~~l~~~~---~~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG~~p~~~----------------- 367 (384) -...++-. .......++|-.-.....|+.+-++. ..++.+|+.|.-|+-...|.++-+- T Consensus 397 l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~ 476 (548) T protein:vir:95 397 LARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLV 476 (548) T ss_pred HcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCC Confidence 43333211 11112233333334445565555554 4579999999988877777665320 Q ss_pred --CCeeeecCc--eeecCCCC Q lcl|NC_019422. 368 --GDKPVRRLD--TAVVEGGE 384 (384) Q Consensus 368 --gd~~~~~~n--~~~~~~ge 384 (384) +|....+.. ..+.+..+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~ 497 (548) T protein:vir:95 477 FSSDAYHQLVKSGMDPVEAVQ 497 (548) T ss_pred CCCcccccccccccCCCCchh Confidence 111111100 01111000 No 126 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.56 E-value=1.2e-14 Score=96.86 Aligned_cols=384 Identities=9% Similarity=-0.027 Sum_probs=207.5 Q ss_pred CcchhhhcccCCCcchh----------HHHhhccccCcce---------------echhhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV----------MMELISDSGNGFY---------------SWHGNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~----------~~~~~~~~~~~~~---------------~~~~~~~~~~~v~~~i~~ia~~ia~ 55 (384) |..=....-........ ....+.++..... ....-+.++|.+.++|+.+.+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 22110000000000000 0000111111000 1112345799999999999888877 Q ss_pred CceEEEEec------CCc--ceec---cchHHHHHHhhcc------ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-C- Q lcl|NC_019422. 56 MTAKHIRSN------ETE--FKTN---PEIYIKFLLENPN------PFMSGQILQEKMVTQLELNSNAFAVIIKDDY-N- 116 (384) Q Consensus 56 ~~~~~~~~~------~~~--~~~~---~~~~~~~l~~~PN------~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g- 116 (384) -.+++.-.- -++ .++. -......+...|+ ..+|+.++.+.++..++..|++|+.+.+... | T Consensus 81 ~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~ 160 (530) T protein:vir:38 81 SFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTR 160 (530) T ss_pred CCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCC Confidence 677764321 111 1111 1122333334444 3578999999999999999999999987643 3 Q ss_pred -ceeeEEEEcCceEEE--------------EEcCCCEEEEEEE--c--Cc----------eEEEEehhheEEEeccCCCC Q lcl|NC_019422. 117 -MPTQIYPLNALNVEA--------------IYENEVLFLKFLL--R--NG----------KIVSYPYSDIIHLRKDFNEN 167 (384) Q Consensus 117 -~~~~l~~l~~~~v~~--------------~~~~~~~~~~~~~--~--~g----------~~~~~~~~evih~~~~~~~~ 167 (384) .+..|-.|+|..+.- ..+..|....|.. . .| ..+..+.++|||+..+...+ T Consensus 161 ~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~g 240 (530) T protein:vir:38 161 LFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDG 240 (530) T ss_pred ccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCCC Confidence 256788888877642 2233333322222 1 11 12335667999999888788 Q ss_pred CccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC-----------hHHHHHHHHHHH---HHhc--cc Q lcl|NC_019422. 168 DLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR-----------PDDIKKEVKSFE---KNYL--QI 231 (384) Q Consensus 168 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-----------~e~~~~~~~~~~---~~~~--~~ 231 (384) ...|+|.+.+++..+..............+-.+.-.++|+.+.... .++......... .... .. T Consensus 241 Q~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (530) T protein:vir:38 241 QTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPV 320 (530) T ss_pred cccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccce Confidence 8999999999999999998888877777776777777776532210 111111110000 0000 11 Q ss_pred cccCCcceecCCCceeeecccchhHHHHHH-HHHHHHHHHHHhCCCHHHhccccHH------------------HHHHHH Q lcl|NC_019422. 232 DSEAGGAAATDSKYDAEQVKAESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKYSE------------------DEWNAY 292 (384) Q Consensus 232 ~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~~e------------------~~~~~~ 292 (384) .-..|.+..+..|.+++..+.+....++.+ .+.+.+.||+++|||.++|.++.++ ..+..+ T Consensus 321 ~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~ 400 (530) T protein:vir:38 321 RLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFV 400 (530) T ss_pred eccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 123566888999999999888766566654 4678999999999999998654221 111123 Q ss_pred HHHHHHHHHHH-HHHHHhhcccCccc---------ccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHhC Q lcl|NC_019422. 293 YESEIEPVGLQ-LSNQYTEKLFTRKA---------RSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIMN 361 (384) Q Consensus 293 ~~~~i~P~~~~-i~~~l~~~l~~~~~---------~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG 361 (384) ....+.|+.+. +++++....++-.. +......+|-.-.....|+.+-++. ..++.+|+.|.-++-...| T Consensus 401 ~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G 480 (530) T protein:vir:38 401 ASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG 480 (530) T ss_pred HHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC Confidence 33344555444 45555544443111 0111223444444455575555555 4579999999988876666 Q ss_pred CCCCCC-------------------CCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIEN-------------------GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~~-------------------gd~~~~~~n~~~~~~ge 384 (384) .++-+- ++-...+..-.+-.+.| T Consensus 481 ~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~ 522 (530) T protein:vir:38 481 DDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEE 522 (530) T ss_pred CCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCC Confidence 665310 11111111111111111 No 127 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.53 E-value=2.1e-14 Score=95.57 Aligned_cols=384 Identities=9% Similarity=-0.008 Sum_probs=208.9 Q ss_pred CcchhhhcccCCCcchh-HHHh----------hccccCcce---------------echhhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV-MMEL----------ISDSGNGFY---------------SWHGNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~-~~~~----------~~~~~~~~~---------------~~~~~~~~~~~v~~~i~~ia~~ia 54 (384) |.-+-....-....... ...+ +.++..... ...+-+.++|.+..+|+.+.+.+- T Consensus 3 ~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV 82 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIV 82 (533) T ss_pred CchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh Confidence 33222211100000000 0000 000100000 011234579999999999988886 Q ss_pred cCceEEEEec--------CCcceec---cchHHHHHHhhcc------ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-C Q lcl|NC_019422. 55 KMTAKHIRSN--------ETEFKTN---PEIYIKFLLENPN------PFMSGQILQEKMVTQLELNSNAFAVIIKDDY-N 116 (384) Q Consensus 55 ~~~~~~~~~~--------~~~~~~~---~~~~~~~l~~~PN------~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g 116 (384) .-.|++.-.- ++..++. -......+..+|+ ..+++.++...++..++..|++|+.+.+... | T Consensus 83 G~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g 162 (533) T protein:vir:34 83 GSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSS 162 (533) T ss_pred CCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCC Confidence 5566653221 1111111 1122333434443 3468899999999999999999999886543 2 Q ss_pred --ceeeEEEEcCceEEE--------------EEcCCCEEEEEEE--c--Cc----------eEEEEehhheEEEeccCCC Q lcl|NC_019422. 117 --MPTQIYPLNALNVEA--------------IYENEVLFLKFLL--R--NG----------KIVSYPYSDIIHLRKDFNE 166 (384) Q Consensus 117 --~~~~l~~l~~~~v~~--------------~~~~~~~~~~~~~--~--~g----------~~~~~~~~evih~~~~~~~ 166 (384) .+..|..|+|..+.- ..+..|..+-|.. . .| ..+..+.++|||+..+... T Consensus 163 ~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~ 242 (533) T protein:vir:34 163 RLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVED 242 (533) T ss_pred CccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCC Confidence 256778888877642 2223333322222 1 11 1234567899999988878 Q ss_pred CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC-----------ChHHHHHHH---HHHHHHhcc-- Q lcl|NC_019422. 167 NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL-----------RPDDIKKEV---KSFEKNYLQ-- 230 (384) Q Consensus 167 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-----------~~e~~~~~~---~~~~~~~~~-- 230 (384) +...|+|.+.+++..+..............+-.+.-.++|+.+... ..+..+.+. ........+ T Consensus 243 gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (533) T protein:vir:34 243 GQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAP 322 (533) T ss_pred CcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcce Confidence 8899999999999999999888888777777777777777754211 011111111 111111111 Q ss_pred ccccCCcceecCCCceeeecccchhHHHHHH-HHHHHHHHHHHhCCCHHHhccccHH------------------HHHHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKYSE------------------DEWNA 291 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~~e------------------~~~~~ 291 (384) ..-..|.+..+..|.+++..+.+....++.. .+.+.+.||+++|||.++|.++.++ ..+.. T Consensus 323 ~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~ 402 (533) T protein:vir:34 323 VRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKF 402 (533) T ss_pred eeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHH Confidence 1123467888999999999888766666654 4678999999999999998654221 11112 Q ss_pred HHHHHHHHHHHH-HHHHHhhcccCccc---------ccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHh Q lcl|NC_019422. 292 YYESEIEPVGLQ-LSNQYTEKLFTRKA---------RSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIM 360 (384) Q Consensus 292 ~~~~~i~P~~~~-i~~~l~~~l~~~~~---------~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~l 360 (384) +....++|+.+. +++++-...++-.. +.....+.|-.-.....|+.+-++. ..++.+|+.|.-|+-... T Consensus 403 ~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~ 482 (533) T protein:vir:34 403 VASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR 482 (533) T ss_pred HHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc Confidence 344445665554 44444433332110 0111223444444455565555544 457999999999887777 Q ss_pred CCCCCCC----------CCeeeecCce----------eecCCCC Q lcl|NC_019422. 361 NLSPIEN----------GDKPVRRLDT----------AVVEGGE 384 (384) Q Consensus 361 G~~p~~~----------gd~~~~~~n~----------~~~~~ge 384 (384) |.++-+. .+++=++... .+-+..+ T Consensus 483 G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~ 526 (533) T protein:vir:34 483 GDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEE 526 (533) T ss_pred CCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCC Confidence 7666321 1111111111 1111111 No 128 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.51 E-value=3.5e-14 Score=94.37 Aligned_cols=354 Identities=12% Similarity=0.094 Sum_probs=204.5 Q ss_pred CcchhhhcccCCCcchh--------HHHhhcccc----------Ccc----eechhhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV--------MMELISDSG----------NGF----YSWHGNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~--------~~~~~~~~~----------~~~----~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) ||. ..++...+... -..+.+++. ... ..+.+-..+.+.|++|+..+...|.+++| T Consensus 1 ~~~---~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w 77 (446) T protein:vir:98 1 MNM---EVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVG 77 (446) T ss_pred Ccc---cccCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCc Confidence 332 11111111000 000111110 000 01111123478999999999999999999 Q ss_pred EEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-C--ceee----EEEEcCceEEE Q lcl|NC_019422. 59 KHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-N--MPTQ----IYPLNALNVEA 131 (384) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g--~~~~----l~~l~~~~v~~ 131 (384) .|...+ ++..+-....|...+ .++....+.+.+.+|-++.++++... | .+.. +....|..+.. T Consensus 78 ~V~p~~----~~~a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~ 147 (446) T protein:vir:98 78 PYQHGD----KRIKKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVML 147 (446) T ss_pred eecCcc----HHHHHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhcccccccccccee Confidence 995321 112222222232221 23444446789999999999998532 2 1111 11122222222 Q ss_pred EEcCCCEE---------------------E------EEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHH Q lcl|NC_019422. 132 IYENEVLF---------------------L------KFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEV 184 (384) Q Consensus 132 ~~~~~~~~---------------------~------~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~ 184 (384) ..+.+... + ......|..+.+|....+++++....+..+|.|.+..|.-.... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~f 227 (446) T protein:vir:98 148 IANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIF 227 (446) T ss_pred eeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHHHHHHH Confidence 22211110 0 01112234466788899999888777788999999999999999 Q ss_pred HHHHHHHHHHHHHccCCcceEEeeCCCCChHHH---------HHHHHHHHHHhccccccCCcce---ecCCCceeeeccc Q lcl|NC_019422. 185 VNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI---------KKEVKSFEKNYLQIDSEAGGAA---ATDSKYDAEQVKA 252 (384) Q Consensus 185 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~---------~~~~~~~~~~~~~~~~~~~~~~---v~~~g~~~~~l~~ 252 (384) -....+.-..|.+.-|.|--+.+++...++++. +...+++.+.+.....+++.++ ++++|++++-++. T Consensus 228 K~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea 307 (446) T protein:vir:98 228 KRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTT 307 (446) T ss_pred HHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeecc Confidence 999999999999999999888888765443221 2233445566655444444433 2388998887776 Q ss_pred chhH-HHHHHH-HHHHHHHHHHhCCCHHHhc------cccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccc-c--- Q lcl|NC_019422. 253 ESYV-PNAAQM-DKAIQRLYSFFNTNEKIIQ------SKYS-EDEWNAYYESEIEPVGLQLSNQYTEKLFTRKAR-S--- 319 (384) Q Consensus 253 ~~~~-~~~~~~-~~~~~~I~~~fgvp~~~l~------~~~~-e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~-~--- 319 (384) .... .++..+ ++.-++|+.+.....-.+| ||++ .+.......+.+...++.+++.||++|+...-. . T Consensus 308 ~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~ 387 (446) T protein:vir:98 308 GNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDP 387 (446) T ss_pred ccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 5433 245554 7777888888766644443 2222 223344566677888999999999988664211 1 Q ss_pred -------CcceEEeechhhhccCHHHHHHHHH-HHhCCCCCH---HHHHHHhCCCCCCCCCe Q lcl|NC_019422. 320 -------FGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTP---NEWRKIMNLSPIENGDK 370 (384) Q Consensus 320 -------~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~---NE~R~~lG~~p~~~gd~ 370 (384) ...+++|++. ...|++..++..+ ++..|..++ +.+|+.+|+|+-+. |+ T Consensus 388 ~~~~~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 388 ALYPLASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAIS-ST 446 (446) T ss_pred cccccccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCC-CC Confidence 1123344443 3457777777765 678898764 45999999986532 33 No 129 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.50 E-value=3.4e-13 Score=88.95 Aligned_cols=378 Identities=10% Similarity=0.027 Sum_probs=196.2 Q ss_pred hhhhcccCCCcchh-HHHhhccccCc----cee-------------chhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 4 FKSKKKNKEAPGKV-MMELISDSGNG----FYS-------------WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 4 f~~~~~~~~~~~~~-~~~~~~~~~~~----~~~-------------~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) +-.+....+.-.+. +..+.+....+ ++. .-+.++..+.|.+|++.+...|.+++|.|...++ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w~v~p~~~ 80 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVNIIKMFVRKVNWRFVPPKG 80 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 11111111111111 11221111110 000 0122345789999999999999999999965432 Q ss_pred CcceeccchHHHHHHhh-ccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-------------C--ceeeEEEEcCc-- Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLEN-PNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-------------N--MPTQIYPLNAL-- 127 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~-PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-------------g--~~~~l~~l~~~-- 127 (384) .............+... -|-..++.+++..+. +.+.+|-+++++++... | .+..+.+.++. T Consensus 81 ~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq~~~ 159 (488) T protein:vir:95 81 KEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQSTL 159 (488) T ss_pred CchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCcccc Confidence 22111111111112111 122235667777764 67889999999988532 2 14445555553 Q ss_pred -eEEEEEcCCCEEEEEE-E-------------cCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 128 -NVEAIYENEVLFLKFL-L-------------RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGV 192 (384) Q Consensus 128 -~v~~~~~~~~~~~~~~-~-------------~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 192 (384) ++.+..+.+-...... . .....+.+|+...|+.++....+..+|.+.+..|.-..-.-....++- T Consensus 160 ~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w 239 (488) T protein:vir:95 160 DKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYE 239 (488) T ss_pred cceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHH Confidence 3333222221111100 0 011234567777666666666677899999999999988888888888 Q ss_pred HHHHHccCCcceEEeeCC----CCChHHHHHHHHHHHHHhccccccCCcceecCCCcee---------eecccch-hHHH Q lcl|NC_019422. 193 VKAIKNSNTIKWLLKFKT----ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDA---------EQVKAES-YVPN 258 (384) Q Consensus 193 ~~~~~ng~~p~~il~~~~----~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~---------~~l~~~~-~~~~ 258 (384) ..|.++.+.|--+...+. ..++++.+++.+...+.......+....++++.|++. +.++... .... T Consensus 240 ~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~ 319 (488) T protein:vir:95 240 AVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYD 319 (488) T ss_pred HHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchh Confidence 999987555544444432 2334444555555544333222222223455555432 2233222 2223 Q ss_pred HHHH-HHHHHHHHHHhCCCH-HH-h--ccccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCcc------cccCcceEEe Q lcl|NC_019422. 259 AAQM-DKAIQRLYSFFNTNE-KI-I--QSKYS-EDEWNAYYESEIEPVGLQLSNQYTEKLFTRK------ARSFGNEIVF 326 (384) Q Consensus 259 ~~~~-~~~~~~I~~~fgvp~-~~-l--~~~~~-e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~------~~~~~~~i~f 326 (384) +..+ ++.-++|+.+.--.- .+ - +|+.+ .+.........+...++.+++.||++|+... .....++++| T Consensus 320 ~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~ 399 (488) T protein:vir:95 320 TGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITY 399 (488) T ss_pred HHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEe Confidence 4433 555566665542211 11 1 12222 1233456667778888999999999887753 1122245555 Q ss_pred echhhhccCHHHHHHHHH-HHhCCCCCH-----HHHHHHhCCCCCCCCCeeeecCceeec-CCCC Q lcl|NC_019422. 327 EASNLQYASMSTKLNLVQ-MVDRGSLTP-----NEWRKIMNLSPIENGDKPVRRLDTAVV-EGGE 384 (384) Q Consensus 327 d~~~~~~~d~~~~~~~~~-~~~~g~~t~-----NE~R~~lG~~p~~~gd~~~~~~n~~~~-~~ge 384 (384) +.....|+++.++.++ ++..|+.-+ +.+|+.+|+|+.+.+.....+.-..+- ..|+ T Consensus 400 --~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~ 462 (488) T protein:vir:95 400 --DDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGD 462 (488) T ss_pred --cCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCc Confidence 4445667788887775 778887654 679999999976655544433321110 1111 No 130 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.50 E-value=1.2e-13 Score=91.44 Aligned_cols=372 Identities=11% Similarity=0.035 Sum_probs=202.2 Q ss_pred CcchhhhcccCCC--cch-----h----H----HHhhccccCccee--------------chhhhhhcHHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA--PGK-----V----M----MELISDSGNGFYS--------------WHGNLYKSDIVRSIIRPKAK 51 (384) Q Consensus 1 M~~f~~~~~~~~~--~~~-----~----~----~~~~~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ia~ 51 (384) |- ++.++... +.+ + + ..+++....+... .-..++..+.|.+|++.+.. T Consensus 1 m~---k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~ 77 (448) T protein:vir:79 1 MA---KRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFG 77 (448) T ss_pred CC---CCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHH Confidence 32 21111100 000 0 0 0111111111110 01224557899999999999 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhhccc---cCCHHHHHHHHHHHHHHhCCeeEEEeeC--CCCc--eeeEEEE Q lcl|NC_019422. 52 AVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNP---FMSGQILQEKMVTQLELNSNAFAVIIKD--DYNM--PTQIYPL 124 (384) Q Consensus 52 ~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~---~~s~~~f~~~~~~~~l~~G~~~~~~~~~--~~g~--~~~l~~l 124 (384) .|.+++|.|...+++.......+.....+..++. ..++.+++..+ .+.+.+|-+++++++. ..|. +..+.+. T Consensus 78 av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~-lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r 156 (448) T protein:vir:79 78 RIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPI 156 (448) T ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHH-HHhhhhcceeEEEEeeecCCCceeccccccc Confidence 9999999996544332222222222333334432 23555666665 5578999999999974 3453 4456666 Q ss_pred cCce---EEEEEcCCCEEEEEEEc-------CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 125 NALN---VEAIYENEVLFLKFLLR-------NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVK 194 (384) Q Consensus 125 ~~~~---v~~~~~~~~~~~~~~~~-------~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 194 (384) ++.+ +.+..+. +..+..... +...+.+|..-++|..+ ......+|.+.+..|.-..-.-....+.-.. T Consensus 157 ~~~~~~~f~~~~d~-~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~ 234 (448) T protein:vir:79 157 HPFNIDEVLYDEEG-GPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINH 234 (448) T ss_pred CCccccceeeecCC-ceEEeecCCcccccccCCCccccccceEEEEec-CccCCcccchhHHHHHHHHHHHHHHHHHHHH Confidence 6653 3333332 232222111 11223457778888864 3445679999999999999999999999999 Q ss_pred HHHccCCcceEEeeCCCCC--hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHH Q lcl|NC_019422. 195 AIKNSNTIKWLLKFKTALR--PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYS 271 (384) Q Consensus 195 ~~~ng~~p~~il~~~~~~~--~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~ 271 (384) |.+.-|.|--+.+++...+ +++.+++.+.. ..+.+ .....++++.|++++-++......++.++ ++.-++|+. T Consensus 235 f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av-~~i~~---g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk 310 (448) T protein:vir:79 235 GLERFMIGVPTLTIPKSVRQGTKQWEAAKEIV-KNFVQ---KPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIAR 310 (448) T ss_pred HHHHcCCceEEEecCCCCCcCHHHHHHHHHHH-HHHhc---CCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHH Confidence 9999999988888875544 34444444433 22322 12234677888887777655443334333 555556655 Q ss_pred HhCCCH-HH-h-ccccHH--HHHHHHHHHHHHHHHHHHHHHHhhcccCcc------cccCcceEEeechhhhccCHHHHH Q lcl|NC_019422. 272 FFNTNE-KI-I-QSKYSE--DEWNAYYESEIEPVGLQLSNQYTEKLFTRK------ARSFGNEIVFEASNLQYASMSTKL 340 (384) Q Consensus 272 ~fgvp~-~~-l-~~~~~e--~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~------~~~~~~~i~fd~~~~~~~d~~~~~ 340 (384) +.-=.- .. - +++.+. ........+.+...++.+++.||+.|+... .....+++.|+.. ...|+++.+ T Consensus 311 ~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~--e~~Dl~~~a 388 (448) T protein:vir:79 311 ALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME--ERNDFSAAA 388 (448) T ss_pred HHhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC--ChHHHHHHH Confidence 442110 00 1 111111 122344556677888999999999887753 1122356666544 445777777 Q ss_pred HHHH-HHhCCCCCHHHHHHHhCCCCCCCCCeeeecCce-eecCCCC Q lcl|NC_019422. 341 NLVQ-MVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDT-AVVEGGE 384 (384) Q Consensus 341 ~~~~-~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~-~~~~~ge 384 (384) +..+ ++..+-...+-+|+.+|+|....++....+.-. .+-++++ T Consensus 389 ~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~~a~~~~~~~~~~~~ 434 (448) T protein:vir:79 389 NLMGMLINAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVR 434 (448) T ss_pred HHhhhhhccchhhHHHHHHhhcCCCCCCCccccccCCCCccccccc Confidence 7665 444444344457888999854333333322211 1112222 No 131 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.49 E-value=6.5e-14 Score=92.87 Aligned_cols=382 Identities=9% Similarity=-0.024 Sum_probs=204.7 Q ss_pred Ccchhhhccc---CCCcchhH------------HHhhccccCcc---------------eechhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKN---KEAPGKVM------------MELISDSGNGF---------------YSWHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~---~~~~~~~~------------~~~~~~~~~~~---------------~~~~~~~~~~~~v~~~i~~ia 50 (384) |.--.+.-.. ........ .....++.... ....+-+.+++.+..+|+.+. T Consensus 2 ~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 81 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQR 81 (553) T ss_pred cchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 0000000000 00000000 00000000000 011123467899999999998 Q ss_pred HhhccCceEEEEec-------CCcc--e---eccchHHHHHHhhcc------ccCCHHHHHHHHHHHHHHhCCeeEEEee Q lcl|NC_019422. 51 KAVGKMTAKHIRSN-------ETEF--K---TNPEIYIKFLLENPN------PFMSGQILQEKMVTQLELNSNAFAVIIK 112 (384) Q Consensus 51 ~~ia~~~~~~~~~~-------~~~~--~---~~~~~~~~~l~~~PN------~~~s~~~f~~~~~~~~l~~G~~~~~~~~ 112 (384) +.+-.-.+++.-.- .++. + .........+...+| ..+++..+...++..++..|++|+.+.+ T Consensus 82 ~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~ 161 (553) T protein:vir:63 82 DSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEW 161 (553) T ss_pred HhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeee Confidence 88866667663221 1111 1 111222334444443 4568889999999999999999999876 Q ss_pred CCC-C--ceeeEEEEcCceEEEE--------------EcCCCEEEEEEEcC---c----------------eEEEEehhh Q lcl|NC_019422. 113 DDY-N--MPTQIYPLNALNVEAI--------------YENEVLFLKFLLRN---G----------------KIVSYPYSD 156 (384) Q Consensus 113 ~~~-g--~~~~l~~l~~~~v~~~--------------~~~~~~~~~~~~~~---g----------------~~~~~~~~e 156 (384) ... | .+..|-.|+|..+... .+..+..+.|.... | ....++.++ T Consensus 162 ~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~ 241 (553) T protein:vir:63 162 DRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQ 241 (553) T ss_pred ccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChhH Confidence 533 2 2457788888877432 23333333333211 1 123467899 Q ss_pred eEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHH--------------- Q lcl|NC_019422. 157 IIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEV--------------- 221 (384) Q Consensus 157 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~--------------- 221 (384) |||+..+-..+...|+|.+.+++..+..............+=.+...++|+.+.. +++..... T Consensus 242 vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 319 (553) T protein:vir:63 242 VIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVGIFGK 319 (553) T ss_pred heecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhcccccccccccccccc Confidence 9999988778889999999999999999988888777777777777777775421 11111110 Q ss_pred --HHHHHHhcc---ccccCCcceecCCCceeeecccchhHHHHHHH-HHHHHHHHHHhCCCHHHhccccHH--------- Q lcl|NC_019422. 222 --KSFEKNYLQ---IDSEAGGAAATDSKYDAEQVKAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQSKYSE--------- 286 (384) Q Consensus 222 --~~~~~~~~~---~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~~~~~e--------- 286 (384) +.......+ ..-..|.+..|..|.+++.++.+....++... +.+.+.||+++|||.+.|.++.++ T Consensus 320 ~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~ 399 (553) T protein:vir:63 320 YMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAG 399 (553) T ss_pred cccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHH Confidence 000000000 11134678889999999998887656666544 678999999999999998654221 Q ss_pred ---------HHHHHHHHHHHHHHHHH-HHHHHhhcccCccc------------ccCcceEEeechhhhccCHHHHHHH-H Q lcl|NC_019422. 287 ---------DEWNAYYESEIEPVGLQ-LSNQYTEKLFTRKA------------RSFGNEIVFEASNLQYASMSTKLNL-V 343 (384) Q Consensus 287 ---------~~~~~~~~~~i~P~~~~-i~~~l~~~l~~~~~------------~~~~~~i~fd~~~~~~~d~~~~~~~-~ 343 (384) ..+..|....++|+.+. +++++-...++-.. +.....+++-.-.....|+.+-++. . T Consensus 400 ~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~ 479 (553) T protein:vir:63 400 IAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAV 479 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHH Confidence 11112444455665444 33333332222111 0111123333334445565555444 4 Q ss_pred HHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeecCc-----------eee------------cCCCC Q lcl|NC_019422. 344 QMVDRGSLTPNEWRKIMNLSPIEN----------GDKPVRRLD-----------TAV------------VEGGE 384 (384) Q Consensus 344 ~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~~~~~n-----------~~~------------~~~ge 384 (384) .++.+|+.|.-|+-...|.++-+- .+++=++.+ ... -+.|| T Consensus 480 ~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 480 MRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 579999999988876666665310 011100000 000 01111 No 132 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.45 E-value=1.2e-13 Score=91.51 Aligned_cols=379 Identities=11% Similarity=0.016 Sum_probs=199.7 Q ss_pred CcchhhhcccCCCcc--hhH---HHhh------ccccCccee--------------chhhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG--KVM---MELI------SDSGNGFYS--------------WHGNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~~~~~~~~~~--~~~---~~~~------~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ia~~ia~ 55 (384) |||+.+--....... ... .+.. ..+ ...+. ..+-+.+++.+..+|+.+.+.+-. T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~-~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG 79 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHRWQDI-GDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVG 79 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcccCCC-CCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Confidence 999886321111100 000 0000 000 00110 112345689999999999888855 Q ss_pred CceEEEEecCC-cceeccchHHHHHHhhcc--ccCCHHHHHHHHHHHHHHhCCeeEEEeeCC--CC--ceeeEEEEcCce Q lcl|NC_019422. 56 MTAKHIRSNET-EFKTNPEIYIKFLLENPN--PFMSGQILQEKMVTQLELNSNAFAVIIKDD--YN--MPTQIYPLNALN 128 (384) Q Consensus 56 ~~~~~~~~~~~-~~~~~~~~~~~~l~~~PN--~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~--~g--~~~~l~~l~~~~ 128 (384) -.|+..-..++ .....-......+..++. ..+++..+.+.++..++..|++|+.+.+.. .| .+..|-.|+|.. T Consensus 80 ~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~ 159 (495) T protein:vir:10 80 NGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDM 159 (495) T ss_pred CCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhh Confidence 55554222211 111112223333344432 457899999999999999999999887543 22 467888888888 Q ss_pred EEEE-----------------EcCCCEEEEEEE---cCc---------eEEEEehhheEEEeccCCCCCccCccHHHHHH Q lcl|NC_019422. 129 VEAI-----------------YENEVLFLKFLL---RNG---------KIVSYPYSDIIHLRKDFNENDLFGTSPAKVLE 179 (384) Q Consensus 129 v~~~-----------------~~~~~~~~~~~~---~~g---------~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~ 179 (384) +.-- .+..+....|.. ..| ..+.+|+++|+|+. ........|+|.+..+. T Consensus 160 l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f-~~r~gQ~RGis~la~i~ 238 (495) T protein:vir:10 160 LASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVT-VLTVRSDAGAPWFQLLL 238 (495) T ss_pred cCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEecc-ccCCCcccCcchhHHHH Confidence 7521 111122222221 111 35668999999996 44456789999876554 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCC--hHHHH-HHHHHHHHHhccccccCCcceecCCCceeeecccchhH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALR--PDDIK-KEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYV 256 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~--~e~~~-~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~ 256 (384) .+..............+-.+...++|+.+..-. .+... ...+.-.....+ -..|.+..+..|.+++.++.+... T Consensus 239 -~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~pG~i~~L~pGe~i~~~~p~~p~ 315 (495) T protein:vir:10 239 -RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITG--LNPGTLQYLQPGQEVKFSNPADVG 315 (495) T ss_pred -HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCccccee--cCCceeeecCCCCeeeeeCCCCCC Confidence 466666666665555555666667776532110 00000 000000001111 234678889999999999887555 Q ss_pred HHHHH-HHHHHHHHHHHhCCCHHHhccccHH-----------HHHH--------HHHHHHHHHHHHH-HHHHHhhcccCc Q lcl|NC_019422. 257 PNAAQ-MDKAIQRLYSFFNTNEKIIQSKYSE-----------DEWN--------AYYESEIEPVGLQ-LSNQYTEKLFTR 315 (384) Q Consensus 257 ~~~~~-~~~~~~~I~~~fgvp~~~l~~~~~e-----------~~~~--------~~~~~~i~P~~~~-i~~~l~~~l~~~ 315 (384) ..+.. .+.+.+.||+++|||.+.|.++.++ +... -++...++|+.+. ++.++-...++. T Consensus 316 ~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~ 395 (495) T protein:vir:10 316 TTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVI 395 (495) T ss_pred CCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC Confidence 56654 4678899999999999998654221 1111 1233445565444 444444332221 Q ss_pred --cc--ccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHhCCCCCCC-------------------CCee Q lcl|NC_019422. 316 --KA--RSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIMNLSPIEN-------------------GDKP 371 (384) Q Consensus 316 --~~--~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG~~p~~~-------------------gd~~ 371 (384) +. +.....++|-.-.....|+.+-++. ..++.+|+.|.-|+-...|.++-+- .|-. T Consensus 396 p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~ 475 (495) T protein:vir:10 396 PDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPR 475 (495) T ss_pred CCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCC Confidence 10 1111223343344445565555544 4579999999988876666665310 1111 Q ss_pred eecCcee---ecCC----CC Q lcl|NC_019422. 372 VRRLDTA---VVEG----GE 384 (384) Q Consensus 372 ~~~~n~~---~~~~----ge 384 (384) ..+..-. +.++ .| T Consensus 476 ~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 476 YVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred cCCCccCCCCCCCCCCCCCC Confidence 1111111 1111 11 No 133 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.45 E-value=9.2e-13 Score=86.58 Aligned_cols=366 Identities=10% Similarity=0.018 Sum_probs=200.8 Q ss_pred hhhhcccCCCcch-------h--------HHHhhccccCccee--------------chhhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 4 FKSKKKNKEAPGK-------V--------MMELISDSGNGFYS--------------WHGNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 4 f~~~~~~~~~~~~-------~--------~~~~~~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ia~~ia 54 (384) +-+|++......+ . ...+.+..+.+... .-..++..+.|.+|++.+...|. T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~ 80 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFGRIR 80 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHHHHh Confidence 3333222211000 0 01111111111110 01224557899999999999999 Q ss_pred cCceEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeC--CCCc--eeeEEEEcCc Q lcl|NC_019422. 55 KMTAKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKD--DYNM--PTQIYPLNAL 127 (384) Q Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~--~~g~--~~~l~~l~~~ 127 (384) +++|.|...+++.............+..+. ...++.+++..+ .+.+.+|-+++++++. ..|. +..+.+.++. T Consensus 81 ~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~ 159 (448) T protein:vir:77 81 SAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPF 159 (448) T ss_pred cCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeeccccccCCC Confidence 999998654433222222222233333322 234677788887 5788999999999875 3453 4456666664 Q ss_pred eE---EEEEcCCCEEEEEEEc-------CceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 128 NV---EAIYENEVLFLKFLLR-------NGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIK 197 (384) Q Consensus 128 ~v---~~~~~~~~~~~~~~~~-------~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 197 (384) ++ .+..+ ++..+..... ......+|...++|..+. .....+|.+.+..|.-..-.-....++-..|.+ T Consensus 160 ~~~~f~~~~~-~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E 237 (448) T protein:vir:77 160 NIDEVLYDEE-GGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLE 237 (448) T ss_pred ccceeeeecC-CceEEEecCCcccccccCCCccccccceEEEEecC-CcCCcccchHHHHHHHHHHHHHhhHHHHHHHHH Confidence 33 22222 2222222111 112344577788888653 445679999999999988888888999999999 Q ss_pred ccCCcceEEeeCCCCC--hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHH-HHHHHHHHHHHhC Q lcl|NC_019422. 198 NSNTIKWLLKFKTALR--PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQ-MDKAIQRLYSFFN 274 (384) Q Consensus 198 ng~~p~~il~~~~~~~--~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~-~~~~~~~I~~~fg 274 (384) .-|.|--+.+++...+ +++.+++.+.. ..+.++ .+ ..++++.|++++-++......++.+ +++.-++|+.+.. T Consensus 238 ~yG~P~~vgky~~ga~~~~~~~~~l~~av-~~i~~g-~~--a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iL 313 (448) T protein:vir:77 238 RFMIGVPTLTIPKSVRQGTKQWEAAKEIV-KNFVQK-PR--HGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALG 313 (448) T ss_pred HcCCceeEEecCCCCCCCHHHHHHHHHHH-HHHhcC-Cc--eEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHh Confidence 9999988888875543 34545544443 333221 22 3466788888776665443333333 3565666766553 Q ss_pred CCHHH--hc-cccH-H-HHHHHHHHHHHHHHHHHHHHHHhhcccCcc-----c-ccCcceEEeechhhhccCHHHHHHHH Q lcl|NC_019422. 275 TNEKI--IQ-SKYS-E-DEWNAYYESEIEPVGLQLSNQYTEKLFTRK-----A-RSFGNEIVFEASNLQYASMSTKLNLV 343 (384) Q Consensus 275 vp~~~--l~-~~~~-e-~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~-----~-~~~~~~i~fd~~~~~~~d~~~~~~~~ 343 (384) -.--- -+ ++.. . ..........+...++.+++.||++|+... . ....+++.|+..+ ..|+++.++.. T Consensus 314 GqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e--~eDl~~~a~~~ 391 (448) T protein:vir:77 314 IDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEE--RNDFSAAANLM 391 (448) T ss_pred ccccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCC--hhhHHHHHHHh Confidence 21100 11 1111 1 222345566677888899999999887653 1 1223566766543 45777777666 Q ss_pred HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecCce---eecCCCC Q lcl|NC_019422. 344 QMVDRGSLTPNEWRKIMNLSPIENGDKPVRRLDT---AVVEGGE 384 (384) Q Consensus 344 ~~~~~g~~t~NE~R~~lG~~p~~~gd~~~~~~n~---~~~~~ge 384 (384) +.+. +-+|+.+|+|.-.+....-.|... ..+.+.+ T Consensus 392 ~~l~------~~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~ 429 (448) T protein:vir:77 392 GMLI------NAVKDSEDIPTELKALIDALPSKMRRALGVVDEV 429 (448) T ss_pred HHHH------HHHHHHhcCCccCCcCCCCCchhcccccCCCCCC Confidence 5332 458999999753221111111111 1122222 No 134 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.30 E-value=7.7e-12 Score=81.51 Aligned_cols=358 Identities=9% Similarity=-0.028 Sum_probs=164.3 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc----chHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP----EIYI 76 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~----~~~~ 76 (384) |++.+-..+-.......+..+-....-.+......|..+...+.+|+.+++.+-.--..+.... +..+... +... T Consensus 23 d~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~-~~~~~~~~~~~e~~~ 101 (449) T protein:vir:10 23 MGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGD-DADDSEDETSWEKKS 101 (449) T ss_pred HHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCc-cccchhhhHHHHHHH Confidence 3333321111111111111100000000001112344577888999999986532211222111 1111111 1111 Q ss_pred HHHHhhccccCCHHHHHHH---HHHHHHHhCCeeEEEee-CCC---------CceeeEEEEcCceEEEEEc-CC------ Q lcl|NC_019422. 77 KFLLENPNPFMSGQILQEK---MVTQLELNSNAFAVIIK-DDY---------NMPTQIYPLNALNVEAIYE-NE------ 136 (384) Q Consensus 77 ~~l~~~PN~~~s~~~f~~~---~~~~~l~~G~~~~~~~~-~~~---------g~~~~l~~l~~~~v~~~~~-~~------ 136 (384) ..|+. ..+|.. ....-.++|-+++++.. +.. +.+..+.|+....+++... .+ T Consensus 102 ~~l~~--------~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~y 173 (449) T protein:vir:10 102 KQVFT--------NRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKTY 173 (449) T ss_pred HHHHH--------HHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCCC Confidence 22221 134443 33444567877777643 322 2355566665544443211 11 Q ss_pred CEEEEEEEc----C--ceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHH-HHHHHHHHccCC-------- Q lcl|NC_019422. 137 VLFLKFLLR----N--GKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTD-QGVVKAIKNSNT-------- 201 (384) Q Consensus 137 ~~~~~~~~~----~--g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-~~~~~~~~ng~~-------- 201 (384) +....|... + +..+.+-++-|+||.. .+.-|.|.++++...+....... .+...++++..+ T Consensus 174 g~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~----~~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~ 249 (449) T protein:vir:10 174 GQPKLWKYTERLPNGSSRRVDIHPDRVFILGD----YSEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEK 249 (449) T ss_pred CCceEEEEeeeccCCCccceeeccceeEeecC----CCCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh Confidence 222222221 1 1223344566777732 23448888888877654333322 233333333211 Q ss_pred ---cceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_019422. 202 ---IKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEK 278 (384) Q Consensus 202 ---p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~ 278 (384) ..++....+....+..+++.+......+ +.+ .+.++.+.+|+.++.+..+..-. +.....+||++-|||.. T Consensus 250 ~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~----~~~-~~~i~~~~d~~~~~~~~sgl~d~-l~~~~q~iaaa~~IP~t 323 (449) T protein:vir:10 250 EIDFTNLASLYGVSIDELQDKFNEVAGEINR----GND-VLMTTQGATVTPLVTSVADPTAT-YNVNLQTAAAGVDIPTR 323 (449) T ss_pred hhhhhhhhHHhhCCchHHHHHHHHHHHHHhc----cch-heeecCCcceEEEecccCChhHH-HHHHHHHHHHHhCCCee Confidence 1111111111122222333333222111 122 34567777899888877765432 34466779999999998 Q ss_pred Hhccc-----cHHHHHHHHH------HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH---- Q lcl|NC_019422. 279 IIQSK-----YSEDEWNAYY------ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV---- 343 (384) Q Consensus 279 ~l~~~-----~~e~~~~~~~------~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~---- 343 (384) .|-|. ++.+...+|+ +.-+.|.++.+-+.|-+.-+... ...+.|.+.+|...+.++++++. T Consensus 324 ~L~Gqsp~glnst~D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~----~~d~~i~f~pL~~~t~kEkAei~k~~A 399 (449) T protein:vir:10 324 ILIGNQQAERSSTEDQKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDA----VAKKAVIWDDLNEQTGTEKLTNAKTMG 399 (449) T ss_pred eeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCceeEEeCCCCCCCHHHHHHHHHHHH Confidence 76321 1112223332 23477888777777655433321 23577778899888888887643 Q ss_pred ---H-HHhCC---CCCHHHHHHHhCCCCCCCCCeeeecCceee-cCCCC Q lcl|NC_019422. 344 ---Q-MVDRG---SLTPNEWRKIMNLSPIENGDKPVRRLDTAV-VEGGE 384 (384) Q Consensus 344 ---~-~~~~g---~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~-~~~ge 384 (384) + ++..| +++++|+|+.+|++|..+ +.+. ....+ .+++. T Consensus 400 ~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~-~~~~--~e~~de~~~~~ 445 (449) T protein:vir:10 400 EINQTMLGSGDNPAFSREEIRTAAGYDNDDE-EPLG--EEDGDEEDKAT 445 (449) T ss_pred HHHHHHHHccccCCcCHHHHHHHhcccCCCC-CCCC--CCCCccccccC Confidence 2 34444 999999999999998642 2110 00010 01111 No 135 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.30 E-value=4.8e-12 Score=82.62 Aligned_cols=367 Identities=14% Similarity=0.073 Sum_probs=191.4 Q ss_pred Ccc---hhhhcccCCCcchhHH----H-----------hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_019422. 1 MNI---FKSKKKNKEAPGKVMM----E-----------LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIR 62 (384) Q Consensus 1 M~~---f~~~~~~~~~~~~~~~----~-----------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 62 (384) .++ |.-.-+.....+.... + +.+..+.||.. =..+-+.|.+++|+..||+.+.+- |.-.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~-la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (695) T protein:vir:78 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT-LVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (695) T ss_pred cccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHH-HHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 111 1111111111111000 0 00001111100 012345788899999999888765 42211 Q ss_pred ec-C------------CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-C--------------- Q lcl|NC_019422. 63 SN-E------------TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-D--------------- 113 (384) Q Consensus 63 ~~-~------------~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~--------------- 113 (384) .+ + ++.+..+.+....|..+-....-...|.+ .+.+--++|-+.+++.- . T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~e-aik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:78 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRT-TVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHhhccccceEEEEEeccCcccccccccccccc Confidence 11 0 11111111333334333322222223444 44555566766555432 1 Q ss_pred -CCCceeeEEEEcCceEEEEEcCC--------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHH Q lcl|NC_019422. 114 -DYNMPTQIYPLNALNVEAIYENE--------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVL 178 (384) Q Consensus 114 -~~g~~~~l~~l~~~~v~~~~~~~--------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~ 178 (384) ..|....|.+++|.++++....- +..-+|.. .|+.+ =.+-++.|.... +.....|+|..+.+ T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V-~G~kI--H~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~ 300 (695) T protein:vir:78 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM-IGTEV--HATRLHTIVSRPVGDMLKPTYSFAGISMTQLA 300 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEE-eceEE--eeeeEEEecCCCchhhhhcccccCcccHHHHH Confidence 13567779999999998854321 11112222 23322 222232232111 11235799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeC--CCCChHHHHHHH--HHHHHHhccccccCCcceecC-CCceeeecccc Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK--TALRPDDIKKEV--KSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAE 253 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~e~~~~~~--~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~ 253 (384) ...+................-.. .++ +++ ..+.+.....+. -++.++++ +..++++++ +..+|.+.+.+ T Consensus 301 ~e~V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~~L~~g~~~~l~~R~eli~~~R----sn~G~~llDk~~Eefeq~sts 374 (695) T protein:vir:78 301 MPYIDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRAELINRYR----DNRNILFLDKATEEFFQFNTP 374 (695) T ss_pred HHHHHHHHHHHhHHHHHHHhhhh-HHH-HHHHHHhhcChhHHHHHHHHHHHHHhc----CccceEEEecCCcceEEEecc Confidence 99999988877777777655332 222 222 112222222222 23333342 334577888 57899988866 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHHHH-------HHHHHHHHHHHHHHHHHhhcccCccccc Q lcl|NC_019422. 254 SYVPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEWNA-------YYESEIEPVGLQLSNQYTEKLFTRKARS 319 (384) Q Consensus 254 ~~~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~~~-------~~~~~i~P~~~~i~~~l~~~l~~~~~~~ 319 (384) ...++-.. ......||.+-+||...|-| ++.|....+ .....+.|.++.+-+.|-+..|... T Consensus 375 lSGLddVi-~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i--- 450 (695) T protein:vir:78 375 LSGLDALQ-AQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV--- 450 (695) T ss_pred cCCHHHHH-HHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--- Confidence 65554321 34567899999999876532 233433333 4456789999998888887777653 Q ss_pred CcceEEeechhhhccCHHHHHHHH-------H-HHhCCCCCHHHHHHHhCCCCC-------CCCCeeeecCce------- Q lcl|NC_019422. 320 FGNEIVFEASNLQYASMSTKLNLV-------Q-MVDRGSLTPNEWRKIMNLSPI-------ENGDKPVRRLDT------- 377 (384) Q Consensus 320 ~~~~i~fd~~~~~~~d~~~~~~~~-------~-~~~~g~~t~NE~R~~lG~~p~-------~~gd~~~~~~n~------- 377 (384) ...|.|.+..|.+.+.++++++. + .+..|+++++|+|..|.-+|- +-.|++-+|... T Consensus 451 -dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~ 529 (695) T protein:vir:78 451 -DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLT 529 (695) T ss_pred -CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHh Confidence 23577888999888888888753 2 357899999999999877652 224555444443 Q ss_pred --eec-CCCC Q lcl|NC_019422. 378 --AVV-EGGE 384 (384) Q Consensus 378 --~~~-~~ge 384 (384) +++ +++| T Consensus 530 ~~~~~~~~~~ 539 (695) T protein:vir:78 530 YVQRLAEGGD 539 (695) T ss_pred hhcCcccccc Confidence 122 1222 No 136 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.29 E-value=2.9e-12 Score=83.83 Aligned_cols=377 Identities=11% Similarity=-0.007 Sum_probs=212.6 Q ss_pred CcchhhhcccCCCc------chhHH----Hhhccc-------cCcceech----hhhhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_019422. 1 MNIFKSKKKNKEAP------GKVMM----ELISDS-------GNGFYSWH----GNLYKSDIVRSIIRPKAKAVGKMTAK 59 (384) Q Consensus 1 M~~f~~~~~~~~~~------~~~~~----~~~~~~-------~~~~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~~~ 59 (384) |.++.-|.+..... ...+. .+.... .+...... ..+-..|.++..++.|++.++++.+. T Consensus 1 ~~~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL~ 80 (646) T protein:vir:10 1 MALLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARLY 80 (646) T ss_pred CcccCCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeeee Confidence 88887664332111 00000 111110 00000000 12334588999999999999999998 Q ss_pred EEEecCCcce--eccchHHHHHHhhcccc-CCHHHHHHHHHHHHHHhCCeeEEEee---CCCCceeeEEEEcCceEEEEE Q lcl|NC_019422. 60 HIRSNETEFK--TNPEIYIKFLLENPNPF-MSGQILQEKMVTQLELNSNAFAVIIK---DDYNMPTQIYPLNALNVEAIY 133 (384) Q Consensus 60 ~~~~~~~~~~--~~~~~~~~~l~~~PN~~-~s~~~f~~~~~~~~l~~G~~~~~~~~---~~~g~~~~l~~l~~~~v~~~~ 133 (384) .-+.++.|.. ...++....+-..+-.. .-..++++.+..++-+-|++|++... ...+--..++++....|.. T Consensus 81 aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vvt~~Ev~~-- 158 (646) T protein:vir:10 81 VTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVVTGSAISR-- 158 (646) T ss_pred eeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeeecHHHhcc-- Confidence 8777766643 23344455554555433 33458899999999999999997421 1112233477777777632 Q ss_pred cCCCEEEEEEEc----CceEEEEehhheEEEeccCC--CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe Q lcl|NC_019422. 134 ENEVLFLKFLLR----NGKIVSYPYSDIIHLRKDFN--ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK 207 (384) Q Consensus 134 ~~~~~~~~~~~~----~g~~~~~~~~evih~~~~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 207 (384) .+. .+..... +++.+..+..+++. |.++| .....--||+.+++..+.......+...+..+....-.|++- T Consensus 159 -tg~-~~~i~~p~~~~g~~~v~~~~~d~lv-RiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLf 235 (646) T protein:vir:10 159 -TGD-EIAVRRPQQRGGSKLVLVDGQDILI-RCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMF 235 (646) T ss_pred -CCC-eeeeecCccCCCCCcceecCCceEE-EEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceee Confidence 122 2222222 33455566677744 55554 455678899999999999888888877777777666667887 Q ss_pred eCCCCC-------hHHHHHHHHHHHHHhccccccC-----CcceecCC-Cc------eeeecccchhHH--HHHHHHHHH Q lcl|NC_019422. 208 FKTALR-------PDDIKKEVKSFEKNYLQIDSEA-----GGAAATDS-KY------DAEQVKAESYVP--NAAQMDKAI 266 (384) Q Consensus 208 ~~~~~~-------~e~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~-g~------~~~~l~~~~~~~--~~~~~~~~~ 266 (384) ++..++ +-....+...+.+...-...+. .-++++.. |. +++.+....... -.+.....+ T Consensus 236 vP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~daI 315 (646) T protein:vir:10 236 LPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKAI 315 (646) T ss_pred eccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHHH Confidence 765432 1223344444444322212121 12233322 22 233333332221 223334689 Q ss_pred HHHHHHhCCCHHHh-ccccHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc----cc--ccCcceEEeechhhhcc Q lcl|NC_019422. 267 QRLYSFFNTNEKII-QSKYSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR----KA--RSFGNEIVFEASNLQYA 334 (384) Q Consensus 267 ~~I~~~fgvp~~~l-~~~~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~----~~--~~~~~~i~fd~~~~~~~ 334 (384) +++|....|||+.| |-.+++ +-...-++ -|.|.+..|+++|++++|.+ .+ ....+.+-||.+.|.. T Consensus 316 ~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~- 393 (646) T protein:vir:10 316 ARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLAS- 393 (646) T ss_pred HHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCccccc- Confidence 99999999999875 433322 11222334 59999999999999998765 22 1234567788888753 Q ss_pred CHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe--e-------------------ee----------cCceee--cC Q lcl|NC_019422. 335 SMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGDK--P-------------------VR----------RLDTAV--VE 381 (384) Q Consensus 335 d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd~--~-------------------~~----------~~n~~~--~~ 381 (384) +.....++.++...|.+|-...|+.+|+.-.++-+. . .+ ++.+.| ++ T Consensus 394 ~pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~~ 473 (646) T protein:vir:10 394 KPNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPTAAQ 473 (646) T ss_pred CCCCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCcccc Confidence 233334566788999999999999999975432221 0 00 001111 11 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) .++ T Consensus 474 ~~d 476 (646) T protein:vir:10 474 RTD 476 (646) T ss_pred ccc Confidence 111 No 137 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.29 E-value=5.6e-12 Score=82.28 Aligned_cols=367 Identities=14% Similarity=0.055 Sum_probs=191.1 Q ss_pred CcchhhhcccCCCcchhHHH---------------hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEec- Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMME---------------LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSN- 64 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~- 64 (384) --.|.-..+.....+..... +.+..+.||.. =..+-+.|.+++|+..||+.+.+- |.-...+ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~-la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~ 146 (694) T protein:vir:10 69 ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT-LVLLAQLPEYRAMHEVLADECIRT-WGEAIGGT 146 (694) T ss_pred hhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHH-HHHHhhccchhhHHHHHHHHhhcc-cceecccc Confidence 11111011111100100000 00001111110 012345788899999999888765 4221111 Q ss_pred C------------CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-C----------------CC Q lcl|NC_019422. 65 E------------TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-D----------------DY 115 (384) Q Consensus 65 ~------------~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~----------------~~ 115 (384) + ++.+..+.+....|..+-....-. +-+...+.+--++|-+.+++.- . .. T Consensus 147 ~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~-~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~k 225 (694) T protein:vir:10 147 KEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIR-DAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPK 225 (694) T ss_pred chhhhhhcccccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHhhccccceEEEEEeecCccccccccccccccccC Confidence 0 111111113333343333222222 3334444555566766555432 1 13 Q ss_pred CceeeEEEEcCceEEEEEcCC--------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHHHHH Q lcl|NC_019422. 116 NMPTQIYPLNALNVEAIYENE--------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVLEPI 181 (384) Q Consensus 116 g~~~~l~~l~~~~v~~~~~~~--------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~~~~ 181 (384) |....|.+++|.++++....- +..-+|.. .|+.+ =.+-++.+.... +.....|+|..+.+... T Consensus 226 GslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V-~G~~I--H~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~ 302 (694) T protein:vir:10 226 GSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM-IGTEV--HATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPY 302 (694) T ss_pred cceeeeEeecccccccchhhhccchhhccCCCceEEE-eceEE--eeeeEEEecCCCchhhhhcccccCcccHHHHHHHH Confidence 567779999999998854321 11112222 23322 222233232111 11235799999999999 Q ss_pred HHHHHHHHHHHHHHHHccCCcceEEeeC--CCCChHHHHHHH--HHHHHHhccccccCCcceecC-CCceeeecccchhH Q lcl|NC_019422. 182 MEVVNTTDQGVVKAIKNSNTIKWLLKFK--TALRPDDIKKEV--KSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESYV 256 (384) Q Consensus 182 i~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~e~~~~~~--~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~~ 256 (384) +................-.. .++ +++ ..+.+.....+. -++.++++ +..++++++ +..+|.+.+.+... T Consensus 303 V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~~L~~g~~~~l~~R~eli~~~R----sn~G~~llDk~~Eefeq~stslSG 376 (694) T protein:vir:10 303 IDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRAELINRYR----DNRNILFLDKATEEFFQFNTPLSG 376 (694) T ss_pred HHHHHHHHhHHHHHHHhhhh-HHH-HHHHHHhhcChhHHHHHHHHHHHHHhc----CccceEEEecCCcceEEEecccCC Confidence 99988877777777655332 222 122 112222222222 23333342 334577888 57899988866655 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHHHH-------HHHHHHHHHHHHHHHHHhhcccCcccccCcc Q lcl|NC_019422. 257 PNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEWNA-------YYESEIEPVGLQLSNQYTEKLFTRKARSFGN 322 (384) Q Consensus 257 ~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~~~-------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~ 322 (384) ++-.. ......||.+-+||...|-| ++.|....+ .....+.|.++.+-+.|-+..|... .. T Consensus 377 LddVi-~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i----dp 451 (694) T protein:vir:10 377 LDALQ-AQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV----DP 451 (694) T ss_pred HHHHH-HHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----CC Confidence 54321 34567899999999876532 233433333 4456789999998888887777653 23 Q ss_pred eEEeechhhhccCHHHHHHHH-------H-HHhCCCCCHHHHHHHhCCCCC-------CCCCeeeecCce---------e Q lcl|NC_019422. 323 EIVFEASNLQYASMSTKLNLV-------Q-MVDRGSLTPNEWRKIMNLSPI-------ENGDKPVRRLDT---------A 378 (384) Q Consensus 323 ~i~fd~~~~~~~d~~~~~~~~-------~-~~~~g~~t~NE~R~~lG~~p~-------~~gd~~~~~~n~---------~ 378 (384) .|.|.+..|.+.+.++++++. + .+..|+++++|+|..|.-+|- +-.|++-+|... + T Consensus 452 ~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~ 531 (694) T protein:vir:10 452 SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQ 531 (694) T ss_pred cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhc Confidence 577888999888888888753 2 357899999999999877652 224555444443 1 Q ss_pred ec-CCCC Q lcl|NC_019422. 379 VV-EGGE 384 (384) Q Consensus 379 ~~-~~ge 384 (384) ++ +++| T Consensus 532 ~~~~~~~ 538 (694) T protein:vir:10 532 RLAEGGD 538 (694) T ss_pred Ccccccc Confidence 22 1222 No 138 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.28 E-value=6e-12 Score=82.12 Aligned_cols=367 Identities=14% Similarity=0.067 Sum_probs=190.8 Q ss_pred Ccc---hhhhcccCCCcchhH----HH-----------hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_019422. 1 MNI---FKSKKKNKEAPGKVM----ME-----------LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIR 62 (384) Q Consensus 1 M~~---f~~~~~~~~~~~~~~----~~-----------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 62 (384) .++ |.-.-+.....+... .+ +.+..+.||.. =..+-+.|.+++|+..||+.+.+- |.-.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~-la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (695) T protein:vir:36 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT-LVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (695) T ss_pred cccceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHH-HHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 111 110001111101000 00 00001111110 012345788899999999888765 42211 Q ss_pred ec-------------CCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC---------------- Q lcl|NC_019422. 63 SN-------------ETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD---------------- 113 (384) Q Consensus 63 ~~-------------~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~---------------- 113 (384) .+ .++.+..+.+....|..+-.. ..-.+-+...+.+--++|-+.+++.-. T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~er-L~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:36 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER-LRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred ccchhhhhhccccccccccccCchHHHHHHHHHHHH-HHHHHHHHHHHHhhccccceEEEEEeccCcccccccccccccc Confidence 11 011111111233333333222 222233444455555677776554321 Q ss_pred -CCCceeeEEEEcCceEEEEEcCC--------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHH Q lcl|NC_019422. 114 -DYNMPTQIYPLNALNVEAIYENE--------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVL 178 (384) Q Consensus 114 -~~g~~~~l~~l~~~~v~~~~~~~--------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~ 178 (384) ..|....|.+++|.++++....- +..-+|.. .|+.+ =.+-++.+.... +.....|+|..+.+ T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V-~G~kI--H~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~ 300 (695) T protein:vir:36 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM-IGTEV--HATRLHTIVSRPVGDMLKPTYSFAGISMTQLA 300 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEE-eceEE--eeeeEEEecCCCchhhhhcccccCcccHHHHH Confidence 13567779999999998854321 11112222 23322 222233232111 11235799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeC--CCCChHHHHHH--HHHHHHHhccccccCCcceecC-CCceeeecccc Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK--TALRPDDIKKE--VKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAE 253 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~e~~~~~--~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~ 253 (384) ...+................-.. .++ +.+ ..+.+.....+ +-++.++++ +..++++++ +..+|.+.+.+ T Consensus 301 ~e~V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~aL~~g~~~~l~~R~eli~~~R----sn~G~~llDk~~Eefeq~sts 374 (695) T protein:vir:36 301 MPYIDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRAELINRYR----DNRNILFLDKATEEFFQFNTP 374 (695) T ss_pred HHHHHHHHHHHhHHHHHHHhhhH-HHH-HHHHHHhhcChhHHHHHHHHHHHHHhc----CccceEEEecCCcceEEEecc Confidence 99999988877777777655322 221 122 11222221222 223333343 234577888 57899988866 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHHHH-------HHHHHHHHHHHHHHHHHhhcccCccccc Q lcl|NC_019422. 254 SYVPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEWNA-------YYESEIEPVGLQLSNQYTEKLFTRKARS 319 (384) Q Consensus 254 ~~~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~~~-------~~~~~i~P~~~~i~~~l~~~l~~~~~~~ 319 (384) ...++-.. ......||.+-+||...|-| ++.|....+ .....+.|.++.+-+.|-+..|... T Consensus 375 lSGLddVi-~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i--- 450 (695) T protein:vir:36 375 LSGLDALQ-AQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV--- 450 (695) T ss_pred cCCHHHHH-HHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--- Confidence 65554321 34567899999999876532 233433333 4456789999998888887777653 Q ss_pred CcceEEeechhhhccCHHHHHHHH-------H-HHhCCCCCHHHHHHHhCCCCC-------CCCCeeeecCce------- Q lcl|NC_019422. 320 FGNEIVFEASNLQYASMSTKLNLV-------Q-MVDRGSLTPNEWRKIMNLSPI-------ENGDKPVRRLDT------- 377 (384) Q Consensus 320 ~~~~i~fd~~~~~~~d~~~~~~~~-------~-~~~~g~~t~NE~R~~lG~~p~-------~~gd~~~~~~n~------- 377 (384) ...|.|.+..|.+.+.++++++. + .+..|+++++|+|..|.-+|- +-.|++-+|... T Consensus 451 -dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~ 529 (695) T protein:vir:36 451 -DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLT 529 (695) T ss_pred -CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHh Confidence 23577888999888888888753 2 357899999999999877652 224555444443 Q ss_pred --eec-CCCC Q lcl|NC_019422. 378 --AVV-EGGE 384 (384) Q Consensus 378 --~~~-~~ge 384 (384) +++ +++| T Consensus 530 ~~~~~~~~~~ 539 (695) T protein:vir:36 530 YVQRLAEGGD 539 (695) T ss_pred hhcCcccccc Confidence 122 1222 No 139 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.26 E-value=1.5e-11 Score=79.91 Aligned_cols=275 Identities=7% Similarity=0.003 Sum_probs=159.7 Q ss_pred eEEEeeCCC-C--ceeeEEEEcCceEE-EEEcCCCEE--EEEEE-cCceEEEEehhheEEEeccCCCCCccCccHHHHHH Q lcl|NC_019422. 107 FAVIIKDDY-N--MPTQIYPLNALNVE-AIYENEVLF--LKFLL-RNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLE 179 (384) Q Consensus 107 ~~~~~~~~~-g--~~~~l~~l~~~~v~-~~~~~~~~~--~~~~~-~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~ 179 (384) +.++++... | .+..|.+.++.++. ...+.++.. ..... .++..+.+++...|+.++.......+|.+.+..|. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 777777533 3 46778888887554 233333332 22222 23345677888888777776677789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeC--CCCChHH-------HHHHHHHHHHHhccccccCCcceecCCCceeeec Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFK--TALRPDD-------IKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQV 250 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~e~-------~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l 250 (384) -....-....++-..|.+.-+.|--+.+.+ .....++ .+..++................++++.|++++-+ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ 160 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLT 160 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEe Confidence 999999999999999999874443344443 2222111 1222333333332222222346788889888877 Q ss_pred ccchhHHHHHHH-HHHHHHHHHHhCCCHHHhc-----cccH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCcc------c Q lcl|NC_019422. 251 KAESYVPNAAQM-DKAIQRLYSFFNTNEKIIQ-----SKYS-EDEWNAYYESEIEPVGLQLSNQYTEKLFTRK------A 317 (384) Q Consensus 251 ~~~~~~~~~~~~-~~~~~~I~~~fgvp~~~l~-----~~~~-e~~~~~~~~~~i~P~~~~i~~~l~~~l~~~~------~ 317 (384) +.+....++.++ ++.-++|+.+..-.---.+ |+.+ .+.......+.+...+..+++.||++|+... . T Consensus 161 ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~ 240 (355) T protein:vir:78 161 GVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWGP 240 (355) T ss_pred ecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 765544455443 5666677776643321111 2222 2334456667777788888888888776642 1 Q ss_pred ccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHH-----HHHHHhCCCCCCCCCeeeecCce-e------ecCCC- Q lcl|NC_019422. 318 RSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPN-----EWRKIMNLSPIENGDKPVRRLDT-A------VVEGG- 383 (384) Q Consensus 318 ~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~N-----E~R~~lG~~p~~~gd~~~~~~n~-~------~~~~g- 383 (384) ....++++|+. .. .+.++.++..+ ++..|+..++ .+|+.+|+|+.+.+++...+..- . +...| T Consensus 241 ~~~~P~~~~~~--~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (355) T protein:vir:78 241 EEPAPRLVPAQ--LG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQ 317 (355) T ss_pred CCCCCEEEecC--cC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCCc Confidence 12234555543 33 34445666654 6788877654 57999999876555544433221 0 11111 Q ss_pred ----C Q lcl|NC_019422. 384 ----E 384 (384) Q Consensus 384 ----e 384 (384) + T Consensus 318 ~~~~~ 322 (355) T protein:vir:78 318 RQGAA 322 (355) T ss_pred ccccc Confidence 1 No 140 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.21 E-value=9.8e-12 Score=80.93 Aligned_cols=380 Identities=12% Similarity=0.086 Sum_probs=211.4 Q ss_pred Ccchh------hhcccCCCcchhHHH----------hhcc--ccCcceech----hhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFK------SKKKNKEAPGKVMME----------LISD--SGNGFYSWH----GNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~------~~~~~~~~~~~~~~~----------~~~~--~~~~~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |.-=. +-|.........+.. ..+. +.+....+. ..+-..+.++..++.|++.++++.+ T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 33222 112111111111000 0000 000010100 1233458899999999999999999 Q ss_pred EEEEecCCc-----ceeccc---hHHHH-HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-CCCC---------c-e Q lcl|NC_019422. 59 KHIRSNETE-----FKTNPE---IYIKF-LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-DDYN---------M-P 118 (384) Q Consensus 59 ~~~~~~~~~-----~~~~~~---~~~~~-l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~~~g---------~-~ 118 (384) ..-+.+.+. ..+.++ ..... ...-+...+...++++.+..++-+-|++|+.+.. .++| . . T Consensus 81 ~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~~ 160 (631) T protein:vir:10 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (631) T ss_pred EeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccccc Confidence 887766442 222211 22222 2224667788899999999999999999999753 2221 1 3 Q ss_pred eeEEEEcCceEEEEEcCCCEEEEEEEcCceEE-EEehhheEEEecc--CCCCCccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 119 TQIYPLNALNVEAIYENEVLFLKFLLRNGKIV-SYPYSDIIHLRKD--FNENDLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~-~~~~~evih~~~~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) .+++++....|+..-..++..+... .|..- .....|+ .||.. .|.....--||+.+++..+.......+...+. T Consensus 161 ~~W~~vt~~ei~~~~~g~g~~v~lp--~g~~h~~~~~~D~-l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aa 237 (631) T protein:vir:10 161 QEWYAVSKEEIKKSNKGSGTNIVLP--TGEEHEFVKGTDI-IFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (631) T ss_pred cceeeccHHHHhcccCcccceeecC--CCCccceecCCce-EEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHH Confidence 3677777777665444444444332 33322 2233343 33433 34555678899999999999888888777777 Q ss_pred HHccCCcceEEeeCCCCChH---------------------HHHHHHHHHHHHhccccccC-----CcceecC-C---Cc Q lcl|NC_019422. 196 IKNSNTIKWLLKFKTALRPD---------------------DIKKEVKSFEKNYLQIDSEA-----GGAAATD-S---KY 245 (384) Q Consensus 196 ~~ng~~p~~il~~~~~~~~e---------------------~~~~~~~~~~~~~~~~~~~~-----~~~~v~~-~---g~ 245 (384) .+....-.|++-++..++-. ..+.+.+.+.+.-.....+. .-++++. . .. T Consensus 238 akSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E~i~ 317 (631) T protein:vir:10 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (631) T ss_pred HHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhc Confidence 77666666777776554321 33333333332221111111 1122222 1 12 Q ss_pred eeeecccc--hhHHHHHHHHHHHHHHHHHhCCCHHHhccc--cHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc- Q lcl|NC_019422. 246 DAEQVKAE--SYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--YSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR- 315 (384) Q Consensus 246 ~~~~l~~~--~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~- 315 (384) +++-+... ....-.+.....++++|..+.|||+.|-|- +++ +-...-++--|.|.++.|+++|++++|.+ T Consensus 318 ~i~hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~Lrp~ 397 (631) T protein:vir:10 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (631) T ss_pred CeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHH Confidence 33333332 222233334568999999999999876432 332 22334566779999999999999997764 Q ss_pred ---cccc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC------------------eeee Q lcl|NC_019422. 316 ---KARS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGD------------------KPVR 373 (384) Q Consensus 316 ---~~~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd------------------~~~~ 373 (384) .+.+ ..+.+-||.+.|.. ++....++.++...|.+|-...|+.+|+...++-| .-++ T Consensus 398 Le~eGvDp~kYvvW~DaS~Lt~-dPdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLi 476 (631) T protein:vir:10 398 LAREGIDPSKYVVWYDPSQLTI-DPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (631) T ss_pred HHHhCCCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcc Confidence 2222 33456788888743 33333456678899999999999999998754422 1111 Q ss_pred c----------------CceeecCCCC Q lcl|NC_019422. 374 R----------------LDTAVVEGGE 384 (384) Q Consensus 374 ~----------------~n~~~~~~ge 384 (384) | ....+.-+|| T Consensus 477 p~lApl~~~~~~~v~~P~~~a~~~~g~ 503 (631) T protein:vir:10 477 PMLAPLIAGVLKQIEFPQQQAIDSGGN 503 (631) T ss_pred hhhHHHHHHHhhhccCCCCCCCCCCCC Confidence 1 1111122222 No 141 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=99.19 E-value=2.2e-13 Score=90.02 Aligned_cols=311 Identities=14% Similarity=0.144 Sum_probs=173.6 Q ss_pred CcchhhhcccCCCcchhHHHhhccc-cCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDS-GNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFL 79 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l 79 (384) ||+|+.|++..-+|+..-. +|+.. ....+. +.-..-|.+.+.+|+.||.. +-.|+.. .+.+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~-~~~~~~~--------~~~~~~~ 65 (320) T protein:vir:97 1 MGIFNFKKRETLTPELKES-IIRQVTIEDESP-----FTGTTDFNVRNEVAESIATY-LGAYKTS--------AKRLSLL 65 (320) T ss_pred CCccccccccccChhHHhh-hhheeeeccCCC-----cccccccchhhHHHHHHHHH-hhhhccc--------cceeeee Confidence 9999999888777654321 12111 111111 11112344556666666643 2223222 2223445 Q ss_pred HhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEE Q lcl|NC_019422. 80 LENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIH 159 (384) Q Consensus 80 ~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih 159 (384) ..+| .|++.++.+.+..-..|+++... .| +..-++.++.-.+..-.+..--.++.-...+.|..|+= T Consensus 66 ~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~~~~~~~~~~D~FN~~V~mtvpfyD~~- 132 (320) T protein:vir:97 66 TNNP-------SFLRRLVKHALHNKTTYVYKSPT-YG----WLITDSMTIEGLRARLTFTLPDPFNSAVTMTVPFYDVG- 132 (320) T ss_pred eCCH-------HHHHHHHHHhhcccceEEeeCCc-cc----eeeecceeeeeeeeeEEEecCcccceeEEEEeeeechh- Confidence 5555 59999999999999999998752 23 33334444332222111111101111112233322221 Q ss_pred EeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChH-HHHHHHHHHHHHhccccccCCcc Q lcl|NC_019422. 160 LRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPD-DIKKEVKSFEKNYLQIDSEAGGA 238 (384) Q Consensus 160 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e-~~~~~~~~~~~~~~~~~~~~~~~ 238 (384) --.++++|..+-+.- .-. ..+.....+.+.|.+.....++.+-+..-+ -.++.++.+++.. ...+.-.++ T Consensus 133 ----ILdnpl~gv~tqe~g-kM~---g~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk~mq-~~A~~~nG~ 203 (320) T protein:vir:97 133 ----IIDSPLVEVDTEEAN-KML---EAAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIKAML-ATAELLSGY 203 (320) T ss_pred ----hhhhhhcccChHHhh-HHH---HHHhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHHHHHH-HHHHHhcCc Confidence 124567777776332 222 334456677788888888888887665533 3455555554433 333333568 Q ss_pred eecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccccHHHHHHHHHHHHHHHHHHHH---HHHHhhcccCc Q lcl|NC_019422. 239 AATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYSEDEWNAYYESEIEPVGLQL---SNQYTEKLFTR 315 (384) Q Consensus 239 ~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~~~~~~~~~~i~P~~~~i---~~~l~~~l~~~ 315 (384) -+++.|-+++++....+.....+....+...+..|+||..+|.|+.++.+..+|+...+.|+++++ +..|..++-.+ T Consensus 204 T~i~~~dDI~Qi~pDYS~sn~~D~~l~~t~alS~y~m~~~IL~GsAte~~~Iaf~~~~V~PLL~Q~~~~Ek~Lvy~m~~E 283 (320) T protein:vir:97 204 TYIQRGDDVTQMMPDYTTSNVTDFAAMRTFAASQLSVSDKILDGSATDGEKVAVMFRFVEPILEQFREYEPSLIYAMRDE 283 (320) T ss_pred ccccCCcceeeecccccccchhHHHHHHHHHHhhcCCchhhccccCCcceeeehhhHhHHHHHHHhhhcCcceeeeeccc Confidence 889999999999987766666666667788899999999999999999999999999999999997 44555444222 Q ss_pred ccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeee Q lcl|NC_019422. 316 KARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIE----NGDKPVR 373 (384) Q Consensus 316 ~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~----~gd~~~~ 373 (384) .++-|-.. +|-+.-|.+- --|.+..| |||+--+ T Consensus 284 ------~FVs~mtT------------------GG~l~S~~~~-~~~~~~~~~~~~~~~~~~~ 320 (320) T protein:vir:97 284 ------FFVSFMTT------------------GGMLNSNRVD-GWGKEKAPNESKGGDVGDV 320 (320) T ss_pred ------eeeeeeec------------------Cceeeccccc-ccccccCCccccCCcccCC Confidence 12222111 1222222111 11333222 3433222 No 142 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.18 E-value=3.7e-11 Score=77.81 Aligned_cols=369 Identities=14% Similarity=0.078 Sum_probs=189.4 Q ss_pred Ccc---hhhhcccCCCcchhHH----H-----------hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_019422. 1 MNI---FKSKKKNKEAPGKVMM----E-----------LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIR 62 (384) Q Consensus 1 M~~---f~~~~~~~~~~~~~~~----~-----------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 62 (384) .++ |.-.-+.....+.... + +.+..+.||.. =..+-+.|.+++|+..||+.+.+- |.-.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~-la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (698) T protein:vir:10 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT-LVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (698) T ss_pred ccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHH-HHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 111 1111111111111000 0 00001111100 012345788899999999888765 42211 Q ss_pred ec-C------------CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-C--------------- Q lcl|NC_019422. 63 SN-E------------TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-D--------------- 113 (384) Q Consensus 63 ~~-~------------~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-~--------------- 113 (384) .+ + ++.+..+.+....|..+-....-...+.+. +.+--++|-+.+++.- . T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~ea-i~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~ 223 (698) T protein:vir:10 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTT-VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (698) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccceEEEEEeecCcccccccccccccc Confidence 11 0 111111113333343333222222334444 4445566666544422 1 Q ss_pred -CCCceeeEEEEcCceEEEEEcCC--------CEEEEEEEcCceEEEEehhheEEEeccC------CCCCccCccHHHHH Q lcl|NC_019422. 114 -DYNMPTQIYPLNALNVEAIYENE--------VLFLKFLLRNGKIVSYPYSDIIHLRKDF------NENDLFGTSPAKVL 178 (384) Q Consensus 114 -~~g~~~~l~~l~~~~v~~~~~~~--------~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~ 178 (384) ..|....|.+++|.++++..... +..-+|... |+. +-.+-++.+.... +.....|.|..+.+ T Consensus 224 I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G~~--IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~ 300 (698) T protein:vir:10 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-GSE--VHATRLHTIVSRPVGDMLKPTYSFAGISMTQLA 300 (698) T ss_pred ccCccceeeeeecccccccchhhhccchhhccCCCceEEEe-cce--ecceeEEEecCCCchhhhcchhccCCccHHHHH Confidence 13557779999999998754321 111122222 332 2222333232111 12234699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHH--HHHHHhccccccCCcceecC-CCceeeecccchh Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVK--SFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESY 255 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~--~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~ 255 (384) .+.+................-.........-..++++....+.. ++.++++ +..+.++++ ++.+|++.+.+.. T Consensus 301 ~e~V~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~R----sn~G~~llDk~~Eefeq~st~lS 376 (698) T protein:vir:10 301 MPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALTPGANVDLSMRAELINRYR----DNRNILFLDKATEEFFQFNTPLS 376 (698) T ss_pred HHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhc----CccceEEEecCCcceEEEecCcC Confidence 99999988877777777655322211111111122222222222 3333343 234577788 5789998886665 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHHHH-------HHHHHHHHHHHHHHHHHhhcccCcccccCc Q lcl|NC_019422. 256 VPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEWNA-------YYESEIEPVGLQLSNQYTEKLFTRKARSFG 321 (384) Q Consensus 256 ~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~~~-------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~ 321 (384) .++-.. ......||.+-+||...|-| ++.|....+ .....++|.++.+-+.|-+..|... . T Consensus 377 GLddVi-~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i----d 451 (698) T protein:vir:10 377 GLDALQ-AQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV----D 451 (698) T ss_pred CHHHHH-HHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----C Confidence 554322 34567899999999876532 233433333 4456789999999888887776653 2 Q ss_pred ceEEeechhhhccCHHHHHHHH-------H-HHhCCCCCHHHHHHHhCCCCC-------CCCCeeeecCc-e-------- Q lcl|NC_019422. 322 NEIVFEASNLQYASMSTKLNLV-------Q-MVDRGSLTPNEWRKIMNLSPI-------ENGDKPVRRLD-T-------- 377 (384) Q Consensus 322 ~~i~fd~~~~~~~d~~~~~~~~-------~-~~~~g~~t~NE~R~~lG~~p~-------~~gd~~~~~~n-~-------- 377 (384) ..|.|.+..|.+.+.++++++. + .+..|+++++|+|+.|--+|- +--|++..|.. . T Consensus 452 p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~ 531 (698) T protein:vir:10 452 PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYV 531 (698) T ss_pred CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhh Confidence 3578888999888888888763 2 357899999999998865542 11234333322 1 Q ss_pred -eecCCCC Q lcl|NC_019422. 378 -AVVEGGE 384 (384) Q Consensus 378 -~~~~~ge 384 (384) .+.++|+ T Consensus 532 ~~~~~~~~ 539 (698) T protein:vir:10 532 QRMAEGGD 539 (698) T ss_pred cCCcCCCC Confidence 1123333 No 143 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.12 E-value=2.5e-11 Score=78.71 Aligned_cols=379 Identities=12% Similarity=0.067 Sum_probs=208.5 Q ss_pred CcchhhhcccCCCcchh------H---HHhhccccCcc------ee---ch----hhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV------M---MELISDSGNGF------YS---WH----GNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~------~---~~~~~~~~~~~------~~---~~----~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |---..+-..+.+..+. . ...+......+ +. .. ..+-..+.++..++.|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 54433221111111111 0 11111110000 00 00 1123378899999999999999999 Q ss_pred EEEEecCCcce----ec-cch---HHHHHHhhcc-ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc------e-eeEE Q lcl|NC_019422. 59 KHIRSNETEFK----TN-PEI---YIKFLLENPN-PFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM------P-TQIY 122 (384) Q Consensus 59 ~~~~~~~~~~~----~~-~~~---~~~~l~~~PN-~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~------~-~~l~ 122 (384) ..-+.+.++.. +. ++. ....+-.++- ..+-..++++.+..++-+-|++|+.+.....|. + .+++ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW~ 160 (629) T protein:vir:86 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEWL 160 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhhe Confidence 88776644421 11 111 2333333443 446677999999999999999999998544332 2 3566 Q ss_pred EEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCC--CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 123 PLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFN--ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 123 ~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) .+-+..|+- ..++ .......+++.+..+..+++ +|..+| .....--||+.+++..+.......+...+..+... T Consensus 161 ~vt~~ei~~--~~~~-~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL 236 (629) T protein:vir:86 161 ALTPEEVRA--SEKK-TIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRL 236 (629) T ss_pred eechHHhhh--ccCc-eeeEcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 677766552 1222 22222223344444555655 555544 45567889999999999888777777766666655 Q ss_pred CcceEEeeCCCCCh----------------------HHHHHHHHHHHHHhccccccC-----CcceecC-C---Cceeee Q lcl|NC_019422. 201 TIKWLLKFKTALRP----------------------DDIKKEVKSFEKNYLQIDSEA-----GGAAATD-S---KYDAEQ 249 (384) Q Consensus 201 ~p~~il~~~~~~~~----------------------e~~~~~~~~~~~~~~~~~~~~-----~~~~v~~-~---g~~~~~ 249 (384) .-.|++-++..++- ...+.+.+.+.+.-.-...+. .-++++. . ..+++- T Consensus 237 ~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i~h 316 (629) T protein:vir:86 237 IGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNVTH 316 (629) T ss_pred hhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCeeE Confidence 55566555432211 133444444443222111111 1122222 1 123333 Q ss_pred cccc--hhHHHHHHHHHHHHHHHHHhCCCHHHh-cc-ccHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc----c Q lcl|NC_019422. 250 VKAE--SYVPNAAQMDKAIQRLYSFFNTNEKII-QS-KYSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR----K 316 (384) Q Consensus 250 l~~~--~~~~~~~~~~~~~~~I~~~fgvp~~~l-~~-~~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~----~ 316 (384) +... ....-.+.....++++|..+.|||+.| |- ++++ +-...-++--|.|.++.|+++|++++|.+ . T Consensus 317 lkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~e 396 (629) T protein:vir:86 317 LKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLMRE 396 (629) T ss_pred EeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHHh Confidence 3332 222233334568999999999999876 43 2332 22334566779999999999999997764 2 Q ss_pred ccc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-------------eeeecCcee---- Q lcl|NC_019422. 317 ARS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGD-------------KPVRRLDTA---- 378 (384) Q Consensus 317 ~~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd-------------~~~~~~n~~---- 378 (384) +.+ ..+.+-||.+.|.. +.....++.++...|.+|-...|+.+|+...++-| ..-..-++. T Consensus 397 GiDp~kYvvW~DaS~Lt~-dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a 475 (629) T protein:vir:86 397 GIDPNAYVVWHDASQLTV-DPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLA 475 (629) T ss_pred CCCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhh Confidence 222 33456788888743 33333456678899999999999999998754433 111111111 Q ss_pred ec-----CCCC Q lcl|NC_019422. 379 VV-----EGGE 384 (384) Q Consensus 379 ~~-----~~ge 384 (384) ++ +.+- T Consensus 476 ~l~~~~a~~~~ 486 (629) T protein:vir:86 476 VLIPELADVEF 486 (629) T ss_pred hhhhhhccccc Confidence 11 1110 No 144 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.11 E-value=3e-11 Score=78.31 Aligned_cols=379 Identities=12% Similarity=0.067 Sum_probs=208.6 Q ss_pred CcchhhhcccCCCcchh------H---HHhhccccCcc------ee---ch----hhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV------M---MELISDSGNGF------YS---WH----GNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~------~---~~~~~~~~~~~------~~---~~----~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |---..+-..+.+..+. . ...+......+ +. .. ..+-..+.++..++.|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 54433221111111111 0 01111100000 00 00 1123378899999999999999999 Q ss_pred EEEEecCCcce----ec-cch---HHHHHHhhcc-ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc------e-eeEE Q lcl|NC_019422. 59 KHIRSNETEFK----TN-PEI---YIKFLLENPN-PFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM------P-TQIY 122 (384) Q Consensus 59 ~~~~~~~~~~~----~~-~~~---~~~~l~~~PN-~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~------~-~~l~ 122 (384) ..-+.+.++.. +. ++. ....+-.++- ..+-..++++.+..++-+-|++|+.+.....|. + .+++ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW~ 160 (629) T protein:vir:99 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEWL 160 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhhe Confidence 88776644421 11 111 2333333443 446677999999999999999999998544332 2 3566 Q ss_pred EEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCC--CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 123 PLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFN--ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 123 ~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) .+-+..|+- ..++ .......++..+..+..|++ +|..+| .....--||+.+++..+.......+...+..+... T Consensus 161 ~vt~~ei~~--~~~~-~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL 236 (629) T protein:vir:99 161 ALTPEEVRA--SEKK-TIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRL 236 (629) T ss_pred eechHHhhh--ccCc-eeEEcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 677766552 1222 22222223344444555655 555544 45567889999999999888777777766666655 Q ss_pred CcceEEeeCCCCCh----------------------HHHHHHHHHHHHHhccccccC-----CcceecC-C---Cceeee Q lcl|NC_019422. 201 TIKWLLKFKTALRP----------------------DDIKKEVKSFEKNYLQIDSEA-----GGAAATD-S---KYDAEQ 249 (384) Q Consensus 201 ~p~~il~~~~~~~~----------------------e~~~~~~~~~~~~~~~~~~~~-----~~~~v~~-~---g~~~~~ 249 (384) .-.|++-++..++- ...+.+.+.+.+.-.-...+. .-++++. . ..+++- T Consensus 237 ~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i~h 316 (629) T protein:vir:99 237 IGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNVTH 316 (629) T ss_pred hhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCeeE Confidence 55566555432211 133444444443222111111 1122222 1 123333 Q ss_pred cccc--hhHHHHHHHHHHHHHHHHHhCCCHHHh-cc-ccHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc----c Q lcl|NC_019422. 250 VKAE--SYVPNAAQMDKAIQRLYSFFNTNEKII-QS-KYSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR----K 316 (384) Q Consensus 250 l~~~--~~~~~~~~~~~~~~~I~~~fgvp~~~l-~~-~~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~----~ 316 (384) +... ....-.+.....++++|..+.|||+.| |- ++++ +-...-++--|.|.++.|+++|++++|.+ . T Consensus 317 lkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~e 396 (629) T protein:vir:99 317 LKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLMRE 396 (629) T ss_pred EeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHHh Confidence 3332 222233334568999999999999876 43 2332 22334566779999999999999997764 2 Q ss_pred ccc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-------------eeeecCcee---- Q lcl|NC_019422. 317 ARS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGD-------------KPVRRLDTA---- 378 (384) Q Consensus 317 ~~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd-------------~~~~~~n~~---- 378 (384) +.+ ..+.+-||.+.|.. +.....++.++...|.+|-...|+.+|+...++-| .+-..-++. T Consensus 397 GiDp~kYvvW~DaS~Lt~-dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a 475 (629) T protein:vir:99 397 GIDPNAYVVWHDASQLTV-DPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLA 475 (629) T ss_pred CCCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhh Confidence 222 33456788888743 33333456678899999999999999998754333 111111111 Q ss_pred ec-----CCCC Q lcl|NC_019422. 379 VV-----EGGE 384 (384) Q Consensus 379 ~~-----~~ge 384 (384) ++ +.+- T Consensus 476 ~l~~~~a~~~~ 486 (629) T protein:vir:99 476 VLIPELADVEF 486 (629) T ss_pred hhhhhhccccc Confidence 11 0000 No 145 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=99.02 E-value=3.6e-10 Score=72.38 Aligned_cols=376 Identities=11% Similarity=0.042 Sum_probs=206.9 Q ss_pred CcchhhhcccCCCcch----------------hHHHh------hccccCcceechhhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK----------------VMMEL------ISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~----------------~~~~~------~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |---..+-..+.+.++ ..... .++|++... ..+-..+.++..++.+++.++++.+ T Consensus 1 ma~~~lrv~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW---~~~d~VgElryyvgW~~ss~Sr~rL 77 (629) T protein:vir:10 1 MAASTLRVSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAW---ECMDLVGELRYYVGWRASSCSRVEL 77 (629) T ss_pred CCccceeEEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHH---HHHHhhhhHHHHhhhhhhhheeeeE Confidence 4432211111111000 00000 111111110 1223457889999999999999999 Q ss_pred EEEEecCCcce----e-ccchHHHH----HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC----cee-eEEEE Q lcl|NC_019422. 59 KHIRSNETEFK----T-NPEIYIKF----LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN----MPT-QIYPL 124 (384) Q Consensus 59 ~~~~~~~~~~~----~-~~~~~~~~----l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g----~~~-~l~~l 124 (384) ..-..+.+... + .+++-... ...-...-+...++++.+..++-+-|+.|+++.....+ .+. .+|.+ T Consensus 78 ~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~vV 157 (629) T protein:vir:10 78 IASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYVV 157 (629) T ss_pred EEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceeee Confidence 88666644321 1 22222222 22223455777899999999999999999998754333 233 45566 Q ss_pred cCceEEEEEcCCCEEEEEEEcCceEEEEeh-hheEEEeccC--CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCC Q lcl|NC_019422. 125 NALNVEAIYENEVLFLKFLLRNGKIVSYPY-SDIIHLRKDF--NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNT 201 (384) Q Consensus 125 ~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~-~evih~~~~~--~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 201 (384) ....|. .++...-.....+|....|.- .|++ +|-++ |.....--||+.+++..+.......+...+..+.... T Consensus 158 t~~Ei~---~kg~g~~~i~lpdg~~he~~~~~D~l-~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL~ 233 (629) T protein:vir:10 158 TNDEVK---NKGAGKTDIELPDGTIHEYSKGRDVM-FRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRLI 233 (629) T ss_pred cHHHhc---cccCceeEEEcCCCceeeeeCCCeeE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHHh Confidence 666544 333222245556666544433 3433 34343 4455678899999999998887777776666665555 Q ss_pred cceEEeeCCCCCh----------------------HHHHHHHHHHHHHhccccccCC-----cceecC----CCceeeec Q lcl|NC_019422. 202 IKWLLKFKTALRP----------------------DDIKKEVKSFEKNYLQIDSEAG-----GAAATD----SKYDAEQV 250 (384) Q Consensus 202 p~~il~~~~~~~~----------------------e~~~~~~~~~~~~~~~~~~~~~-----~~~v~~----~g~~~~~l 250 (384) -.|++-++..++- ...+.+...+.+.-.....+.+ -++++. ..-+++.| T Consensus 234 gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ikhL 313 (629) T protein:vir:10 234 GNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIFHL 313 (629) T ss_pred hCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCeeee Confidence 5566555432211 1334444444443322222211 122221 12344555 Q ss_pred ccchh--HHHHHHHHHHHHHHHHHhCCCHHHhccc--cHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc----cc Q lcl|NC_019422. 251 KAESY--VPNAAQMDKAIQRLYSFFNTNEKIIQSK--YSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR----KA 317 (384) Q Consensus 251 ~~~~~--~~~~~~~~~~~~~I~~~fgvp~~~l~~~--~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~----~~ 317 (384) ..... ..-.+.....++++|..+.|||+.|-|- +++ +-...-++--|.|.++.++++|++.+|.+ .+ T Consensus 314 kf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~eG 393 (629) T protein:vir:10 314 KIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLRAEG 393 (629) T ss_pred eecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHHHhC Confidence 44332 2233344568999999999999876432 332 22233456679999999999999987764 22 Q ss_pred cc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--e-----------eeecCcee----- Q lcl|NC_019422. 318 RS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGD--K-----------PVRRLDTA----- 378 (384) Q Consensus 318 ~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd--~-----------~~~~~n~~----- 378 (384) .+ ..+.+-||.+.|.. |+....++.++...|.+|-...|+.+|+...++-| + ...+-+++ T Consensus 394 iDp~~Yvvw~DaS~Lt~-dPd~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~ap 472 (629) T protein:vir:10 394 IDPDRYVLWYDASGLTV-DPDKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVLAP 472 (629) T ss_pred CCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhhhh Confidence 22 33456778888643 34334456677899999999999999997654311 1 11111110 Q ss_pred ----------------ecCCCC Q lcl|NC_019422. 379 ----------------VVEGGE 384 (384) Q Consensus 379 ----------------~~~~ge 384 (384) .+.+|| T Consensus 473 ll~~~l~~i~~P~p~~a~~~~~ 494 (629) T protein:vir:10 473 LLTDELAEIDWPEPPAALPPGE 494 (629) T ss_pred hcCCccccccccCCCCcCCCCC Confidence 011111 No 146 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.96 E-value=2e-10 Score=73.71 Aligned_cols=380 Identities=11% Similarity=0.081 Sum_probs=207.7 Q ss_pred CcchhhhcccCCCcch------hHH---HhhccccCcc---------eec----hhhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK------VMM---ELISDSGNGF---------YSW----HGNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~------~~~---~~~~~~~~~~---------~~~----~~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |---..+-..+.+..+ .+. ..+.+-+..+ ..+ -+.+-..+.++..++.|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 5443321111111111 111 0110000000 000 01233458899999999999999999 Q ss_pred EEEEecCCcc-e-----eccchHHHHH----HhhccccCCHHHHHHHHHHHHHHhCCeeEEEe-eCCCCc-------eee Q lcl|NC_019422. 59 KHIRSNETEF-K-----TNPEIYIKFL----LENPNPFMSGQILQEKMVTQLELNSNAFAVII-KDDYNM-------PTQ 120 (384) Q Consensus 59 ~~~~~~~~~~-~-----~~~~~~~~~l----~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~-~~~~g~-------~~~ 120 (384) ..-+.+.+.. . +++++-...+ ..-...-+...++++.+..++-+-|++|+.+. +.+++- -.+ T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:10 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 8866663322 1 1222211112 12234457778999999999999999998865 333331 234 Q ss_pred EEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccC--CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDF--NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKN 198 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~--~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 198 (384) ++.+....|. .+++........+|....|.-+.=+.+|.++ |.....--||+.+++..+.......+...+..+. T Consensus 161 W~vvs~~Ei~---~~~~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakS 237 (639) T protein:vir:10 161 WYAVTREEIK---SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKS 237 (639) T ss_pred eeeeeHHHhc---ccCCCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 5556655554 2222233344445544333322212244343 4555678899999999998888777777776666 Q ss_pred cCCcceEEeeCCCCChH-------------------------HHHHHHHHHHHHhccccccCC-----cceecC----CC Q lcl|NC_019422. 199 SNTIKWLLKFKTALRPD-------------------------DIKKEVKSFEKNYLQIDSEAG-----GAAATD----SK 244 (384) Q Consensus 199 g~~p~~il~~~~~~~~e-------------------------~~~~~~~~~~~~~~~~~~~~~-----~~~v~~----~g 244 (384) ...-.|++-++..++.. ..+.+...+.+...-...+.+ -++++. .. T Consensus 238 Rl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l 317 (639) T protein:vir:10 238 RVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL 317 (639) T ss_pred HHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechHHh Confidence 55556676665433211 133344444443322222221 123322 22 Q ss_pred ceeeecccchhH--HHHHHHHHHHHHHHHHhCCCHHHh-ccccHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc- Q lcl|NC_019422. 245 YDAEQVKAESYV--PNAAQMDKAIQRLYSFFNTNEKII-QSKYSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR- 315 (384) Q Consensus 245 ~~~~~l~~~~~~--~~~~~~~~~~~~I~~~fgvp~~~l-~~~~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~- 315 (384) -+++.|...... .-.+.....++++|..+.|||+.| |-++++ +-...-++--|.|.+..|+++|++++|.+ T Consensus 318 ~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~ 397 (639) T protein:vir:10 318 EKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPL 397 (639) T ss_pred cCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHH Confidence 345555544332 223334568999999999999865 443332 22234566679999999999999997764 Q ss_pred ---cccc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee-------------eecCc-- Q lcl|NC_019422. 316 ---KARS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGDKP-------------VRRLD-- 376 (384) Q Consensus 316 ---~~~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd~~-------------~~~~n-- 376 (384) .+.+ ..+.+-||.+.|.. +.....++.++...|.+|-.-.|+.+|+...++=|+. -.+-+ T Consensus 398 Le~eGvDp~kYvvW~DaS~Lt~-dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li 476 (639) T protein:vir:10 398 LAREGIDPTKYILWYDASGLTS-DPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELI 476 (639) T ss_pred HHHhCCCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchh Confidence 2222 33456788888753 3333345667889999999999999999765432311 11111 Q ss_pred --eeecCCCC Q lcl|NC_019422. 377 --TAVVEGGE 384 (384) Q Consensus 377 --~~~~~~ge 384 (384) +.|+-.++ T Consensus 477 ~~~apl~~P~ 486 (639) T protein:vir:10 477 AMYAPLLSSQ 486 (639) T ss_pred hhhhhccCcc Confidence 11222222 No 147 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.96 E-value=2e-10 Score=73.71 Aligned_cols=380 Identities=11% Similarity=0.081 Sum_probs=207.7 Q ss_pred CcchhhhcccCCCcch------hHH---HhhccccCcc---------eec----hhhhhhcHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGK------VMM---ELISDSGNGF---------YSW----HGNLYKSDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~------~~~---~~~~~~~~~~---------~~~----~~~~~~~~~v~~~i~~ia~~ia~~~~ 58 (384) |---..+-..+.+..+ .+. ..+.+-+..+ ..+ -+.+-..+.++..++.|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 5443321111111111 111 0110000000 000 01233458899999999999999999 Q ss_pred EEEEecCCcc-e-----eccchHHHHH----HhhccccCCHHHHHHHHHHHHHHhCCeeEEEe-eCCCCc-------eee Q lcl|NC_019422. 59 KHIRSNETEF-K-----TNPEIYIKFL----LENPNPFMSGQILQEKMVTQLELNSNAFAVII-KDDYNM-------PTQ 120 (384) Q Consensus 59 ~~~~~~~~~~-~-----~~~~~~~~~l----~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~-~~~~g~-------~~~ 120 (384) ..-+.+.+.. . +++++-...+ ..-...-+...++++.+..++-+-|++|+.+. +.+++- -.+ T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:97 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 8866663322 1 1222211112 12234457778999999999999999998865 333331 234 Q ss_pred EEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccC--CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDF--NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKN 198 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~--~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 198 (384) ++.+....|. .+++........+|....|.-+.=+.+|.++ |.....--||+.+++..+.......+...+..+. T Consensus 161 W~vvs~~Ei~---~~~~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakS 237 (639) T protein:vir:97 161 WYAVTREEIK---SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKS 237 (639) T ss_pred eeeeeHHHhc---ccCCCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 5556655554 2222233344445544333322212244343 4555678899999999998888777777776666 Q ss_pred cCCcceEEeeCCCCChH-------------------------HHHHHHHHHHHHhccccccCC-----cceecC----CC Q lcl|NC_019422. 199 SNTIKWLLKFKTALRPD-------------------------DIKKEVKSFEKNYLQIDSEAG-----GAAATD----SK 244 (384) Q Consensus 199 g~~p~~il~~~~~~~~e-------------------------~~~~~~~~~~~~~~~~~~~~~-----~~~v~~----~g 244 (384) ...-.|++-++..++.. ..+.+...+.+...-...+.+ -++++. .. T Consensus 238 Rl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l 317 (639) T protein:vir:97 238 RVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL 317 (639) T ss_pred HHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechHHh Confidence 55556676665433211 133344444443322222221 123322 22 Q ss_pred ceeeecccchhH--HHHHHHHHHHHHHHHHhCCCHHHh-ccccHH-----HHHHHHHHHHHHHHHHHHHHHHhhcccCc- Q lcl|NC_019422. 245 YDAEQVKAESYV--PNAAQMDKAIQRLYSFFNTNEKII-QSKYSE-----DEWNAYYESEIEPVGLQLSNQYTEKLFTR- 315 (384) Q Consensus 245 ~~~~~l~~~~~~--~~~~~~~~~~~~I~~~fgvp~~~l-~~~~~e-----~~~~~~~~~~i~P~~~~i~~~l~~~l~~~- 315 (384) -+++.|...... .-.+.....++++|..+.|||+.| |-++++ +-...-++--|.|.+..|+++|++++|.+ T Consensus 318 ~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~ 397 (639) T protein:vir:97 318 EKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPL 397 (639) T ss_pred cCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHH Confidence 345555544332 223334568999999999999865 443332 22234566679999999999999997764 Q ss_pred ---cccc-CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee-------------eecCc-- Q lcl|NC_019422. 316 ---KARS-FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENGDKP-------------VRRLD-- 376 (384) Q Consensus 316 ---~~~~-~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~gd~~-------------~~~~n-- 376 (384) .+.+ ..+.+-||.+.|.. +.....++.++...|.+|-.-.|+.+|+...++=|+. -.+-+ T Consensus 398 Le~eGvDp~kYvvW~DaS~Lt~-dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li 476 (639) T protein:vir:97 398 LAREGIDPTKYILWYDASGLTS-DPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELI 476 (639) T ss_pred HHHhCCCHHHhEeeecCccccc-CCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchh Confidence 2222 33456788888753 3333345667889999999999999999765432311 11111 Q ss_pred --eeecCCCC Q lcl|NC_019422. 377 --TAVVEGGE 384 (384) Q Consensus 377 --~~~~~~ge 384 (384) +.|+-.++ T Consensus 477 ~~~apl~~P~ 486 (639) T protein:vir:97 477 AMYAPLLSSQ 486 (639) T ss_pred hhhhhccCcc Confidence 11222222 No 148 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=98.76 E-value=2e-10 Score=73.78 Aligned_cols=272 Identities=14% Similarity=0.112 Sum_probs=141.8 Q ss_pred HHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceee Q lcl|NC_019422. 41 IVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQ 120 (384) Q Consensus 41 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~ 120 (384) --....++.|++++-..|.+..-+ ..-.-.+++-++----|...+-..-++.++.+. +.|.-...+...+.-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 76 (279) T protein:vir:40 1 MSLFNLSRRAEDVSFSTFTVQDPT---TDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWA-LQGKEVYRVWYGGFKYYAQ 76 (279) T ss_pred CcccccchhhcccceeeeeecCcc---hhHHHHHHHHHHHHhhcccchhhhhhhhhhhhh-hccceeehhhhhhHHHHHh Confidence 000112334444543344331000 000001111112223455555555556554433 3343322222221111111 Q ss_pred EEEEcCceEEEEEcCCCEEEEEEEcCceEEEEehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLFLKFLLRNGKIVSYPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSN 200 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 200 (384) -...||..+-+ ..++-.-+++|-.|+..+- ++.+|.-+- ....-++.. .+...+-+.+.+ T Consensus 77 ~~~~d~fn~~v-----------r~~~~~~vtVP~~Dv~Iie-----NPlv~v~~e-e~~kM~~la---~nai~~KLD~~~ 136 (279) T protein:vir:40 77 RVNADQFNIVV-----------REPNRREVTIRTNDYEMLL-----NPFYGANPQ-RFGVMFGMA---SNGIGRRLDSQA 136 (279) T ss_pred hcCcchhhhhe-----------ecCCcceeEeecchhhhhh-----cchheeccc-hhhHHHHHH---Hhhhhhhhcccc Confidence 11112222111 1112233455655665553 223443333 222222222 444555557788 Q ss_pred CcceEEeeCCCCC-hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019422. 201 TIKWLLKFKTALR-PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKI 279 (384) Q Consensus 201 ~p~~il~~~~~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~ 279 (384) ..+++++.+.+.. .+..++.+..+++... ..+.-+++.+++.|.++++|..+-+.....++..++.+.+..+|||..+ T Consensus 137 qIk~fIKTd~d~glee~kekaR~rIk~mla-lAk~~nGityid~~ddItQL~kDYStslk~die~lkS~l~Sq~GinekI 215 (279) T protein:vir:40 137 QIKIYWKTKVSSGLKEVWDRIRERLTQQQQ-LAREFNGVSVIGSDDDIKQIQPDYSGSLQNDANLAIEIALSEYGMPREL 215 (279) T ss_pred eeeeEEecCcchhHHHHHHHHHHHHHHHHH-HHHhcCCeeeecCCceeEeeccccccccHHHHHHHHHHHHhhcCCchhh Confidence 8999999987744 4456666777766554 4444468999999999999999888888888889999999999999999 Q ss_pred hccccHHHHHHHHHHHHHHHHHHHHHHHHhh------cccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCH Q lcl|NC_019422. 280 IQSKYSEDEWNAYYESEIEPVGLQLSNQYTE------KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTP 353 (384) Q Consensus 280 l~~~~~e~~~~~~~~~~i~P~~~~i~~~l~~------~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~ 353 (384) |.|+.+|.+..+|+..+|.|++++.+..|.. +.++... .+|.+. T Consensus 216 L~GsAtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~ttta-----------------------------~gg~~~- 265 (279) T protein:vir:40 216 LYGQSNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYISTTA-----------------------------KGGAIE- 265 (279) T ss_pred ccccCchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhheecc-----------------------------cCcccc- Confidence 9999999999999999999999997664332 2222111 111100 Q ss_pred HHHHHHhCCCCCCCCC Q lcl|NC_019422. 354 NEWRKIMNLSPIENGD 369 (384) Q Consensus 354 NE~R~~lG~~p~~~gd 369 (384) -.-.+-+-+|+ |.| T Consensus 266 -s~~~~~~~~~~-~~~ 279 (279) T protein:vir:40 266 -SKSSKRDSEPV-GND 279 (279) T ss_pred -cccccccCCCC-CCC Confidence 00001112222 111 No 149 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.75 E-value=1.9e-08 Score=62.91 Aligned_cols=375 Identities=10% Similarity=0.082 Sum_probs=191.3 Q ss_pred Ccc---hhhhcccCCC-cch------hHHH-hhccccCccee--------chhhhhhcHHHHHHHHHHHHhhccC----- Q lcl|NC_019422. 1 MNI---FKSKKKNKEA-PGK------VMME-LISDSGNGFYS--------WHGNLYKSDIVRSIIRPKAKAVGKM----- 56 (384) Q Consensus 1 M~~---f~~~~~~~~~-~~~------~~~~-~~~~~~~~~~~--------~~~~~~~~~~v~~~i~~ia~~ia~~----- 56 (384) |+= |...+....+ +-+ .... ..+..+.+-.. ..+.++.+|.|.+||+.|++.+.-+ T Consensus 20 ~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~~ 99 (533) T protein:vir:58 20 LSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENGN 99 (533) T ss_pred hchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCCc Confidence 221 2211111111 111 0011 01111111111 1122356799999999999887543 Q ss_pred ceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC-CCCceeeEEEEcCceEEEEEcC Q lcl|NC_019422. 57 TAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD-DYNMPTQIYPLNALNVEAIYEN 135 (384) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~~g~~~~l~~l~~~~v~~~~~~ 135 (384) |+.+-..+.+-.+...+.+.. .+++..--+.++..|+++|+.|..++-+ .++-..+|..|||..++.+++. T Consensus 100 pV~v~l~~~e~s~~iK~kI~~--------lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~ 171 (533) T protein:vir:58 100 IVDVVTKDIELAKAILSYLDY--------VINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNP 171 (533) T ss_pred eeEeecccccccHHHHHHHHH--------HhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEee Confidence 233322222222222222222 3344455566778899999999998743 4455779999999999998876 Q ss_pred CCEEEEEEEcC--------ceEEEEehhheEEEeccC-CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE Q lcl|NC_019422. 136 EVLFLKFLLRN--------GKIVSYPYSDIIHLRKDF-NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLL 206 (384) Q Consensus 136 ~~~~~~~~~~~--------g~~~~~~~~evih~~~~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 206 (384) .+...+|.++. +..+.++.+.|+|+.+.. ..++.+++|-+..+.+.+..+.-...+..-+--..+.-+-++ T Consensus 172 ~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvF 251 (533) T protein:vir:58 172 ETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVF 251 (533) T ss_pred ccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEE Confidence 55444444331 234667889999998774 346778999999999999888887777666655555445566 Q ss_pred eeC-CCCChHHHHHHHHHHHHHhcc---ccccCCcc----------eec----------CCCceeeecccchhHHHHHHH Q lcl|NC_019422. 207 KFK-TALRPDDIKKEVKSFEKNYLQ---IDSEAGGA----------AAT----------DSKYDAEQVKAESYVPNAAQM 262 (384) Q Consensus 207 ~~~-~~~~~e~~~~~~~~~~~~~~~---~~~~~~~~----------~v~----------~~g~~~~~l~~~~~~~~~~~~ 262 (384) .++ +.+.+..++.....+-.+++. ...++|.+ ..+ ..|.+++.|.- ..-.++... T Consensus 252 YIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpG-g~lgemeDV 330 (533) T protein:vir:58 252 YVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQG-SKVDLAEDV 330 (533) T ss_pred EEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCC-CCCCcHHHH Confidence 665 334333344444444444432 11222322 111 23456666653 233445667 Q ss_pred HHHHHHHHHHhCCCHHHhccc-----cHHHHHHH-HHHHHHHHHHHHHHHHHhhcccCccccc-CcceEEeechh----h Q lcl|NC_019422. 263 DKAIQRLYSFFNTNEKIIQSK-----YSEDEWNA-YYESEIEPVGLQLSNQYTEKLFTRKARS-FGNEIVFEASN----L 331 (384) Q Consensus 263 ~~~~~~I~~~fgvp~~~l~~~-----~~e~~~~~-~~~~~i~P~~~~i~~~l~~~l~~~~~~~-~~~~i~fd~~~----~ 331 (384) ++..+.+..+++||.+-|+.. .+|-.+.. =....|.-+-..+.+.|..+|....... ..+.+.|..|. + T Consensus 331 ~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f~El 410 (533) T protein:vir:58 331 EYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRLVMNRSNSIVEG 410 (533) T ss_pred HHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheeeeeeccchHHHH Confidence 889999999999999988532 22221111 1344455566677888888877653222 22345554443 2 Q ss_pred hccC-HHHHHHHHHHHh---------CCC--CC-----HHHHHHHhCCCCC---CCCCeeeecCcee-----ecCC---- Q lcl|NC_019422. 332 QYAS-MSTKLNLVQMVD---------RGS--LT-----PNEWRKIMNLSPI---ENGDKPVRRLDTA-----VVEG---- 382 (384) Q Consensus 332 ~~~d-~~~~~~~~~~~~---------~g~--~t-----~NE~R~~lG~~p~---~~gd~~~~~~n~~-----~~~~---- 382 (384) +... +..++.+++.+. .-+ +| -.|+-+..+..++ |+-+.=..|..+- |++. T Consensus 411 Ke~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~ 490 (533) T protein:vir:58 411 ERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGR 490 (533) T ss_pred HHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCCh Confidence 2211 223333322111 111 12 1122222222222 1111111222211 1211 Q ss_pred -----CC Q lcl|NC_019422. 383 -----GE 384 (384) Q Consensus 383 -----ge 384 (384) |+ T Consensus 491 ~~~~~~~ 497 (533) T protein:vir:58 491 TEFDFGT 497 (533) T ss_pred hhHhccc Confidence 00 No 150 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.68 E-value=8.8e-08 Score=59.26 Aligned_cols=362 Identities=12% Similarity=0.011 Sum_probs=160.3 Q ss_pred Ccch----------------hhhcccCCCcchhHHHhhccccCc--c--eech---hhhhhcHHHHHHHHHHHHhhccCc Q lcl|NC_019422. 1 MNIF----------------KSKKKNKEAPGKVMMELISDSGNG--F--YSWH---GNLYKSDIVRSIIRPKAKAVGKMT 57 (384) Q Consensus 1 M~~f----------------~~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~---~~~~~~~~v~~~i~~ia~~ia~~~ 57 (384) |.-. -+........-.....+..+.... . .... .......+...+|+..+..+--.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g 80 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEG 80 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCc Confidence 1100 000000000000011111111000 0 0000 001123455667777766554334 Q ss_pred eEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce-------eeEEEEcCceEE Q lcl|NC_019422. 58 AKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP-------TQIYPLNALNVE 130 (384) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~-------~~l~~l~~~~v~ 130 (384) |.+ .++ . ..+..+..++.+ -........+..+.+.+|.||+.+.++..|.+ ..+.+++|..+. T Consensus 81 ~~~---~~~-~--~~~~~l~~i~~~----N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~ 150 (484) T protein:vir:77 81 FRL---GGA-D--KADEQLWDWWQA----NDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLY 150 (484) T ss_pred eec---CCc-c--hhHHHHHHHHHh----cCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeE Confidence 432 211 1 112223334332 23456778889999999999999999887753 247778888887 Q ss_pred EEEcCCCE------E---------------------EEEEEcCceEEEEe-----hh--heEEEeccCCCCCccCccHHH Q lcl|NC_019422. 131 AIYENEVL------F---------------------LKFLLRNGKIVSYP-----YS--DIIHLRKDFNENDLFGTSPAK 176 (384) Q Consensus 131 ~~~~~~~~------~---------------------~~~~~~~g~~~~~~-----~~--evih~~~~~~~~~~~G~s~~~ 176 (384) +..+.... . ++|...+|...... .. -|++|.++....+.+|.|.+. T Consensus 151 ~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~ 230 (484) T protein:vir:77 151 AQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEIT 230 (484) T ss_pred EEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccch Confidence 76654211 0 11111112111100 01 146666444455668888765 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHH--HHHHHHHHHHhccccccCCcceecC-CCceeeeccc Q lcl|NC_019422. 177 V-LEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI--KKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKA 252 (384) Q Consensus 177 ~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~ 252 (384) . +...++..............-.+.|..++. +........ .+....|+. ..+.++.++ ++.++.++.. T Consensus 231 ~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~q~~~ 302 (484) T protein:vir:77 231 PELRSVTDAAARTLMLMQATAELMGVPQRLLF-GVKGEELGVDPETGQTLFDA-------YLARILAFEDHESKAQQFSA 302 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh-CCCcchhcccccccchhhhh-------hhhhhcccCCCCceeEeecC Confidence 3 333334443333333333333344543332 111111110 111111211 123444444 5677877776 Q ss_pred chhHHHHHHHHHHHHHHHHHhCCCHHHhcccc----HHHHHHH--------------HHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019422. 253 ESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEWNA--------------YYESEIEPVGLQLSNQYTEKLFT 314 (384) Q Consensus 253 ~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~~~--------------~~~~~i~P~~~~i~~~l~~~l~~ 314 (384) .+-+.-...++..+.+|+..-++|+..+|+.. +..+.+. .+...+.-++..+....+ . T Consensus 303 ~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~----~ 378 (484) T protein:vir:77 303 AELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMN----G 378 (484) T ss_pred CChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----C Confidence 55444455678889999999999999998642 2211111 111222222222211111 1 Q ss_pred cccccCcceEEeechhhhccCHHHHHHHH-HHHhCC--CCCHHHHHHHhCCCCCC--CC----Ce-------eeecC-ce Q lcl|NC_019422. 315 RKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRG--SLTPNEWRKIMNLSPIE--NG----DK-------PVRRL-DT 377 (384) Q Consensus 315 ~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g--~~t~NE~R~~lG~~p~~--~g----d~-------~~~~~-n~ 377 (384) .........+++.+.+....+..+.++.+ |++..| +++..-+++++|+.+.+ .. ++ .+.+. .. T Consensus 379 ~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~ 458 (484) T protein:vir:77 379 GDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGT 458 (484) T ss_pred CCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 11111223455556666667777777654 566654 88888888888876542 10 00 00000 00 Q ss_pred eecCCCC Q lcl|NC_019422. 378 AVVEGGE 384 (384) Q Consensus 378 ~~~~~ge 384 (384) .+-.+|+ T Consensus 459 ~~~~~~~ 465 (484) T protein:vir:77 459 DPSGGGN 465 (484) T ss_pred cccCCCC Confidence 0001111 No 151 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.67 E-value=1e-07 Score=58.94 Aligned_cols=339 Identities=10% Similarity=0.013 Sum_probs=162.3 Q ss_pred hhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHH Q lcl|NC_019422. 21 LISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQL 100 (384) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~ 100 (384) ++.......+-.........+...+|+.+++.+---.|. .. ++ ..+..+..++.+ | ........+..+. T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~---~~-d~---~~~~~~~~i~~~-N---~~d~~~~~~~~~a 69 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVT---GP-DG---EPDTRASRWWQA-N---RLDSRQKLVWRMA 69 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCcee---cC-CC---chHHHHHHHHHh-c---ChhHHHHHHHHHH Confidence 111000000000000012235677777777655333332 11 11 112223334332 3 3445777788999 Q ss_pred HHhCCeeEEEeeCCCCc------eeeEEEEcCceEEEEEcCCC------EEEEEEEcCceE---EEE------------- Q lcl|NC_019422. 101 ELNSNAFAVIIKDDYNM------PTQIYPLNALNVEAIYENEV------LFLKFLLRNGKI---VSY------------- 152 (384) Q Consensus 101 l~~G~~~~~~~~~~~g~------~~~l~~l~~~~v~~~~~~~~------~~~~~~~~~g~~---~~~------------- 152 (384) +.+|.+|+.+.++..+. ...+.+++|..+.+..+... ..++....++.. +.+ T Consensus 70 ~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (434) T protein:vir:98 70 MAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERT 149 (434) T ss_pred hhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeecc Confidence 99999999998775543 22366789988887776432 111111111100 000 Q ss_pred -----------------------eh--hheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe Q lcl|NC_019422. 153 -----------------------PY--SDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK 207 (384) Q Consensus 153 -----------------------~~--~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 207 (384) +. =-|+||.++...++ .|.|-++.+...++.............+-.+.|..++. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~ 228 (434) T protein:vir:98 150 GARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK 228 (434) T ss_pred ccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc Confidence 00 01455543332223 68999998888888887777666666665566654543 Q ss_pred -eCC-CCChHHHHHHHHHHHHHhccccccCCcceecC-CCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc- Q lcl|NC_019422. 208 -FKT-ALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK- 283 (384) Q Consensus 208 -~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~- 283 (384) .+. ....+. ......+ +.+.. ..+.+..++ ++.++.++.......-...++..+.+++..=++|+..++++ T Consensus 229 G~~~~~~~~~~-~~~~~~~-~~~~~---~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~ 303 (434) T protein:vir:98 229 GHKFAKRTDPA-TGMTVVD-QPFVP---SPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDL 303 (434) T ss_pred CCCcccccccc-cccchhh-hhhhc---cccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 111 111111 1111111 11211 123444444 45777777655444444556788999999999999999863 Q ss_pred -cH--HHHHH-------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HH Q lcl|NC_019422. 284 -YS--EDEWN-------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MV 346 (384) Q Consensus 284 -~~--e~~~~-------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~ 346 (384) +. ++... ..+...+.-+++.+. ++.... .....+++.+......+..+.++.+. +. T Consensus 304 ~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~-----~~~g~~--~~~~~~~v~w~~~~~~s~~~~ada~~kl~ 376 (434) T protein:vir:98 304 VNISADTIGALDILHVAKVREHIASFSEGLESVLALAA-----AQAGVP--EDYTEAEVRWANPAHVTMAVKADAATKLK 376 (434) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCC--hhheeeeEEecCCCCCCHHHHHHHHHHHH Confidence 21 11111 111222222222111 111111 12224555556666778888888764 55 Q ss_pred hCCCCCHHHHHHHhCCCCCCC------CCe------eeec--Cc----eeecCCCC Q lcl|NC_019422. 347 DRGSLTPNEWRKIMNLSPIEN------GDK------PVRR--LD----TAVVEGGE 384 (384) Q Consensus 347 ~~g~~t~NE~R~~lG~~p~~~------gd~------~~~~--~n----~~~~~~ge 384 (384) ..|+ +..-+++++|+++-+- .++ ...+ .. ..+-+++- T Consensus 377 ~~g~-~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 431 (434) T protein:vir:98 377 SIGY-PLDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGA 431 (434) T ss_pred hcCC-cHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCC Confidence 5564 7777788888866320 000 0000 10 01111110 No 152 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.52 E-value=3.7e-07 Score=55.86 Aligned_cols=362 Identities=12% Similarity=0.066 Sum_probs=161.7 Q ss_pred Ccc------------hhhhcccCCCcchhHHHhhccccCc--cee-----chhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNI------------FKSKKKNKEAPGKVMMELISDSGNG--FYS-----WHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~------------f~~~~~~~~~~~~~~~~~~~~~~~~--~~~-----~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) +|+ +-+.-......-.....+..+.... ... .........+...+|+..+..+.-.+|.+ T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~- 84 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRL- 84 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccCceec- Confidence 111 1000000000000011111111000 000 00011113455666776666654444542 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce-------eeEEEEcCceEEEEEc Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP-------TQIYPLNALNVEAIYE 134 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~-------~~l~~l~~~~v~~~~~ 134 (384) .+ + ...+..+..++.. | ........+..+.+.+|.||+.+.++..+.. ..+.+++|..+.+..+ T Consensus 85 --~~-~--~~~~~~l~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D 155 (485) T protein:vir:24 85 --GD-A--DEADEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEID 155 (485) T ss_pred --CC-C--chhHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEee Confidence 21 1 1112223334332 2 3456788899999999999999988766532 2477788888877665 Q ss_pred CCCE------E---------------------EEEEEcCceEEEE-----ehh--heEEEeccCCCCCccCccHHHH-HH Q lcl|NC_019422. 135 NEVL------F---------------------LKFLLRNGKIVSY-----PYS--DIIHLRKDFNENDLFGTSPAKV-LE 179 (384) Q Consensus 135 ~~~~------~---------------------~~~~~~~g~~~~~-----~~~--evih~~~~~~~~~~~G~s~~~~-~~ 179 (384) .... . +++...+|+.... +.. -|+||+++....+.+|.|.+.. +. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~ 235 (485) T protein:vir:24 156 PRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELR 235 (485) T ss_pred CCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHH Confidence 4311 0 1111112211110 001 2466655444566789887763 44 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHH--HHHHHHHHHHhccccccCCcceec-CCCceeeecccchhH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI--KKEVKSFEKNYLQIDSEAGGAAAT-DSKYDAEQVKAESYV 256 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~~~~~v~-~~g~~~~~l~~~~~~ 256 (384) ..++..............-.+.|..++. +........ +.....|.. ..+.+..+ +++.++.++...+.+ T Consensus 236 ~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~~~q~~~~~~e 307 (485) T protein:vir:24 236 SMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDPETGQTLFDA-------YLARILAFEDAEGKIQQFSAAELA 307 (485) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhc-cCCccccccccccccchhhh-------cccceeccCCCCceEEeecccchH Confidence 4455444444444344444445544443 111111100 111111111 12334444 356777777655544 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcccc----HHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccc Q lcl|NC_019422. 257 PNAAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKAR 318 (384) Q Consensus 257 ~~~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~ 318 (384) .-...++..+.+++..=++|+..+|++. +..+. ...+...+.-+++.+....+. .... T Consensus 308 ~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~ 383 (485) T protein:vir:24 308 NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG----GDVP 383 (485) T ss_pred HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCCc Confidence 4445667788889999999999998652 22111 112333344333333322111 1111 Q ss_pred cCcceEEeechhhhccCHHHHHHHH-HHHhC--CCCCHHHHHHHhCCCCCC--CCCe-----------eeec-Cceeec- Q lcl|NC_019422. 319 SFGNEIVFEASNLQYASMSTKLNLV-QMVDR--GSLTPNEWRKIMNLSPIE--NGDK-----------PVRR-LDTAVV- 380 (384) Q Consensus 319 ~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~--g~~t~NE~R~~lG~~p~~--~gd~-----------~~~~-~n~~~~- 380 (384) .....+++.+......+..+.++.+ +++.. |+++..-+++++|+.+.+ .... ..-. .+..+. T Consensus 384 ~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~ 463 (485) T protein:vir:24 384 PDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTV 463 (485) T ss_pred cccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCC Confidence 1223455555555566777777654 56554 477777777777776532 0000 0000 001111 Q ss_pred ----------------CCCC Q lcl|NC_019422. 381 ----------------EGGE 384 (384) Q Consensus 381 ----------------~~ge 384 (384) ++|| T Consensus 464 ~~~~~~~e~~~~~~~~~~~~ 483 (485) T protein:vir:24 464 PGSPNPTPAPKPQPAIEGGD 483 (485) T ss_pred CCCCCCCCCCCCccCCCCCC Confidence 1111 No 153 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.44 E-value=6.2e-07 Score=54.60 Aligned_cols=362 Identities=8% Similarity=0.001 Sum_probs=170.6 Q ss_pred CcchhhhcccCCCcc---hhHHHhhccccCc------ceech---hhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---KVMMELISDSGNG------FYSWH---GNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) ..+..+....-.... .....+..+.... ..... ..-..+.+...+|+..+..+-.-||.+-. .++. T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~-~~d~- 84 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG-SADS- 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC-CCCc- Confidence 122221111100000 1111111111100 00000 11123457788888888888878887521 1111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEE Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKF 142 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~ 142 (384) .....+..++.+ | ....+...+..+++.+|.+|+.+..+..|.+. +..++|..+.+..+.... ..+| T Consensus 85 --~~~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~ 157 (456) T protein:vir:10 85 --DLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWW 157 (456) T ss_pred --chHHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEE Confidence 112233444433 3 34456677889999999999999998888654 677888888877764421 1111 Q ss_pred EEcCceEE----------------------------EEehhh------eEEEeccC---CCCCccCccHHHHHHHHHHHH Q lcl|NC_019422. 143 LLRNGKIV----------------------------SYPYSD------IIHLRKDF---NENDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 143 ~~~~g~~~----------------------------~~~~~e------vih~~~~~---~~~~~~G~s~~~~~~~~i~~~ 185 (384) ...++... ...... .-|.-... +.+...|+|.++.....++.. T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~ 237 (456) T protein:vir:10 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRI 237 (456) T ss_pred EecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHH Confidence 11111000 000000 00000000 011236888888887777776 Q ss_pred HHHHHHHHHHHHccCCcceEEeeC-CC--CChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFK-TA--LRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM 262 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~-~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~ 262 (384) .....-........+.|..++.-. .. ..++.-..+. ..+.+. ...+.++.++++.++.+++..+-..-...+ T Consensus 238 ~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~--~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l 312 (456) T protein:vir:10 238 NRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAID--YASIFE---AAPGALWELPPGVDIWESQANDFTPMLSAI 312 (456) T ss_pred HHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccc--hhhhhh---hhccccccCCCCcceEEecccChhHHHHHH Confidence 655444333333333443333210 00 0011111110 011111 122456667888888887654433334456 Q ss_pred HHHHHHHHHHhCCCHHHhccc--cH-HHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEE Q lcl|NC_019422. 263 DKAIQRLYSFFNTNEKIIQSK--YS-EDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIV 325 (384) Q Consensus 263 ~~~~~~I~~~fgvp~~~l~~~--~~-e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~ 325 (384) +..+.+|++.=++|+..+++. +. ..+.+ ..+...+.-+++.+. + +.+.. ....++ T Consensus 313 ~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~-~----~~g~~---~~~~~~ 384 (456) T protein:vir:10 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-Q----IEGES---VEDTVD 384 (456) T ss_pred HHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----hcCCC---ccccee Confidence 788999999999999998763 21 11111 111222222222211 1 11111 112345 Q ss_pred eechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCC----CCCeeeec-----Cc--eeecCCCC Q lcl|NC_019422. 326 FEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIE----NGDKPVRR-----LD--TAVVEGGE 384 (384) Q Consensus 326 fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~----~gd~~~~~-----~n--~~~~~~ge 384 (384) +.+.+....+..+.++.+ ++...|+++..-+++++|+.+.+ ..+...-. .+ -.|-++|. T Consensus 385 v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 385 VSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS 455 (456) T ss_pred EEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC Confidence 555555666777887765 56778999998889999987641 11111111 11 11233333 No 154 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.44 E-value=6.2e-07 Score=54.60 Aligned_cols=362 Identities=8% Similarity=0.001 Sum_probs=170.6 Q ss_pred CcchhhhcccCCCcc---hhHHHhhccccCc------ceech---hhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG---KVMMELISDSGNG------FYSWH---GNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) ..+..+....-.... .....+..+.... ..... ..-..+.+...+|+..+..+-.-||.+-. .++. T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~-~~d~- 84 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG-SADS- 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC-CCCc- Confidence 122221111100000 1111111111100 00000 11123457788888888888878887521 1111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEE Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKF 142 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~ 142 (384) .....+..++.+ | ....+...+..+++.+|.+|+.+..+..|.+. +..++|..+.+..+.... ..+| T Consensus 85 --~~~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~ 157 (456) T protein:vir:10 85 --DLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWW 157 (456) T ss_pred --chHHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEE Confidence 112233444433 3 34456677889999999999999998888654 677888888877764421 1111 Q ss_pred EEcCceEE----------------------------EEehhh------eEEEeccC---CCCCccCccHHHHHHHHHHHH Q lcl|NC_019422. 143 LLRNGKIV----------------------------SYPYSD------IIHLRKDF---NENDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 143 ~~~~g~~~----------------------------~~~~~e------vih~~~~~---~~~~~~G~s~~~~~~~~i~~~ 185 (384) ...++... ...... .-|.-... +.+...|+|.++.....++.. T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~ 237 (456) T protein:vir:10 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRI 237 (456) T ss_pred EecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHH Confidence 11111000 000000 00000000 011236888888887777776 Q ss_pred HHHHHHHHHHHHccCCcceEEeeC-CC--CChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFK-TA--LRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQM 262 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~-~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~ 262 (384) .....-........+.|..++.-. .. ..++.-..+. ..+.+. ...+.++.++++.++.+++..+-..-...+ T Consensus 238 ~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~--~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l 312 (456) T protein:vir:10 238 NRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAID--YASIFE---AAPGALWELPPGVDIWESQANDFTPMLSAI 312 (456) T ss_pred HHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccc--hhhhhh---hhccccccCCCCcceEEecccChhHHHHHH Confidence 655444333333333443333210 00 0011111110 011111 122456667888888887654433334456 Q ss_pred HHHHHHHHHHhCCCHHHhccc--cH-HHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEE Q lcl|NC_019422. 263 DKAIQRLYSFFNTNEKIIQSK--YS-EDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIV 325 (384) Q Consensus 263 ~~~~~~I~~~fgvp~~~l~~~--~~-e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~ 325 (384) +..+.+|++.=++|+..+++. +. ..+.+ ..+...+.-+++.+. + +.+.. ....++ T Consensus 313 ~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~-~----~~g~~---~~~~~~ 384 (456) T protein:vir:10 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-Q----IEGES---VEDTVD 384 (456) T ss_pred HHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----hcCCC---ccccee Confidence 788999999999999998763 21 11111 111222222222211 1 11111 112345 Q ss_pred eechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCC----CCCeeeec-----Cc--eeecCCCC Q lcl|NC_019422. 326 FEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIE----NGDKPVRR-----LD--TAVVEGGE 384 (384) Q Consensus 326 fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~----~gd~~~~~-----~n--~~~~~~ge 384 (384) +.+.+....+..+.++.+ ++...|+++..-+++++|+.+.+ ..+...-. .+ -.|-++|. T Consensus 385 v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 385 VSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS 455 (456) T ss_pred EEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC Confidence 555555666777887765 56778999998889999987641 11111111 11 11233333 No 155 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.36 E-value=1.1e-06 Score=53.26 Aligned_cols=361 Identities=9% Similarity=0.031 Sum_probs=166.6 Q ss_pred CcchhhhcccCCCc---chhHHHhhccccCc------cee---chhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAP---GKVMMELISDSGNG------FYS---WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~~~~~~~~---~~~~~~~~~~~~~~------~~~---~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) +.+.+......... -.....+..+.... ... .........+...+|+..+..+-.-||.+- ..++. T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~~~d~- 84 (456) T protein:vir:79 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG-GSADS- 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecC-CCCCc- Confidence 11111110000000 00011111111000 000 000112234678888888888877788752 11111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEE Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKF 142 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~ 142 (384) ........++.+ | ....+.+.+..+++.+|.||+.+..+..|.+ .+..++|..+.+..+.... ..++ T Consensus 85 --~~~~~~~~~~~~-n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~ 157 (456) T protein:vir:79 85 --DLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) T ss_pred --cHHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCceEEEEEEE Confidence 111223334433 2 3446677889999999999999999888876 4778889888877664221 0111 Q ss_pred EEcCceE-----------EE-----------------------EehhheEEEeccC---CCCCccCccHHHHHHHHHHHH Q lcl|NC_019422. 143 LLRNGKI-----------VS-----------------------YPYSDIIHLRKDF---NENDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 143 ~~~~g~~-----------~~-----------------------~~~~evih~~~~~---~~~~~~G~s~~~~~~~~i~~~ 185 (384) ...++.. +. .+..++-|.-... +.+...|+|.+..+...++.. T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~gd~e~v~~liD~~ 237 (456) T protein:vir:79 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRI 237 (456) T ss_pred EecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCchhhhhHHHHHHH Confidence 1111100 00 0000110100000 011235778888777766665 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCCCCC----hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKTALR----PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQ 261 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~~~~----~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~ 261 (384) .....-........+.|..++. +...+ ++.-+.+ . ..+.+. ...+.++.++++.++.++...+-..-... T Consensus 238 ~~~~s~~~~~~~~~a~~~~~~~-G~~~~~~~~d~~g~~i-~-~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~~~~~~~ 311 (456) T protein:vir:79 238 NRAELQLLSTMAIQAFRQRALK-SSEHRLPKVDENGNAI-D-YASIFE---AAPGALWELPPGVDIWESQTNDFTPMLSA 311 (456) T ss_pred HHHHHHHHHHHHHHhhHHHHHh-cCCccccccccccccc-c-hhhhhh---hhccccccCCCCcceeeecccChHHHHHH Confidence 5444333333333333332321 11111 1100110 0 011111 12245666788888877765544443445 Q ss_pred HHHHHHHHHHHhCCCHHHhccc--c-HHHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceE Q lcl|NC_019422. 262 MDKAIQRLYSFFNTNEKIIQSK--Y-SEDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEI 324 (384) Q Consensus 262 ~~~~~~~I~~~fgvp~~~l~~~--~-~e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i 324 (384) ++..+.+|++.=++|+..+++. + +..+.+ ..+...+.-+++.+. + +.+. .....+ T Consensus 312 l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~-~----~~g~---~~~~~i 383 (456) T protein:vir:79 312 IKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-Q----IEGE---SVEDTV 383 (456) T ss_pred HHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----hcCC---Cccccc Confidence 6788999999999999998753 2 211111 122222332222221 1 1111 111234 Q ss_pred EeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecCce---eecCCCC Q lcl|NC_019422. 325 VFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIE----NGDKPVRRLDT---AVVEGGE 384 (384) Q Consensus 325 ~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~----~gd~~~~~~n~---~~~~~ge 384 (384) +..+.+....+..+.++++ +++..|+++..-+++.+|+.+.+ ..+......+. .++..++ T Consensus 384 ~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~ 451 (456) T protein:vir:79 384 DVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ 451 (456) T ss_pred eEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCC Confidence 4444555566777777765 56778999998888999997641 11111110110 1122222 No 156 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.34 E-value=1.2e-06 Score=53.06 Aligned_cols=359 Identities=9% Similarity=0.047 Sum_probs=173.2 Q ss_pred CcchhhhcccCCC-cc---hhHHHhhccccC-------cceech------------------------hhhhhcHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA-PG---KVMMELISDSGN-------GFYSWH------------------------GNLYKSDIVRSI 45 (384) Q Consensus 1 M~~f~~~~~~~~~-~~---~~~~~~~~~~~~-------~~~~~~------------------------~~~~~~~~v~~~ 45 (384) |.+...+.+.... .+ ..+..+++.... .|+... .+=..++....+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~i 92 (503) T protein:vir:59 13 EELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLF 92 (503) T ss_pred HhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHH Confidence 2222222221111 00 011111111000 000000 001124567788 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEc Q lcl|NC_019422. 46 IRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLN 125 (384) Q Consensus 46 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~ 125 (384) ++..+.-+..-|+++- .++ . ...+....++. | ........+..+.+.+|.+|+.+..+..|++. +..++ T Consensus 93 vd~~~~yl~g~~~~~~-~~d--~--~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~-i~~~~ 161 (503) T protein:vir:59 93 VDQKTQYLVGEPVTFT-SDN--K--TLLEYVNELAD--D---DFDDILNETVKNMSNKGIEYWHPFVDEEGEFD-YVIFP 161 (503) T ss_pred HHHHHhhhhcCCeeec-cCc--H--HHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEeecCCCceE-EEEEc Confidence 8888888888787751 111 1 11122233322 2 45567777889999999999999999888754 78889 Q ss_pred CceEEEEEcCCC--E----EEEEEEc--Cce----EEEEehhheEEEecc-------------------------CC--- Q lcl|NC_019422. 126 ALNVEAIYENEV--L----FLKFLLR--NGK----IVSYPYSDIIHLRKD-------------------------FN--- 165 (384) Q Consensus 126 ~~~v~~~~~~~~--~----~~~~~~~--~g~----~~~~~~~evih~~~~-------------------------~~--- 165 (384) |..+.+..+... . +.+|... .++ ...+.++.+.++... ++ T Consensus 162 p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (503) T protein:vir:59 162 AEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGR 241 (503) T ss_pred cceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCc Confidence 988887766532 1 1112211 111 112233333333211 00 Q ss_pred ------CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcce Q lcl|NC_019422. 166 ------ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAA 239 (384) Q Consensus 166 ------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 239 (384) .+...|.|.+..+...++..........+.++..+.|-.+++- .... ..+.....+. ..+++ T Consensus 242 vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g-~~~~--~~~~~~~~~~---------~~~~~ 309 (503) T protein:vir:59 242 VPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKN-YDGE--NPKEFTANLR---------YHSVI 309 (503) T ss_pred cceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeec-CCcc--ccchhhhhhh---------cccce Confidence 1223588888888888888777776677777777777555432 1111 1111111111 12345 Q ss_pred ecCCCceeeecccchhHHHH-HH---HHHHHHHHHHHhCCCHHHhccccHHHH--------------HHHHHHHHHHHHH Q lcl|NC_019422. 240 ATDSKYDAEQVKAESYVPNA-AQ---MDKAIQRLYSFFNTNEKIIQSKYSEDE--------------WNAYYESEIEPVG 301 (384) Q Consensus 240 v~~~g~~~~~l~~~~~~~~~-~~---~~~~~~~I~~~fgvp~~~l~~~~~e~~--------------~~~~~~~~i~P~~ 301 (384) .++++.+++.+........+ .. ++..+.++|...++++..++++.+..+ ....+...+.-++ T Consensus 310 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 389 (503) T protein:vir:59 310 KVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFF 389 (503) T ss_pred eccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555555555443322222 22 333444556555555555544333222 1223445555555 Q ss_pred HHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHHHhCCCCCCCC--Ceee------ Q lcl|NC_019422. 302 LQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRKIMNLSPIENG--DKPV------ 372 (384) Q Consensus 302 ~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~~lG~~p~~~g--d~~~------ 372 (384) ..+...++..-.. .......+.+.+..-...|..+.++.+ +++.+|+++...+.++++.-+.+.. ...- T Consensus 390 ~~i~~~~~~~~~~--~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~ 467 (503) T protein:vir:59 390 WFFAEYLRNTGKG--DFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQY 467 (503) T ss_pred HHHHHHHHhccCc--ccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 5555555432211 112223355666666677888888765 6888999999888888876442110 0000 Q ss_pred --ecCcee---e-cCCCC Q lcl|NC_019422. 373 --RRLDTA---V-VEGGE 384 (384) Q Consensus 373 --~~~n~~---~-~~~ge 384 (384) ...+.. + .++++ T Consensus 468 ~~~~~~~~~~~~~~~~~~ 485 (503) T protein:vir:59 468 AEMQGNLLDDEGGDDDLE 485 (503) T ss_pred HhhhccccCccCCCCCCC Confidence 000000 0 00111 No 157 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.26 E-value=2e-06 Score=51.87 Aligned_cols=331 Identities=10% Similarity=0.021 Sum_probs=149.7 Q ss_pred Cc--chhhhc---ccCCCcchhHHHhhccccCc--cee-chhh---h--hhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MN--IFKSKK---KNKEAPGKVMMELISDSGNG--FYS-WHGN---L--YKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~--~f~~~~---~~~~~~~~~~~~~~~~~~~~--~~~-~~~~---~--~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) |- +.+.+. ......-.....+..+...- ... .... . .-..+...+|+.+|+.+.=-.|. . T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~---~---- 73 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE---N---- 73 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCccc---C---- Confidence 21 111111 00111111112222211110 000 0000 0 11235566666666544322221 1 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE----EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF----LKFL 143 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~ 143 (384) .+..++.++.. | +.......+..+.+.+|.+|+.+..++.|.+ .+.+++|..+.+..+..... +.+. T Consensus 74 ----~d~~l~~i~~~-N---~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:94 74 ----DDFTVNEIFEE-N---NPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPITGLLTEGYAVL 144 (409) T ss_pred ----CchHHHHHHHh-c---ChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCCCceeeeEEEE Confidence 11223344332 2 2345666788899999999999999988875 57788898888776653211 1111 Q ss_pred Ec--CceE---EEEehhh----------------------eEEEeccCCCCCccCccHH----HHHHHHHHHHHHHHHHH Q lcl|NC_019422. 144 LR--NGKI---VSYPYSD----------------------IIHLRKDFNENDLFGTSPA----KVLEPIMEVVNTTDQGV 192 (384) Q Consensus 144 ~~--~g~~---~~~~~~e----------------------vih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~ 192 (384) .. ++.. ..+.+++ |++|.+....++.+|.|.+ ..+.+.+.....-.... T Consensus 145 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~ 224 (409) T protein:vir:94 145 ERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVT 224 (409) T ss_pred EecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHH Confidence 11 1110 0111111 4555444344567888865 33444444433333344 Q ss_pred HHHHHccCCc-ceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-----CCceeeecccchhHHHHHHHHHHH Q lcl|NC_019422. 193 VKAIKNSNTI-KWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-----SKYDAEQVKAESYVPNAAQMDKAI 266 (384) Q Consensus 193 ~~~~~ng~~p-~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~g~~~~~l~~~~~~~~~~~~~~~~ 266 (384) ..++.+ | ..++.++.+..+ .+ .|+... ++++.++ .+.++.++...+-..-...++.++ T Consensus 225 ~e~~a~---pqr~i~G~d~d~~~--~~----~~~~~~-------~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~ 288 (409) T protein:vir:94 225 AEFYSF---PQKYVTGLSDDAEP--ME----TWKATV-------SSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAA 288 (409) T ss_pred HHHhcC---hhheeEecCCCCcc--cc----hhhhhH-------HHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHH Confidence 444433 4 344444332222 12 232211 1222222 335566665444333334567889 Q ss_pred HHHHHHhCCCHHHhccccH----HHHHH----H----------HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeec Q lcl|NC_019422. 267 QRLYSFFNTNEKIIQSKYS----EDEWN----A----------YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEA 328 (384) Q Consensus 267 ~~I~~~fgvp~~~l~~~~~----e~~~~----~----------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~ 328 (384) .++|+.=++|+..+|+... .++.. . .+...+.-+++... ++.... . ........+++.+ T Consensus 289 ~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~-~i~~~~-~-~~~~~~~~~~v~W 365 (409) T protein:vir:94 289 AGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAA-CLRDDA-P-YLREQFRKTKPKW 365 (409) T ss_pred HHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhCCC-C-ccccccccceEEe Confidence 9999999999999986322 11111 1 11112222221111 111100 0 0111222334444 Q ss_pred hhhhccCH---HHHHHH-HHHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019422. 329 SNLQYASM---STKLNL-VQMVDRG--SLTPNEWRKIMNLSPIE 366 (384) Q Consensus 329 ~~~~~~d~---~~~~~~-~~~~~~g--~~t~NE~R~~lG~~p~~ 366 (384) ..+...+. ...++. .|++..| +..-+-+++++|+..-+ T Consensus 366 ~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 366 EPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 44433333 333444 3677776 66678899999999776 No 158 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.22 E-value=2.4e-06 Score=51.40 Aligned_cols=362 Identities=10% Similarity=0.039 Sum_probs=159.9 Q ss_pred Ccc----------hhhh---cccCCCcchhHHHhhccccCc--ce-ech----hhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNI----------FKSK---KKNKEAPGKVMMELISDSGNG--FY-SWH----GNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~----------f~~~---~~~~~~~~~~~~~~~~~~~~~--~~-~~~----~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) +.. ...+ -......-.....+..+-... .. ... .......+...+|+..+..+--..|.+ T Consensus 5 i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~ 84 (485) T protein:vir:10 5 LPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRF 84 (485) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceec Confidence 000 0000 000000000011111110000 00 000 001112355677777776553333332 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc-------eeeEEEEcCceEEEEE Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM-------PTQIYPLNALNVEAIY 133 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~-------~~~l~~l~~~~v~~~~ 133 (384) .++. ..+..+..++.+ -....+...+..+.+.+|.||+.+.++..+. ...+.+++|..+.+.. T Consensus 85 ---~~~~---~~~~~~~~i~~~----N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~ 154 (485) T protein:vir:10 85 ---GDAD---EADEELWQWWQA----NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEI 154 (485) T ss_pred ---CCCc---hhHHHHHHHHHh----cCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEE Confidence 2111 112223334332 2345677888999999999999998876542 2247778888888776 Q ss_pred cCCCE------EEEEEEcCceE---EEEehh-------------------------heEEEeccCCCCCccCccHHHH-H Q lcl|NC_019422. 134 ENEVL------FLKFLLRNGKI---VSYPYS-------------------------DIIHLRKDFNENDLFGTSPAKV-L 178 (384) Q Consensus 134 ~~~~~------~~~~~~~~g~~---~~~~~~-------------------------evih~~~~~~~~~~~G~s~~~~-~ 178 (384) +.... .+++...++.. ..+.++ -|++|.++....+.+|.|-+.. + T Consensus 155 D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v 234 (485) T protein:vir:10 155 DPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPEL 234 (485) T ss_pred cCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHH Confidence 54321 11111111110 011111 2455554434455678886653 3 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHH--HHHHHHHHHHhccccccCCcceecC-CCceeeecccchh Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI--KKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESY 255 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~ 255 (384) ...++.............+-.+.|..++. +........ .+....|.. ..+.++.++ ++.++.++...+- T Consensus 235 ~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-------~~~~i~~~~~~d~k~~q~~~~~~ 306 (485) T protein:vir:10 235 RSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDPETGQTLFDA-------YLARILAFEDAEGKIQQFSAAEL 306 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchHHHHh-cCCcccccccccccchhhhh-------cccceeccCCCCceEEeecccch Confidence 34444443333333333333344544432 111111000 000111111 123444443 5667777765544 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhcccc----HHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCccc Q lcl|NC_019422. 256 VPNAAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKA 317 (384) Q Consensus 256 ~~~~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~ 317 (384) +.....++..+.+|+..=++|+..+|+.. +..+. ...+...+..+++.+.. +.. .... T Consensus 307 ~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~-~~~---~~~~ 382 (485) T protein:vir:10 307 ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYR-MMK---GGDV 382 (485) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhC---CCCC Confidence 44445567889999999999999987542 11111 11222333333332221 111 1111 Q ss_pred ccCcceEEeechhhhccCHHHHHHHH-HHHhCC--CCCHHHHHHHhCCCCCC--CC---------------CeeeecCce Q lcl|NC_019422. 318 RSFGNEIVFEASNLQYASMSTKLNLV-QMVDRG--SLTPNEWRKIMNLSPIE--NG---------------DKPVRRLDT 377 (384) Q Consensus 318 ~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g--~~t~NE~R~~lG~~p~~--~g---------------d~~~~~~n~ 377 (384) ......+++.+......+..+.++.+ +++..| +++-.-+++.+|+.+.+ .. |.+..+.+. T Consensus 383 ~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~ 462 (485) T protein:vir:10 383 PPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPT 462 (485) T ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 11223556666666677788887765 566644 88888888888887642 10 011111111 Q ss_pred ee--------------cCCCC Q lcl|NC_019422. 378 AV--------------VEGGE 384 (384) Q Consensus 378 ~~--------------~~~ge 384 (384) .+ -.+|+ T Consensus 463 ~~~~~~~~~~~~~~~~~~~~~ 483 (485) T protein:vir:10 463 VPGSPSPAPAPKPAALESGGD 483 (485) T ss_pred CCCCCCccccccCcCCCCCCC Confidence 10 01111 No 159 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.22 E-value=2.5e-06 Score=51.34 Aligned_cols=347 Identities=11% Similarity=0.078 Sum_probs=150.0 Q ss_pred Ccc-----hhhhcccCCCcchhHHHhhccccCc--cee-chh---hhhh--cHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNI-----FKSKKKNKEAPGKVMMELISDSGNG--FYS-WHG---NLYK--SDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~-----f~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~---~~~~--~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) |-. +-++-......-.....+..+.... ... ... ..++ ..+...+|+.+|+.+.=-.| +.. T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf---~~~--- 74 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREF---TND--- 74 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcccccee---eCC--- Confidence 222 1111111111111122222221110 000 001 1111 13445555555553321122 111 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeEEEEcCceEEEEEcCCCEE----E-E Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQIYPLNALNVEAIYENEVLF----L-K 141 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~----~-~ 141 (384) +..++.++.. |. .......+..+.+.+|.||+.+..+. .|.+ .+.+++|..+....+..... + . T Consensus 75 -----d~~l~~~w~~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~~~~~~~a~~~ 144 (422) T protein:vir:97 75 -----DFNAWEIFKA-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPTTFLLTEGYAI 144 (422) T ss_pred -----chhHHHHHHh-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCCCCcceeeEEE Confidence 1123334333 32 34455577889999999999999874 5654 58888999998877654211 0 1 Q ss_pred EE-EcCceE--E-EEehh---------------------heEEEeccCCCCCccCccHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 142 FL-LRNGKI--V-SYPYS---------------------DIIHLRKDFNENDLFGTSPA-KVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 142 ~~-~~~g~~--~-~~~~~---------------------evih~~~~~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~ 195 (384) +. ..+|.. . .++.. =|++|.+.....+.+|.|.+ ..+...++............ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~ 224 (422) T protein:vir:97 145 LESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVT 224 (422) T ss_pred EEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHH Confidence 11 111211 0 01111 14555544445567888865 34444444443333222222 Q ss_pred HHccCCcc-eEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-----CCceeeecccchhHHHHHHHHHHHHHH Q lcl|NC_019422. 196 IKNSNTIK-WLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-----SKYDAEQVKAESYVPNAAQMDKAIQRL 269 (384) Q Consensus 196 ~~ng~~p~-~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~g~~~~~l~~~~~~~~~~~~~~~~~~I 269 (384) .+=.+.|. .++.++....+ .+.|+... ++++.++ ++.++.++...+-..-...++.++..+ T Consensus 225 ~e~~a~pqr~i~G~d~d~~~------~~~~~~~~-------~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~ 291 (422) T protein:vir:97 225 AEFYSFPQKYVLGMDPDAKP------MEKWRATV-------STLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLF 291 (422) T ss_pred HHHhcchhhhhcccCccccc------Cchhhhhh-------hhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHH Confidence 22223343 33333322211 11232221 1333332 234666665554443345668899999 Q ss_pred HHHhCCCHHHhccccH----HHHHH----H----------HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh Q lcl|NC_019422. 270 YSFFNTNEKIIQSKYS----EDEWN----A----------YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL 331 (384) Q Consensus 270 ~~~fgvp~~~l~~~~~----e~~~~----~----------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~ 331 (384) ++.=++|+..+|+... .++.. . .+...+.-+++.+. ++....- ...+....+++.+... T Consensus 292 a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~-~~~~~~~--~~~~~~~~~~~~w~p~ 368 (422) T protein:vir:97 292 AGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAV-CLRDEFP--YLRNQFMDTVIKWEPL 368 (422) T ss_pred hcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCc--ccchhhccceEEEccC Confidence 9999999999987432 11111 1 11122222222211 1111110 0111222233444444 Q ss_pred hccCHH---HHHHH-HHHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecCceeecCC Q lcl|NC_019422. 332 QYASMS---TKLNL-VQMVDR--GSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEG 382 (384) Q Consensus 332 ~~~d~~---~~~~~-~~~~~~--g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ 382 (384) ...+.. ..++. .|++.. |+++-+-+++++|++..+ .-.....-+.+++ T Consensus 369 ~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~---~~~~~~~~~~~d~ 422 (422) T protein:vir:97 369 FEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGAD---KPIPAITEVTTDG 422 (422) T ss_pred CCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchh---HHHHHHHhhhccC Confidence 334433 33443 355555 788889999999996532 1111111112223 No 160 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.17 E-value=3.2e-06 Score=50.70 Aligned_cols=364 Identities=10% Similarity=0.057 Sum_probs=167.4 Q ss_pred CcchhhhcccCCCc--------------------chhH-------HHhhccccCc--ceec-----hhhhhhcHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP--------------------GKVM-------MELISDSGNG--FYSW-----HGNLYKSDIVRSII 46 (384) Q Consensus 1 M~~f~~~~~~~~~~--------------------~~~~-------~~~~~~~~~~--~~~~-----~~~~~~~~~v~~~i 46 (384) ||+|.+.|...... .+.. ..+..+-... +... ..+..+.+....++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999754331110 0000 0111110000 0000 11222334556666 Q ss_pred HHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcC Q lcl|NC_019422. 47 RPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNA 126 (384) Q Consensus 47 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~ 126 (384) +..|+.+..-+-.+ ..+ +......+.+--..-.....++..+.+.+..|.+++.+..+.. . +.+-.+++ T Consensus 81 ~~~A~lv~~e~~~i-~~~--------d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-~-~~I~~v~a 149 (500) T protein:vir:98 81 KKIASLVFNEQAEI-KVD--------DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGD-K-VRVAFVQA 149 (500) T ss_pred HHHhhhhcCCcceE-ecC--------ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcC Confidence 67776665433332 111 1122222222222233455667777888889999998888743 3 33555666 Q ss_pred ceEEEEEc-CCC------------------EEE---EEE-EcCce-------------------EEEE---e---hhh-- Q lcl|NC_019422. 127 LNVEAIYE-NEV------------------LFL---KFL-LRNGK-------------------IVSY---P---YSD-- 156 (384) Q Consensus 127 ~~v~~~~~-~~~------------------~~~---~~~-~~~g~-------------------~~~~---~---~~e-- 156 (384) ..+.+... .++ .+| +++ ..+|. .+.+ . +.+ T Consensus 150 d~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 229 (500) T protein:vir:98 150 PVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAK 229 (500) T ss_pred CeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceE Confidence 66655322 111 111 011 11111 0000 0 001 Q ss_pred --------eEEEeccCC----CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE-----eeCCCC-ChHHHH Q lcl|NC_019422. 157 --------IIHLRKDFN----ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLL-----KFKTAL-RPDDIK 218 (384) Q Consensus 157 --------vih~~~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~-~~e~~~ 218 (384) ..|++.+.+ .+...|+|.+..+...++.++.......+-++.|.. ..++ ...... +.+... T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~ 308 (500) T protein:vir:98 230 VTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGDVVP 308 (500) T ss_pred eccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCccccC Confidence 234443322 134579999999999999998888777788876543 3333 111111 000000 Q ss_pred HHHHH-HHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhcccc-----HHH---- Q lcl|NC_019422. 219 KEVKS-FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQSKY-----SED---- 287 (384) Q Consensus 219 ~~~~~-~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~~~~-----~e~---- 287 (384) ...-. -+..|..... -.+++..++.++....+.+. ..++...++|+...|+++..++.+. +.+ T Consensus 309 ~~~~d~~~~~~~~~~~------~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~ 382 (500) T protein:vir:98 309 RPRFESDQNVYIRMGG------RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSE 382 (500) T ss_pred CcccCCCcceEEEcCC------CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHH Confidence 00000 0001111000 01233456777666555554 4457788899999999999886321 111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHH-HHHHHHhCCCCCHHHH Q lcl|NC_019422. 288 ---------EWNAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKL-NLVQMVDRGSLTPNEW 356 (384) Q Consensus 288 ---------~~~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~-~~~~~~~~g~~t~NE~ 356 (384) .....++.++..++..+.+..+. .+.. ........+.+++++-...|.++.. ...+++..|+|+.-++ T Consensus 383 ~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~ 461 (500) T protein:vir:98 383 NSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQ-SEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMA 461 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHH Confidence 11123445555555555433221 1111 1122334566777765556655554 4457888999999998 Q ss_pred HHH-hCCCCCCCCCeeee-------cCceeecC----CCC Q lcl|NC_019422. 357 RKI-MNLSPIENGDKPVR-------RLDTAVVE----GGE 384 (384) Q Consensus 357 R~~-lG~~p~~~gd~~~~-------~~n~~~~~----~ge 384 (384) +.+ .|++.-+ .++.+. +..-.+-+ -|| T Consensus 462 i~~~~g~~eee-a~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 462 IQKVLNVTEEK-AQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHhcCCCCHHH-HHHHHHHHHHhccccCCCCCccccccCC Confidence 754 4665321 111110 11101111 122 No 161 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.17 E-value=3.2e-06 Score=50.70 Aligned_cols=364 Identities=10% Similarity=0.057 Sum_probs=167.4 Q ss_pred CcchhhhcccCCCc--------------------chhH-------HHhhccccCc--ceec-----hhhhhhcHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAP--------------------GKVM-------MELISDSGNG--FYSW-----HGNLYKSDIVRSII 46 (384) Q Consensus 1 M~~f~~~~~~~~~~--------------------~~~~-------~~~~~~~~~~--~~~~-----~~~~~~~~~v~~~i 46 (384) ||+|.+.|...... .+.. ..+..+-... +... ..+..+.+....++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999754331110 0000 0111110000 0000 11222334556666 Q ss_pred HHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcC Q lcl|NC_019422. 47 RPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNA 126 (384) Q Consensus 47 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~ 126 (384) +..|+.+..-+-.+ ..+ +......+.+--..-.....++..+.+.+..|.+++.+..+.. . +.+-.+++ T Consensus 81 ~~~A~lv~~e~~~i-~~~--------d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-~-~~I~~v~a 149 (500) T protein:vir:30 81 KKIASLVFNEQAEI-KVD--------DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGD-K-VRVAFVQA 149 (500) T ss_pred HHHhhhhcCCcceE-ecC--------ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcC Confidence 67776665433332 111 1122222222222233455667777888889999998888743 3 33555666 Q ss_pred ceEEEEEc-CCC------------------EEE---EEE-EcCce-------------------EEEE---e---hhh-- Q lcl|NC_019422. 127 LNVEAIYE-NEV------------------LFL---KFL-LRNGK-------------------IVSY---P---YSD-- 156 (384) Q Consensus 127 ~~v~~~~~-~~~------------------~~~---~~~-~~~g~-------------------~~~~---~---~~e-- 156 (384) ..+.+... .++ .+| +++ ..+|. .+.+ . +.+ T Consensus 150 d~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 229 (500) T protein:vir:30 150 PVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAK 229 (500) T ss_pred CeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceE Confidence 66655322 111 111 011 11111 0000 0 001 Q ss_pred --------eEEEeccCC----CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE-----eeCCCC-ChHHHH Q lcl|NC_019422. 157 --------IIHLRKDFN----ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLL-----KFKTAL-RPDDIK 218 (384) Q Consensus 157 --------vih~~~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~-~~e~~~ 218 (384) ..|++.+.+ .+...|+|.+..+...++.++.......+-++.|.. ..++ ...... +.+... T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~ 308 (500) T protein:vir:30 230 VTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGDVVP 308 (500) T ss_pred eccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCccccC Confidence 234443322 134579999999999999998888777788876543 3333 111111 000000 Q ss_pred HHHHH-HHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhcccc-----HHH---- Q lcl|NC_019422. 219 KEVKS-FEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQSKY-----SED---- 287 (384) Q Consensus 219 ~~~~~-~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~~~~-----~e~---- 287 (384) ...-. -+..|..... -.+++..++.++....+.+. ..++...++|+...|+++..++.+. +.+ T Consensus 309 ~~~~d~~~~~~~~~~~------~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~ 382 (500) T protein:vir:30 309 RPRFESDQNVYIRMGG------RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSE 382 (500) T ss_pred CcccCCCcceEEEcCC------CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHH Confidence 00000 0001111000 01233456777666555554 4457788899999999999886321 111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHH-HHHHHHhCCCCCHHHH Q lcl|NC_019422. 288 ---------EWNAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKL-NLVQMVDRGSLTPNEW 356 (384) Q Consensus 288 ---------~~~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~-~~~~~~~~g~~t~NE~ 356 (384) .....++.++..++..+.+..+. .+.. ........+.+++++-...|.++.. ...+++..|+|+.-++ T Consensus 383 ~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~ 461 (500) T protein:vir:30 383 NSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQ-SEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMA 461 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHH Confidence 11123445555555555433221 1111 1122334566777765556655554 4457888999999998 Q ss_pred HHH-hCCCCCCCCCeeee-------cCceeecC----CCC Q lcl|NC_019422. 357 RKI-MNLSPIENGDKPVR-------RLDTAVVE----GGE 384 (384) Q Consensus 357 R~~-lG~~p~~~gd~~~~-------~~n~~~~~----~ge 384 (384) +.+ .|++.-+ .++.+. +..-.+-+ -|| T Consensus 462 i~~~~g~~eee-a~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 462 IQKVLNVTEEK-AQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHhcCCCCHHH-HHHHHHHHHHhccccCCCCCccccccCC Confidence 754 4665321 111110 11101111 122 No 162 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.15 E-value=3.6e-06 Score=50.46 Aligned_cols=362 Identities=10% Similarity=0.047 Sum_probs=157.6 Q ss_pred Ccc---------hhhhccc---CCCcchhHHHhhccccCc--cee-chhh----hhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNI---------FKSKKKN---KEAPGKVMMELISDSGNG--FYS-WHGN----LYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~---------f~~~~~~---~~~~~~~~~~~~~~~~~~--~~~-~~~~----~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) +++ ....... +...-.....+..+-... ... .... .....+...+|+..+..+--..|.+ T Consensus 6 ~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~- 84 (486) T protein:vir:42 6 PGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRL- 84 (486) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceec- Confidence 111 0000000 000000011111111000 000 0000 0123456677777776654344432 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce-------eeEEEEcCceEEEEEc Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP-------TQIYPLNALNVEAIYE 134 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~-------~~l~~l~~~~v~~~~~ 134 (384) ++ ....+..+..++.. | ........+..+++.+|.||+.+.++..|.. ..+.+++|..+.+..+ T Consensus 85 --~~---~~~~~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d 155 (486) T protein:vir:42 85 --GD---ADEADEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEID 155 (486) T ss_pred --CC---CchhHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEe Confidence 21 11122333444432 3 2345667888999999999999988764432 3566788888877765 Q ss_pred CCC-EE--------------------------EEEEEcCceEEEE-----ehh--heEEEeccCCCCCccCccHHHH-HH Q lcl|NC_019422. 135 NEV-LF--------------------------LKFLLRNGKIVSY-----PYS--DIIHLRKDFNENDLFGTSPAKV-LE 179 (384) Q Consensus 135 ~~~-~~--------------------------~~~~~~~g~~~~~-----~~~--evih~~~~~~~~~~~G~s~~~~-~~ 179 (384) ... .. ++|...+|..... +.. -|++|.++....+.+|.|-+.. +. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~ 235 (486) T protein:vir:42 156 PRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELR 235 (486) T ss_pred CCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHH Confidence 321 00 1111111111110 000 1455554434455678886653 33 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHH--HHHHHHHHHHhccccccCCcceecC-CCceeeecccchhH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI--KKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESYV 256 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~~ 256 (384) ..++.............+-.+.|..++. +........ .+....|.. ..+.++.++ ++.++.++.....+ T Consensus 236 ~liDa~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~q~~~~~~e 307 (486) T protein:vir:42 236 SMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDSETGQTLFDA-------YLARILAFEDAEGKIQQFSAAELA 307 (486) T ss_pred HHHHHHHHHHHHHHHHHHhhcchHHHhh-cCCccccccccccccchhhh-------hhchhcccCCCCceEEeecccCHH Confidence 3344443333333333333344544432 111111000 011111111 123444443 55677676655444 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcccc----HHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccc Q lcl|NC_019422. 257 PNAAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKAR 318 (384) Q Consensus 257 ~~~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~ 318 (384) .....++..+.+++..=++|+..+|++. +..+. ...+...+.-+++.+....+. .... T Consensus 308 ~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~ 383 (486) T protein:vir:42 308 NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG----GDVP 383 (486) T ss_pred HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCcc Confidence 4455667888999999999999988652 22111 112233333333322221111 1111 Q ss_pred cCcceEEeechhhhccCHHHHHHHH-HHHhC--CCCCHHHHHHHhCCCCCCC--CCe--------eeecCc-e------- Q lcl|NC_019422. 319 SFGNEIVFEASNLQYASMSTKLNLV-QMVDR--GSLTPNEWRKIMNLSPIEN--GDK--------PVRRLD-T------- 377 (384) Q Consensus 319 ~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~--g~~t~NE~R~~lG~~p~~~--gd~--------~~~~~n-~------- 377 (384) .....+++.+......+..+.++.+ ++++. |+++-.-+++.+|+.+.+- ... .....+ + T Consensus 384 ~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~ 463 (486) T protein:vir:42 384 PDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTV 463 (486) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 1123455555665667777777765 56554 6788777788888755421 000 000000 0 Q ss_pred --ee-------------cCCCC Q lcl|NC_019422. 378 --AV-------------VEGGE 384 (384) Q Consensus 378 --~~-------------~~~ge 384 (384) .+ ..+|+ T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~ 485 (486) T protein:vir:42 464 PGSPSPTAPPKPQPAIESSGGD 485 (486) T ss_pred CCCCCCCCCCCCCcccCCCCCC Confidence 00 01111 No 163 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=354 Identities=9% Similarity=0.021 Sum_probs=159.4 Q ss_pred CcchhhhcccCC---CcchhHHHhhccccCc----ceec---hhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSKKKNKE---APGKVMMELISDSGNG----FYSW---HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~~~~~~---~~~~~~~~~~~~~~~~----~~~~---~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) --+..++..... ..-.....+..+.... .... ...-....+...+|+..+..+--..|. .. + T Consensus 6 ~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~---~~-d---- 77 (441) T protein:vir:80 6 LALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWT---NG-D---- 77 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhcccccc---CC-C---- Confidence 000110000000 0000011111110000 0000 011112335556666665554211221 11 1 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEEE Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFLL 144 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~ 144 (384) +.....++.. | +.......+..+.+.+|.||+.+..+..|.+ .+.+++|..+.++.+.... .+++.. T Consensus 78 --~~~l~~i~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~ 150 (441) T protein:vir:80 78 --GYGLDGVYAA-N---RLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQTC 150 (441) T ss_pred --hHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEEEe Confidence 1123333322 2 4567888889999999999999999988876 4788999998887664321 011111 Q ss_pred cC----------ceEEEE-----------eh-----h--heEEEeccCCCCCccCccHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 145 RN----------GKIVSY-----------PY-----S--DIIHLRKDFNENDLFGTSPAKV-LEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 145 ~~----------g~~~~~-----------~~-----~--evih~~~~~~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~ 195 (384) .+ +..+.+ .. . -|+|+.+.......+|.|.+.. +...++............ T Consensus 151 ~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~ 230 (441) T protein:vir:80 151 DPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVN 230 (441) T ss_pred cCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHH Confidence 11 111110 00 0 2566665544566688886643 444455444444444444 Q ss_pred HHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCC-----CceeeecccchhHHHHHHHHHHHHHHH Q lcl|NC_019422. 196 IKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-----KYDAEQVKAESYVPNAAQMDKAIQRLY 270 (384) Q Consensus 196 ~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-----g~~~~~l~~~~~~~~~~~~~~~~~~I~ 270 (384) .+-.+.|..++. +...+....+ .+.. ..++++.++. +.++.++.....+.....++..+..|+ T Consensus 231 ~~~~~~~~~~i~-G~~~~~~~~~----~~~~-------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~ 298 (441) T protein:vir:80 231 RDFYAYPQRWVT-GVSADEFSQP----GWVL-------SMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTA 298 (441) T ss_pred HHhhcCceeeee-cCCccccccc----hhhh-------cccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHh Confidence 444455644443 2222222111 1111 1123333332 234544444333333455678889999 Q ss_pred HHhCCCHHHhcccc----HHHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhh Q lcl|NC_019422. 271 SFFNTNEKIIQSKY----SEDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQ 332 (384) Q Consensus 271 ~~fgvp~~~l~~~~----~e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~ 332 (384) ..-++|+..+|++. +..+.+ ..+...+.-+++.+...++...- .......+++.+.... T Consensus 299 ~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~---~~~~~~~i~~~f~~~~ 375 (441) T protein:vir:80 299 GEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVD---EADFFGDVGLRWRDAS 375 (441) T ss_pred cccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccceeeeEEeCCCC Confidence 99999999887642 221111 12223333333333333322111 1112234566666666 Q ss_pred ccCHHHHHHHH-HHHhCCCC--CHHHHHHHhCCCCCCC---------CCeeeecCce-eecCCCC Q lcl|NC_019422. 333 YASMSTKLNLV-QMVDRGSL--TPNEWRKIMNLSPIEN---------GDKPVRRLDT-AVVEGGE 384 (384) Q Consensus 333 ~~d~~~~~~~~-~~~~~g~~--t~NE~R~~lG~~p~~~---------gd~~~~~~n~-~~~~~ge 384 (384) ..+..+.++.+ +++..|++ +-.-+++.+|+.+.|- ....+...+- ..-+..| T Consensus 376 ~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~ 440 (441) T protein:vir:80 376 TPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQVEAVMRHRAESSDPLAVLAGAISRQTNE 440 (441) T ss_pred CcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 77888888765 57777765 3445777888765421 0001111010 0011111 No 164 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.01 E-value=7.1e-06 Score=48.82 Aligned_cols=332 Identities=10% Similarity=0.012 Sum_probs=150.2 Q ss_pred Ccc--hhhhccc---CCCcchhHHHhhccccCc--cee-chh---hh--hhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNI--FKSKKKN---KEAPGKVMMELISDSGNG--FYS-WHG---NL--YKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~--f~~~~~~---~~~~~~~~~~~~~~~~~~--~~~-~~~---~~--~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) |-. .+.+... ....-.....+..+...- ... ... .. .-..+..-+|+.+|+.+.=-.|. . T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~---~---- 73 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE---N---- 73 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhccccccc---C---- Confidence 211 1111100 001111111111111100 000 000 00 11235556666666544322221 1 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE----EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF----LKFL 143 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~ 143 (384) .+..++.++.+ -+.......+..+.+.+|.+|+.+..++.|.+ .+.+++|..+....+..... +.+. T Consensus 74 ----~d~~l~~i~~~----N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:16 74 ----DDFTVNEIFEE----NNPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPITGLLTEGYAVL 144 (409) T ss_pred ----cchHHHHHHHh----cChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeecccccceeeeEEE Confidence 11223344432 23445666788899999999999999888864 57788888887766543211 1111 Q ss_pred --EcCceE---EEEehhh----------------------eEEEeccCCCCCccCccHH----HHHHHHHHHHHHHHHHH Q lcl|NC_019422. 144 --LRNGKI---VSYPYSD----------------------IIHLRKDFNENDLFGTSPA----KVLEPIMEVVNTTDQGV 192 (384) Q Consensus 144 --~~~g~~---~~~~~~e----------------------vih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~ 192 (384) ...|.. ..+.+++ |++|.+....++.+|.|-+ ..+.+.+.....-.... T Consensus 145 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~ 224 (409) T protein:vir:16 145 ERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVT 224 (409) T ss_pred EecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHH Confidence 111110 0111111 4555444344566888854 44445554444444445 Q ss_pred HHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-----CCceeeecccchhHHHHHHHHHHHH Q lcl|NC_019422. 193 VKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-----SKYDAEQVKAESYVPNAAQMDKAIQ 267 (384) Q Consensus 193 ~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~g~~~~~l~~~~~~~~~~~~~~~~~ 267 (384) ..++.+ .-..++.++.+..+. + .|+... ++++.++ .+.++.+++..+-..-...++.++. T Consensus 225 ~e~~a~--pqr~i~G~d~d~~~~--~----~~~~~~-------~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~ 289 (409) T protein:vir:16 225 AEFYSF--PQKYVTGLSDDAEPM--E----TWKATV-------SSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAA 289 (409) T ss_pred HHHhcC--hhheeEecCCCCCcc--c----hhhhhh-------hHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHH Confidence 555544 223444444332221 1 232211 1233332 3356666655443333455678899 Q ss_pred HHHHHhCCCHHHhccccHH----HHHH----H----------HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeech Q lcl|NC_019422. 268 RLYSFFNTNEKIIQSKYSE----DEWN----A----------YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEAS 329 (384) Q Consensus 268 ~I~~~fgvp~~~l~~~~~e----~~~~----~----------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~ 329 (384) ++|+.=++|+..+|+...+ ++.. . .+...++-+++......+. .+ ........+++.+. T Consensus 290 ~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~--~~-~~~~~~~~~~v~W~ 366 (409) T protein:vir:16 290 GFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDD--VP-YLREQFSKTKPKWE 366 (409) T ss_pred HHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CC-ccchhhccceEEec Confidence 9999999999999965321 1111 1 1111112111111111111 00 01111123333344 Q ss_pred hhhcc---CHHHHHHHH-HHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019422. 330 NLQYA---SMSTKLNLV-QMVDRG--SLTPNEWRKIMNLSPIE 366 (384) Q Consensus 330 ~~~~~---d~~~~~~~~-~~~~~g--~~t~NE~R~~lG~~p~~ 366 (384) +.... +....++.+ |++..| +..-+-+++++|++.-+ T Consensus 367 ~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 367 PLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred CCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 43322 345555554 566665 44457779999998766 No 165 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=364 Identities=11% Similarity=0.088 Sum_probs=169.7 Q ss_pred CcchhhhcccCCC------c---------------chh-------HHHhhccccCcc---eech----hhhhhcHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA------P---------------GKV-------MMELISDSGNGF---YSWH----GNLYKSDIVRSI 45 (384) Q Consensus 1 M~~f~~~~~~~~~------~---------------~~~-------~~~~~~~~~~~~---~~~~----~~~~~~~~v~~~ 45 (384) ||+|.+.|..... . .+. +..+..+-...+ .+.. ....+......+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 9999875432100 0 000 001111110000 0000 011123455666 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEc Q lcl|NC_019422. 46 IRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLN 125 (384) Q Consensus 46 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~ 125 (384) ++..|+.+..=+..+- .+++. ..+..+..++.. -.....++..+.+.+..|.+++.+..+..+ ..+-.++ T Consensus 81 ~~~~A~lv~~e~~~i~-v~~~~---~~~e~l~~il~~----n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~~v~ 150 (508) T protein:vir:15 81 ARRIASVVFNEKAEIH-VKDNN---EADKFLNDVLED----NDFKNKFEEALEKGVALGGFAMRPYIDGNH--IKIAWVR 150 (508) T ss_pred HHHHHhhhhCCCceEE-eCCch---HHHHHHHHHHHh----ccHHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEEEEc Confidence 6777766654343331 11111 111122222221 223445566678888999999998887543 3455566 Q ss_pred CceEEEEE-cCCC------------------EEEE---EEE--cC------------------ceEEEE---e-----hh Q lcl|NC_019422. 126 ALNVEAIY-ENEV------------------LFLK---FLL--RN------------------GKIVSY---P-----YS 155 (384) Q Consensus 126 ~~~v~~~~-~~~~------------------~~~~---~~~--~~------------------g~~~~~---~-----~~ 155 (384) +..+-+.. +.++ .+|. ++. .+ |..+.+ + .+ T Consensus 151 ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~ 230 (508) T protein:vir:15 151 ADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAP 230 (508) T ss_pred CCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCc Confidence 66655421 2211 1110 110 00 111110 0 00 Q ss_pred h----------eEEEeccCCC----CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe---eCCCCChHHHH Q lcl|NC_019422. 156 D----------IIHLRKDFNE----NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK---FKTALRPDDIK 218 (384) Q Consensus 156 e----------vih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~---~~~~~~~e~~~ 218 (384) + ..||+.+-+. +...|.|.+..+...++.++.......+-++. +.+..++. ++.+.... T Consensus 231 ~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~d~~~~--- 306 (508) T protein:vir:15 231 QVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRL-GQKHIAVQPGMLRFDDEHK--- 306 (508) T ss_pred ceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcCCCCCc--- Confidence 1 2344433221 34579999999999999998888777777764 44544441 11111100 Q ss_pred HHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHhCCCHHHhccc-----cHHHH---- Q lcl|NC_019422. 219 KEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFFNTNEKIIQSK-----YSEDE---- 288 (384) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp~~~l~~~-----~~e~~---- 288 (384) .....-.+.|.+... -.++|..++.++....+.+.. .++...+.|....|++|.-+|.. ++.+. T Consensus 307 ~~~~~~~~~~~~~~~------~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~ 380 (508) T protein:vir:15 307 PTFDTEQNVYVGVLS------DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNN 380 (508) T ss_pred cccCCCCeeEEeccC------CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHH Confidence 000000011111100 012344577777765555544 45777889999999999987622 11111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHhh-cccCc-------ccccCcceEEeechhhhccCHHHHHH-HHHHHhCCC Q lcl|NC_019422. 289 ---------WNAYYESEIEPVGLQLSNQYTE-KLFTR-------KARSFGNEIVFEASNLQYASMSTKLN-LVQMVDRGS 350 (384) Q Consensus 289 ---------~~~~~~~~i~P~~~~i~~~l~~-~l~~~-------~~~~~~~~i~fd~~~~~~~d~~~~~~-~~~~~~~g~ 350 (384) ....++.++..++..+....+. .+... ........+.+++++-...|.++..+ ..+++..|+ T Consensus 381 ~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi 460 (508) T protein:vir:15 381 SMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGA 460 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCC Confidence 1123444444444444433221 11111 01112345677787766677666654 456888999 Q ss_pred CCHHHHHHHh-CCCCCC-----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 351 LTPNEWRKIM-NLSPIE-----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 351 ~t~NE~R~~l-G~~p~~-----------~gd~~~~~~n~~~~~~ge 384 (384) ++.-+++++. |++.-+ .....-.-..+.++++++ T Consensus 461 ~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 461 LSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGD 506 (508) T ss_pred CCHHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCC Confidence 9999987654 764310 111111112344554444 No 166 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.91 E-value=1.1e-05 Score=47.69 Aligned_cols=371 Identities=11% Similarity=0.054 Sum_probs=176.4 Q ss_pred CcchhhhcccC-----------CCcc-hh---HHHhhccccCcce-----ec---hhhhhhcHHHHHHHHHHHHhhccCc Q lcl|NC_019422. 1 MNIFKSKKKNK-----------EAPG-KV---MMELISDSGNGFY-----SW---HGNLYKSDIVRSIIRPKAKAVGKMT 57 (384) Q Consensus 1 M~~f~~~~~~~-----------~~~~-~~---~~~~~~~~~~~~~-----~~---~~~~~~~~~v~~~i~~ia~~ia~~~ 57 (384) |..+....... .... +. ...+..+-..... .. ...-...+....+++..+.-+-+-| T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNP 110 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccC Confidence 22221111000 0000 00 1111111100000 00 0011234577788888888888888 Q ss_pred eEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC Q lcl|NC_019422. 58 AKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV 137 (384) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~ 137 (384) +++--.+++. +.....++.+....-........+..+++.+|.||+.+.++..|.+ .+..++|..+.++.+... T Consensus 111 ~~~~~~d~~~-----~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~~ 184 (502) T protein:vir:48 111 IRVEYDDNED-----NSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSL 184 (502) T ss_pred eeEecCCccc-----hhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCC Confidence 8763222111 1112222233333345667888899999999999999999888865 467788998888776431 Q ss_pred ---EE---EEEEE--cCc-e--EEEEehhheEEEeccC----------C---------CCCccCccHHHHHHHHHHHHHH Q lcl|NC_019422. 138 ---LF---LKFLL--RNG-K--IVSYPYSDIIHLRKDF----------N---------ENDLFGTSPAKVLEPIMEVVNT 187 (384) Q Consensus 138 ---~~---~~~~~--~~g-~--~~~~~~~evih~~~~~----------~---------~~~~~G~s~~~~~~~~i~~~~~ 187 (384) .. .+|.. ..+ . ...+.++.++++.... + .+...|.|.+..+...++.... T Consensus 185 ~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~ 264 (502) T protein:vir:48 185 EDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDS 264 (502) T ss_pred CCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHH Confidence 11 11111 111 1 1122333333332111 1 1123688999998888888888 Q ss_pred HHHHHHHHHHccCCcceEEeeCCCCCh-HHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHH Q lcl|NC_019422. 188 TDQGVVKAIKNSNTIKWLLKFKTALRP-DDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKA 265 (384) Q Consensus 188 ~~~~~~~~~~ng~~p~~il~~~~~~~~-e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~ 265 (384) ......+.++..+.|-.++.-...... +....+++. ...... ..+..--.+.+.++..+.......... .++.+ T Consensus 265 ~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L 340 (502) T protein:vir:48 265 AESDTANHMSDMADAILAIYGDLALPQGMQASDMKRT-RLMQLK---PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRL 340 (502) T ss_pred HHHHHHHHHHHhcCceeeeecCcccccccchhhhhhc-ceeecc---ccccccccccCcceeEeeecCCHHHHHHHHHHH Confidence 777777777777777555543222211 111111110 000000 000011123455666665544333333 35667 Q ss_pred HHHHHHHhCCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeec Q lcl|NC_019422. 266 IQRLYSFFNTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEA 328 (384) Q Consensus 266 ~~~I~~~fgvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~ 328 (384) .+.|+..-++|+... +++.+..+. ...+...+.-+++.+...++..--.. ......+++.+ T Consensus 341 ~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--~~d~~~i~i~f 418 (502) T protein:vir:48 341 NKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK--DFDESRLKITF 418 (502) T ss_pred HHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--ccccccceEEe Confidence 788888888886433 222222211 12444555555555555544321111 11122345555 Q ss_pred hhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C----------C-eeeecCceeecCCC------------ Q lcl|NC_019422. 329 SNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G----------D-KPVRRLDTAVVEGG------------ 383 (384) Q Consensus 329 ~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g----------d-~~~~~~n~~~~~~g------------ 383 (384) ......|..+.++.+... .|+++..-+.+++++-..+. . + ....+....-..+| T Consensus 419 ~~~~p~d~~e~a~~~~kl-~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~ 497 (502) T protein:vir:48 419 TPNLPKSLYEQVSILNDL-GGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDF 497 (502) T ss_pred CCCCCcCHHHHHHHHHHH-hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCc Confidence 666667788777766433 37888888888887643210 0 0 00010111000011 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) | T Consensus 498 ~ 498 (502) T protein:vir:48 498 E 498 (502) T ss_pred C Confidence 1 No 167 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=97.88 E-value=1.3e-05 Score=47.35 Aligned_cols=366 Identities=10% Similarity=0.004 Sum_probs=168.2 Q ss_pred Ccchhhhcc---cCC--Ccchh------HHHhhccccCcce----ec-----hhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFKSKKK---NKE--APGKV------MMELISDSGNGFY----SW-----HGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~~~~~---~~~--~~~~~------~~~~~~~~~~~~~----~~-----~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) |+.=+..+. ... .+... +-.+..+-...+. .. ....+.......+++..|+-+..-|..+ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i 95 (496) T protein:vir:38 16 MGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKI 95 (496) T ss_pred hccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCCcceE Confidence 544222111 111 11111 1111111101000 00 0122334566788888888887766665 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE- Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF- 139 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~- 139 (384) -- + +......+.+-...-....-.+.++.+...+|.+|+.+..+.+|.+ .+-.++|..+-+.....+.. T Consensus 96 ~~-~--------d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~~~~~~P~~~~~~~~~ 165 (496) T protein:vir:38 96 NI-D--------DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSENVD 165 (496) T ss_pred ee-C--------ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EEEEEcccceEEEEecCCcEE Confidence 21 1 1111122222222234566777888999999999999999887764 35667777766543332211 Q ss_pred --------------E---EEEE---------------cC----ceEEEE-------ehh------h---eEEEeccC--- Q lcl|NC_019422. 140 --------------L---KFLL---------------RN----GKIVSY-------PYS------D---IIHLRKDF--- 164 (384) Q Consensus 140 --------------~---~~~~---------------~~----g~~~~~-------~~~------e---vih~~~~~--- 164 (384) | ++.. .+ |..+.+ .+. + +.|++.+- T Consensus 166 ~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~ 245 (496) T protein:vir:38 166 ECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANN 245 (496) T ss_pred EEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccc Confidence 1 0000 00 111100 000 1 22333221 Q ss_pred -CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe-----eCCCCChHHHHHHHHHHHHHhccccccCCcc Q lcl|NC_019422. 165 -NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK-----FKTALRPDDIKKEVKSFEKNYLQIDSEAGGA 238 (384) Q Consensus 165 -~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~-----~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 238 (384) ......|+|.+..+...++..........+-++.+ ++..++. .....+.+..... ..-.+.+... .. T Consensus 246 ~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~g~~~~~~-~~~~~~~~~~-----~~ 318 (496) T protein:vir:38 246 KNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQYF-DSTDEAFFLY-----QG 318 (496) T ss_pred cccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceecchHHhhccCCCCCccccCC-CCccceEEEe-----ec Confidence 12345799999999999999887777777777653 4443331 1111110000000 0000000000 00 Q ss_pred eecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhccccH----HHHH--------------HHHHHHHHHH Q lcl|NC_019422. 239 AATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQSKYS----EDEW--------------NAYYESEIEP 299 (384) Q Consensus 239 ~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~~~~~----e~~~--------------~~~~~~~i~P 299 (384) .-.+++..++.++........ ..++...++|+..-|+||..+|.+.+ ..+. ...++.++.. T Consensus 319 ~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~ 398 (496) T protein:vir:38 319 DQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKE 398 (496) T ss_pred CCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111233346666655444443 45677888999999999998863211 1111 1123344444 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHh-CCCCCCCCCeee----- Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIM-NLSPIENGDKPV----- 372 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~l-G~~p~~~gd~~~----- 372 (384) ++..+.+..+..............+.+.+++-...|..+.++. .+++..|+++.-.+++.+ |.+.. ..++-+ T Consensus 399 l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~-ea~~el~ri~~ 477 (496) T protein:vir:38 399 MIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEA-EADEWAEMLAK 477 (496) T ss_pred HHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChH-HHHHHHHHHHH Confidence 4444443322111111111233456666666556666666554 467888999988887654 55331 111000 Q ss_pred -----ecCceeecCCCC Q lcl|NC_019422. 373 -----RRLDTAVVEGGE 384 (384) Q Consensus 373 -----~~~n~~~~~~ge 384 (384) +|..-..--+|| T Consensus 478 E~~~~~~~~d~~~~~~~ 494 (496) T protein:vir:38 478 EKQAEMPNNDMNGIFGE 494 (496) T ss_pred hhhccCccccccCCCCC Confidence 111100011222 No 168 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=97.84 E-value=1.5e-05 Score=47.00 Aligned_cols=341 Identities=10% Similarity=0.046 Sum_probs=153.1 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCc--cee-ch---hhh--hhcHHHHHHHHHHHHhhccCceEEEEecCCcceecc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNG--FYS-WH---GNL--YKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNP 72 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~---~~~--~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 72 (384) |+++..+-. ....+..+...- ... .. +.. .-..+...+|+.+|+.+.=-.| +.. T Consensus 1 l~~~~~r~~-------~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf---~~~-------- 62 (410) T protein:vir:95 1 MNLYQSRVN-------LRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAF---AND-------- 62 (410) T ss_pred CCcchhhHH-------HHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccc---cCC-------- Confidence 555532211 111111111100 000 00 000 1123555666666654432222 111 Q ss_pred chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE----EEEE--EcC Q lcl|NC_019422. 73 EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF----LKFL--LRN 146 (384) Q Consensus 73 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~--~~~ 146 (384) +..++.++.. -+.......+..+.+.+|.+|+.+..++.|.+ .+.+++|..+....+..... +.+. ..+ T Consensus 63 d~~l~~i~~~----N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~~ 137 (410) T protein:vir:95 63 DFNVTEIFDR----NNPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGLLVEGYAVLARDDY 137 (410) T ss_pred CchHHHHHhh----cChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCceEEEEEEEEecCC Confidence 1123334332 23445666788899999999999999888875 57888999888777653211 1111 111 Q ss_pred ceE---EEEehhh---------------------eEEEeccCCCCCccCccHH----HHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019422. 147 GKI---VSYPYSD---------------------IIHLRKDFNENDLFGTSPA----KVLEPIMEVVNTTDQGVVKAIKN 198 (384) Q Consensus 147 g~~---~~~~~~e---------------------vih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~n 198 (384) |.. ..+.++. |++|.+....++.+|.|-+ ..+.+.+.....-......++.+ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~ 217 (410) T protein:vir:95 138 NRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSW 217 (410) T ss_pred CeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111 1122222 3555433334556788844 44444444444444444555433 Q ss_pred cCCc-ceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCC-----CceeeecccchhHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 199 SNTI-KWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-----KYDAEQVKAESYVPNAAQMDKAIQRLYSF 272 (384) Q Consensus 199 g~~p-~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-----g~~~~~l~~~~~~~~~~~~~~~~~~I~~~ 272 (384) | ..++.++.+..+. +.|+... ++++.++. +.++-+++..+-..-...++.++..||.. T Consensus 218 ---pqr~i~G~d~d~~~~------~~~~~~~-------~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~ 281 (410) T protein:vir:95 218 ---PQKYILGLDPDAEPM------EKWKATV-------SSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGE 281 (410) T ss_pred ---hhheeeccCCCCCcC------chhhhhh-------hhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhh Confidence 4 3444443222211 1232221 23444432 34565665444333334568889999999 Q ss_pred hCCCHHHhccccHH----HHHHH---HHHHHHHHHHHHHHHHHhh------cccCccc--ccCcceEEeech---hhhcc Q lcl|NC_019422. 273 FNTNEKIIQSKYSE----DEWNA---YYESEIEPVGLQLSNQYTE------KLFTRKA--RSFGNEIVFEAS---NLQYA 334 (384) Q Consensus 273 fgvp~~~l~~~~~e----~~~~~---~~~~~i~P~~~~i~~~l~~------~l~~~~~--~~~~~~i~fd~~---~~~~~ 334 (384) =++|+..+|+...+ ++..+ -+...+.-..+.+.+.+.+ .+..... ......+++.+. +.... T Consensus 282 s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~ 361 (410) T protein:vir:95 282 MGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADAN 361 (410) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchh Confidence 99999999863321 11110 0111111111111111111 1111111 111222333333 22233 Q ss_pred CHHHHHHHH-HHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 335 SMSTKLNLV-QMVDR--GSLTPNEWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 335 d~~~~~~~~-~~~~~--g~~t~NE~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) +....++.+ |++.. |+.+-.-++++||+.+.+-. ......---.|| T Consensus 362 s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~----~~~~~e~~~~g~ 410 (410) T protein:vir:95 362 TMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMSA----KPVVSEGGSNGE 410 (410) T ss_pred hHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHHH----HHHHHHHHhCCC Confidence 455566554 56655 78788889999999864211 011111112333 No 169 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=97.80 E-value=1.8e-05 Score=46.60 Aligned_cols=368 Identities=13% Similarity=0.034 Sum_probs=153.1 Q ss_pred CcchhhhcccCC---CcchhHHHhhccccC----cceec---hhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc--- Q lcl|NC_019422. 1 MNIFKSKKKNKE---APGKVMMELISDSGN----GFYSW---HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE--- 67 (384) Q Consensus 1 M~~f~~~~~~~~---~~~~~~~~~~~~~~~----~~~~~---~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~--- 67 (384) .-+.+++..... ..-.....+..+... +.... ...-....+...+|+.+++.+---.|.+-...+.. T Consensus 10 ~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~~~~~ 89 (488) T protein:vir:23 10 EKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGEEPES 89 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceeccCCccccccc Confidence 111111100000 000001111111000 00000 01112244666777777765543334331111000 Q ss_pred -ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc-------eeeEEEEcCceEEEEEcCCC-E Q lcl|NC_019422. 68 -FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM-------PTQIYPLNALNVEAIYENEV-L 138 (384) Q Consensus 68 -~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~-------~~~l~~l~~~~v~~~~~~~~-~ 138 (384) ........+..++. -| ........+..+.+.+|.||+.+..+..+. ...+.+++|..+.+..+... . T Consensus 90 ~~d~~~~~~l~~i~~-~N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d~~~~~ 165 (488) T protein:vir:23 90 GGENDPASELWDWWQ-AN---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAEVDPRTRK 165 (488) T ss_pred ccchhHHHHHHHHHH-hc---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEEEecCCCc Confidence 00011112222322 12 355677778899999999999987643211 12356778877776655321 0 Q ss_pred -----EEEEEEcCceE---EEEehh-------------------------heEEEeccCCCCCccCccHHHH-HHHHHHH Q lcl|NC_019422. 139 -----FLKFLLRNGKI---VSYPYS-------------------------DIIHLRKDFNENDLFGTSPAKV-LEPIMEV 184 (384) Q Consensus 139 -----~~~~~~~~g~~---~~~~~~-------------------------evih~~~~~~~~~~~G~s~~~~-~~~~i~~ 184 (384) .+++...++.. ..+.++ -|++|+++....+.+|.|-+.. +...++. T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da 245 (488) T protein:vir:23 166 VLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDA 245 (488) T ss_pred eEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHH Confidence 11111111110 011111 1566654444556688887653 3333333 Q ss_pred HHHHHHHHHHHHHccCCcceEEeeCCCCChHHH--HHHHHHHHHHhccccccCCcceecCCC--ceeeecccchhHHHHH Q lcl|NC_019422. 185 VNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI--KKEVKSFEKNYLQIDSEAGGAAATDSK--YDAEQVKAESYVPNAA 260 (384) Q Consensus 185 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~~~~~v~~~g--~~~~~l~~~~~~~~~~ 260 (384) .............-.+.|..++. +........ ++....|+. ..+++..++.| .++.++...+...... T Consensus 246 ~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~g~~~~~~q~~~~~~~~~~~ 317 (488) T protein:vir:23 246 AAQILMNMQGTANLMAIPQRLIF-GAKPEELGINAETGQRMFDA-------YMARILAFEGGEGAHAEQFSAAELRNFVD 317 (488) T ss_pred HHHHHHHHHHHHHHhhhHHHHHh-CCCcccccccccccchhhhh-------hhhhhccCCCCCCceeEecCCCChHHHHH Confidence 33333333333232333433332 111111100 011111111 12345555555 4566665544444445 Q ss_pred HHHHHHHHHHHHhCCCHHHhcccc----HHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcc Q lcl|NC_019422. 261 QMDKAIQRLYSFFNTNEKIIQSKY----SEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGN 322 (384) Q Consensus 261 ~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~ 322 (384) .++..+.+|+..=++|+..+|++. +..+. ...+...+.-++..+....+.. ....... T Consensus 318 ~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~----~~~~~~~ 393 (488) T protein:vir:23 318 ALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGG----DIPTEYY 393 (488) T ss_pred HHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----Ccchhhc Confidence 577889999999999999987642 11111 1122233333333332222211 0011112 Q ss_pred eEEeechhhhccCHHHHHHH-HHHHhCC--CCCHHHHHHHhCCCCCCC--CC----ee-------eecC--ce-e----- Q lcl|NC_019422. 323 EIVFEASNLQYASMSTKLNL-VQMVDRG--SLTPNEWRKIMNLSPIEN--GD----KP-------VRRL--DT-A----- 378 (384) Q Consensus 323 ~i~fd~~~~~~~d~~~~~~~-~~~~~~g--~~t~NE~R~~lG~~p~~~--gd----~~-------~~~~--n~-~----- 378 (384) .+++.+.+....+..+.++. .|++..| +++..-+++++|+.+.+- .+ +- +-+. .. . T Consensus 394 ~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (488) T protein:vir:23 394 RMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPG 473 (488) T ss_pred cceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCC Confidence 34444555555667777665 4566654 788888889998765421 10 00 0000 00 0 Q ss_pred ecCCCC Q lcl|NC_019422. 379 VVEGGE 384 (384) Q Consensus 379 ~~~~ge 384 (384) ....|+ T Consensus 474 ~~~~~~ 479 (488) T protein:vir:23 474 EAPVGE 479 (488) T ss_pred CCCCCC Confidence 001111 No 170 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.80 E-value=1.8e-05 Score=46.57 Aligned_cols=356 Identities=10% Similarity=0.046 Sum_probs=160.8 Q ss_pred CcchhhhcccCCCcchhHHHhhccccC------cceechhh----hhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGN------GFYSWHGN----LYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~----~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) =-++........ .-.....+..+-.. ........ ...+.+...+|+..++.+--..|.+ .++. T Consensus 20 ~~l~~~~~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~---~d~~--- 92 (479) T protein:vir:99 20 TKVFPKMNTECE-RLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK---TGTN--- 92 (479) T ss_pred HHHHHHHHHHhH-HHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC---CCch--- Confidence 011111100000 00001111111000 00000001 1123456677777776553223321 2111 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee-----CCCCceeeEEEEcCceEEEEEcCCCE----EEE Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK-----DDYNMPTQIYPLNALNVEAIYENEVL----FLK 141 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~-----~~~g~~~~l~~l~~~~v~~~~~~~~~----~~~ 141 (384) .......++.. | ........+..+.+.+|.+|+++.. ++.|.+ .+..++|..+.+..+.... .|. T Consensus 93 -~~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~~~~~~ 166 (479) T protein:vir:99 93 -ENAKGWDTWRL-N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDEWPKYL 166 (479) T ss_pred -hhHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEecCCcccceeeEE Confidence 12223334332 3 2335667788999999999998874 344543 4667788888776543221 111 Q ss_pred -------------------EEEcCceEEEEeh-----h--heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 142 -------------------FLLRNGKIVSYPY-----S--DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 142 -------------------~~~~~g~~~~~~~-----~--evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) +...+|....... . -|++|.++... .-+|.|.+..+...++............ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~ 245 (479) T protein:vir:99 167 LERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLV 245 (479) T ss_pred EeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 1111121111000 1 15666544322 3479999988888888877766666666 Q ss_pred HHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcc-eecCCCceeeecccchhHHHHHHHHHHHHHHHHHhC Q lcl|NC_019422. 196 IKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGA-AATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFN 274 (384) Q Consensus 196 ~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fg 274 (384) .+-.+.|..++.- ....++. ......+.. ..+++ ...+++.++.++...+.......++..+.+|+..=+ T Consensus 246 ~~~~a~p~~~i~G-~~~~~~~-~~~~~~~~~-------~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~ 316 (479) T protein:vir:99 246 QHHQSFQIRWATG-LMLPEGA-NADQEKMRF-------AQESMLISQNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQ 316 (479) T ss_pred HHHhhchhhhhcC-CCccccc-ccchhcccc-------ccccceeecCCCceEEEecccchHHHHHHHHHHHHHHhccCC Confidence 6666666544431 1111110 000011111 11223 334566677666644433334556778889999999 Q ss_pred CCHHHhcc-ccHH-HHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHH Q lcl|NC_019422. 275 TNEKIIQS-KYSE-DEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMST 338 (384) Q Consensus 275 vp~~~l~~-~~~e-~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~ 338 (384) +|+..+|. ++.. .+. ...+...+.-+++.+.... ..........+++.+.+....+..+ T Consensus 317 ~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~-----~~~~~~~~~~i~~~w~~~~~~s~~~ 391 (479) T protein:vir:99 317 LPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIE-----GRTEEATDLDFTITWQDVTIQSLAQ 391 (479) T ss_pred CCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-----CCCccccceeeeEEecCCCCCCHHH Confidence 99998873 2221 111 1122233333333332211 1111111223455555555566777 Q ss_pred HHHHH-HHHhCCCCCHHHHHHHh-CCCCCC--CC----------CeeeecCc--eeec-----CCCC Q lcl|NC_019422. 339 KLNLV-QMVDRGSLTPNEWRKIM-NLSPIE--NG----------DKPVRRLD--TAVV-----EGGE 384 (384) Q Consensus 339 ~~~~~-~~~~~g~~t~NE~R~~l-G~~p~~--~g----------d~~~~~~n--~~~~-----~~ge 384 (384) .++.+ |++..|+++...+.+++ |+++.+ .. +....... ..+. .+|+ T Consensus 392 ~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (479) T protein:vir:99 392 FADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGA 458 (479) T ss_pred HHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCC Confidence 77654 67778888888888877 776531 00 00000000 0000 1111 No 171 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.79 E-value=1.9e-05 Score=46.47 Aligned_cols=372 Identities=10% Similarity=0.077 Sum_probs=175.9 Q ss_pred Cc------chhhhcccCCCcchhHHHhhccccCcce-----e---chhhhhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_019422. 1 MN------IFKSKKKNKEAPGKVMMELISDSGNGFY-----S---WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNET 66 (384) Q Consensus 1 M~------~f~~~~~~~~~~~~~~~~~~~~~~~~~~-----~---~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 66 (384) +. +.++-.......-.....+..+...... . ....-...+....+++..+.-+-+-|+++- .+++ T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~-~~d~ 118 (511) T protein:vir:93 40 QNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQ-DDDK 118 (511) T ss_pred ccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeec-cCCh Confidence 11 1111000000000011122211111000 0 000112245667788888888877777752 1111 Q ss_pred cceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E-E---E Q lcl|NC_019422. 67 EFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L-F---L 140 (384) Q Consensus 67 ~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~-~---~ 140 (384) .....+.+-+...........+..+.+.+|.||+++..+..|.+. +..++|..+.++.+... . . . T Consensus 119 --------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~-i~~~~p~~~~~vydd~~~~~~~~~vr 189 (511) T protein:vir:93 119 --------DVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVR 189 (511) T ss_pred --------HHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceE-EEEEccceeEEEEcCCCCCceEEEEE Confidence 111122222233356677888889999999999999998888654 77889999988776532 1 1 1 Q ss_pred EEEEc--Cc---e----EEEEehhheEEEeccCC-------------------------CCCccCccHHHHHHHHHHHHH Q lcl|NC_019422. 141 KFLLR--NG---K----IVSYPYSDIIHLRKDFN-------------------------ENDLFGTSPAKVLEPIMEVVN 186 (384) Q Consensus 141 ~~~~~--~g---~----~~~~~~~evih~~~~~~-------------------------~~~~~G~s~~~~~~~~i~~~~ 186 (384) +|... .+ + ...+.++.+.+++.... .+...|.|.+..+...++... T Consensus 190 ~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d 269 (511) T protein:vir:93 190 YLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYD 269 (511) T ss_pred EEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHH Confidence 11111 11 0 01234444544432211 012368888998888888887 Q ss_pred HHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHH Q lcl|NC_019422. 187 TTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKA 265 (384) Q Consensus 187 ~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~ 265 (384) .+.....+.+...+.|-.+++-....+.++....++...-.........+...-.+++.++..++.......+ ..++.+ T Consensus 270 ~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L 349 (511) T protein:vir:93 270 NAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRL 349 (511) T ss_pred HHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHH Confidence 7777777777776666555543223333332222211110000000000111223456666666654333333 334566 Q ss_pred HHHHHHHhCCCHHHh---ccccHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeec Q lcl|NC_019422. 266 IQRLYSFFNTNEKII---QSKYSEDE--------------WNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEA 328 (384) Q Consensus 266 ~~~I~~~fgvp~~~l---~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~ 328 (384) .+.|+..-++|..-. +++.+..+ ....+...+.-.++.+...+...--.... .....+++.+ T Consensus 350 ~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~-~d~~~i~~~f 428 (511) T protein:vir:93 350 NSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDAN-KDFNTVRYVY 428 (511) T ss_pred HHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc-cccccceEEe Confidence 777877778876432 23322211 12244555555555555544432211111 1112345555 Q ss_pred hhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C--------Cee-eecCce----eecCCCC Q lcl|NC_019422. 329 SNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G--------DKP-VRRLDT----AVVEGGE 384 (384) Q Consensus 329 ~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g--------d~~-~~~~n~----~~~~~ge 384 (384) ..-...+..+.++.+... .|+++..-+++++++-+.|. . +.. ....+. -..++++ T Consensus 429 ~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:93 429 NRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred CCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCC Confidence 665666777777765433 47899888888887644211 0 000 000011 1111111 No 172 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.79 E-value=1.9e-05 Score=46.46 Aligned_cols=369 Identities=12% Similarity=0.044 Sum_probs=149.6 Q ss_pred CcchhhhcccCC---CcchhHHHhhccccCc--cee-chh----hhhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSKKKNKE---APGKVMMELISDSGNG--FYS-WHG----NLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~~~~~~---~~~~~~~~~~~~~~~~--~~~-~~~----~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) +-+.+.+..... ..-.....+..+-... ... ... ......+...+|+.+|+.+.--.|. ..+ +. T Consensus 23 ~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~---~~d-~~-- 96 (504) T protein:vir:99 23 VDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFV---WPD-GD-- 96 (504) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceee---CCC-CC-- Confidence 222221111100 0001111111111000 000 000 1112345566777777654322232 221 11 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCcee-eEEEEcCceEEEEEcCCCE------EEEEE Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPT-QIYPLNALNVEAIYENEVL------FLKFL 143 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~-~l~~l~~~~v~~~~~~~~~------~~~~~ 143 (384) ..+..++.++.. |. .......+..+.+.+|.||+.+..++.|.+. -+.+++|..+.+..+.... .++.. T Consensus 97 ~~~~~l~~i~~~-N~---ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~ 172 (504) T protein:vir:99 97 YGSIGGPDVWDE-NF---FATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSR 172 (504) T ss_pred hhhHHHHHHHHh-cC---hhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEe Confidence 112223344432 32 3456778889999999999999998888764 5678899988877664321 11111 Q ss_pred EcCceEE---EEehhh------------------------eEEEeccCCCCCccCccHHH-HHHHHHHHHH---HHHHHH Q lcl|NC_019422. 144 LRNGKIV---SYPYSD------------------------IIHLRKDFNENDLFGTSPAK-VLEPIMEVVN---TTDQGV 192 (384) Q Consensus 144 ~~~g~~~---~~~~~e------------------------vih~~~~~~~~~~~G~s~~~-~~~~~i~~~~---~~~~~~ 192 (384) ..+|+.. .+.++. |+++.++...+..+|.|.+. .+...++..+ .-.... T Consensus 173 d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~ 252 (504) T protein:vir:99 173 DAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGH 252 (504) T ss_pred cCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHH Confidence 2222211 112222 44554333335567888553 3333333333 222333 Q ss_pred HHHHHccCCcce-EEeeCC-CCChHHHHHHHHHHHHHhc---cccccCCcceecCCCceeeecccchhHHHHHHHHHHHH Q lcl|NC_019422. 193 VKAIKNSNTIKW-LLKFKT-ALRPDDIKKEVKSFEKNYL---QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQ 267 (384) Q Consensus 193 ~~~~~ng~~p~~-il~~~~-~~~~e~~~~~~~~~~~~~~---~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~ 267 (384) ..++. .|.. ++.++. ....++. +....|+.... ....+....+.-....++-++....-..-...++.++. T Consensus 253 ~e~~a---~p~r~i~G~~~~~~~~~d~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~ 328 (504) T protein:vir:99 253 ADVYS---FPQLILLGADAKNFRNKDG-SMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAM 328 (504) T ss_pred HHHhc---chhhhhccCCccccccccc-cccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHH Confidence 33333 3322 222211 1111110 11122222111 11111111111223455666655443333345688899 Q ss_pred HHHHHhCCCHHHhccc-----cHHHHHH---HHHHHHHHHHHHHHHHHHhh------cccCc--ccccCcceEEeechhh Q lcl|NC_019422. 268 RLYSFFNTNEKIIQSK-----YSEDEWN---AYYESEIEPVGLQLSNQYTE------KLFTR--KARSFGNEIVFEASNL 331 (384) Q Consensus 268 ~I~~~fgvp~~~l~~~-----~~e~~~~---~~~~~~i~P~~~~i~~~l~~------~l~~~--~~~~~~~~i~fd~~~~ 331 (384) +|+..=++|+..+|-. .+.++.+ .-+...+.-..+.+.+.|.+ .+... ........+++.+.+. T Consensus 329 ~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~ 408 (504) T protein:vir:99 329 MFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSP 408 (504) T ss_pred HHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCC Confidence 9999999999988621 1111111 01111111111222222211 01110 0112223344455666 Q ss_pred hccCHHHHHHHH-HHHhCCC--CCH-HHHHHHhCCCCCC-----------CC----CeeeecCceeecCCC--------C Q lcl|NC_019422. 332 QYASMSTKLNLV-QMVDRGS--LTP-NEWRKIMNLSPIE-----------NG----DKPVRRLDTAVVEGG--------E 384 (384) Q Consensus 332 ~~~d~~~~~~~~-~~~~~g~--~t~-NE~R~~lG~~p~~-----------~g----d~~~~~~n~~~~~~g--------e 384 (384) ...+..+.++++ |++..|. +.. .-+++++|+.+-+ .+ |.+....+.. ..++ | T Consensus 409 ~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~e 487 (504) T protein:vir:99 409 LYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEA-ATAGEDQDQGAGE 487 (504) T ss_pred CccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCC-CCCCCCCCcCCCC Confidence 667777887765 5666553 222 2344556665431 00 1111111111 1111 1 No 173 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.76 E-value=2.1e-05 Score=46.19 Aligned_cols=360 Identities=10% Similarity=0.004 Sum_probs=156.2 Q ss_pred Ccchh----hhccc---CCCcchhHHHhhccccC----cceech---hhhhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_019422. 1 MNIFK----SKKKN---KEAPGKVMMELISDSGN----GFYSWH---GNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNET 66 (384) Q Consensus 1 M~~f~----~~~~~---~~~~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 66 (384) |.-=. ..... ....-.....+..+-.. +..... ..-....+...+|+..+..+--..|.+ .++ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~---~~d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED 77 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceec---CCC Confidence 21100 00000 00000000111111000 000000 000123355666776666553333322 211 Q ss_pred cceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC------CCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 67 EFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD------DYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 67 ~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~------~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) . .....+..++.. | ........+..+.+.+|.||+.+.++ ..|.+ .+.+++|..+.+..+... . T Consensus 78 -~--~~~~~l~~i~~~-N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D~~~~~~ 149 (480) T protein:vir:78 78 -S--EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRR 149 (480) T ss_pred -c--hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEcCCCccc Confidence 1 112233344432 2 34567788889999999999988753 34443 477888888887776431 1 Q ss_pred E---EE-EEEc-C-ce---EEEEehh-----------------------------heEEEeccCCCCCccCccHHHH-HH Q lcl|NC_019422. 139 F---LK-FLLR-N-GK---IVSYPYS-----------------------------DIIHLRKDFNENDLFGTSPAKV-LE 179 (384) Q Consensus 139 ~---~~-~~~~-~-g~---~~~~~~~-----------------------------evih~~~~~~~~~~~G~s~~~~-~~ 179 (384) . +. |... + +. ...+.++ -|+||.++...++.+|.|-+.. +. T Consensus 150 ~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~ 229 (480) T protein:vir:78 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) T ss_pred eEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHH Confidence 0 11 1000 0 00 0001111 2455654444455678887653 44 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-CCceeeecccchhHHH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVKAESYVPN 258 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~~~~~~~~ 258 (384) ..++..............-.+.|..++. +....+...+.....|.. ..+.++.++ ++.++.++.....+.- T Consensus 230 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (480) T protein:vir:78 230 KVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAELRNF 301 (480) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhh-CCCccccccccccchhhh-------hhhhhccCCCCCceEEecCccCHHHH Confidence 5555544444444444444445544442 222211111111111111 112334433 4566766665544433 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcccc----HHHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccC Q lcl|NC_019422. 259 AAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSF 320 (384) Q Consensus 259 ~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~ 320 (384) ...++..+.+++..=++|+..+|+.. +..+.+ ..+...+.-+++.+... ........ T Consensus 302 ~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~-----~~~~~~~~ 376 (480) T protein:vir:78 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI-----MGREVTEE 376 (480) T ss_pred HHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----cCCCcccc Confidence 44568889999999999999998642 221111 11222232233322221 22111222 Q ss_pred cceEEeechhhhccCHHHHHHH-HHHHhCC--CCCHHHHHHHhCCCCCCC----------C----CeeeecC--c--eee Q lcl|NC_019422. 321 GNEIVFEASNLQYASMSTKLNL-VQMVDRG--SLTPNEWRKIMNLSPIEN----------G----DKPVRRL--D--TAV 379 (384) Q Consensus 321 ~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g--~~t~NE~R~~lG~~p~~~----------g----d~~~~~~--n--~~~ 379 (384) ...+++.+.+....+..+.++. .|++..| +++-.-+++++|+.+.+- + +.+..+. . -++ T Consensus 377 ~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCcccc Confidence 3345666655555666666654 4666544 677777788888876420 1 1111100 0 000 Q ss_pred -cCCCC Q lcl|NC_019422. 380 -VEGGE 384 (384) Q Consensus 380 -~~~ge 384 (384) -..|| T Consensus 457 ~~~~~~ 462 (480) T protein:vir:78 457 KPTVTE 462 (480) T ss_pred CCCCCC Confidence 01111 No 174 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.76 E-value=2.2e-05 Score=46.16 Aligned_cols=367 Identities=10% Similarity=0.019 Sum_probs=176.6 Q ss_pred chhhhcccCCCcchhHHHhhccccCccee--------chhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 3 IFKSKKKNKEAPGKVMMELISDSGNGFYS--------WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 3 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) +...........-.....+..+....... ....-+..+....+|+..+.-+-+-|.++- ..+++... T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~-~~~~~~~~---- 75 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIG-VMEGGSAD---- 75 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEe-eCCCccHH---- Confidence 33322211111111111122111110000 000112345667788888877766666652 12211111 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEEEcCce Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFLLRNGK 148 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~~~g~ 148 (384) ....| .+-............+..+.+.+|.+|+.+..+..|.+. +..++|..+.++.+.... +.++...+.. T Consensus 76 ~~~~l-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~ 153 (440) T protein:vir:95 76 QLSTI-KDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDR-VVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKV 153 (440) T ss_pred HHHHH-HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEecCce Confidence 11111 222122245566777888999999999999998888754 677899999888765431 1122222221 Q ss_pred EE-EEehhheEEEecc--------------CCC---------CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcce Q lcl|NC_019422. 149 IV-SYPYSDIIHLRKD--------------FNE---------NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKW 204 (384) Q Consensus 149 ~~-~~~~~evih~~~~--------------~~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 204 (384) .. .+..+.+++++.. ++. +...|.|.+..+...++..........+..+..+.|.. T Consensus 154 ~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~ 233 (440) T protein:vir:95 154 NMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAML 233 (440) T ss_pred EEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhccee Confidence 11 2333333333211 111 12358888888888888877777777777777777766 Q ss_pred EEeeC---CCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHH- Q lcl|NC_019422. 205 LLKFK---TALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKI- 279 (384) Q Consensus 205 il~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~- 279 (384) +++-. ...++++....++.-.-... ........+++.+++.+........+ ..++.+.+.|+..-++|..- T Consensus 234 v~~g~~~~~~~~~e~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 309 (440) T protein:vir:95 234 LVKGDLDGIKLSPEDAAKMKDANMLFLK----TGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDD 309 (440) T ss_pred eeecccccCCCCccchhhhhhccceecc----cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc Confidence 66432 22344444443322111110 01111222344445555443333333 34566778888888888633 Q ss_pred --hccccHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH Q lcl|NC_019422. 280 --IQSKYSED--------------EWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV 343 (384) Q Consensus 280 --l~~~~~e~--------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~ 343 (384) ++++.+.. .....+...+..+++.+...++..--... ....+++.+..-...+..+.++.+ T Consensus 310 ~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~---~~~~v~i~f~~~~p~~~~~~ad~~ 386 (440) T protein:vir:95 310 DRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVI---EANKLTFTFHPNIPQDVWTEIKAY 386 (440) T ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc---ccccceEEeCCCCCCCHHHHHHHH Confidence 22222221 11234455555555555555443211111 122345555666667777777765 Q ss_pred HHHhCCCCCHHHHHHHhCCCCCC--------C--CCeeeecCceeecCCCC Q lcl|NC_019422. 344 QMVDRGSLTPNEWRKIMNLSPIE--------N--GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 344 ~~~~~g~~t~NE~R~~lG~~p~~--------~--gd~~~~~~n~~~~~~ge 384 (384) ... .|+++..-+.++++....+ . .+..-...-.-..++|+ T Consensus 387 ~kl-~g~iS~et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 436 (440) T protein:vir:95 387 IEA-GGEISQETLMENASFTDYKTEHSRILKQGGSSDLEIGQIVGDADVGQ 436 (440) T ss_pred HHH-hccCcHHHHHHhCCCCCcHHHHHHHHHHHHHhhhhHHhhccCCCCCC Confidence 433 4788888887887764321 0 11111111111223333 No 175 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.72 E-value=2.5e-05 Score=45.83 Aligned_cols=372 Identities=10% Similarity=0.075 Sum_probs=169.5 Q ss_pred Ccchhh----------hc-------ccCCCcchhHHHhhccccCcceec---h-----hhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKS----------KK-------KNKEAPGKVMMELISDSGNGFYSW---H-----GNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~----------~~-------~~~~~~~~~~~~~~~~~~~~~~~~---~-----~~~~~~~~v~~~i~~ia~~ia~ 55 (384) ..+|.. .+ ......-.....+..+........ . ..-...+.....++..+.-+.+ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g 108 (511) T protein:vir:99 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLG 108 (511) T ss_pred cccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhcc Confidence 111110 00 000000001111221111110000 0 0012235566777777777777 Q ss_pred CceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcC Q lcl|NC_019422. 56 MTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYEN 135 (384) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~ 135 (384) -|+++- .+++ .....+..++.. -........+..+.+.+|.+|+++.++..|.+ .+..++|..+.++.+. T Consensus 109 ~p~~~~-~~d~----~~~~~l~~~~~~----n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~ 178 (511) T protein:vir:99 109 NPIQYQ-DDDK----DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDN 178 (511) T ss_pred cCceee-cCch----HHHHHHHHHHhh----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcC Confidence 777752 1111 111222333322 24556777888999999999999999888864 5778899998887765 Q ss_pred CC--E----EEEEEEc--C-c--e----EEEEehhheEEEeccCC-------------------------CCCccCccHH Q lcl|NC_019422. 136 EV--L----FLKFLLR--N-G--K----IVSYPYSDIIHLRKDFN-------------------------ENDLFGTSPA 175 (384) Q Consensus 136 ~~--~----~~~~~~~--~-g--~----~~~~~~~evih~~~~~~-------------------------~~~~~G~s~~ 175 (384) .. . +.+|... . + . ...+.++.+.+++.... .+...|.|.+ T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~ 258 (511) T protein:vir:99 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDY 258 (511) T ss_pred CCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCch Confidence 42 1 1111111 1 1 0 11234444555432110 0113688888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchh Q lcl|NC_019422. 176 KVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESY 255 (384) Q Consensus 176 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~ 255 (384) ..+...++....+.....+.+...+.|-.+++-....++++....++.-.-.........+...-.++|.+++.+..... T Consensus 259 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~ 338 (511) T protein:vir:99 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD 338 (511) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCC Confidence 88888888777766666666666556654443323333333222221100001000000111223345667776665444 Q ss_pred HHHHH-HHHHHHHHHHHHhCCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCccc Q lcl|NC_019422. 256 VPNAA-QMDKAIQRLYSFFNTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKA 317 (384) Q Consensus 256 ~~~~~-~~~~~~~~I~~~fgvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~ 317 (384) ...+. .++.+.+.|+..-++|..-. +++.+..+. ...+...+.-.++.+...+...--... T Consensus 339 ~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~- 417 (511) T protein:vir:99 339 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDV- 417 (511) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc- Confidence 43333 34667778888888876432 233222211 123334444444444444433211111 Q ss_pred ccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CC---eeeec--CceeecCC Q lcl|NC_019422. 318 RSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GD---KPVRR--LDTAVVEG 382 (384) Q Consensus 318 ~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd---~~~~~--~n~~~~~~ 382 (384) ......+++.+..-...+..+.++.+... .|+++..-++++++.-+.+. .+ ....+ ...-+.++ T Consensus 418 ~~~~~~i~i~f~~~~p~n~~e~~~~~~kl-~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:99 418 SKDFNTVRYVYNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNIND 496 (511) T ss_pred ccccccceEEeCCCCCcCHHHHHHHHHHH-hccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCC Confidence 11112344445555556777777765433 38889888888876543210 00 00000 00011111 Q ss_pred CC Q lcl|NC_019422. 383 GE 384 (384) Q Consensus 383 ge 384 (384) ++ T Consensus 497 ~~ 498 (511) T protein:vir:99 497 DE 498 (511) T ss_pred CC Confidence 11 No 176 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=97.72 E-value=2.1e-06 Score=51.77 Aligned_cols=165 Identities=13% Similarity=0.153 Sum_probs=84.5 Q ss_pred EEeeCC---CCChHHHHHHHHHHH--HHhccccccCCcceecCCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019422. 205 LLKFKT---ALRPDDIKKEVKSFE--KNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKI 279 (384) Q Consensus 205 il~~~~---~~~~e~~~~~~~~~~--~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~ 279 (384) ++++++ .++.+. .+.++++. ..+++ ..+.+.+...+.+|+.++.+...+.-. +......||++-|||... T Consensus 1 V~k~~~l~~~~~~~~-~~~~~r~~~~~~~~~---~~~~~~ld~~~e~~e~~~~~lsGl~d~-l~~~~~~iaa~s~iP~t~ 75 (201) T protein:vir:10 1 MWKAKGLADLCDDSD-GAARLRLAQVDNNSG---VGQAIGIDADSEEYNVLNSDIGGIDTF-LSQKFDRIVALSGIHEII 75 (201) T ss_pred CccchHHHHHhcCCh-HHHHHHHHHHHHhhh---hhhhheeecCCcceeeeecCcCChHHH-HHHHHHHHHhHhcCchhh Confidence 444332 111111 12222222 22222 223455555667788887766544321 245677899999999987 Q ss_pred hccc-------cHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHH-- Q lcl|NC_019422. 280 IQSK-------YSEDEWNAYY-------ESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLV-- 343 (384) Q Consensus 280 l~~~-------~~e~~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-- 343 (384) |-|. +.+....+|| ...+.|.++.+-+.+. +. ..+.|.+..|...+.++++++. T Consensus 76 LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~--------~~--~~~~~~f~pL~~~s~kekAei~~~ 145 (201) T protein:vir:10 76 LKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIV--------TE--QEWSVEFNPLSQVSDKDKSEILEK 145 (201) T ss_pred hcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------CC--CCceEeeCCCCCCCHHHHHHHHHH Confidence 7431 2233333333 3446666665443221 12 2456667888777777776542 Q ss_pred -----H-HHhCCCCCHHHHHHHhCCCCCCC--CCeee-----ecCceeecCCCC Q lcl|NC_019422. 344 -----Q-MVDRGSLTPNEWRKIMNLSPIEN--GDKPV-----RRLDTAVVEGGE 384 (384) Q Consensus 344 -----~-~~~~g~~t~NE~R~~lG~~p~~~--gd~~~-----~~~n~~~~~~ge 384 (384) + ++..|+++++|+|+.|--.+..+ ++... ..-.--|.+.+| T Consensus 146 ~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~dp~~~~~ 199 (201) T protein:vir:10 146 NVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVVINESEDPLDVSA 199 (201) T ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCccccccccCCCCCCCC Confidence 2 46789999999999875544321 11110 001111222233 No 177 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=97.72 E-value=6.4e-06 Score=49.05 Aligned_cols=366 Identities=12% Similarity=0.095 Sum_probs=164.4 Q ss_pred Ccchhhhc-cc---------CCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEE--EEecCCcc Q lcl|NC_019422. 1 MNIFKSKK-KN---------KEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKH--IRSNETEF 68 (384) Q Consensus 1 M~~f~~~~-~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~--~~~~~~~~ 68 (384) |++|+.-+ +. .+.|...... |......|+...+ .|+...+.|- ++-.+.+++ .++.++- T Consensus 51 ~~~~~ng~i~~v~~~~l~~~f~npd~~~~~-i~~l~~y~yi~~~------~v~ql~~li~-~lp~l~y~i~~~~~~k~~- 121 (525) T protein:vir:10 51 MDLCNNGKIKTVNLDTLQLWFNNPDKYINN-IVNLLTYYYIIDG------NVFQLYDLIF-SLPPLDYQIKVLKRDKDY- 121 (525) T ss_pred HHhhcCCceeeeeHHHHHhhhcChHHHHHH-HHHHHHHhhhhcc------hHHHHHHHHH-hcCCcceeehhhhhccch- Confidence 77776421 11 1111111111 1111111221111 2233333332 334444444 2222211 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEE--------------------EeeCCCCceeeEEEEcCce Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAV--------------------IIKDDYNMPTQIYPLNALN 128 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~--------------------~~~~~~g~~~~l~~l~~~~ 128 (384) +.-...++..-..-.-+.++-+.+..++...|.-... +....+|..+. +++-.+ T Consensus 122 ----~~~~s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~--vid~~~ 195 (525) T protein:vir:10 122 ----KEDLSTINLYLEKKIQHKQLTRDLLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVA--VIDLQW 195 (525) T ss_pred ----hhHHHHHHHHHHHhHHHHHHHHHHHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccccCCceEE--EEehHH Confidence 1123334433333344556666666666666643211 11111221111 112221 Q ss_pred EEEEEcCCC--------------EEEEEEEcCc------eEEEEehhheEEEeccCCC-CCccCccHHHHHHHHHHHHHH Q lcl|NC_019422. 129 VEAIYENEV--------------LFLKFLLRNG------KIVSYPYSDIIHLRKDFNE-NDLFGTSPAKVLEPIMEVVNT 187 (384) Q Consensus 129 v~~~~~~~~--------------~~~~~~~~~g------~~~~~~~~evih~~~~~~~-~~~~G~s~~~~~~~~i~~~~~ 187 (384) ++-..+... .+..+...+| ..+.+|.+.++|.|...+. +.-.|.|....+...|..... T Consensus 196 f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~k 275 (525) T protein:vir:10 196 FDEMSELERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQK 275 (525) T ss_pred hhhhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHH Confidence 111000000 0000111122 2456788999999988775 445699999999999999888 Q ss_pred HHHHHHHHHHccCCcceEEeeCCCCC------hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecc---cch--hH Q lcl|NC_019422. 188 TDQGVVKAIKNSNTIKWLLKFKTALR------PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVK---AES--YV 256 (384) Q Consensus 188 ~~~~~~~~~~ng~~p~~il~~~~~~~------~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~---~~~--~~ 256 (384) ......+...+-..|-.++++++.-+ +...+++.+..+...........++.++.-. +|..+. .+. .- T Consensus 276 lrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~P-dfa~~efp~ik~~~~g 354 (525) T protein:vir:10 276 LRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMP-DFATFEFPEIKNGDKT 354 (525) T ss_pred HHHHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEecc-ceeecccccccCcccC Confidence 88877777777778888999976533 2244556666666555433344455553211 233322 111 11 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccccHH--HHH--HHHHHHHHHHHHHHHHHHHhhcccCc-ccccCcceEEeechhh Q lcl|NC_019422. 257 PNAAQMDKAIQRLYSFFNTNEKIIQSKYSE--DEW--NAYYESEIEPVGLQLSNQYTEKLFTR-KARSFGNEIVFEASNL 331 (384) Q Consensus 257 ~~~~~~~~~~~~I~~~fgvp~~~l~~~~~e--~~~--~~~~~~~i~P~~~~i~~~l~~~l~~~-~~~~~~~~i~fd~~~~ 331 (384) .+-.-.+..-.+|-.|+|++-.+++|+..+ .+. .......|.=+++.|++..+ +|+.- ..-..+.-+-|+.+.- T Consensus 355 lDg~K~d~I~~DI~~A~GlS~sL~nGdggNyAtaslnld~fykkigVm~e~Iee~y~-kL~d~Vl~~~k~~nyifnydkd 433 (525) T protein:vir:10 355 LDPKKYDSIDNDITNATGISQVLTNGTKGNYASAKLNLDVFYKKIGVMLEIIEEIYN-QLIDIILGEEKGCNYIFQYNKD 433 (525) T ss_pred CCchhhhhhhhhhhhhhccceeeecCCCCceeeeeeeHHHHHHHHHHHHHHHHHHHH-HHHhhhcCcccCcceEEecCCC Confidence 111122345668999999999999876443 111 12233445556777774433 44431 1112233344555444 Q ss_pred hccCHHHHHHH-HHHHhCCCCCHHHHHHHhCCC--CC-C----------CCCeeeecCceeecC------------CCC Q lcl|NC_019422. 332 QYASMSTKLNL-VQMVDRGSLTPNEWRKIMNLS--PI-E----------NGDKPVRRLDTAVVE------------GGE 384 (384) Q Consensus 332 ~~~d~~~~~~~-~~~~~~g~~t~NE~R~~lG~~--p~-~----------~gd~~~~~~n~~~~~------------~ge 384 (384) ...+.+++.+. +++...|+. .--+....|.. |. + --++..-|.+...+. +++ T Consensus 434 ~pi~~kkk~d~LIkL~d~g~s-~k~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~P~~dd~ 511 (525) T protein:vir:10 434 TPIEREKKLDTLIKLEAQGYS-AKYVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGSPKLDDS 511 (525) T ss_pred chhhhhhhhhhhhhhhccchh-hhhhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccCCccCCC Confidence 44455555544 344444442 11111122211 10 0 012222333322222 222 No 178 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.69 E-value=2.8e-05 Score=45.56 Aligned_cols=364 Identities=10% Similarity=0.046 Sum_probs=179.8 Q ss_pred CcchhhhcccCCC--cchhHHHhhcccc-------------Cc------------------cee---------chhhhhh Q lcl|NC_019422. 1 MNIFKSKKKNKEA--PGKVMMELISDSG-------------NG------------------FYS---------WHGNLYK 38 (384) Q Consensus 1 M~~f~~~~~~~~~--~~~~~~~~~~~~~-------------~~------------------~~~---------~~~~~~~ 38 (384) |++.+....-... ....+..+|.... .+ +.. ....=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 7776643222111 1111111111100 00 000 0000122 Q ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce Q lcl|NC_019422. 39 SDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP 118 (384) Q Consensus 39 ~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~ 118 (384) ++....+|+..+.-+-+-|+++--.++. ..++.+...+.+-............+..+...+|.||..+..+..|.+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~----~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~ 156 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENA----EKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI 156 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCC----cchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee Confidence 5566777888887777778775221111 111222223333333345667888889999999999999988888864 Q ss_pred eeEEEEcCceEEEEEcCCCEEE----EEEEc---CceE---E-EEehhheEEEecc------------CC---------C Q lcl|NC_019422. 119 TQIYPLNALNVEAIYENEVLFL----KFLLR---NGKI---V-SYPYSDIIHLRKD------------FN---------E 166 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~~~~~~~----~~~~~---~g~~---~-~~~~~evih~~~~------------~~---------~ 166 (384) .+..++|..+.++.+..+... .|... ++.. + .+....+.+++.. ++ . T Consensus 157 -~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 235 (474) T protein:vir:94 157 -RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVP 235 (474) T ss_pred -EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEec Confidence 577788888877765543211 11111 1111 0 1112222222211 00 1 Q ss_pred CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCce Q lcl|NC_019422. 167 NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYD 246 (384) Q Consensus 167 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~ 246 (384) +...|.|.+..+...++..........+.+...+.|-.+++ +..++++.....+ ..+.+.+.+++.+ T Consensus 236 n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~------------~~~~i~~~~~~~~ 302 (474) T protein:vir:94 236 NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GMGMSEEMIQETQ------------KSGAFELFDKDMD 302 (474) T ss_pred CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCCCCchhhhhhh------------hcceeEecCCCCc Confidence 12358888888888888777766666666666666654442 2333333322211 1234455566777 Q ss_pred eeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHh---ccccHHHH--------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 247 AEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKII---QSKYSEDE--------------WNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 247 ~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l---~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l 308 (384) ++.+........+ ..++.+.+.|+..-++|.... +++.+..+ ....+...+.-+++.+...+ T Consensus 303 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 382 (474) T protein:vir:94 303 VKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSAL 382 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777655443333 345667778888778876432 23222221 12244555555555555555 Q ss_pred hhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C-----CeeeecCceeecC Q lcl|NC_019422. 309 TEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G-----DKPVRRLDTAVVE 381 (384) Q Consensus 309 ~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g-----d~~~~~~n~~~~~ 381 (384) +.+-..... .....+++.+..-...|..+.++.+... .|+++..-+.+++++-+.+. . ++--......... T Consensus 383 ~~~~~~~~~-~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:94 383 KRKGYNLDD-DSYLNLIFKFTRNIPVNKLEESQVLINL-KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred hhccCCCCc-cccccceEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 443111111 1122455666666667778787776433 48899998999887644210 0 0000011111111 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) +|+ T Consensus 461 ~~~ 463 (474) T protein:vir:94 461 EGD 463 (474) T ss_pred CCC Confidence 111 No 179 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.69 E-value=2.8e-05 Score=45.56 Aligned_cols=364 Identities=10% Similarity=0.046 Sum_probs=179.8 Q ss_pred CcchhhhcccCCC--cchhHHHhhcccc-------------Cc------------------cee---------chhhhhh Q lcl|NC_019422. 1 MNIFKSKKKNKEA--PGKVMMELISDSG-------------NG------------------FYS---------WHGNLYK 38 (384) Q Consensus 1 M~~f~~~~~~~~~--~~~~~~~~~~~~~-------------~~------------------~~~---------~~~~~~~ 38 (384) |++.+....-... ....+..+|.... .+ +.. ....=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 7776643222111 1111111111100 00 000 0000122 Q ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce Q lcl|NC_019422. 39 SDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP 118 (384) Q Consensus 39 ~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~ 118 (384) ++....+|+..+.-+-+-|+++--.++. ..++.+...+.+-............+..+...+|.||..+..+..|.+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~----~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~ 156 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENA----EKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI 156 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCC----cchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee Confidence 5566777888887777778775221111 111222223333333345667888889999999999999988888864 Q ss_pred eeEEEEcCceEEEEEcCCCEEE----EEEEc---CceE---E-EEehhheEEEecc------------CC---------C Q lcl|NC_019422. 119 TQIYPLNALNVEAIYENEVLFL----KFLLR---NGKI---V-SYPYSDIIHLRKD------------FN---------E 166 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~~~~~~~----~~~~~---~g~~---~-~~~~~evih~~~~------------~~---------~ 166 (384) .+..++|..+.++.+..+... .|... ++.. + .+....+.+++.. ++ . T Consensus 157 -~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 235 (474) T protein:vir:10 157 -RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVP 235 (474) T ss_pred -EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEec Confidence 577788888877765543211 11111 1111 0 1112222222211 00 1 Q ss_pred CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCce Q lcl|NC_019422. 167 NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYD 246 (384) Q Consensus 167 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~ 246 (384) +...|.|.+..+...++..........+.+...+.|-.+++ +..++++.....+ ..+.+.+.+++.+ T Consensus 236 n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~------------~~~~i~~~~~~~~ 302 (474) T protein:vir:10 236 NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GMGMSEEMIQETQ------------KSGAFELFDKDMD 302 (474) T ss_pred CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCCCCchhhhhhh------------hcceeEecCCCCc Confidence 12358888888888888777766666666666666654442 2333333322211 1234455566777 Q ss_pred eeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHh---ccccHHHH--------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 247 AEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKII---QSKYSEDE--------------WNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 247 ~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l---~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l 308 (384) ++.+........+ ..++.+.+.|+..-++|.... +++.+..+ ....+...+.-+++.+...+ T Consensus 303 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 382 (474) T protein:vir:10 303 VKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSAL 382 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777655443333 345667778888778876432 23222221 12244555555555555555 Q ss_pred hhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C-----CeeeecCceeecC Q lcl|NC_019422. 309 TEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G-----DKPVRRLDTAVVE 381 (384) Q Consensus 309 ~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g-----d~~~~~~n~~~~~ 381 (384) +.+-..... .....+++.+..-...|..+.++.+... .|+++..-+.+++++-+.+. . ++--......... T Consensus 383 ~~~~~~~~~-~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:10 383 KRKGYNLDD-DSYLNLIFKFTRNIPVNKLEESQVLINL-KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred hhccCCCCc-cccccceEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 443111111 1122455666666667778787776433 48899998999887644210 0 0000011111111 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) +|+ T Consensus 461 ~~~ 463 (474) T protein:vir:10 461 EGD 463 (474) T ss_pred CCC Confidence 111 No 180 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=97.69 E-value=2.8e-05 Score=45.52 Aligned_cols=367 Identities=10% Similarity=-0.008 Sum_probs=161.3 Q ss_pred CcchhhhcccCC-------------------Ccch-h-------HHHhhccccCcce--ec-----hhhhhhcHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKE-------------------APGK-V-------MMELISDSGNGFY--SW-----HGNLYKSDIVRSII 46 (384) Q Consensus 1 M~~f~~~~~~~~-------------------~~~~-~-------~~~~~~~~~~~~~--~~-----~~~~~~~~~v~~~i 46 (384) ||+|.+.+.... ...+ . +..+..+...... .. ....++.+....+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 999886542211 0001 0 0111111111000 00 00112223334444 Q ss_pred HHHHHhhccC--ceEEEEecCCcce----eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceee Q lcl|NC_019422. 47 RPKAKAVGKM--TAKHIRSNETEFK----TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQ 120 (384) Q Consensus 47 ~~ia~~ia~~--~~~~~~~~~~~~~----~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~ 120 (384) ..+|+.+..= .+.+-..+..... ....+.+..++. .......++..+.+.+..|.+++.+..+..+ .. T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~----~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~--~~ 154 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ----HNKFIKNLSDYLEPTFALGGLTVRPYVDNGE--IE 154 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHH----hccHHHHHHHHHHHHhhhCCEEEEEEEeCCe--eE Confidence 4555444332 2332100000000 111112222222 2234456666778888899999998886433 23 Q ss_pred EEEEcCceEEEEEc-CCC------------------EEE---EEEEcC------------------------ceEEEE-- Q lcl|NC_019422. 121 IYPLNALNVEAIYE-NEV------------------LFL---KFLLRN------------------------GKIVSY-- 152 (384) Q Consensus 121 l~~l~~~~v~~~~~-~~~------------------~~~---~~~~~~------------------------g~~~~~-- 152 (384) +-.+++..+-+... .++ .+| +++..+ |..+.+ T Consensus 155 I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~ 234 (517) T protein:vir:98 155 FSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEE 234 (517) T ss_pred EEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccc Confidence 44555555544211 111 111 111000 111110 Q ss_pred -e---hhh----------eEEEeccCCC----CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCC-- Q lcl|NC_019422. 153 -P---YSD----------IIHLRKDFNE----NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTAL-- 212 (384) Q Consensus 153 -~---~~e----------vih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-- 212 (384) + +++ ..|++.+-+. +...|+|.+..+...++.++........-++.|.. +.++ +..+ T Consensus 235 ~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p~~~l~ 311 (517) T protein:vir:98 235 LYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--SDVMLR 311 (517) T ss_pred cccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--Chhhhc Confidence 0 011 2245443222 34579999999999999988877777777777544 2222 2111 Q ss_pred ---ChHH--HHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhccc--- Q lcl|NC_019422. 213 ---RPDD--IKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQSK--- 283 (384) Q Consensus 213 ---~~e~--~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~~~--- 283 (384) +... ....-+.-...|.+.... +++..++.++....+.+. ..++...++|+...|+|+..++-. T Consensus 312 ~~~~~~g~~~~~~~d~~~~~y~~~~~~-------~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~ 384 (517) T protein:vir:98 312 TVPDESGMPPPQVFDPDVNVYKSIRMG-------TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS 384 (517) T ss_pred cccCCCCcccCCCCCcccceeeeccCC-------CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc Confidence 0000 000000000111111100 122235666665555454 456788999999999999988732 Q ss_pred --cH-HH-HHHH-----------HHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHHHH-HHHH Q lcl|NC_019422. 284 --YS-ED-EWNA-----------YYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKLNL-VQMV 346 (384) Q Consensus 284 --~~-e~-~~~~-----------~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~ 346 (384) ++ |. ...+ .+..+|.-++..+.....- .+++. .......+.+++++-...|.++..+. .+++ T Consensus 385 ~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~-~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v 463 (517) T protein:vir:98 385 MKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGG-EIPSAEHIGVDFDDGVFQDRSALLRFYGQAK 463 (517) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-CCCCCcceEEEcCCCCCCCHHHHHHHHHHHH Confidence 12 21 0111 2223333333333221111 12221 12234467788888777777766665 4688 Q ss_pred hCCCCCHHHHHHH-hCCCCCC-----------C--CC-eeeecCceeecC-CCC Q lcl|NC_019422. 347 DRGSLTPNEWRKI-MNLSPIE-----------N--GD-KPVRRLDTAVVE-GGE 384 (384) Q Consensus 347 ~~g~~t~NE~R~~-lG~~p~~-----------~--gd-~~~~~~n~~~~~-~ge 384 (384) ..|+|++-+++.+ .|++.-+ . .| ....+..-.++. ++| T Consensus 464 ~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 464 TFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred hcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 8999999998665 4775421 0 01 111111112222 333 No 181 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.65 E-value=3.3e-05 Score=45.18 Aligned_cols=360 Identities=11% Similarity=0.024 Sum_probs=154.3 Q ss_pred Ccchhh----hcc---cCCCcchhHHHhhccccC----cceech---hhhhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_019422. 1 MNIFKS----KKK---NKEAPGKVMMELISDSGN----GFYSWH---GNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNET 66 (384) Q Consensus 1 M~~f~~----~~~---~~~~~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 66 (384) |.-..- +.. .....-.....+..+... +..... ..-....+...+|+..+..+--..|.+ .++ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~---~~d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED 77 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceec---CCC Confidence 222110 000 000000001111111000 000000 000123355667777766553333322 211 Q ss_pred cceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeC------CCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 67 EFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKD------DYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 67 ~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~------~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) . .....+..++.+ | ........+..+.+.+|.||+.+.++ ..|.+ .+.+++|..+.+..+... . T Consensus 78 -~--~~~~~l~~i~~~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D~~~~~~ 149 (480) T protein:vir:78 78 -S--EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRR 149 (480) T ss_pred -c--hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEcCCCccc Confidence 1 112223333332 2 34567788889999999999998763 34443 367788888887776431 1 Q ss_pred E---E-EEEEc--C-----------ceEEEEe-------------------hh--heEEEeccCCCCCccCccHHHH-HH Q lcl|NC_019422. 139 F---L-KFLLR--N-----------GKIVSYP-------------------YS--DIIHLRKDFNENDLFGTSPAKV-LE 179 (384) Q Consensus 139 ~---~-~~~~~--~-----------g~~~~~~-------------------~~--evih~~~~~~~~~~~G~s~~~~-~~ 179 (384) . + +|... . +..+.+. -. -|++|.++...++.+|.|-+.. +. T Consensus 150 ~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~ 229 (480) T protein:vir:78 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) T ss_pred eEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHH Confidence 0 0 00000 0 1111110 00 2455554444455688887654 44 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceec-CCCceeeecccchhHHH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT-DSKYDAEQVKAESYVPN 258 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~-~~g~~~~~l~~~~~~~~ 258 (384) ..++.............+-.+.|..++. +....+...+.....|.. . .+.++.+ +++.++.++.....+.- T Consensus 230 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~-~------~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (480) T protein:vir:78 230 KVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI-Y------YGRILTLASEAAKISEFKAAELRNF 301 (480) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCccccccccccchhhh-h------hhhhccCCCCCceEEecCccCHHHH Confidence 5555544444444444444445544442 222211111111111111 1 1223333 34567777765544444 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcccc----HHHHHHH--------------HHHHHHHHHHHHHHHHHhhcccCcccccC Q lcl|NC_019422. 259 AAQMDKAIQRLYSFFNTNEKIIQSKY----SEDEWNA--------------YYESEIEPVGLQLSNQYTEKLFTRKARSF 320 (384) Q Consensus 259 ~~~~~~~~~~I~~~fgvp~~~l~~~~----~e~~~~~--------------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~ 320 (384) ...++..+.+|+..=++|+..+|+.. +..+.+. .+...+.-+++.+.. +.+...... T Consensus 302 ~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~-----~~g~~~~~~ 376 (480) T protein:vir:78 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGREVTEE 376 (480) T ss_pred HHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HcCCCcccc Confidence 55678888999999999999998642 2211111 111122222222221 112111122 Q ss_pred cceEEeechhhhccCHHHHHHH-HHHHhCC--CCCHHHHHHHhCCCCCCC----------CCeeeecCc----e----ee Q lcl|NC_019422. 321 GNEIVFEASNLQYASMSTKLNL-VQMVDRG--SLTPNEWRKIMNLSPIEN----------GDKPVRRLD----T----AV 379 (384) Q Consensus 321 ~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g--~~t~NE~R~~lG~~p~~~----------gd~~~~~~n----~----~~ 379 (384) ...+++.+.+....+..+.++. .|++..| +++..-+++.+|+.+.+- +...+-..+ - .+ T Consensus 377 ~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 456 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCC Confidence 2345555555555566666654 4666544 667776777777765421 100000000 0 00 Q ss_pred -cCCCC Q lcl|NC_019422. 380 -VEGGE 384 (384) Q Consensus 380 -~~~ge 384 (384) -+.|| T Consensus 457 ~~~~~~ 462 (480) T protein:vir:78 457 KPTVTE 462 (480) T ss_pred CCCCCC Confidence 00111 No 182 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.64 E-value=3.4e-05 Score=45.09 Aligned_cols=364 Identities=11% Similarity=0.070 Sum_probs=169.3 Q ss_pred CcchhhhcccCCC--------------------c-chhH-------HHhhccccCcc---ee----chhhhhhcHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA--------------------P-GKVM-------MELISDSGNGF---YS----WHGNLYKSDIVRSI 45 (384) Q Consensus 1 M~~f~~~~~~~~~--------------------~-~~~~-------~~~~~~~~~~~---~~----~~~~~~~~~~v~~~ 45 (384) ||+|.+.+..... . .+.. -.+..+-.... .. ......+......+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 9999975422110 0 0000 01111100000 00 00111223455666 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEc Q lcl|NC_019422. 46 IRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLN 125 (384) Q Consensus 46 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~ 125 (384) ++..|+.+..=|-.+- .++ . ...+.+..++ ..-.....++..+.+.+..|.+++.+..+. |. ..+-.++ T Consensus 81 ~~~~A~ll~~e~~~i~-~~d---~-~~~e~l~~i~----~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ 149 (505) T protein:vir:79 81 SAKLASLIFNEQCQVT-VSD---E-TANDFLDDVF----QQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWAT 149 (505) T ss_pred HHHHHhhhcCCCceee-cCC---h-HHHHHHHHHH----HhccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEc Confidence 6777776654443331 111 1 1111222222 222345566777888899999999988864 33 3345556 Q ss_pred CceEEEEE-cCCCE------------------EE---EEEE-cC------------------ceEEEEe--------hh- Q lcl|NC_019422. 126 ALNVEAIY-ENEVL------------------FL---KFLL-RN------------------GKIVSYP--------YS- 155 (384) Q Consensus 126 ~~~v~~~~-~~~~~------------------~~---~~~~-~~------------------g~~~~~~--------~~- 155 (384) |..+-+.. +.++. +| ++.. .+ |..+.+. .+ T Consensus 150 ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~ 229 (505) T protein:vir:79 150 ADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQ 229 (505) T ss_pred CCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcc Confidence 66655432 22110 11 1110 00 1111000 00 Q ss_pred ---------heEEEeccCCC----CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceE-----EeeCCCCChHHH Q lcl|NC_019422. 156 ---------DIIHLRKDFNE----NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWL-----LKFKTALRPDDI 217 (384) Q Consensus 156 ---------evih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i-----l~~~~~~~~e~~ 217 (384) -..||+.+.+. ....|+|.+..+...++.++.......+-++.|... .+ ++.......+.. T Consensus 230 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~-i~v~~~~l~~~~~~~~~~~ 308 (505) T protein:vir:79 230 VKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRR-LIVPAEWLKTGSSYGGQAS 308 (505) T ss_pred eeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccc-eeechHHhcccCCCCcccc Confidence 13455543222 235799999999999999988888888888775442 22 222222111100 Q ss_pred HH---HHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhccc-----cHHHH Q lcl|NC_019422. 218 KK---EVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQSK-----YSEDE 288 (384) Q Consensus 218 ~~---~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~~~-----~~e~~ 288 (384) .. .-+.-...|.+...+ +++..++.++....+.+. ..++...++|+..-|+++..++.. ++.+. T Consensus 309 ~~~~~~fd~~~~~y~~~~~~-------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei 381 (505) T protein:vir:79 309 ETHPPMFDPDETVYQAMYGD-------ASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEV 381 (505) T ss_pred cccccCCCccceeeeeccCC-------CCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHH Confidence 00 000000111111111 223457777776555554 456788889999999999988632 11111 Q ss_pred -------------HHHHHHHHHHHHHHHHHHHHhhcccCcc------cccCcceEEeechhhhccCHHHHHH-HHHHHhC Q lcl|NC_019422. 289 -------------WNAYYESEIEPVGLQLSNQYTEKLFTRK------ARSFGNEIVFEASNLQYASMSTKLN-LVQMVDR 348 (384) Q Consensus 289 -------------~~~~~~~~i~P~~~~i~~~l~~~l~~~~------~~~~~~~i~fd~~~~~~~d~~~~~~-~~~~~~~ 348 (384) ....++.+|..++..+........+... .......+.+++++-...|.++..+ ..+++.+ T Consensus 382 ~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~ 461 (505) T protein:vir:79 382 VTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQA 461 (505) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHc Confidence 1123455555555555443322111111 1112246777888766677665554 4568889 Q ss_pred CCCCHHHHHHHh-CCCCCCCCCeeeecC---c--e---eecCCCC Q lcl|NC_019422. 349 GSLTPNEWRKIM-NLSPIENGDKPVRRL---D--T---AVVEGGE 384 (384) Q Consensus 349 g~~t~NE~R~~l-G~~p~~~gd~~~~~~---n--~---~~~~~ge 384 (384) |+++.-+++... |++.- ..++-+.-. + . -.--+|| T Consensus 462 Gi~s~e~~l~~~~~~~ee-ea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 462 QVMPKKQFLMRNYGLDEE-EADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCCCHHHHHHhcCCCChH-HHHHHHHHHHHhccccCCCchhccCC Confidence 999998887653 55431 111000000 0 0 0112555 No 183 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.63 E-value=3.6e-05 Score=44.98 Aligned_cols=361 Identities=7% Similarity=-0.024 Sum_probs=176.1 Q ss_pred CcchhhhcccCC------------CcchhHHHhhcc--------------ccCc----ceec-----hhhhhhcHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKE------------APGKVMMELISD--------------SGNG----FYSW-----HGNLYKSDIVRSI 45 (384) Q Consensus 1 M~~f~~~~~~~~------------~~~~~~~~~~~~--------------~~~~----~~~~-----~~~~~~~~~v~~~ 45 (384) |-.++...-... .....+-.+++. .+.+ .... ...-+..+....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~I 80 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYV 80 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcccccCCcceeecchHHHH Confidence 666553221100 000111111111 0000 0000 0011224566778 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEc Q lcl|NC_019422. 46 IRPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLN 125 (384) Q Consensus 46 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~ 125 (384) ++..+.-+-+-|+++.-.++ .+ . ...+..++. ..........+..+.+.+|.+|+.+..+..|.+ .+..++ T Consensus 81 vd~~~~~l~g~p~~~~~~~d--~~-~-~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~ 151 (470) T protein:vir:99 81 VDVYNGYFCGIEPKLALLND--SS-K-IDEIARWNR----QENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSS 151 (470) T ss_pred HHHHhhhhccCCeeEeeCCc--hh-H-HHHHHHHHH----hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEc Confidence 88887777666766522111 11 0 111222222 235667888899999999999999999888875 477789 Q ss_pred CceEEEEEcCCCE------EEEEEEcCc-eE----EEEehhheEEEecc-------------CC---------CCCccCc Q lcl|NC_019422. 126 ALNVEAIYENEVL------FLKFLLRNG-KI----VSYPYSDIIHLRKD-------------FN---------ENDLFGT 172 (384) Q Consensus 126 ~~~v~~~~~~~~~------~~~~~~~~g-~~----~~~~~~evih~~~~-------------~~---------~~~~~G~ 172 (384) |..+.+..+.... +.++...++ .. ..+..+.++++... ++ .+...|. T Consensus 152 p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 231 (470) T protein:vir:99 152 PNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQ 231 (470) T ss_pred cceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCC Confidence 9998887765431 112222211 11 11223333332211 01 1224688 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCccee-----cCCCcee Q lcl|NC_019422. 173 SPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAA-----TDSKYDA 247 (384) Q Consensus 173 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v-----~~~g~~~ 247 (384) |.+..+...++............+...+.|..++.--.....+.-+ ....+.. .+++. .+.+.++ T Consensus 232 sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~-~~~~~~~---------~~~~~~~~~~~~~~~~~ 301 (470) T protein:vir:99 232 GIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGN-PKFDFKN---------NRVLYVSQLDPDTNPQI 301 (470) T ss_pred cchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccc-hhhhhhh---------cceeeecCCCCCCCCcc Confidence 8898888888888777777777777777776555332111111111 1111111 11221 2345566 Q ss_pred eecccchhHHHHH-HHHHHHHHHHHHhCCCHHHhc---cccHHHH--------------HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 248 EQVKAESYVPNAA-QMDKAIQRLYSFFNTNEKIIQ---SKYSEDE--------------WNAYYESEIEPVGLQLSNQYT 309 (384) Q Consensus 248 ~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp~~~l~---~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l~ 309 (384) ..+........+. .++.+.+.|+..-++|+...+ ++.+..+ ....+...+.-+++.+...+. T Consensus 302 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 381 (470) T protein:vir:99 302 GFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLF 381 (470) T ss_pred eEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6666554443333 356678889888899874332 2222211 122344455555555555444 Q ss_pred hcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CC--------------Ceeeec Q lcl|NC_019422. 310 EKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIE-NG--------------DKPVRR 374 (384) Q Consensus 310 ~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~-~g--------------d~~~~~ 374 (384) ..--... ....+++.+..-...+..+.++.+... .|+++...++++++.-... .. +....+ T Consensus 382 ~~~~~~~---~~~~i~v~f~~~~p~~~~e~a~~~~kl-~giis~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~ 457 (470) T protein:vir:99 382 NNKQDQE---LWSELDFKFTRNLPEDMASAIDNAKNA-EGIVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMP 457 (470) T ss_pred ccCCccc---ccccceEEeCCCCCcCHHHHHHHHHHH-hccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 3322211 122445555666666777777765433 2788988888888764210 00 111112 Q ss_pred CceeecC-CCC Q lcl|NC_019422. 375 LDTAVVE-GGE 384 (384) Q Consensus 375 ~n~~~~~-~ge 384 (384) ......+ ++| T Consensus 458 ~d~~~~d~~~e 468 (470) T protein:vir:99 458 IDILKRDNNAE 468 (470) T ss_pred CCcCCCCCCcc Confidence 2221111 111 No 184 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.60 E-value=3.9e-05 Score=44.77 Aligned_cols=372 Identities=10% Similarity=0.072 Sum_probs=175.4 Q ss_pred Ccc--h----------hhhc----c--cCCCc-chhHHHhhccccCccee---c-----hhhhhhcHHHHHHHHHHHHhh Q lcl|NC_019422. 1 MNI--F----------KSKK----K--NKEAP-GKVMMELISDSGNGFYS---W-----HGNLYKSDIVRSIIRPKAKAV 53 (384) Q Consensus 1 M~~--f----------~~~~----~--~~~~~-~~~~~~~~~~~~~~~~~---~-----~~~~~~~~~v~~~i~~ia~~i 53 (384) +.+ | +..+ . ....+ -.....+..+....... . ...-...+....+++..+.-+ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:96 27 VVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred CccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhh Confidence 100 0 0000 0 00000 00011111111110000 0 001122456677788888777 Q ss_pred ccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEE Q lcl|NC_019422. 54 GKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIY 133 (384) Q Consensus 54 a~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~ 133 (384) -+-|+++- .+++ .....+. .-+..-........+..++..+|.+|+++.++..|.+ .+.+++|..+.++. T Consensus 107 ~g~p~~~~-~~~~----~~~~~l~----~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vy 176 (511) T protein:vir:96 107 LGNPIQYQ-DDDK----DVLEAIE----AFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIY 176 (511) T ss_pred ccCCceee-cCch----HHHHHHH----HHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEE Confidence 77777752 1111 1112222 2233334566778888999999999999999888864 57788999988877 Q ss_pred cCCC---E---EEEEEEc--Cc---e---E-EEEehhheEEEeccCC-------------------------CCCccCcc Q lcl|NC_019422. 134 ENEV---L---FLKFLLR--NG---K---I-VSYPYSDIIHLRKDFN-------------------------ENDLFGTS 173 (384) Q Consensus 134 ~~~~---~---~~~~~~~--~g---~---~-~~~~~~evih~~~~~~-------------------------~~~~~G~s 173 (384) +... . +++|... .+ . . ..+.++.+.++..... .+.-.|.| T Consensus 177 dd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~g 256 (511) T protein:vir:96 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCC Confidence 6532 1 1122111 10 0 0 1233344444322110 01236889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccc Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAE 253 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~ 253 (384) .+..+...++....+.....+.+...+.|-.+++-....+.++.....+...-.........+...-.+++.++..|... T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 336 (511) T protein:vir:96 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeec Confidence 99988888888877777777777776666555543333333332222111111111100011111223455666666654 Q ss_pred hhHHHHH-HHHHHHHHHHHHhCCCHHHh---ccccHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcccCc Q lcl|NC_019422. 254 SYVPNAA-QMDKAIQRLYSFFNTNEKII---QSKYSEDE--------------WNAYYESEIEPVGLQLSNQYTEKLFTR 315 (384) Q Consensus 254 ~~~~~~~-~~~~~~~~I~~~fgvp~~~l---~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~ 315 (384) .....+. .++.+.+.|+..-++|..-. +++.+..+ ....+...+.-.++.+...+..+--.. T Consensus 337 ~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~ 416 (511) T protein:vir:96 337 YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSID 416 (511) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Confidence 4333333 34566777777777776433 23322221 122444555555555555444321111 Q ss_pred ccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCee-e----ecCceeec Q lcl|NC_019422. 316 KARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GDKP-V----RRLDTAVV 380 (384) Q Consensus 316 ~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~-~----~~~n~~~~ 380 (384) ... ....+++.+..-...|..+.++.+... .|+++...+.+++++-..|. .+.. . .....-+. T Consensus 417 ~~~-d~~~i~~~f~~~~p~n~~e~~~~~~kl-~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:96 417 ANK-DFNTVRYVYNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDI 494 (511) T ss_pred ccc-ccccceEEeCCCCCCCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCC Confidence 111 112345555555566777777765433 58899999888887644210 1100 0 00111112 Q ss_pred CCCC Q lcl|NC_019422. 381 EGGE 384 (384) Q Consensus 381 ~~ge 384 (384) +++| T Consensus 495 ~~~~ 498 (511) T protein:vir:96 495 NDDE 498 (511) T ss_pred CCCC Confidence 2222 No 185 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.57 E-value=4.3e-05 Score=44.52 Aligned_cols=353 Identities=10% Similarity=0.023 Sum_probs=165.7 Q ss_pred Ccchhhhccc------CCCcchhHHHhhccccC-------------cceechhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNIFKSKKKN------KEAPGKVMMELISDSGN-------------GFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~f~~~~~~------~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) |......++. ....-.....+..+-.. ........-+..+....+++..+.-+-+-|+++- T Consensus 43 ~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~ 122 (492) T protein:vir:94 43 ETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 122 (492) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCceec Confidence 2222111100 00000011111111000 0000001112356777888888888877777652 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E- Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L- 138 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~- 138 (384) - + ... .......++. | ........+..+.+.+|.+|+++..+..|.+ .+..++|..+.+..+... . T Consensus 123 ~-~--d~~--~~~~l~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~~~~~~~ 191 (492) T protein:vir:94 123 H-T--DDE--VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 191 (492) T ss_pred c-C--chH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCce Confidence 1 1 111 1112222322 2 3446677788999999999999999888865 477789988887765321 1 Q ss_pred ---EEEEEEcCc-eEEEEehhheEEEecc---------------------CCC---------CCccCccHHHHHHHHHHH Q lcl|NC_019422. 139 ---FLKFLLRNG-KIVSYPYSDIIHLRKD---------------------FNE---------NDLFGTSPAKVLEPIMEV 184 (384) Q Consensus 139 ---~~~~~~~~g-~~~~~~~~evih~~~~---------------------~~~---------~~~~G~s~~~~~~~~i~~ 184 (384) ..+|...+. ....+....+.++... ++. +...|.|.+..+...++. T Consensus 192 ~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa 271 (492) T protein:vir:94 192 EAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDA 271 (492) T ss_pred EEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHH Confidence 111211111 1111122222222110 111 123588889888888888 Q ss_pred HHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHH Q lcl|NC_019422. 185 VNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMD 263 (384) Q Consensus 185 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~ 263 (384) ...+.....+.+...+.|..+++-- +.++....... .. ..+++.++++.++..+........+ ..++ T Consensus 272 ~d~~~S~~~~~~~~~~~p~lv~~g~---~~~~~~~~~~~----~~-----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 339 (492) T protein:vir:94 272 YNRRLSDLSNTFKDSNELTYVLKNY---DDQELPEFKRL----LR-----YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 339 (492) T ss_pred HHHHHHHHHHHHHHhcCceeeeecC---CcccchhhHHH----Hh-----hccceecCCCCcceeEeccCCHHHHHHHHH Confidence 8777777777777777775555321 11111121111 11 1234444555555544433333222 2334 Q ss_pred HHHHHHHHHhCCCH---HHhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEe Q lcl|NC_019422. 264 KAIQRLYSFFNTNE---KIIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVF 326 (384) Q Consensus 264 ~~~~~I~~~fgvp~---~~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~f 326 (384) .+.+.|+..-++|. .-++++.+..+. ...+...+..+++.+...++.+ .....+++ T Consensus 340 ~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~-------~~~~~i~v 412 (492) T protein:vir:94 340 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------GEHKDVDI 412 (492) T ss_pred HHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------cccceeeE Confidence 45555655556553 344444332222 1234444555555555444321 12234555 Q ss_pred echhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eecCceeecCCCC Q lcl|NC_019422. 327 EASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKP-----VRRLDTAVVEGGE 384 (384) Q Consensus 327 d~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~-----~~~~n~~~~~~ge 384 (384) .+..-...+..+.++.+... .|+++..-++++++.-+.+. .+.. -.......+.+++ T Consensus 413 ~f~~~~p~~~~e~~~~~~kl-~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~ 476 (492) T protein:vir:94 413 SFNYNKVANTELQVQTAQQS-MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG 476 (492) T ss_pred EecCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccc Confidence 55666667777777766433 38899988888887744211 1000 0001111111111 No 186 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.55 E-value=4.7e-05 Score=44.30 Aligned_cols=353 Identities=10% Similarity=0.005 Sum_probs=167.0 Q ss_pred CcchhhhcccC---CCcchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNIFKSKKKNK---EAPGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~f~~~~~~~---~~~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) +.+........ ...-+. ...+..+.... ......+-+..+....+++..+.-+-+-|+++- T Consensus 34 e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~ 113 (483) T protein:vir:12 34 ETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 113 (483) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceec Confidence 12211110000 000000 11111111000 000000112356778888888888877777652 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E- Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L- 138 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~- 138 (384) - + +.. .......++. | ........+..+.+.+|.+|+.+..+..|.+. +..++|..+.++.+... . T Consensus 114 ~-~--d~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~-i~~~~p~~~~~v~d~~~~~~~ 182 (483) T protein:vir:12 114 H-T--DDE--VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEEL 182 (483) T ss_pred c-C--ChH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEEcCCCceE-EEEEcccceEEEEcCCCCCce Confidence 1 1 111 1112222322 2 23456667788999999999999999888754 77889998888765321 1 Q ss_pred ---EEEEEEcCc-eEEEEehhheEEEecc---------------------CCC---------CCccCccHHHHHHHHHHH Q lcl|NC_019422. 139 ---FLKFLLRNG-KIVSYPYSDIIHLRKD---------------------FNE---------NDLFGTSPAKVLEPIMEV 184 (384) Q Consensus 139 ---~~~~~~~~g-~~~~~~~~evih~~~~---------------------~~~---------~~~~G~s~~~~~~~~i~~ 184 (384) ..+|...+. ....+.+..+.|+... ++. +...|.|.+..+...++. T Consensus 183 ~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa 262 (483) T protein:vir:12 183 EAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDA 262 (483) T ss_pred EEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHH Confidence 111211111 1111222222222100 000 123688888888888887 Q ss_pred HHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHH Q lcl|NC_019422. 185 VNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMD 263 (384) Q Consensus 185 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~ 263 (384) .........+.+...+.|..+++-.. .+..+...... . ..+++-++++.++..+........+ ..++ T Consensus 263 ~d~~~S~~~~~~~~~~~~~lv~~g~~---~~~~~~~~~~~----~-----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 330 (483) T protein:vir:12 263 YNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL----R-----YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 330 (483) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCC---cccchhHHHhh----h-----hccccccCCCCcceEEeecCCHHHHHHHHH Confidence 77767666666666667755553211 11111211111 1 1224444555555555544333333 3345 Q ss_pred HHHHHHHHHhCCCHH---HhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEe Q lcl|NC_019422. 264 KAIQRLYSFFNTNEK---IIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVF 326 (384) Q Consensus 264 ~~~~~I~~~fgvp~~---~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~f 326 (384) .+.+.|+..-++|.. -++++.+..+. ...+...+..+++.+...++.+ .....+++ T Consensus 331 ~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~-------~~~~~i~v 403 (483) T protein:vir:12 331 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------GEHKDVDI 403 (483) T ss_pred HHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------CccceeeE Confidence 566677777777753 23333332221 2234445555555555444321 12234555 Q ss_pred echhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CC-----eeeecCceeecCCCC Q lcl|NC_019422. 327 EASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GD-----KPVRRLDTAVVEGGE 384 (384) Q Consensus 327 d~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd-----~~~~~~n~~~~~~ge 384 (384) .+..-...|..+.++.+... .|++|..-++++++.-+.+. .+ +--...+...+.+++ T Consensus 404 ~f~~~~p~~~~~~a~~~~kl-~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 467 (483) T protein:vir:12 404 SFNYNKVANTELQVQTAQQS-MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG 467 (483) T ss_pred EeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccc Confidence 56666667777777766433 48899988888887643210 00 000001111111111 No 187 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.52 E-value=5.1e-05 Score=44.10 Aligned_cols=366 Identities=10% Similarity=0.064 Sum_probs=168.3 Q ss_pred Ccchh---------h-hcccCCCc---chhHHHhhccccCcce-------e---chhhhhhcHHHHHHHHHHHHhhccCc Q lcl|NC_019422. 1 MNIFK---------S-KKKNKEAP---GKVMMELISDSGNGFY-------S---WHGNLYKSDIVRSIIRPKAKAVGKMT 57 (384) Q Consensus 1 M~~f~---------~-~~~~~~~~---~~~~~~~~~~~~~~~~-------~---~~~~~~~~~~v~~~i~~ia~~ia~~~ 57 (384) |.=+. + ........ -.....+..+...... . ....-+..+....+++..+.-+.+-| T Consensus 23 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~ 102 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNP 102 (481) T ss_pred eecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhccCC Confidence 11111 0 00000000 0001111111100000 0 00011345677888888888777777 Q ss_pred eEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC Q lcl|NC_019422. 58 AKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV 137 (384) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~ 137 (384) +.+-- +++ ...+.+..++.. .....+...+..+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+... T Consensus 103 ~~~~~-~d~----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~~~~~v~d~~~ 172 (481) T protein:vir:10 103 ITITH-QDN----QTNDKIIELNDL----NDADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPKSTFVVYDQTL 172 (481) T ss_pred ceEec-CCh----hHHHHHHHHHHh----cChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEcccceEEEEcCCC Confidence 66421 111 112233334433 24557888899999999999999999888865 477889998888766532 Q ss_pred E------EEEEEE--cCceE----EEEehhheEEEeccC-----------C---------CCCccCccHHHHHHHHHHHH Q lcl|NC_019422. 138 L------FLKFLL--RNGKI----VSYPYSDIIHLRKDF-----------N---------ENDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 138 ~------~~~~~~--~~g~~----~~~~~~evih~~~~~-----------~---------~~~~~G~s~~~~~~~~i~~~ 185 (384) . ..+|.. ..+.. ..+.++.+.|++... + .+...|.|.+..+...++.. T Consensus 173 ~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~ 252 (481) T protein:vir:10 173 DKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLY 252 (481) T ss_pred CCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHH Confidence 1 111111 11111 123344444442211 0 11236788887777777766 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDK 264 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~ 264 (384) ..........+...+.|..++.-....+++..+..+..-. +.. . ........+++.++..+........+. .++. T Consensus 253 ~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~--~~~-~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 328 (481) T protein:vir:10 253 DSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANM--IHL-E-PGTNANGSEGKAEVKYVYKQYDVAGVEAYKKR 328 (481) T ss_pred HHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccc--eec-c-ccccccCCCCCcceeEEeecCCHHHHHHHHHH Confidence 6655555555555566655554322333333333222100 000 0 000011122344455454443333333 3466 Q ss_pred HHHHHHHHhCCCHHHhc---cccHHHHHH--------------HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEee Q lcl|NC_019422. 265 AIQRLYSFFNTNEKIIQ---SKYSEDEWN--------------AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFE 327 (384) Q Consensus 265 ~~~~I~~~fgvp~~~l~---~~~~e~~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd 327 (384) +.+.|+..-++|....+ ++.+..+.. ..+...+.-+++.+...++..-.... ....+++. T Consensus 329 l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~---~~~~i~v~ 405 (481) T protein:vir:10 329 LQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQH---NYAELTIT 405 (481) T ss_pred HHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc---ccceeeEE Confidence 77778888888865432 222221111 12223333333333333332211111 12244555 Q ss_pred chhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--C--------------CCeeee---cCceeecCCCC Q lcl|NC_019422. 328 ASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIE--N--------------GDKPVR---RLDTAVVEGGE 384 (384) Q Consensus 328 ~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~--~--------------gd~~~~---~~n~~~~~~ge 384 (384) +......|..+.++.+... .|+++...+.+++++-..+ . .+.... ..+....++|+ T Consensus 406 f~~~~~~~~~~~a~~~~kl-~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~ 480 (481) T protein:vir:10 406 FTPNLPKSMMESINAFNAL-SGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSN 480 (481) T ss_pred eCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCC Confidence 5666667777777765433 3788887778877763211 0 011111 11222334444 No 188 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=366 Identities=10% Similarity=0.002 Sum_probs=168.7 Q ss_pred CcchhhhcccC---C--Ccchh------HHHhhccccCcce---------echhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFKSKKKNK---E--APGKV------MMELISDSGNGFY---------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~~~~~~~---~--~~~~~------~~~~~~~~~~~~~---------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) |+.-+..+... . .+... +-.+..+....+. ......+.......+++..|+-+..=|..+ T Consensus 16 ~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~a~~l~~ep~~i 95 (499) T protein:vir:80 16 MGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKI 95 (499) T ss_pred hccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHHHHHHHHhhhCCcceE Confidence 55433322111 1 11100 1111111111000 001223344566778888888887665554 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE- Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF- 139 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~- 139 (384) -- + +......+.+-...-.....++.++.+.+..|.+|+.+..|.+|.+. +-.++|..+-+...+.+.. T Consensus 96 ~~-~--------d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~-i~~v~a~~~~Pi~~d~~~~~ 165 (499) T protein:vir:80 96 NI-D--------DETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVK-VSFATADCMYPLSNDSENVD 165 (499) T ss_pred ee-C--------CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEE-EEEEcCCceEEEEecCCCeE Confidence 11 1 11122222222222345566777778888999999999998887643 5667777766543222211 Q ss_pred --------------E---EEEEc-C---------------------ceEEEEe-------h-------h--heEEEeccC Q lcl|NC_019422. 140 --------------L---KFLLR-N---------------------GKIVSYP-------Y-------S--DIIHLRKDF 164 (384) Q Consensus 140 --------------~---~~~~~-~---------------------g~~~~~~-------~-------~--evih~~~~~ 164 (384) | ++..- + |..+.+. + . -+.|++.+- T Consensus 166 ~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~ 245 (499) T protein:vir:80 166 ECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNI 245 (499) T ss_pred EEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCc Confidence 1 01000 0 1111000 0 0 034454432 Q ss_pred C----CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE-----eeCCCCChHHHHHHHHHHHHHhccccccC Q lcl|NC_019422. 165 N----ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLL-----KFKTALRPDDIKKEVKSFEKNYLQIDSEA 235 (384) Q Consensus 165 ~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 235 (384) + .....|.|.+..+...++.++.......+-++.+ ....++ ......+.+..... ..-.+.|.... T Consensus 246 ~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~g~~~~~~-~~~~~~~~~~~--- 320 (499) T protein:vir:80 246 ANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQYF-DSTDEAFFLYQ--- 320 (499) T ss_pred cccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceecchhhhhccCCCCCCcccCC-CcccceeeEee--- Confidence 1 1335699999999999999888877777777764 333333 11111110000000 00001111100 Q ss_pred CcceecCCCceeeecccchhHHH-HHHHHHHHHHHHHHhCCCHHHhccccH----HHHH--------------HHHHHHH Q lcl|NC_019422. 236 GGAAATDSKYDAEQVKAESYVPN-AAQMDKAIQRLYSFFNTNEKIIQSKYS----EDEW--------------NAYYESE 296 (384) Q Consensus 236 ~~~~v~~~g~~~~~l~~~~~~~~-~~~~~~~~~~I~~~fgvp~~~l~~~~~----e~~~--------------~~~~~~~ 296 (384) ...-+++-.++.++....+.. ...++...++|...-|+++..+|.+.. ..+. ...++.+ T Consensus 321 --~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~ 398 (499) T protein:vir:80 321 --GEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQG 398 (499) T ss_pred --ccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 001122234666666554443 456677888999999999998873211 1111 1122333 Q ss_pred HHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHH-HHHHhCCCCCHHHHHHHh-CCCCCCCCCeee-- Q lcl|NC_019422. 297 IEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNL-VQMVDRGSLTPNEWRKIM-NLSPIENGDKPV-- 372 (384) Q Consensus 297 i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-~~~~~~g~~t~NE~R~~l-G~~p~~~gd~~~-- 372 (384) +..++..+....+..............+.+++++-...|.++.++. .+++..|+++.-.++... |.+.. .+++-+ T Consensus 399 l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~-ea~~el~~ 477 (499) T protein:vir:80 399 IKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEA-EADEWAEM 477 (499) T ss_pred HHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChH-HHHHHHHH Confidence 3333333332222111111112223456777776555666655554 567888999998887654 55431 111110 Q ss_pred --------ecCceeecCCCC Q lcl|NC_019422. 373 --------RRLDTAVVEGGE 384 (384) Q Consensus 373 --------~~~n~~~~~~ge 384 (384) .|.+-..--.|| T Consensus 478 i~~E~~~~~~~~d~~g~~ge 497 (499) T protein:vir:80 478 LAKEKQAEIPNNDMTGIFGE 497 (499) T ss_pred HHHHhhcCCCCCCccccCCC Confidence 111100111233 No 189 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=376 Identities=10% Similarity=0.052 Sum_probs=178.3 Q ss_pred Ccchh--------hhcccCCCcchh----HHH------hhccccCccee--------------chhhhhhcHHHHHHHHH Q lcl|NC_019422. 1 MNIFK--------SKKKNKEAPGKV----MME------LISDSGNGFYS--------------WHGNLYKSDIVRSIIRP 48 (384) Q Consensus 1 M~~f~--------~~~~~~~~~~~~----~~~------~~~~~~~~~~~--------------~~~~~~~~~~v~~~i~~ 48 (384) +.-=. .+.....+|... ..+ ..++....+++ .-+....+|.|.+||+. T Consensus 17 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e 96 (524) T protein:vir:98 17 AREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVENAVSE 96 (524) T ss_pred hhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHHHhhccchhhHHHh Confidence 11111 111111111000 000 00000000010 00233568999999999 Q ss_pred HHHhhccC-----ceEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCc--e Q lcl|NC_019422. 49 KAKAVGKM-----TAKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNM--P 118 (384) Q Consensus 49 ia~~ia~~-----~~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~--~ 118 (384) |.+.+.-+ |+.+--.+-+-.+...... +..++ ..+++..--+.++..|.+.|..|+.++-+++.. . T Consensus 97 IVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~~~kGI 172 (524) T protein:vir:98 97 IIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVL----NIYDFDNMGARLFRDWYVDSRIYFHKIMHKDESKGI 172 (524) T ss_pred hhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHH----HHhccchhhhHHHhhhhhcceeEEEEEEcCCCCcce Confidence 99887533 2222111111111111111 11111 123344445566778889999999999664433 8 Q ss_pred eeEEEEcCceEEEEEc-------CCCE-------EEEEEEc------------CceEEEEehhheEEEeccCC-CCCccC Q lcl|NC_019422. 119 TQIYPLNALNVEAIYE-------NEVL-------FLKFLLR------------NGKIVSYPYSDIIHLRKDFN-ENDLFG 171 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~-------~~~~-------~~~~~~~------------~g~~~~~~~~evih~~~~~~-~~~~~G 171 (384) .+|+.|||..++.++. .+.. ++.|... -++.+.++.+-|.|...+-- .+.. = T Consensus 173 ~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d~~~~-i 251 (524) T protein:vir:98 173 RELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLEDCSNN-I 251 (524) T ss_pred eeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCcccCCCC-e Confidence 8999999999987641 1111 1222211 12346677888888765532 2222 2 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhc---------cccccCCcce-- Q lcl|NC_019422. 172 TSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYL---------QIDSEAGGAA-- 239 (384) Q Consensus 172 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~---------~~~~~~~~~~-- 239 (384) +|-+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+ |...+..+.+ T Consensus 252 isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msM 331 (524) T protein:vir:98 252 IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSM 331 (524) T ss_pred eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccch Confidence 58889999988888877777666655555556677776 44554455555555555554 2122222221 Q ss_pred ------ec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc--------cHHHHH-----HHHHHHHH Q lcl|NC_019422. 240 ------AT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--------YSEDEW-----NAYYESEI 297 (384) Q Consensus 240 ------v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--------~~e~~~-----~~~~~~~i 297 (384) +- +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+ .-|+..-= T Consensus 332 lEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR 411 (524) T protein:vir:98 332 TEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQ 411 (524) T ss_pred hhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHH Confidence 11 234555555545555567778899999999999999877421 122222 22332222 Q ss_pred HHHHHHHHHHHhhcccCccc--------ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-h Q lcl|NC_019422. 298 EPVGLQLSNQYTEKLFTRKA--------RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-M 360 (384) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~~--------~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-l 360 (384) .-+...+.+.|..+|+.+.- ......+.|..|. ++... +..|+.+++. +.+-+++.+=+|+. | T Consensus 412 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~IL 491 (524) T protein:vir:98 412 IQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEIL 491 (524) T ss_pred HHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHh Confidence 22333344444444444311 1122344443333 22222 3344444432 23346677666543 3 Q ss_pred CCCCCC----------CCCeeeecCceeec-CCCC Q lcl|NC_019422. 361 NLSPIE----------NGDKPVRRLDTAVV-EGGE 384 (384) Q Consensus 361 G~~p~~----------~gd~~~~~~n~~~~-~~ge 384 (384) .+.-.+ ..+....+ .|- +.+| T Consensus 492 r~tDeei~~~~k~I~~E~k~~~~~---~p~~e~~~ 523 (524) T protein:vir:98 492 RMSDEDIDEQAKLIEEESKEERFK---NPEAEEEN 523 (524) T ss_pred ccCHHHHHHHHHHHHHHHhCCCCc---CCcccccc Confidence 332110 01111111 111 1111 No 190 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.49 E-value=5.7e-05 Score=43.84 Aligned_cols=372 Identities=10% Similarity=0.082 Sum_probs=175.3 Q ss_pred Ccchhhhccc------CCCc-chhHHHhhccccCcce---ec-----hhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKN------KEAP-GKVMMELISDSGNGFY---SW-----HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~------~~~~-~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |.......+. ...+ -.....+..+...... .. ...-...+....+++..+.-+-+-|+++-- ++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~-~d 117 (511) T protein:vir:10 39 LQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD-DD 117 (511) T ss_pred ccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec-Cc Confidence 1111100000 0000 0011112221111000 00 001122456677788888777777777521 11 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC---EE--- Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV---LF--- 139 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~---~~--- 139 (384) + .....+..++. .-........+..+++.+|.||.++..++.|.+ .+..++|..+.++.+... .. T Consensus 118 ~----~~~~~l~~~~~----~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~v 188 (511) T protein:vir:10 118 K----DVLEAIEAFND----LNDVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGV 188 (511) T ss_pred h----HHHHHHHHHHh----hcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEE Confidence 1 11122233322 224556777788999999999999999888864 467788998888776532 11 Q ss_pred EEEEEc--Cc---e---E-EEEehhheEEEeccC----------------C---------CCCccCccHHHHHHHHHHHH Q lcl|NC_019422. 140 LKFLLR--NG---K---I-VSYPYSDIIHLRKDF----------------N---------ENDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 140 ~~~~~~--~g---~---~-~~~~~~evih~~~~~----------------~---------~~~~~G~s~~~~~~~~i~~~ 185 (384) .+|... .+ . . ..+.++.+.++.... + .+.-.|.|.+..+...++.. T Consensus 189 r~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:10 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHH Confidence 111111 10 0 1 123344444442211 0 01135888899888888888 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDK 264 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~ 264 (384) ..+.....+.+...+.|-.+++-....++++....++...-.........+...-.+++.+++.+........+ ..++. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:10 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 77776667777776666555543333333333322221111111100011112223456677766654444333 33456 Q ss_pred HHHHHHHHhCCCHHH---hccccHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEee Q lcl|NC_019422. 265 AIQRLYSFFNTNEKI---IQSKYSEDE--------------WNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFE 327 (384) Q Consensus 265 ~~~~I~~~fgvp~~~---l~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd 327 (384) +.+.|+..-++|..- ++++.+..+ ....+...+.-.++.+...+...--... ......+++. T Consensus 349 L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~-~~d~~~i~i~ 427 (511) T protein:vir:10 349 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVRYV 427 (511) T ss_pred HHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccc-ccccceeeEE Confidence 677777777777633 223322221 1224444555555555554443211111 1112345666 Q ss_pred chhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCee-eecCce----eecCCCC Q lcl|NC_019422. 328 ASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GDKP-VRRLDT----AVVEGGE 384 (384) Q Consensus 328 ~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~-~~~~n~----~~~~~ge 384 (384) +..-...|..+.++.+... .|+++..-+.+++++-+.|. .+.. ....+. -..++++ T Consensus 428 f~~~~p~d~~~~~~~~~kl-~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:10 428 YNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred eCCCCCcCHHHHHHHHHHH-hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCC Confidence 6766677788887765433 37889888888887643210 1100 000111 1112222 No 191 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=379 Identities=10% Similarity=0.045 Sum_probs=179.0 Q ss_pred Ccchhh-----hcccCCCcchhH------HHhhcc-ccCcceech-------------hhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKS-----KKKNKEAPGKVM------MELISD-SGNGFYSWH-------------GNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~-----~~~~~~~~~~~~------~~~~~~-~~~~~~~~~-------------~~~~~~~~v~~~i~~ia~~ia~ 55 (384) --||+. ++..+....... .....+ .+..+.... +....+|.|.+||+.|.+.+.- T Consensus 3 ~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv 82 (537) T protein:vir:10 3 QQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNETIC 82 (537) T ss_pred cccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeE Confidence 223442 222222111010 011111 111111111 1234589999999999988864 Q ss_pred Cc-----eEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---CceeeEEEE Q lcl|NC_019422. 56 MT-----AKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQIYPL 124 (384) Q Consensus 56 ~~-----~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~l~~l 124 (384) +. +.+--.+-+..+...... +..++ ..+++..--+.++..|.+.|..|+.++-+.. .-..+|+.| T Consensus 83 ~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il----~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~l 158 (537) T protein:vir:10 83 GNFDDVPISIDLHNLKQSEKIKKLIRSEFDEIL----RLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYV 158 (537) T ss_pred ecCCCceEEEEecccccchHHHHHHHHHHHHHH----HHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeee Confidence 42 222111111122111111 11111 1233444455667788899999999977533 248899999 Q ss_pred cCceEEEEEcC---C--CEE------------E-EEEEc-------CceEEEEehhheEEEecc--CCCCCccCccHHHH Q lcl|NC_019422. 125 NALNVEAIYEN---E--VLF------------L-KFLLR-------NGKIVSYPYSDIIHLRKD--FNENDLFGTSPAKV 177 (384) Q Consensus 125 ~~~~v~~~~~~---~--~~~------------~-~~~~~-------~g~~~~~~~~evih~~~~--~~~~~~~G~s~~~~ 177 (384) ||..++.++.. . +.. . +|.++ .+..+.++. +.|++-+. -..+..+.+|-+.. T Consensus 159 DPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~-dAI~y~hSGl~d~n~~~i~syLhk 237 (537) T protein:vir:10 159 DPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAP-DSIAYCHSGIQDLNKNMVLSHLHK 237 (537) T ss_pred CCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccH-hheeeecccceeCCCCeeeeeehh Confidence 99999766542 1 110 0 11111 122334444 55555432 12345678899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc--------e Q lcl|NC_019422. 178 LEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA--------A 239 (384) Q Consensus 178 ~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~--------~ 239 (384) |.+.+..+.-...+..=+--..+.-+-++.++ +.+....+++....+-.+|+. ...+..+. + T Consensus 238 AiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWL 317 (537) T protein:vir:10 238 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWL 317 (537) T ss_pred hhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcc Confidence 99999988888777666655555556677776 445444455555555554442 11111111 1 Q ss_pred ec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHH-----HHHHHHHHHHHHHHH Q lcl|NC_019422. 240 AT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEW-----NAYYESEIEPVGLQL 304 (384) Q Consensus 240 v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~-----~~~~~~~i~P~~~~i 304 (384) +- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+ .-|+..-=.-+...+ T Consensus 318 PRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF 397 (537) T protein:vir:10 318 PRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELF 397 (537) T ss_pred cccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 11 23455555554455556777889999999999999987642 1222222 123222112223333 Q ss_pred HHHHhhcccCc-----ccc---cCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-h------- Q lcl|NC_019422. 305 SNQYTEKLFTR-----KAR---SFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-M------- 360 (384) Q Consensus 305 ~~~l~~~l~~~-----~~~---~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-l------- 360 (384) .+.|..+|+.+ .+. .....+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. | T Consensus 398 ~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI 477 (537) T protein:vir:10 398 VDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEI 477 (537) T ss_pred HHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHH Confidence 44444443332 111 123334443333 22221 3334443322 22234444444321 1 Q ss_pred -------------CCCCCCC---------C-CeeeecCceeec------------CCCC Q lcl|NC_019422. 361 -------------NLSPIEN---------G-DKPVRRLDTAVV------------EGGE 384 (384) Q Consensus 361 -------------G~~p~~~---------g-d~~~~~~n~~~~------------~~ge 384 (384) |+=+.|. | ..++.|....|- ++|| T Consensus 478 ~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 478 KEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred HHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 1111111 1 233444444332 2233 No 192 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=353 Identities=10% Similarity=0.033 Sum_probs=165.4 Q ss_pred CcchhhhcccCCC---cchh---HHHhhccccCcce-------------echhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNIFKSKKKNKEA---PGKV---MMELISDSGNGFY-------------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~f~~~~~~~~~---~~~~---~~~~~~~~~~~~~-------------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) +.+.......... .-.. ...+..+-..... .....-+..+....+|+..+.-+-+-|+++- T Consensus 23 ~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~ 102 (472) T protein:vir:93 23 ETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 102 (472) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeeec Confidence 1112211110000 0000 1111111100000 0000012346778888888888877776652 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E- Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L- 138 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~- 138 (384) - + +.. ..+....++. | ........+..+.+.+|.||+.+..++.|.+ .+..++|..+.++.+... . T Consensus 103 ~-~--d~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~~~p~~~~~i~d~~~~~~~ 171 (472) T protein:vir:93 103 H-T--DDE--VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 171 (472) T ss_pred c-C--ChH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCCce Confidence 1 1 111 1112222322 2 2446666778999999999999999888865 477789988888765321 1 Q ss_pred ---EEEEEEcCce-EEEEehhheEEEecc---------------------CC---------CCCccCccHHHHHHHHHHH Q lcl|NC_019422. 139 ---FLKFLLRNGK-IVSYPYSDIIHLRKD---------------------FN---------ENDLFGTSPAKVLEPIMEV 184 (384) Q Consensus 139 ---~~~~~~~~g~-~~~~~~~evih~~~~---------------------~~---------~~~~~G~s~~~~~~~~i~~ 184 (384) ..+|...+.. ...+.+..+.++... ++ .+...|.|.+..+...++. T Consensus 172 ~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa 251 (472) T protein:vir:93 172 EAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDA 251 (472) T ss_pred EEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHH Confidence 1111111111 111111222121100 00 0123688999988888888 Q ss_pred HHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHH Q lcl|NC_019422. 185 VNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMD 263 (384) Q Consensus 185 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~ 263 (384) .........+.+...+.|..+++-- . .++..... .... ..+++.++++.++..+........+ ..++ T Consensus 252 ~~~~~s~~~~~~~~~~~~~~~~~g~-~--~~~~~~~~----~~~~-----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 319 (472) T protein:vir:93 252 YNRRLSDLSNTFKDSNELTYVLTNY-D--DQELPEFK----RLLR-----YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 319 (472) T ss_pred HHHHHHHHHHHHHHhcCceeEeecC-C--cccchhhH----HHHh-----hccccccCCCCcceeEeecCCHHHHHHHHH Confidence 7777766677777777775555321 1 11111111 1111 1234444555555555544333333 3345 Q ss_pred HHHHHHHHHhCCCHH---HhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEe Q lcl|NC_019422. 264 KAIQRLYSFFNTNEK---IIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVF 326 (384) Q Consensus 264 ~~~~~I~~~fgvp~~---~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~f 326 (384) .+.+.|+..-++|.. .++++.+..+. ...+...+.-+++.+...++.. .....+++ T Consensus 320 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-------~~~~~i~v 392 (472) T protein:vir:93 320 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------GEHKDVDI 392 (472) T ss_pred HHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-------cccceeeE Confidence 566677777777753 33343332221 1233344444444444443321 12234555 Q ss_pred echhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C----------CeeeecCceeecCCCC Q lcl|NC_019422. 327 EASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G----------DKPVRRLDTAVVEGGE 384 (384) Q Consensus 327 d~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g----------d~~~~~~n~~~~~~ge 384 (384) .+..-...|..+.++.+... .|+++..-+.+++++-..+. . -+...+.+..-.++++ T Consensus 393 ~f~~~~p~~~~~~~~~~~k~-~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~ 461 (472) T protein:vir:93 393 SFNYNKVANTELQVQTAQQS-MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ 461 (472) T ss_pred EeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCC Confidence 55666666777777765433 47888888888887643210 0 0111111111112221 No 193 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.43 E-value=6.9e-05 Score=43.39 Aligned_cols=353 Identities=10% Similarity=0.021 Sum_probs=165.2 Q ss_pred Ccchh-hhccc---CCCcchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFK-SKKKN---KEAPGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~-~~~~~---~~~~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) +-+.. ...+- -...-+. ...+..+-... .......-..++....+++..+.-+-+-|+++ T Consensus 42 ~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~ 121 (492) T protein:vir:97 42 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 121 (492) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 11111 00000 0000000 11111111000 00000011235677788888888887777765 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) -- ++ .. .......++. | ...+....+..+.+.+|.||+++..+..|.+ .+..++|..+.+..+... . T Consensus 122 ~~-~d--~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~~~~~ 190 (492) T protein:vir:97 122 KH-TD--DE--VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEE 190 (492) T ss_pred cc-Cc--hH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCc Confidence 21 11 11 1122222322 2 2345666778999999999999999888865 477789988888766421 1 Q ss_pred ----EEEEEEcCceE-EEEehhheEEEecc---------------------CCC---------CCccCccHHHHHHHHHH Q lcl|NC_019422. 139 ----FLKFLLRNGKI-VSYPYSDIIHLRKD---------------------FNE---------NDLFGTSPAKVLEPIME 183 (384) Q Consensus 139 ----~~~~~~~~g~~-~~~~~~evih~~~~---------------------~~~---------~~~~G~s~~~~~~~~i~ 183 (384) ..+|...+... ..+.+..+.++... +++ +...|.|.+..+...++ T Consensus 191 ~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liD 270 (492) T protein:vir:97 191 LEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 270 (492) T ss_pred eEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHH Confidence 11111111111 11122222222110 001 12358888988888888 Q ss_pred HHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HH Q lcl|NC_019422. 184 VVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QM 262 (384) Q Consensus 184 ~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~ 262 (384) ....+.....+.+.....|..+++-. ..++... +..... ..+++.++++.++..+........+. .+ T Consensus 271 a~d~~~S~~~~~~~~~~~~~l~~~g~---~~~~~~~----~~~~~~-----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:97 271 AYNRRLSDLSNTFKDSNELTYVLKNY---DDQELPE----FKRLLR-----YYGAIKVSDNGGVDTIQVEVPVENSKKYL 338 (492) T ss_pred HHHHHHHHHHHHHHHhccceeeeecC---Ccccchh----HHHHHh-----hccceecCCCCcceeEeccCCHHHHHHHH Confidence 87777766677777766665554321 1111112 222111 12244455555555555443333332 34 Q ss_pred HHHHHHHHHHhCCCH---HHhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEE Q lcl|NC_019422. 263 DKAIQRLYSFFNTNE---KIIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIV 325 (384) Q Consensus 263 ~~~~~~I~~~fgvp~---~~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~ 325 (384) +.+.+.|+..-++|. .-++++.+..+. ...+...+..+++.+...++.+ .....++ T Consensus 339 ~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~-------~~~~~i~ 411 (492) T protein:vir:97 339 DELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------GEHKDVD 411 (492) T ss_pred HHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------cccceee Confidence 555666666666664 334443332222 1233444555555554443321 1223455 Q ss_pred eechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eecCceeecCCCC Q lcl|NC_019422. 326 FEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKP-----VRRLDTAVVEGGE 384 (384) Q Consensus 326 fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~-----~~~~n~~~~~~ge 384 (384) +.+..-...+..+.++.+... .|+++..-+.++++.-+.+. .+.+ -...+...+.+++ T Consensus 412 v~f~~~~p~~~~e~a~~~~kl-~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~ 476 (492) T protein:vir:97 412 ISFNYNKVANTELQVQTAQQS-MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGG 476 (492) T ss_pred EEecCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC Confidence 555665666777777765433 47889888888887643211 1000 0011111111111 No 194 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.39 E-value=7.9e-05 Score=43.08 Aligned_cols=369 Identities=9% Similarity=0.000 Sum_probs=156.2 Q ss_pred Ccchhhhccc---CCCcchhHHHhhccccCc--ce-echhhh----hhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_019422. 1 MNIFKSKKKN---KEAPGKVMMELISDSGNG--FY-SWHGNL----YKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKT 70 (384) Q Consensus 1 M~~f~~~~~~---~~~~~~~~~~~~~~~~~~--~~-~~~~~~----~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 70 (384) +.+...+... ....-.....+..+.... .. .....+ ....+...||+.+|+.+.--.|.+ .+ +. T Consensus 17 ~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~---~d-~~-- 90 (474) T protein:vir:81 17 NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVW---PD-GD-- 90 (474) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceEC---CC-CC-- Confidence 1111110000 000000111111111000 00 000110 123455667777776554333432 21 11 Q ss_pred ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce-eeEEEEcCceEEEEEcCCCEE------EEEE Q lcl|NC_019422. 71 NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP-TQIYPLNALNVEAIYENEVLF------LKFL 143 (384) Q Consensus 71 ~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~-~~l~~l~~~~v~~~~~~~~~~------~~~~ 143 (384) ..+..++.++. -| ........+..+.+.+|.+|+.+..++.|.+ ..+.+++|..+....|..... +... T Consensus 91 ~~~~~l~~iw~-~N---~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~ 166 (474) T protein:vir:81 91 LDSLGGTEVVD-DN---HLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLSIIDK 166 (474) T ss_pred ccchHHHHHHH-hc---ChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeEEEEE Confidence 11222334443 22 2335666778899999999999998877764 457788998888766543211 0111 Q ss_pred EcCceE---EEEehhh-------------------------eEEEeccCCCCCccCccHH----HHHHHHHHHHHHHHHH Q lcl|NC_019422. 144 LRNGKI---VSYPYSD-------------------------IIHLRKDFNENDLFGTSPA----KVLEPIMEVVNTTDQG 191 (384) Q Consensus 144 ~~~g~~---~~~~~~e-------------------------vih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~ 191 (384) ..+|+. ..+.++. |+++.+....++.+|.|.+ ..+.+.+.....-... T Consensus 167 ~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~ 246 (474) T protein:vir:81 167 DKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREG 246 (474) T ss_pred cCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHH Confidence 112211 0111122 4555444444556788855 3333433333333333 Q ss_pred HHHHHHccCCcc-eEEeeCCC-CChHHHHHHHHHHHHHhc---cccccCCcceecCCCceeeecccchhHHHHHHHHHHH Q lcl|NC_019422. 192 VVKAIKNSNTIK-WLLKFKTA-LRPDDIKKEVKSFEKNYL---QIDSEAGGAAATDSKYDAEQVKAESYVPNAAQMDKAI 266 (384) Q Consensus 192 ~~~~~~ng~~p~-~il~~~~~-~~~e~~~~~~~~~~~~~~---~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~~~~~~ 266 (384) ...++. .|. +++..... ..+++ .+....|+.... ....+..+-.....+.++-++...+-......++.++ T Consensus 247 ~~e~~a---~pqr~i~G~~~~~~~d~d-~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~ 322 (474) T protein:vir:81 247 HMDVFS---YPEFWLLGADESALKNAD-GTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGLA 322 (474) T ss_pred HHHHhc---chhheeecCChhhccccc-ccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHHH Confidence 444433 343 34333211 11111 111122333221 1222222222233455666766554444445578889 Q ss_pred HHHHHHhCCCHHHhcc---cc---HHHHHH---H----------HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEee Q lcl|NC_019422. 267 QRLYSFFNTNEKIIQS---KY---SEDEWN---A----------YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFE 327 (384) Q Consensus 267 ~~I~~~fgvp~~~l~~---~~---~e~~~~---~----------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd 327 (384) ..+|+.=++|+..||- +| ++.... . .+...+.-+++.........-. .........+++. T Consensus 323 ~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~-~~~~~~~~~~~v~ 401 (474) T protein:vir:81 323 KLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAI-DEIPDEWKSIDAK 401 (474) T ss_pred HHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc-cccchhhccceeE Confidence 9999999999999872 22 111111 1 1112222222111111111000 0001122344555 Q ss_pred chhhhccCHHHHHHHH-HHHhCC--CCCHHHHHHHhCCCCCC---CC---Ce--eeecCceeec-----CCCC Q lcl|NC_019422. 328 ASNLQYASMSTKLNLV-QMVDRG--SLTPNEWRKIMNLSPIE---NG---DK--PVRRLDTAVV-----EGGE 384 (384) Q Consensus 328 ~~~~~~~d~~~~~~~~-~~~~~g--~~t~NE~R~~lG~~p~~---~g---d~--~~~~~n~~~~-----~~ge 384 (384) +.+....+..++++.+ |++..| +.+-.=+++++|+.+.+ +. ++ ...+..-... ..+| T Consensus 402 W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 402 WRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred ecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 5666667778887765 567665 44445568888998742 11 00 0001110000 1111 No 195 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.34 E-value=8.9e-05 Score=42.79 Aligned_cols=361 Identities=7% Similarity=0.051 Sum_probs=165.7 Q ss_pred CcchhhhcccCCC---cchhHHHhhccccCccee------chhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceec Q lcl|NC_019422. 1 MNIFKSKKKNKEA---PGKVMMELISDSGNGFYS------WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTN 71 (384) Q Consensus 1 M~~f~~~~~~~~~---~~~~~~~~~~~~~~~~~~------~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 71 (384) ..+..+..+.... .-.....+..+....... ....=+..+....+|+..+.-+-+-|+.+--.+ + . T Consensus 19 ~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d----~-~ 93 (453) T protein:vir:39 19 NEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNGIPVKKSHSD----K-E 93 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhhhhcccCceeccCC----h-H Confidence 0001100000000 000011111110000000 000012345777888888888777777652111 1 1 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE--E---EEEEEcC Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL--F---LKFLLRN 146 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~---~~~~~~~ 146 (384) ....+..++.. .........+..+.+.+|.||+.+..+..|.+ .+..++|..+.++.+.... . +++.... T Consensus 94 ~~~~l~~i~~~----N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~ 168 (453) T protein:vir:39 94 TLSKLQEFDNL----NDMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPENMFMVYDDTIKQEPLFAVRYGYDD 168 (453) T ss_pred HHHHHHHHHHh----cChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEecCCCCCeEEEEEEEEEeC Confidence 11223333322 24456778888999999999999999988865 4667888888887764321 1 1111111 Q ss_pred ceE---EEEehhheEEEeccC-----------C---------CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_019422. 147 GKI---VSYPYSDIIHLRKDF-----------N---------ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIK 203 (384) Q Consensus 147 g~~---~~~~~~evih~~~~~-----------~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 203 (384) +.. ..+.++.+.++.... + .+...|.|.+..+...++..........+.+...+.|. T Consensus 169 ~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~ 248 (453) T protein:vir:39 169 DYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQY 248 (453) T ss_pred CeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCce Confidence 110 112222222222110 0 11236888888888888777776666666666666675 Q ss_pred eEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHhCCCH---HH Q lcl|NC_019422. 204 WLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFFNTNE---KI 279 (384) Q Consensus 204 ~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp~---~~ 279 (384) .++. +..++++..+..+.. .......+ ...+.+.++..+........+. .++.+.+.|+..-++|. .. T Consensus 249 ~~~~-g~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 320 (453) T protein:vir:39 249 LTFL-GAAVEEEDLKNIRSN---RVINYYGE----SSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDES 320 (453) T ss_pred eeee-cCCCCchhhhhhhhc---ceeeecCC----CCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc Confidence 5543 344454444332211 01111000 1112344444454433333332 34555666666666653 22 Q ss_pred hccccHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHH Q lcl|NC_019422. 280 IQSKYSED-------------EWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMV 346 (384) Q Consensus 280 l~~~~~e~-------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~ 346 (384) +|....++ .....+...+..+++.+...++..- .. .....+.+.+..-...|..+.++.+... T Consensus 321 ~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~---~~~~~i~v~f~~~~p~~~~~~a~~~~kl 396 (453) T protein:vir:39 321 FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS-NK---EAWKDIEYTFTRNEPKDIKEQAETANIL 396 (453) T ss_pred ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Cc---cccccceEEeCCCCCcCHHHHHHHHHHH Confidence 33222221 1123445556666665555444321 11 1112344445555667777777766433 Q ss_pred hCCCCCHHHHHHHhCCCCCCC----------CCeeee-cCceeecCCCC Q lcl|NC_019422. 347 DRGSLTPNEWRKIMNLSPIEN----------GDKPVR-RLDTAVVEGGE 384 (384) Q Consensus 347 ~~g~~t~NE~R~~lG~~p~~~----------gd~~~~-~~n~~~~~~ge 384 (384) .|+++..-+.++++.-+.+. .+.... ..+....++.+ T Consensus 397 -~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 444 (453) T protein:vir:39 397 -MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTD 444 (453) T ss_pred -hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCC Confidence 47899988888887644210 100000 11111112111 No 196 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.23 E-value=0.00012 Score=42.01 Aligned_cols=371 Identities=12% Similarity=0.071 Sum_probs=172.8 Q ss_pred Ccchhhhccc----CCCcc---hhHHHhhccccCcc-----ee---chhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKN----KEAPG---KVMMELISDSGNGF-----YS---WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~----~~~~~---~~~~~~~~~~~~~~-----~~---~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ++.+...++. ....- .....+..+..... .. ....-...+....+++..+.-+-+-|+++--.++ T Consensus 38 ~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~ 117 (501) T protein:vir:96 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCc Confidence 3332211111 00000 01111111110000 00 0011134567788888888888777877632222 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCC--CEE---- Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENE--VLF---- 139 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~---- 139 (384) +. .+....++.+....-........+..+.+.+|.+|+.+.++..|.+ .+..++|..+.++.+.. +.. T Consensus 118 ~~-----~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 191 (501) T protein:vir:96 118 DD-----NSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAV 191 (501) T ss_pred cc-----hhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEE Confidence 11 1222233333333345667888899999999999999999988865 47778999998887653 111 Q ss_pred EEEEEc--Cce--EE-EEehhheEEEecc----------CC---------CCCccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 140 LKFLLR--NGK--IV-SYPYSDIIHLRKD----------FN---------ENDLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 140 ~~~~~~--~g~--~~-~~~~~evih~~~~----------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) .+|... .+. .+ .+.++.+.++... ++ .+...|.|.+..+...++..........+. T Consensus 192 ~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~ 271 (501) T protein:vir:96 192 RYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANH 271 (501) T ss_pred EEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHH Confidence 111111 111 11 1223333333211 00 112368899988888888887777777777 Q ss_pred HHccCCcceEEeeCCCCC-hHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHh Q lcl|NC_019422. 196 IKNSNTIKWLLKFKTALR-PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFF 273 (384) Q Consensus 196 ~~ng~~p~~il~~~~~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~f 273 (384) +...+.|..++.-..... ++.....+. ...-.. ...+.......+.++..+........+. .++.+.+.|+..- T Consensus 272 ~~~~~~~~l~i~G~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 347 (501) T protein:vir:96 272 MSDMADAILAIYGDLALPKGMQASDMKR---TRLMQL-KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFT 347 (501) T ss_pred HHHhcCceeeeecccccCcccchhhhhh---cCeeee-cccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHh Confidence 776666655553221111 111111111 000000 0111111223444555555443333333 3455667777777 Q ss_pred CCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 274 NTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 274 gvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) ++|.... +++.+..+. ...+...+.-+++.+...++..--.. ......+++.+......+. T Consensus 348 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~--~~d~~~i~i~f~~~~p~n~ 425 (501) T protein:vir:96 348 NTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFK--DFDESLLKITFTPNLPKSL 425 (501) T ss_pred CCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccccccceEEeCCCCCcCH Confidence 7775433 233222211 12344445555554444443321111 1111234455566666777 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC------------CCeeeecCceee-------------cCCCC Q lcl|NC_019422. 337 STKLNLVQMVDRGSLTPNEWRKIMNLSPIEN------------GDKPVRRLDTAV-------------VEGGE 384 (384) Q Consensus 337 ~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~------------gd~~~~~~n~~~-------------~~~ge 384 (384) .+.++.+... .|+++..-+.+++++-..|. .+.-..+..+.+ .++|| T Consensus 426 ~e~ad~~~kl-~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e 497 (501) T protein:vir:96 426 NEQVSILTGL-GGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFE 497 (501) T ss_pred HHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccc Confidence 7777765433 27777777777765532110 111111122211 12222 No 197 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.17 E-value=0.00014 Score=41.64 Aligned_cols=366 Identities=11% Similarity=0.077 Sum_probs=168.9 Q ss_pred CcchhhhcccCCC-------------c-------chhHHHhhcc----ccCc-----cee-----chhhhhhcHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA-------------P-------GKVMMELISD----SGNG-----FYS-----WHGNLYKSDIVRSII 46 (384) Q Consensus 1 M~~f~~~~~~~~~-------------~-------~~~~~~~~~~----~~~~-----~~~-----~~~~~~~~~~v~~~i 46 (384) ||+|.+.+..... . ......-|.. +-+. +.. ......+......++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 9998864322110 0 0000000100 0000 000 001223334555666 Q ss_pred HHHHHhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcC Q lcl|NC_019422. 47 RPKAKAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNA 126 (384) Q Consensus 47 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~ 126 (384) +..|+.+..=+-.+- .++ ....+.+..++. .-.+...++..+...+..|.+++.+..+..+ ..+-.+++ T Consensus 81 ~~~A~lv~~e~~~i~-v~d----~~~~~~l~~~l~----~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~--~~i~~v~a 149 (522) T protein:vir:47 81 KKIASLVYNEQATIT-TKN----EILQKFLDDMLT----NDRFNKNFERYLESCLALGGLAMRPYIDGDK--VRVAFIQA 149 (522) T ss_pred HHHhhhhcCCcceee-cCC----hHHHHHHHHHHh----hcchHHHHHHHHHHhhccCCEEEEEEEcCCc--eEEEEEcC Confidence 666666654333321 111 011122222322 2234456667778888889998888876432 22334444 Q ss_pred ceEEEEE-cCC------------------CEEE---EEE-------------------------Ec--C----ceEEEEe Q lcl|NC_019422. 127 LNVEAIY-ENE------------------VLFL---KFL-------------------------LR--N----GKIVSYP 153 (384) Q Consensus 127 ~~v~~~~-~~~------------------~~~~---~~~-------------------------~~--~----g~~~~~~ 153 (384) ..+-+.. +.+ +.+| +++ +. + |..+.+. T Consensus 150 d~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (522) T protein:vir:47 150 PVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLS 229 (522) T ss_pred CceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccc Confidence 4444321 111 1111 110 00 0 1111100 Q ss_pred --------hhh----------eEEEeccCCC----CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCc----ceEEe Q lcl|NC_019422. 154 --------YSD----------IIHLRKDFNE----NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTI----KWLLK 207 (384) Q Consensus 154 --------~~e----------vih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p----~~il~ 207 (384) +++ ..||+.+-+. +..+|+|.+..+...++.++........-++-|... ..+++ T Consensus 230 ~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~ 309 (522) T protein:vir:47 230 ELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQ 309 (522) T ss_pred ccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhc Confidence 001 2355543221 346899999999999999888777777777766542 11122 Q ss_pred eCCCCChH--HHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHhCCCHHHhcccc Q lcl|NC_019422. 208 FKTALRPD--DIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFFNTNEKIIQSKY 284 (384) Q Consensus 208 ~~~~~~~e--~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp~~~l~~~~ 284 (384) ........ .....-+.-...|.+.... .+++.+++.++....+.++. .++...+.|+...|+++..++.+. T Consensus 310 ~~~~~~~g~~~~~~~fd~~~~~f~~~~~~------~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~ 383 (522) T protein:vir:47 310 RQYQRPDGTIDFRPRFDVEQNVYMQIGGS------SMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDG 383 (522) T ss_pred cCCCCCCcccccccccCcccceEeecCCC------CCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccc Confidence 21111100 0000000001112221111 12334567777666666654 457788899999999998876321 Q ss_pred -----H-HH------------HHHHHHHHHHHHHHHHHHHHHhhc-ccCcccccCcceEEeechhhhccCHHHHHHH-HH Q lcl|NC_019422. 285 -----S-ED------------EWNAYYESEIEPVGLQLSNQYTEK-LFTRKARSFGNEIVFEASNLQYASMSTKLNL-VQ 344 (384) Q Consensus 285 -----~-e~------------~~~~~~~~~i~P~~~~i~~~l~~~-l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~-~~ 344 (384) + |. .....++.+|..++..+.+..+.. ++.. .......+.+++++-...|.++..+. .+ T Consensus 384 ~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~-~~~~~~~i~v~f~D~i~~D~~~~~~~~~~ 462 (522) T protein:vir:47 384 QGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSG-EIPELDDISVNLDDGVFTDRHAELDYWAK 462 (522) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC-CCCCcceeEEEcCCCCCCCHHHHHHHHHH Confidence 1 11 112244555555555555443321 1111 12234457777877766776666554 56 Q ss_pred HHhCCCCCHHHHHHH-hCCCCCC--------------CCC--eeeecCceeecCCCC Q lcl|NC_019422. 345 MVDRGSLTPNEWRKI-MNLSPIE--------------NGD--KPVRRLDTAVVEGGE 384 (384) Q Consensus 345 ~~~~g~~t~NE~R~~-lG~~p~~--------------~gd--~~~~~~n~~~~~~ge 384 (384) ++..|+|++-+++.+ .|++.-+ .++ .-+.+++...-+.|+ T Consensus 463 ~v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d 519 (522) T protein:vir:47 463 MVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKAD 519 (522) T ss_pred HHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCC Confidence 788899999997655 3664310 011 112233333333333 No 198 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=97.01 E-value=0.00021 Score=40.73 Aligned_cols=376 Identities=10% Similarity=0.053 Sum_probs=179.6 Q ss_pred Ccchhh--------hcccCCCcchhH----------HHhhccc-cCccee------------chhhhhhcHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKS--------KKKNKEAPGKVM----------MELISDS-GNGFYS------------WHGNLYKSDIVRSIIRPK 49 (384) Q Consensus 1 M~~f~~--------~~~~~~~~~~~~----------~~~~~~~-~~~~~~------------~~~~~~~~~~v~~~i~~i 49 (384) +.-=.. +.....+|.... .....+. ...+.. .-+....+|.|.+||+.| T Consensus 5 ~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eI 84 (511) T protein:vir:56 5 TKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDAIQEI 84 (511) T ss_pred cchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhHHHHh Confidence 111100 000000000000 0000000 000000 012335689999999999 Q ss_pred HHhhccCc-----eEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeE Q lcl|NC_019422. 50 AKAVGKMT-----AKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQI 121 (384) Q Consensus 50 a~~ia~~~-----~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l 121 (384) .+.+.-+. +.+--.+-+-.+...... +..++ ..+++..--+.++..|.+.|..|..++-+.+.-..+| T Consensus 85 vne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eL 160 (511) T protein:vir:56 85 VDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVV----SLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIEL 160 (511) T ss_pred hcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHH----HHhccchhhhHHHhhhhhcceEEEEEEeccccceeeh Confidence 98886442 222111111111111111 11111 1233444455667778899999999987766568999 Q ss_pred EEEcCceEEEEEcC-----CC------E--EEEEEEcC-------------ceEEEEehhheEEEeccC---CCCCccCc Q lcl|NC_019422. 122 YPLNALNVEAIYEN-----EV------L--FLKFLLRN-------------GKIVSYPYSDIIHLRKDF---NENDLFGT 172 (384) Q Consensus 122 ~~l~~~~v~~~~~~-----~~------~--~~~~~~~~-------------g~~~~~~~~evih~~~~~---~~~~~~G~ 172 (384) +.|||..++.++.- ++ . ++.|...+ ...+.++.+.|.|....- +.+..+.+ T Consensus 161 r~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~ 240 (511) T protein:vir:56 161 RPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCADDPYII 240 (511) T ss_pred hhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCCCCeee Confidence 99999999865432 11 1 12222111 133567888887765553 24555788 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc---- Q lcl|NC_019422. 173 SPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA---- 238 (384) Q Consensus 173 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~---- 238 (384) |-+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-..++. ...+..+. T Consensus 241 syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMl 320 (511) T protein:vir:56 241 GYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSML 320 (511) T ss_pred ccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhH Confidence 9999999999998888877766655555556677776 445444455555555444432 11111221 Q ss_pred ----eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc----------cHHHHHH-----HHHHHH Q lcl|NC_019422. 239 ----AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK----------YSEDEWN-----AYYESE 296 (384) Q Consensus 239 ----~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~----------~~e~~~~-----~~~~~~ 296 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+. -|+..- T Consensus 321 EDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RL 400 (511) T protein:vir:56 321 EDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRL 400 (511) T ss_pred hhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHH Confidence 111 234555555545555567778899999999999999887522 2232221 222221 Q ss_pred HHHHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH- Q lcl|NC_019422. 297 IEPVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI- 359 (384) Q Consensus 297 i~P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~- 359 (384) =.-+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. T Consensus 401 R~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~I 480 (511) T protein:vir:56 401 QTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNI 480 (511) T ss_pred HHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHH Confidence 1222333344444444332 11 1123334443333 22222 3344444332 33346677666643 Q ss_pred hCCCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 360 MNLSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 360 lG~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) |.+.-.+ +...++.+.. +.|= T Consensus 481 Lr~tDeei~~~~k~I~~E~k~~~~~~~----e~~f 511 (511) T protein:vir:56 481 LRLSDDQITAMQSEIDEEETNPRFQQD----DQGF 511 (511) T ss_pred hccCHHHHHHHHHHHHHhhcCCCCCCc----ccCC Confidence 3332210 0111111000 0000 No 199 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=96.97 E-value=0.00023 Score=40.50 Aligned_cols=348 Identities=11% Similarity=0.063 Sum_probs=165.4 Q ss_pred CcchhhhcccCCCcchhHHHhhcccc-------------CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSG-------------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) .+-+.... ..-.....+..+.. ........+-+..+....+++..+.-+-+-|+++- .+++ T Consensus 36 i~~~~~~~----~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~-~~d~- 109 (474) T protein:vir:95 36 IDDHRKQL----DKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYS-CEDE- 109 (474) T ss_pred HHHHHHHH----HHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCceec-cCch- Confidence 11111000 00000001111000 00000001112355677788888888877777752 1111 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFLK 141 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~~ 141 (384) ........++. | ........+..+...+|.+|+.+..+..|++ .+..++|..+-++.+... ...+ T Consensus 110 ---~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~ 180 (474) T protein:vir:95 110 ---SVLKIIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRY 180 (474) T ss_pred ---HHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEE Confidence 11222333332 2 2445667778999999999999999888865 477788888887765431 1122 Q ss_pred EEEcCce-EEEEehhheEEEeccC---------------------C---------CCCccCccHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 142 FLLRNGK-IVSYPYSDIIHLRKDF---------------------N---------ENDLFGTSPAKVLEPIMEVVNTTDQ 190 (384) Q Consensus 142 ~~~~~g~-~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~~~~~~~~i~~~~~~~~ 190 (384) |...+.. ...+..+.+.+++... + .+...|.|.+..+...++....+.. T Consensus 181 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S 260 (474) T protein:vir:95 181 YKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLS 260 (474) T ss_pred EEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHH Confidence 2222211 1122233333332110 0 1123588888888888888777666 Q ss_pred HHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHHHHHHHHH Q lcl|NC_019422. 191 GVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAIQRL 269 (384) Q Consensus 191 ~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~~~I 269 (384) ...+.++....|..++. +..+.+ .+. +.... ...+++.++++.+++.+........+ ..++.+.+.| T Consensus 261 ~~~~~~~~~~~p~lv~~-g~~~~~--~~~----~~~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i 328 (474) T protein:vir:95 261 DAQNMFDESVELIYILK-GYEGQD--LEE----FMRGL-----KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYI 328 (474) T ss_pred HHHHHHHHhcCceeeee-cCCccc--chh----hhhhh-----hccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHH Confidence 66666776667754443 222211 111 11111 12345666667676666654433333 3456677788 Q ss_pred HHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhh Q lcl|NC_019422. 270 YSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQ 332 (384) Q Consensus 270 ~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~ 332 (384) +..-++|..- ++++.+..+. ...+...+..+++.+.+.+... . ....+++.++.-. T Consensus 329 ~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~----~---d~~~i~v~f~~~~ 401 (474) T protein:vir:95 329 MEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLK----M---DVKDIEISFNFNR 401 (474) T ss_pred HHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----c---ccceeeEEeccCC Confidence 8888887532 2232222111 2234444555555554443221 1 1123344444444 Q ss_pred ccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-------CC-----eeeecCceeecCCCC Q lcl|NC_019422. 333 YASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN-------GD-----KPVRRLDTAVVEGGE 384 (384) Q Consensus 333 ~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~-------gd-----~~~~~~n~~~~~~ge 384 (384) ..|..+.++.+ ...|+++...+.+++++-+.+. .+ ......+-.-.++.+ T Consensus 402 p~d~~e~a~~~--~~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~ 463 (474) T protein:vir:95 402 MMNDAEQSQII--AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQ 463 (474) T ss_pred CcCHHHHHHHH--HhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCc Confidence 45566555543 3468999888888887643210 00 011000100011111 No 200 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=96.97 E-value=0.00023 Score=40.50 Aligned_cols=352 Identities=10% Similarity=0.036 Sum_probs=162.1 Q ss_pred Ccc--------hhhhcccCCCcch---hHHHhhcccc-------------CcceechhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MNI--------FKSKKKNKEAPGK---VMMELISDSG-------------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~~--------f~~~~~~~~~~~~---~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) +.. ..++...-...-+ ....+..+-. ........+-...+....+++..+.-+-+- T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~ 100 (474) T protein:vir:97 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASK 100 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcC Confidence 000 0000000000000 0011111000 000000011123556778888888888888 Q ss_pred ceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCC Q lcl|NC_019422. 57 TAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENE 136 (384) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~ 136 (384) |+++-- +++ ........++. | ........+..+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+.. T Consensus 101 p~~~~~-~d~----~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~ 169 (474) T protein:vir:97 101 PVTYSC-EDE----NVLKVIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDK 169 (474) T ss_pred Cceecc-CcH----HHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCC Confidence 877521 111 11122222332 2 3456667778999999999999999888864 47778888888876643 Q ss_pred C--E----EEEEEEcCce-EEEEehhheEEEeccC---------------------C---------CCCccCccHHHHHH Q lcl|NC_019422. 137 V--L----FLKFLLRNGK-IVSYPYSDIIHLRKDF---------------------N---------ENDLFGTSPAKVLE 179 (384) Q Consensus 137 ~--~----~~~~~~~~g~-~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~~~~~~ 179 (384) . . ..+|...+.. ...+..+.+.+++... + .+...|.|.+..+. T Consensus 170 ~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~ 249 (474) T protein:vir:97 170 EREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYK 249 (474) T ss_pred CCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHH Confidence 2 1 1122211111 1112222222221100 0 11236888888888 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA 259 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~ 259 (384) ..++....+.....+.++..+.|..+++- .... ..+... ... ...+++.+++|.+++.+......... T Consensus 250 ~liDa~n~~~s~~~~~~~~~~~~~lv~~g-~~~~--~~~~~~----~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~ 317 (474) T protein:vir:97 250 SIIDAIDKRLSDAQNMFDESVELIYILKG-YEGE--DLEEFM----RGL-----KYYKAINVDGDGGVETIQVEVPVSST 317 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeec-CCcc--cchhhh----hhh-----hccceeeccCCCceeEEeecCCHHHH Confidence 88888777776666666666666554432 2221 111111 111 12345666666666666654333333 Q ss_pred -HHHHHHHHHHHHHhCCCH---HHhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCc Q lcl|NC_019422. 260 -AQMDKAIQRLYSFFNTNE---KIIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFG 321 (384) Q Consensus 260 -~~~~~~~~~I~~~fgvp~---~~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~ 321 (384) ..++.+.+.|...-++|. .-++++.+..+. ...+...+..++..+.+.++.. . .. T Consensus 318 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----~---d~ 390 (474) T protein:vir:97 318 KEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK----T---DV 390 (474) T ss_pred HHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----c---cc Confidence 334556666766666664 223333232221 2244445555555554433321 1 11 Q ss_pred ceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-------CCeeeecCceeecCCCC Q lcl|NC_019422. 322 NEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN-------GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 322 ~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~-------gd~~~~~~n~~~~~~ge 384 (384) ..+++.++.-...+..+.++.+ ...|+++..-++++++.-+.+. .++--...+..+..+++ T Consensus 391 ~~i~v~f~~~~p~~~~e~a~~~--~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:97 391 KDIEISFNFNRMMNDAEQSQII--AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG 458 (474) T ss_pred ceeeEEeccCcccCHHHHHHHH--HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC Confidence 2334444443344555554443 4468999998988887633210 00000011111111111 No 201 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=96.97 E-value=0.00023 Score=40.50 Aligned_cols=352 Identities=10% Similarity=0.036 Sum_probs=162.1 Q ss_pred Ccc--------hhhhcccCCCcch---hHHHhhcccc-------------CcceechhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MNI--------FKSKKKNKEAPGK---VMMELISDSG-------------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~~--------f~~~~~~~~~~~~---~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) +.. ..++...-...-+ ....+..+-. ........+-...+....+++..+.-+-+- T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~ 100 (474) T protein:vir:94 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASK 100 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcC Confidence 000 0000000000000 0011111000 000000011123556778888888888888 Q ss_pred ceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCC Q lcl|NC_019422. 57 TAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENE 136 (384) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~ 136 (384) |+++-- +++ ........++. | ........+..+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+.. T Consensus 101 p~~~~~-~d~----~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~ 169 (474) T protein:vir:94 101 PVTYSC-EDE----NVLKVIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDK 169 (474) T ss_pred Cceecc-CcH----HHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCC Confidence 877521 111 11122222332 2 3456667778999999999999999888864 47778888888876643 Q ss_pred C--E----EEEEEEcCce-EEEEehhheEEEeccC---------------------C---------CCCccCccHHHHHH Q lcl|NC_019422. 137 V--L----FLKFLLRNGK-IVSYPYSDIIHLRKDF---------------------N---------ENDLFGTSPAKVLE 179 (384) Q Consensus 137 ~--~----~~~~~~~~g~-~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~~~~~~ 179 (384) . . ..+|...+.. ...+..+.+.+++... + .+...|.|.+..+. T Consensus 170 ~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~ 249 (474) T protein:vir:94 170 EREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYK 249 (474) T ss_pred CCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHH Confidence 2 1 1122211111 1112222222221100 0 11236888888888 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA 259 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~ 259 (384) ..++....+.....+.++..+.|..+++- .... ..+... ... ...+++.+++|.+++.+......... T Consensus 250 ~liDa~n~~~s~~~~~~~~~~~~~lv~~g-~~~~--~~~~~~----~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~ 317 (474) T protein:vir:94 250 SIIDAIDKRLSDAQNMFDESVELIYILKG-YEGE--DLEEFM----RGL-----KYYKAINVDGDGGVETIQVEVPVSST 317 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeec-CCcc--cchhhh----hhh-----hccceeeccCCCceeEEeecCCHHHH Confidence 88888777776666666666666554432 2221 111111 111 12345666666666666654333333 Q ss_pred -HHHHHHHHHHHHHhCCCH---HHhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCc Q lcl|NC_019422. 260 -AQMDKAIQRLYSFFNTNE---KIIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFG 321 (384) Q Consensus 260 -~~~~~~~~~I~~~fgvp~---~~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~ 321 (384) ..++.+.+.|...-++|. .-++++.+..+. ...+...+..++..+.+.++.. . .. T Consensus 318 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----~---d~ 390 (474) T protein:vir:94 318 KEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK----T---DV 390 (474) T ss_pred HHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----c---cc Confidence 334556666766666664 223333232221 2244445555555554433321 1 11 Q ss_pred ceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-------CCeeeecCceeecCCCC Q lcl|NC_019422. 322 NEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN-------GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 322 ~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~-------gd~~~~~~n~~~~~~ge 384 (384) ..+++.++.-...+..+.++.+ ...|+++..-++++++.-+.+. .++--...+..+..+++ T Consensus 391 ~~i~v~f~~~~p~~~~e~a~~~--~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:94 391 KDIEISFNFNRMMNDAEQSQII--AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG 458 (474) T ss_pred ceeeEEeccCcccCHHHHHHHH--HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC Confidence 2334444443344555554443 4468999998988887633210 00000011111111111 No 202 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=96.95 E-value=0.00024 Score=40.43 Aligned_cols=377 Identities=10% Similarity=0.060 Sum_probs=180.3 Q ss_pred CcchhhhcccCCCcchh------------------HHHhhccccCccee------chhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV------------------MMELISDSGNGFYS------WHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~------~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) -..++.+.....+|... ....+-+..+.+.+ .-+....+|.|.+||+.|.+.+.-+ T Consensus 21 ~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd~Av~eIvneaiv~ 100 (521) T protein:vir:10 21 QSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVDNAIDEIINDAIVQ 100 (521) T ss_pred hhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhccchhhHHHhhhcceEEe Confidence 22222111111111000 00000000000000 0123456899999999999888644 Q ss_pred c-----eEEEEecCCcceeccchHH---HHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCceeeEEEEc Q lcl|NC_019422. 57 T-----AKHIRSNETEFKTNPEIYI---KFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNMPTQIYPLN 125 (384) Q Consensus 57 ~-----~~~~~~~~~~~~~~~~~~~---~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~~~~l~~l~ 125 (384) . ..+--.+-+..+..+.... ..++ ..+++..--+.++..|.+.|..|..++-+. +.-..+|+.|| T Consensus 101 d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lD 176 (521) T protein:vir:10 101 EDNRDTVYLDLDKTDWNESVKEMVREEFRTIL----KLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLD 176 (521) T ss_pred cCCCceEEEEecCcccchHHHHHHHHHHHHHH----HHhccchhhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeC Confidence 3 3322122222222221111 1111 123344445566777889999999987652 22488999999 Q ss_pred CceEEEEEcC-----CC--------EEEEEEEc-------C---ceEEEEehhheEEEeccC-CCCCccCccHHHHHHHH Q lcl|NC_019422. 126 ALNVEAIYEN-----EV--------LFLKFLLR-------N---GKIVSYPYSDIIHLRKDF-NENDLFGTSPAKVLEPI 181 (384) Q Consensus 126 ~~~v~~~~~~-----~~--------~~~~~~~~-------~---g~~~~~~~~evih~~~~~-~~~~~~G~s~~~~~~~~ 181 (384) |..++.++.. ++ .++.|... + +..+.++.+-|.|...+- ..+....+|-+..|.+. T Consensus 177 Pr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hSGL~d~~~~~i~syLhkAiKp 256 (521) T protein:vir:10 177 PRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHSGKVDIDGKTIVGYLHNVIKP 256 (521) T ss_pred CcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeecccceeCCCCceeccchhhhHh Confidence 9999765432 11 11122111 1 122445654444443222 34567889999999999 Q ss_pred HHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc--------eec-- Q lcl|NC_019422. 182 MEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA--------AAT-- 241 (384) Q Consensus 182 i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~--------~v~-- 241 (384) +..+.-...+..=+--..+.-+-++.++ |.+....+++....+-..++. ...+..+. ++- T Consensus 257 ~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRRe 336 (521) T protein:vir:10 257 ANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRD 336 (521) T ss_pred HHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccC Confidence 9998888877766655555556677776 445444455555555444432 11111221 111 Q ss_pred -CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc--------cHHHHH-----HHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 242 -DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--------YSEDEW-----NAYYESEIEPVGLQLSNQ 307 (384) Q Consensus 242 -~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--------~~e~~~-----~~~~~~~i~P~~~~i~~~ 307 (384) +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+ .-|+..-=.-+...+.+. T Consensus 337 GgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~ 416 (521) T protein:vir:10 337 GKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNP 416 (521) T ss_pred CCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234555555545555567778899999999999999877432 122222 123222212223334444 Q ss_pred HhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH-----HhCCCCCHHHHHHH-hCCCCCC-- Q lcl|NC_019422. 308 YTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM-----VDRGSLTPNEWRKI-MNLSPIE-- 366 (384) Q Consensus 308 l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~-----~~~g~~t~NE~R~~-lG~~p~~-- 366 (384) |..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-+++.+=+|+. |.+.-.+ T Consensus 417 L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik 496 (521) T protein:vir:10 417 LRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIK 496 (521) T ss_pred HHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHH Confidence 44444332 11 1123334443333 22222 3444444432 23347777777653 4544210 Q ss_pred --------CCCeeeecCceeecCC-CC Q lcl|NC_019422. 367 --------NGDKPVRRLDTAVVEG-GE 384 (384) Q Consensus 367 --------~gd~~~~~~n~~~~~~-ge 384 (384) +....+.+ .|-++ +| T Consensus 497 ~~~k~I~~E~~~~~~~---~p~~e~~d 520 (521) T protein:vir:10 497 TEREKIDGELKDSVYK---NPEDPMEE 520 (521) T ss_pred HHHHHHHHhhhCCCCC---CCcchhhc Confidence 11111111 11111 11 No 203 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=96.90 E-value=0.00027 Score=40.17 Aligned_cols=372 Identities=10% Similarity=0.077 Sum_probs=172.7 Q ss_pred Ccchhhh----cccCCCcchhHHHhhccccCccee---c-----hhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSK----KKNKEAPGKVMMELISDSGNGFYS---W-----HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~----~~~~~~~~~~~~~~~~~~~~~~~~---~-----~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) +....+. .......-.....+..+....... . ...-...+....+++..+.-+.+-|+++-- +++ T Consensus 42 ~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~-~d~-- 118 (512) T protein:vir:97 42 INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD-DDK-- 118 (512) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceecc-CCh-- Confidence 1111110 000000001112222211111000 0 001122456677888888888777777521 111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E----EEEE Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L----FLKF 142 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~----~~~~ 142 (384) .....+..++ ..-........+..+.+.+|.+|+++.+++.|.+ .+..++|..+.++.+... . +.+| T Consensus 119 --~~~~~l~~~~----~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~ 191 (512) T protein:vir:97 119 --DVLEAIEAFN----DLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (512) T ss_pred --HHHHHHHHHH----hhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 1112223332 2234556777888999999999999999888864 477889998888876542 1 1111 Q ss_pred EE--cCc------eE-EEEehhheEEEeccC----------------CC---------CCccCccHHHHHHHHHHHHHHH Q lcl|NC_019422. 143 LL--RNG------KI-VSYPYSDIIHLRKDF----------------NE---------NDLFGTSPAKVLEPIMEVVNTT 188 (384) Q Consensus 143 ~~--~~g------~~-~~~~~~evih~~~~~----------------~~---------~~~~G~s~~~~~~~~i~~~~~~ 188 (384) .. ..+ .. ..+.++.+.+++... +. +...|.|.+..+...++....+ T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~ 271 (512) T protein:vir:97 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNA 271 (512) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHH Confidence 11 111 01 123444455543211 10 1236889999888888888877 Q ss_pred HHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcccc-ccCCcceecCCCceeeecccchhHHHH-HHHHHHH Q lcl|NC_019422. 189 DQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQID-SEAGGAAATDSKYDAEQVKAESYVPNA-AQMDKAI 266 (384) Q Consensus 189 ~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~~~~~ 266 (384) ..-..+.+...+.|-.++.-....+++.....+....-...... .+.....-.++|.+++.+........+ ..++.+. T Consensus 272 ~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~ 351 (512) T protein:vir:97 272 ESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLN 351 (512) T ss_pred HHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHH Confidence 77777777776667555543233333333322221111111100 011111223456666666654333333 2345566 Q ss_pred HHHHHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeech Q lcl|NC_019422. 267 QRLYSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEAS 329 (384) Q Consensus 267 ~~I~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~ 329 (384) +.|+..-++|..- ++++.+..+. ...+...+.-+++.+...+...--.... .....+++.+. T Consensus 352 ~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~-~d~~~i~~~f~ 430 (512) T protein:vir:97 352 SDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN-KDFNTVRYVYN 430 (512) T ss_pred HHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc-cccccceEEeC Confidence 7777777777643 3333222221 1233344444444444433321111111 11123444455 Q ss_pred hhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CC---eeeecC--ceeecCCCC Q lcl|NC_019422. 330 NLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GD---KPVRRL--DTAVVEGGE 384 (384) Q Consensus 330 ~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd---~~~~~~--n~~~~~~ge 384 (384) .-...+..+.++.+... .|+++..-++++++.-+.|. .+ ....+. ..-..++++ T Consensus 431 ~~~p~~~~e~~~~~~kl-~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) T protein:vir:97 431 RNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) T ss_pred CCCCcCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Confidence 55556677777665433 38899988888887643210 00 000000 011111111 No 204 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=96.87 E-value=0.00029 Score=40.01 Aligned_cols=372 Identities=10% Similarity=0.070 Sum_probs=170.7 Q ss_pred Ccchhhhccc------CCCc-chhHHHhhccccCccee---c-----hhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKN------KEAP-GKVMMELISDSGNGFYS---W-----HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~------~~~~-~~~~~~~~~~~~~~~~~---~-----~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |.......+. ...+ -.....+..+....... . ...-...+....+++..+.-+-+-|+++- .++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~-~~d 117 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQ-DDD 117 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceee-cCc Confidence 2222211100 0000 01112222221111100 0 00112235667778888887777777752 111 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC---EE--- Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV---LF--- 139 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~---~~--- 139 (384) + .....+..++.. -....+...+..+++.+|.+|.++.+++.|.+ .+..++|..+.++.+... .. T Consensus 118 ~----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~v 188 (511) T protein:vir:96 118 K----DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGV 188 (511) T ss_pred h----HHHHHHHHHHhh----cChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEE Confidence 1 111222333322 34556777788999999999999999888864 477789998888776532 11 Q ss_pred EEEEEc--Cc---e---E-EEEehhheEEEeccCC----------------C---------CCccCccHHHHHHHHHHHH Q lcl|NC_019422. 140 LKFLLR--NG---K---I-VSYPYSDIIHLRKDFN----------------E---------NDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 140 ~~~~~~--~g---~---~-~~~~~~evih~~~~~~----------------~---------~~~~G~s~~~~~~~~i~~~ 185 (384) .+|... .+ . . ..+.++.+.++..... . +...|.|.+..+...++.. T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:96 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHH Confidence 112111 11 0 1 1234445555432210 0 1235888888888888877 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDK 264 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~ 264 (384) ..+.....+.+...+.|-.+++-....++++.+...+...-......--.....-.+.+.++..+........+. .++. T Consensus 269 ~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:96 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 776666666666656665555433333333322221111000000000000001123344555555443333333 3456 Q ss_pred HHHHHHHHhCCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEee Q lcl|NC_019422. 265 AIQRLYSFFNTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFE 327 (384) Q Consensus 265 ~~~~I~~~fgvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd 327 (384) +.+.|+..-++|.... +++.+..+. ...+...+.-.++.+...+...--... ......+++. T Consensus 349 L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~~~~i~~~ 427 (511) T protein:vir:96 349 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVRYV 427 (511) T ss_pred HHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccccccceEE Confidence 6777888888876433 233222221 223444455555555444433211111 1111234555 Q ss_pred chhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCe-eeecCce----eecCCCC Q lcl|NC_019422. 328 ASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GDK-PVRRLDT----AVVEGGE 384 (384) Q Consensus 328 ~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~-~~~~~n~----~~~~~ge 384 (384) +..-...|..+.++.+... .|+++..-+.+++++-+.+. .+. .....+. -..+++| T Consensus 428 f~~~~p~n~~e~~d~~~kl-~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:96 428 YNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred eCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 5665666777777765433 37889888888886643210 010 0001111 1111222 No 205 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=96.87 E-value=0.00029 Score=40.01 Aligned_cols=372 Identities=10% Similarity=0.070 Sum_probs=170.7 Q ss_pred Ccchhhhccc------CCCc-chhHHHhhccccCccee---c-----hhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKN------KEAP-GKVMMELISDSGNGFYS---W-----HGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~------~~~~-~~~~~~~~~~~~~~~~~---~-----~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |.......+. ...+ -.....+..+....... . ...-...+....+++..+.-+-+-|+++- .++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~-~~d 117 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQ-DDD 117 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceee-cCc Confidence 2222211100 0000 01112222221111100 0 00112235667778888887777777752 111 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC---EE--- Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV---LF--- 139 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~---~~--- 139 (384) + .....+..++.. -....+...+..+++.+|.+|.++.+++.|.+ .+..++|..+.++.+... .. T Consensus 118 ~----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~v 188 (511) T protein:vir:78 118 K----DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGV 188 (511) T ss_pred h----HHHHHHHHHHhh----cChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEE Confidence 1 111222333322 34556777788999999999999999888864 477789998888776532 11 Q ss_pred EEEEEc--Cc---e---E-EEEehhheEEEeccCC----------------C---------CCccCccHHHHHHHHHHHH Q lcl|NC_019422. 140 LKFLLR--NG---K---I-VSYPYSDIIHLRKDFN----------------E---------NDLFGTSPAKVLEPIMEVV 185 (384) Q Consensus 140 ~~~~~~--~g---~---~-~~~~~~evih~~~~~~----------------~---------~~~~G~s~~~~~~~~i~~~ 185 (384) .+|... .+ . . ..+.++.+.++..... . +...|.|.+..+...++.. T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:78 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHH Confidence 112111 11 0 1 1234445555432210 0 1235888888888888877 Q ss_pred HHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHH Q lcl|NC_019422. 186 NTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDK 264 (384) Q Consensus 186 ~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~ 264 (384) ..+.....+.+...+.|-.+++-....++++.+...+...-......--.....-.+.+.++..+........+. .++. T Consensus 269 ~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:78 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 776666666666656665555433333333322221111000000000000001123344555555443333333 3456 Q ss_pred HHHHHHHHhCCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEee Q lcl|NC_019422. 265 AIQRLYSFFNTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFE 327 (384) Q Consensus 265 ~~~~I~~~fgvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd 327 (384) +.+.|+..-++|.... +++.+..+. ...+...+.-.++.+...+...--... ......+++. T Consensus 349 L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~~~~i~~~ 427 (511) T protein:vir:78 349 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVRYV 427 (511) T ss_pred HHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccccccceEE Confidence 6777888888876433 233222221 223444455555555444433211111 1111234555 Q ss_pred chhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCe-eeecCce----eecCCCC Q lcl|NC_019422. 328 ASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GDK-PVRRLDT----AVVEGGE 384 (384) Q Consensus 328 ~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~-~~~~~n~----~~~~~ge 384 (384) +..-...|..+.++.+... .|+++..-+.+++++-+.+. .+. .....+. -..+++| T Consensus 428 f~~~~p~n~~e~~d~~~kl-~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:78 428 YNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred eCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 5665666777777765433 37889888888886643210 010 0001111 1111222 No 206 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=96.81 E-value=0.00033 Score=39.70 Aligned_cols=357 Identities=9% Similarity=-0.030 Sum_probs=169.7 Q ss_pred Ccchhh--hcccCCCcc----hhHHHhhccccCcce------------e---chhhhhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_019422. 1 MNIFKS--KKKNKEAPG----KVMMELISDSGNGFY------------S---WHGNLYKSDIVRSIIRPKAKAVGKMTAK 59 (384) Q Consensus 1 M~~f~~--~~~~~~~~~----~~~~~~~~~~~~~~~------------~---~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 59 (384) +..|-. ......... .....+..+...... . ...+-..++....+++..+.-+.+-|.+ T Consensus 20 ~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~ 99 (479) T protein:vir:79 20 STINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNPIV 99 (479) T ss_pred ChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcCCce Confidence 111111 011000000 011111111100000 0 0001123556777888888888777777 Q ss_pred EEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE- Q lcl|NC_019422. 60 HIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL- 138 (384) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~- 138 (384) +-- +.. ........+.. | ........++.+.+.+|.+|+.+..+..|.+. +..++|..+.++.+.... T Consensus 100 ~~~---~~~--~~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~d~~~~~ 168 (479) T protein:vir:79 100 FNA---DDD--NLTKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINRKGEFK-YVIIPAEEAIPIWDSKRQR 168 (479) T ss_pred ecc---CCH--HHHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEccceeEEEEeCCCCC Confidence 521 111 11122233322 2 45566777889999999999999998888654 777888888887654321 Q ss_pred -----EEEEEE--cCceE----EEEehhheEEEeccC------------------------------C---------CCC Q lcl|NC_019422. 139 -----FLKFLL--RNGKI----VSYPYSDIIHLRKDF------------------------------N---------END 168 (384) Q Consensus 139 -----~~~~~~--~~g~~----~~~~~~evih~~~~~------------------------------~---------~~~ 168 (384) ...|.. .+++. ..+.++.+.|++... + .+. T Consensus 169 ~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn 248 (479) T protein:vir:79 169 ELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNN 248 (479) T ss_pred ceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCC Confidence 111111 11211 112233333332110 0 012 Q ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceee Q lcl|NC_019422. 169 LFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAE 248 (384) Q Consensus 169 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~ 248 (384) ..|.|.+..+...++..........+.++..+.|-.+++- . +.+..+... ... ..++++.++++.+++ T Consensus 249 ~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g--~-~~~~~~~~~----~~~-----~~~~~i~~~~~~~~~ 316 (479) T protein:vir:79 249 EKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKE--Y-PGTSLQEFI----DNI-----RYYKSIKVDGGGGVD 316 (479) T ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec--C-Cccccccch----hhh-----hhccceecCCCCcce Confidence 3578888888888888777776777777777777555432 1 111111111 111 123455566666666 Q ss_pred ecccchhHHHH-HHHHHHHHHHHHHhCCCHHHhc--cccHHH--------------HHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019422. 249 QVKAESYVPNA-AQMDKAIQRLYSFFNTNEKIIQ--SKYSED--------------EWNAYYESEIEPVGLQLSNQYTEK 311 (384) Q Consensus 249 ~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l~--~~~~e~--------------~~~~~~~~~i~P~~~~i~~~l~~~ 311 (384) .+......... ..++.+.+.|+..-++|..-.+ ++.+.. .....+...+..+++.+...++.. T Consensus 317 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 396 (479) T protein:vir:79 317 KLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKIS 396 (479) T ss_pred EEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 66544433333 2345566677777777654332 111211 112244455555555555554432 Q ss_pred ccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C-------Ceee-ecCceeecC Q lcl|NC_019422. 312 LFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G-------DKPV-RRLDTAVVE 381 (384) Q Consensus 312 l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g-------d~~~-~~~n~~~~~ 381 (384) -.... ....+++.+..-...|.++.++.+... .|+++...+.++++.-..+. . +.-. ....+...+ T Consensus 397 ~~~~~---~~~~i~i~f~~~~p~~~~~~a~~~~kl-~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 472 (479) T protein:vir:79 397 GNKSY---DYKTVQITFNHSMIINEAEKIDMAAKS-TGIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPNNQ 472 (479) T ss_pred CCCcc---ccccceEEeCCCCCcCHHHHHHHHHHH-hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCccc Confidence 21111 122345555555566777777765433 48899988888887643211 0 0000 001111112 Q ss_pred CCC Q lcl|NC_019422. 382 GGE 384 (384) Q Consensus 382 ~ge 384 (384) ++. T Consensus 473 ~~~ 475 (479) T protein:vir:79 473 DGV 475 (479) T ss_pred CCC Confidence 222 No 207 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=96.79 E-value=0.00033 Score=39.64 Aligned_cols=371 Identities=11% Similarity=0.062 Sum_probs=174.2 Q ss_pred Ccchhhhccc----CCCcc---hhHHHhhccccCcce-----e---chhhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MNIFKSKKKN----KEAPG---KVMMELISDSGNGFY-----S---WHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~~f~~~~~~----~~~~~---~~~~~~~~~~~~~~~-----~---~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) ++.+...++. ..... .....+..+-..... . ....-+..+....+++..+.-+-+-|+++--.++ T Consensus 38 ~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 117 (501) T protein:vir:27 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCc Confidence 2322211110 00000 011111111000000 0 0011133567788888888888877877632222 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EE Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LF 139 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~ 139 (384) +.. ......+.+-...-........+..+++.+|.+|+++.++..|.+. +..++|..+.++.+... .+ T Consensus 118 ~~~-----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~-i~~~~p~~~~~v~d~~~~~~~~~~i 191 (501) T protein:vir:27 118 DNN-----SQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETR-IKRLNPLETFVIYDNSLEDNSIAAV 191 (501) T ss_pred cch-----HHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceE-EEEEccceeEEEecCCCCCceEEEE Confidence 111 1122222222223356678888999999999999999998888653 67788998888776531 11 Q ss_pred EEEEEc--Cce---EEEEehhheEEEecc----------CC---------CCCccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 140 LKFLLR--NGK---IVSYPYSDIIHLRKD----------FN---------ENDLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 140 ~~~~~~--~g~---~~~~~~~evih~~~~----------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) .+|... .+. ...+.++.+.++... ++ .+...|.|.+..+...++..........+. T Consensus 192 r~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~ 271 (501) T protein:vir:27 192 RYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANH 271 (501) T ss_pred EEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 112211 111 011222222222211 00 112368899998888888887777777777 Q ss_pred HHccCCcceEEeeCCC-CChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHh Q lcl|NC_019422. 196 IKNSNTIKWLLKFKTA-LRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFF 273 (384) Q Consensus 196 ~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~f 273 (384) +.....|-.++.-... ..++.....+.. ..... ...+.....+.+.++..+.....+..+. .++.+.+.|+..- T Consensus 272 ~~~~~~~~~v~~g~~~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 347 (501) T protein:vir:27 272 MSDMADAILAIYGDLALPKGMQASDMKRT---RLMQL-KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFT 347 (501) T ss_pred HHHhcCceeeeecCccCCcccchhhhhhc---Cceee-cccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHh Confidence 7766666555432211 112222222111 00010 1112222234555666665554444343 3456677787777 Q ss_pred CCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCH Q lcl|NC_019422. 274 NTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASM 336 (384) Q Consensus 274 gvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~ 336 (384) ++|.... +++.+..+. ...+...+.-+++.+...++..- .........+++.+......+. T Consensus 348 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~v~f~~~~p~n~ 425 (501) T protein:vir:27 348 NIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN--EFKDFDESLLKITFTPNLPKSL 425 (501) T ss_pred CCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cccccccccceEEeCCCCCcCH Confidence 7775332 232222111 12344445555555554443221 1111112234555566666777 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCC------------CCCeeeecCcee--ecCCCC Q lcl|NC_019422. 337 STKLNLVQMVDRGSLTPNEWRKIMNLSPIE------------NGDKPVRRLDTA--VVEGGE 384 (384) Q Consensus 337 ~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~------------~gd~~~~~~n~~--~~~~ge 384 (384) .+.++.+... .|+++..-+.+++++-..| ..+.-.++..+. .-+.++ T Consensus 426 ~e~ad~~~kl-~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d 486 (501) T protein:vir:27 426 NEQVSILTGL-GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTD 486 (501) T ss_pred HHHHHHHHHH-hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccC Confidence 7777765433 4788888888877653321 011111121111 011111 No 208 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=96.76 E-value=0.00035 Score=39.50 Aligned_cols=348 Identities=8% Similarity=0.048 Sum_probs=145.9 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCc--cee-chhh------hhhcHHHHHHHHHHHHhhccCceEEEEecCCcceec Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNG--FYS-WHGN------LYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTN 71 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~~------~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 71 (384) =.++....... ..-.....+..+.... ... .... .....+...+++..+..+--..|. ..++. T Consensus 33 ~~l~~~~~~~~-~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~---~~d~~---- 104 (501) T protein:vir:25 33 ADMWRLHISER-QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYR---NALAK---- 104 (501) T ss_pred HHHHHHHHHHH-HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhccccee---cCCcc---- Confidence 01222111100 0001111221111100 000 0000 011235566677666544322232 22211 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcC-CC--EE---E-EEEE Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYEN-EV--LF---L-KFLL 144 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~-~~--~~---~-~~~~ 144 (384) .......++.. |. .......+..+.+.+|.||+.+.+++.|. .+..++|..+.+..+. .. .. + ++.. T Consensus 105 ~~~~l~~i~~~-N~---~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~ 178 (501) T protein:vir:25 105 ENDPAWEMWQR-NR---MDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALETWVA 178 (501) T ss_pred chHHHHHHHHh-cC---hhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEEEee Confidence 11223333322 32 34566678899999999999999988874 4667788888765432 11 01 1 1111 Q ss_pred cC--ce---EEEEehh----------------------------------------------heEEEeccCCCCCccCcc Q lcl|NC_019422. 145 RN--GK---IVSYPYS----------------------------------------------DIIHLRKDFNENDLFGTS 173 (384) Q Consensus 145 ~~--g~---~~~~~~~----------------------------------------------evih~~~~~~~~~~~G~s 173 (384) .. +. ...+.+. -|+|+.+.. ...-+|.| T Consensus 179 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~-~~~~~g~s 257 (501) T protein:vir:25 179 QKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR-DADDMIVG 257 (501) T ss_pred ccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCcc-ccCccccc Confidence 10 00 0000010 123332211 11235788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE-eeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-CCceeeecc Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLL-KFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-SKYDAEQVK 251 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~~l~ 251 (384) -++.+...++.............+-.+.|..++ .++ .++.+ .|+. ..++++.++ ++.++.++. T Consensus 258 die~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~----~~~~~----~~~~-------~~~~i~~~~~~~~~~~q~~ 322 (501) T protein:vir:25 258 EVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWT----GSKAE----VLKA-------SALRVWTFEDPEVKAQAFP 322 (501) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCC----CCccc----hhhh-------cccceeccCCCCceEEEec Confidence 776666555555554444444444444443332 221 11111 1211 123455554 456666665 Q ss_pred cchhHHHHHHHHHHHHHHHHHhCCCHHHhccccH--H-HHH--------------HHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019422. 252 AESYVPNAAQMDKAIQRLYSFFNTNEKIIQSKYS--E-DEW--------------NAYYESEIEPVGLQLSNQYTEKLFT 314 (384) Q Consensus 252 ~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~~~--e-~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~ 314 (384) ..........++..+.+|++.=++|+..+++... . .+. ...+...+.-+++.+... .. T Consensus 323 ~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~-----~~ 397 (501) T protein:vir:25 323 PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEM-----DD 397 (501) T ss_pred ccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hC Confidence 4433333345678899999999999999885321 1 111 112222222222222211 11 Q ss_pred cccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCCCHHHHHH-HhCCCCCC-----------CCCeee---ecCcee Q lcl|NC_019422. 315 RKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSLTPNEWRK-IMNLSPIE-----------NGDKPV---RRLDTA 378 (384) Q Consensus 315 ~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~t~NE~R~-~lG~~p~~-----------~gd~~~---~~~n~~ 378 (384) .........+++.+.+....+..+.++.+ |++..|+ +.-.+.. +.|+++.+ .++..+ .+..-. T Consensus 398 ~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~ 476 (501) T protein:vir:25 398 DPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPA 476 (501) T ss_pred CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcC Confidence 11112223556666777777888888765 4555553 3333332 23544311 011000 000000 Q ss_pred ec------------CCCC Q lcl|NC_019422. 379 VV------------EGGE 384 (384) Q Consensus 379 ~~------------~~ge 384 (384) +. +.|+ T Consensus 477 ~~~~~~~~~~~~~~~~~~ 494 (501) T protein:vir:25 477 PVPPPPPQAAAQALNEGG 494 (501) T ss_pred CCCCCCCCCCcccccccc Confidence 00 0011 No 209 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.63 E-value=0.00045 Score=38.93 Aligned_cols=365 Identities=8% Similarity=0.076 Sum_probs=168.1 Q ss_pred Cc------------chhhhcccCCCcchhHHHhhccccCcce------echhhhhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_019422. 1 MN------------IFKSKKKNKEAPGKVMMELISDSGNGFY------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIR 62 (384) Q Consensus 1 M~------------~f~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 62 (384) |. +.+..... ...-.....+..+...... .....-+..+....+|+..+.-+-.-|+++-- T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~-~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 89 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQEE-VERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKKTH 89 (453) T ss_pred ccccccCCHHHHHHHHHHHHHH-HHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhhhhcccCceeec Confidence 11 10000000 0000000111111000000 00001122456677788777777666666421 Q ss_pred ecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC-E--- Q lcl|NC_019422. 63 SNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV-L--- 138 (384) Q Consensus 63 ~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~--- 138 (384) +++ ...+....++ ...........+..+.+.+|.+|+.+..++.|.+. +..++|..+.+..+... . T Consensus 90 -~d~----~~~~~l~~~~----~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~dd~~~~~~~ 159 (453) T protein:vir:73 90 -DDK----SVLEAMQLFD----NLNDMEDEESELAKIACVYGRAYELMYQNESTESE-VIYCSPLNVFMVYDDSIKQKPL 159 (453) T ss_pred -CCh----HHHHHHHHHH----HhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEcccceEEEEeCCCCceeE Confidence 111 1111222222 22345567777889999999999999999888764 66788888877765532 1 Q ss_pred --EEEEEEcCceE--EEEehhheEEEeccC-----------C---------CCCccCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 139 --FLKFLLRNGKI--VSYPYSDIIHLRKDF-----------N---------ENDLFGTSPAKVLEPIMEVVNTTDQGVVK 194 (384) Q Consensus 139 --~~~~~~~~g~~--~~~~~~evih~~~~~-----------~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 194 (384) .+++...+|.. ..+..+.++++.... + .+...|.|.+..+...++..........+ T Consensus 160 ~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~ 239 (453) T protein:vir:73 160 FAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKAN 239 (453) T ss_pred EEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHH Confidence 11122222221 112333333332111 1 11236888888888888877776666666 Q ss_pred HHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHH-HHHHHHHHHHHHh Q lcl|NC_019422. 195 AIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAA-QMDKAIQRLYSFF 273 (384) Q Consensus 195 ~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~f 273 (384) ..+..+.|..+++ +..+.++..+..+..-. ........+.....+.+.++..+.....+..+. .++.+.+.|+..- T Consensus 240 ~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 316 (453) T protein:vir:73 240 DVEYFSDQYLVFL-GAEVDEEDAKNIKDNRL--INFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFT 316 (453) T ss_pred HHHHhccceeeee-cCCCCchhhhccccccc--ccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHh Confidence 6666666755553 33444444343322110 011111122233344555666555443333333 3455666676666 Q ss_pred CCCHH---HhccccHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHH Q lcl|NC_019422. 274 NTNEK---IIQSKYSED-------------EWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMS 337 (384) Q Consensus 274 gvp~~---~l~~~~~e~-------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~ 337 (384) ++|.. ..|..+.++ .....+...+..+++.+...++..- ... ....+++.+..-...|.. T Consensus 317 ~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~---~~~~i~v~f~~~~p~~~~ 392 (453) T protein:vir:73 317 MAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNAS-NKD---AWKDIEYTFTRNEPKDIK 392 (453) T ss_pred CCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Ccc---ccccceEEeCCCCCCCHH Confidence 66643 222222221 1123444555555555544333221 111 112344445555566777 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCCC------------CCeeeecCceeec--CCCC Q lcl|NC_019422. 338 TKLNLVQMVDRGSLTPNEWRKIMNLSPIEN------------GDKPVRRLDTAVV--EGGE 384 (384) Q Consensus 338 ~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~------------gd~~~~~~n~~~~--~~ge 384 (384) +.++.+.... |+++..-+.++++.-+.+. ....-...+.+.. +-|+ T Consensus 393 ~~a~~~~k~~-giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 452 (453) T protein:vir:73 393 EQAETANILK-GITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGN 452 (453) T ss_pred HHHHHHHHHh-ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcC Confidence 7777654332 7888877888887743211 1111111222111 1122 No 210 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.62 E-value=0.00046 Score=38.86 Aligned_cols=355 Identities=8% Similarity=0.040 Sum_probs=163.8 Q ss_pred Ccchh----------hhcccCCCcc---hhHHHhhccccCc------ceechhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNIFK----------SKKKNKEAPG---KVMMELISDSGNG------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~f~----------~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) +.++. +.-+.-...- .....+..+.... .......-+..+....+++..+.-+-+-|+++- T Consensus 9 ~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~ 88 (452) T protein:vir:36 9 MTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNGIPVKKS 88 (452) T ss_pred EEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcccCceee Confidence 11111 0000000000 0011111110000 000000112345677788888877777777652 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC---E Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV---L 138 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~---~ 138 (384) - +++ + ....+..++. .-........+..+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+... . T Consensus 89 ~-~d~--~--~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 158 (452) T protein:vir:36 89 H-SDK--E--ILTKLQEFDN----LNDMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPENMFMVYDDTVKQEP 158 (452) T ss_pred c-CCh--h--HHHHHHHHHh----hcChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCce Confidence 1 111 1 1122222222 234556778888999999999999999888865 477788888887766532 1 Q ss_pred E---EEEEEcCce-E-EEEehhheEEEecc-----------CC---------CCCccCccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 139 F---LKFLLRNGK-I-VSYPYSDIIHLRKD-----------FN---------ENDLFGTSPAKVLEPIMEVVNTTDQGVV 193 (384) Q Consensus 139 ~---~~~~~~~g~-~-~~~~~~evih~~~~-----------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 193 (384) . .++...++. . ..+.++.++++... ++ .+...|.|.+..+...++.......... T Consensus 159 ~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~ 238 (452) T protein:vir:36 159 LFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKA 238 (452) T ss_pred EEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHH Confidence 1 111111111 1 11222222222111 00 1123688888888888777777776677 Q ss_pred HHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-----CCceeeecccchhHHHH-HHHHHHHH Q lcl|NC_019422. 194 KAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-----SKYDAEQVKAESYVPNA-AQMDKAIQ 267 (384) Q Consensus 194 ~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~g~~~~~l~~~~~~~~~-~~~~~~~~ 267 (384) +.+...+.|..++. +..+.++.....+. ++.+.++ .+.++..+........+ ..++.+.+ T Consensus 239 ~~~~~~~~p~~~~~-g~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 304 (452) T protein:vir:36 239 NDVDYFSDQYLTFL-GAAVEEEDLKNIRS-------------NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTK 304 (452) T ss_pred HHHHHhcCceeEee-cCCcCchhhhhhhh-------------cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHH Confidence 76766677755543 33444433322211 1112211 22334444433333333 33456667 Q ss_pred HHHHHhCCCHHH---hccccHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhh Q lcl|NC_019422. 268 RLYSFFNTNEKI---IQSKYSED-------------EWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNL 331 (384) Q Consensus 268 ~I~~~fgvp~~~---l~~~~~e~-------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~ 331 (384) .|+..-++|..- +|....++ .....+...+..+++.+...++.. -.. .....+++.+..- T Consensus 305 ~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~---~~~~~i~i~f~~~ 380 (452) T protein:vir:36 305 LIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-SNK---DSWKDIEYTFTRN 380 (452) T ss_pred HHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCc---cccccceEEeCCC Confidence 777777777532 22222221 112344455555555555444332 111 1112344444555 Q ss_pred hccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeecCceeecCCCC Q lcl|NC_019422. 332 QYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN----------GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 332 ~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~----------gd~~~~~~n~~~~~~ge 384 (384) ...|..+.++.+... .|+++..-+.++++.-..+. ....-...+..+-+.|+ T Consensus 381 ~p~d~~~~a~~~~k~-~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 442 (452) T protein:vir:36 381 EPKDIKEQAETANIL-MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGT 442 (452) T ss_pred CCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCcc Confidence 566777777765432 47889888888887643211 01001111111111111 No 211 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=96.57 E-value=0.0005 Score=38.68 Aligned_cols=379 Identities=11% Similarity=0.052 Sum_probs=174.8 Q ss_pred Cc-chhh----hcccCCCcchhHH-------HhhccccCc-ceech-------------hhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 1 MN-IFKS----KKKNKEAPGKVMM-------ELISDSGNG-FYSWH-------------GNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 1 M~-~f~~----~~~~~~~~~~~~~-------~~~~~~~~~-~~~~~-------------~~~~~~~~v~~~i~~ia~~ia 54 (384) |. ||+. ++.....++.... ...++...+ +.... +....+|.|.+||+.|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 43 3442 1111111111111 111111111 11111 123458899999999998886 Q ss_pred cCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCceeeEEE Q lcl|NC_019422. 55 KMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNMPTQIYP 123 (384) Q Consensus 55 ~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~~~~l~~ 123 (384) -+. ..+--.+-+-.+... ..++..-+ ..+++..--+.++..|.+.|..|..++-+. +.-..+|+. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK----~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~ 156 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIK----KLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRY 156 (533) T ss_pred eecCCCceEEEEecccccchHHH----HHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeee Confidence 442 222111111111111 11111111 123344445566778889999999987653 334889999 Q ss_pred EcCceEEEEEcC-----CCE------------EE-EEEEc-C------ceEEEEehhheEEEeccC--CCCCccCccHHH Q lcl|NC_019422. 124 LNALNVEAIYEN-----EVL------------FL-KFLLR-N------GKIVSYPYSDIIHLRKDF--NENDLFGTSPAK 176 (384) Q Consensus 124 l~~~~v~~~~~~-----~~~------------~~-~~~~~-~------g~~~~~~~~evih~~~~~--~~~~~~G~s~~~ 176 (384) |||..++.++.. ++. .. +|.++ . ++.+.++. +.|++-+.. +.++..-+|-+. T Consensus 157 lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~-dAI~y~hSGl~d~~~~~i~syLh 235 (533) T protein:vir:10 157 IDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAP-DSICYVHSGIMDLNKNMTLSHLH 235 (533) T ss_pred ccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecch-hheeeeeccceeCCCCceeccch Confidence 999999875332 110 00 11111 1 23344554 555554332 233444568899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc-------- Q lcl|NC_019422. 177 VLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA-------- 238 (384) Q Consensus 177 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~-------- 238 (384) .|.+.+..+.-......=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 236 kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 315 (533) T protein:vir:10 236 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 315 (533) T ss_pred HhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhc Confidence 999998888877777666655555556677776 445444455555555544442 11111111 Q ss_pred eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHH-----HHHHHHHHHHHHHH Q lcl|NC_019422. 239 AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEW-----NAYYESEIEPVGLQ 303 (384) Q Consensus 239 ~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~-----~~~~~~~i~P~~~~ 303 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+ .-|+..-=.-+... T Consensus 316 LPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~l 395 (533) T protein:vir:10 316 LPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSEL 395 (533) T ss_pred ccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 111 23455555554555556777889999999999999987742 1223222 12322221222333 Q ss_pred HHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hCC---- Q lcl|NC_019422. 304 LSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MNL---- 362 (384) Q Consensus 304 i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG~---- 362 (384) +.+.|..+|+.+ .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |.+ T Consensus 396 F~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee 475 (533) T protein:vir:10 396 FTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVE 475 (533) T ss_pred HHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH Confidence 344444444332 11 1123334443333 22222 3344444332 23335565555432 111 Q ss_pred --------------C--CCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 363 --------------S--PIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 363 --------------~--p~~~gd~~~~~~n~~~~~~ge 384 (384) + +.|..+.-..++...|=-+|. T Consensus 476 i~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~ 513 (533) T protein:vir:10 476 MKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGA 513 (533) T ss_pred HHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCc Confidence 1 112111111111111111111 No 212 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=368 Identities=11% Similarity=0.019 Sum_probs=169.5 Q ss_pred Ccchhhhccc----CCC-c---chhHH-Hh---hccc--------------cCccee-chhhhhhcHHHHHHHHHHHHhh Q lcl|NC_019422. 1 MNIFKSKKKN----KEA-P---GKVMM-EL---ISDS--------------GNGFYS-WHGNLYKSDIVRSIIRPKAKAV 53 (384) Q Consensus 1 M~~f~~~~~~----~~~-~---~~~~~-~~---~~~~--------------~~~~~~-~~~~~~~~~~v~~~i~~ia~~i 53 (384) ||+|...++. ... + +.... .. +.+. ...+.. .....++.+....+++.+|+.+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll 80 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYI 80 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhh Confidence 9999975433 111 1 11111 11 1111 111110 1112234445567788888877 Q ss_pred ccCceEEEEecCCc-ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEE Q lcl|NC_019422. 54 GKMTAKHIRSNETE-FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAI 132 (384) Q Consensus 54 a~~~~~~~~~~~~~-~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~ 132 (384) ..=+..+--.+.+. ..+.....+..++.. -.+..-++..+.+.+..|.+++.+..+. |++ .+-.+++..+-+. T Consensus 81 ~~e~~~i~v~~~~~~d~e~~~~~l~~il~~----n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~i~~v~ad~~~P~ 154 (518) T protein:vir:78 81 SGKPLSIDVTGVNGSKDENLTKQLKEALRI----DNFDSKSVKIVELAGGSGVSAVKINILN-GRP-SISVHSSSQFWID 154 (518) T ss_pred cCCCceEEecCccccCcHHHHHHHHHHHHh----ccHHHHHHHHHHHhhccCceEEEEEEEC-Cee-EEEEEcCCeeEEE Confidence 65444331011110 111111122222221 2234455566778888999998887653 443 4555666666553 Q ss_pred EcCC----------------CEEEE---EE-----------------------EcCceEEEE-------------ehh-- Q lcl|NC_019422. 133 YENE----------------VLFLK---FL-----------------------LRNGKIVSY-------------PYS-- 155 (384) Q Consensus 133 ~~~~----------------~~~~~---~~-----------------------~~~g~~~~~-------------~~~-- 155 (384) ...+ ..+|. ++ ...+..+.. ..+ T Consensus 155 ~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~ 234 (518) T protein:vir:78 155 FKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDI 234 (518) T ss_pred eecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccC Confidence 2211 11110 10 000100000 000 Q ss_pred -------------heEEEeccCCC----CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEe-----eCCCCC Q lcl|NC_019422. 156 -------------DIIHLRKDFNE----NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLK-----FKTALR 213 (384) Q Consensus 156 -------------evih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~-----~~~~~~ 213 (384) -+.|++...+. +...|+|.+..+...++.++........-|+. +++..++. ...... T Consensus 235 ~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~ 313 (518) T protein:vir:78 235 QLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKS 313 (518) T ss_pred ccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCC Confidence 01222221111 23569999999999999998888888888876 45544441 111110 Q ss_pred -hHHHHHHHHHHHHHhccccccCCcceecCCCc----eeeecccchhHHHHH-HHHHHHHHHHHHhCCCHHHhcccc--- Q lcl|NC_019422. 214 -PDDIKKEVKSFEKNYLQIDSEAGGAAATDSKY----DAEQVKAESYVPNAA-QMDKAIQRLYSFFNTNEKIIQSKY--- 284 (384) Q Consensus 214 -~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~----~~~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp~~~l~~~~--- 284 (384) ....-... .=.+.|..... -.++|. .++.++....+.+.. .++...++|....|++|..+|.++ T Consensus 314 ~~~~~~~fd-~~~~~y~~i~~------~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~ 386 (518) T protein:vir:78 314 TDKEEWSMN-VDEDYFMQFKG------TLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREV 386 (518) T ss_pred CCccccccC-CCCceEEEecC------cCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccc Confidence 00000000 00011111000 012222 266666666565554 457788999999999999886432 Q ss_pred --HHHH------------HHHHHHHHHHHHHHHHHHHHhhcccC--cccccCcceEEeechhhhccCHHHHHHHHH-HHh Q lcl|NC_019422. 285 --SEDE------------WNAYYESEIEPVGLQLSNQYTEKLFT--RKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVD 347 (384) Q Consensus 285 --~e~~------------~~~~~~~~i~P~~~~i~~~l~~~l~~--~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~ 347 (384) +|-. ....+...+.-++..+.+.+...... .........+.+++++....|.++.++..+ ++. T Consensus 387 TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~ 466 (518) T protein:vir:78 387 KATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS 466 (518) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Confidence 1110 01233333433444443333221111 111122345778888888888888887764 788 Q ss_pred CCCCCHHHHHHHh--CCCCCCCCCeee---------------ecCceeecCCC Q lcl|NC_019422. 348 RGSLTPNEWRKIM--NLSPIENGDKPV---------------RRLDTAVVEGG 383 (384) Q Consensus 348 ~g~~t~NE~R~~l--G~~p~~~gd~~~---------------~~~n~~~~~~g 383 (384) .|+|++.++-+++ |++. ++.++-+ .+..-..-++| T Consensus 467 aGimS~e~~i~~~~~~~~d-eea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 467 ALAMSVEEKVKLIHPKWED-EEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred cCCCCHHHHHHHhCCCCCH-HHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 8999999866554 3332 1111000 01111233444 No 213 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.56 E-value=0.00051 Score=38.63 Aligned_cols=349 Identities=8% Similarity=0.039 Sum_probs=165.6 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcc------eechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccch Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGF------YSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTNPEI 74 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 74 (384) ..+.+...... ..-.....+..+..... ......-+..+....+|+..+.-+-+-|+.+- .+++ .... T Consensus 7 ~~~i~~~~~~~-~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~-~~~~----~~~~ 80 (429) T protein:vir:98 7 SELIQKHRSFN-LSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTS-HENK----QVSN 80 (429) T ss_pred HHHHHHHHHHH-HHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcccCceee-cCCh----HHHH Confidence 11111100000 00000111111100000 00001123456778888888888877777652 1111 1112 Q ss_pred HHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEEEcCce Q lcl|NC_019422. 75 YIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFLLRNGK 148 (384) Q Consensus 75 ~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~~~g~ 148 (384) ....++.. .........+..+.+.+|.+|+.+..+..|.+. +..++|..+.+..+.... ..++...++. T Consensus 81 ~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~ 155 (429) T protein:vir:98 81 YLELLDGY----NDQDDNNAELSKICSIYGHGYELVFNDENAEAG-ITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGV 155 (429) T ss_pred HHHHHHhh----cCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEE-EEEEcccceEEEEeCCCCCceEEEEEEEEecCce Confidence 23333322 234567788889999999999999999888754 677888888777654221 1122212211 Q ss_pred EE-EEehhh--------------------------eEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCC Q lcl|NC_019422. 149 IV-SYPYSD--------------------------IIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNT 201 (384) Q Consensus 149 ~~-~~~~~e--------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 201 (384) .. .+...+ |++++ +...|.|.+..+...++..........+..+..+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~ 230 (429) T protein:vir:98 156 LEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFAD 230 (429) T ss_pred EEEEEEeCceEEEEEecCCceEecccccccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 10 111111 22222 22468889998888888887777777777777777 Q ss_pred cceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC----CCceeeecccchhHHHHH-HHHHHHHHHHHHhCCC Q lcl|NC_019422. 202 IKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD----SKYDAEQVKAESYVPNAA-QMDKAIQRLYSFFNTN 276 (384) Q Consensus 202 p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~----~g~~~~~l~~~~~~~~~~-~~~~~~~~I~~~fgvp 276 (384) |-.+++ +...+++..+..+. .+++.++ .+.++..+........+. .++.+.+.|+..-++| T Consensus 231 p~~~i~-g~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 296 (429) T protein:vir:98 231 AYLKIL-GAELDDETLKSLRD-------------TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVA 296 (429) T ss_pred ceeeee-cCCCCcchhhhHhh-------------CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 765554 33333332222111 1222221 123444444433333232 3466777777777776 Q ss_pred HHHh---ccccHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHH Q lcl|NC_019422. 277 EKII---QSKYSED-------------EWNAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKL 340 (384) Q Consensus 277 ~~~l---~~~~~e~-------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~ 340 (384) ..-. |..+.++ .....+...+.-+++.+...++..-- . .....+++.+......|..+.+ T Consensus 297 ~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~---~d~~~i~v~f~~~~p~~~~~~a 372 (429) T protein:vir:98 297 NISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIG-P---KDWIGIKYKFTRNLPANLLEES 372 (429) T ss_pred ccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-c---cccccceEEeCCCCCcCHHHHH Confidence 4322 2211111 11123334444444444444332211 1 1112345555666667777777 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCCCC---------Ceee-ecCceeecCCCC Q lcl|NC_019422. 341 NLVQMVDRGSLTPNEWRKIMNLSPIENG---------DKPV-RRLDTAVVEGGE 384 (384) Q Consensus 341 ~~~~~~~~g~~t~NE~R~~lG~~p~~~g---------d~~~-~~~n~~~~~~ge 384 (384) +.+... .|+++..-+.+++|.-+.|.. +... ...+....++++ T Consensus 373 ~~~~kl-~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 425 (429) T protein:vir:98 373 QIAGNL-AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTT 425 (429) T ss_pred HHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCC Confidence 765433 578998888888887543210 0000 001111111222 No 214 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=96.53 E-value=0.00054 Score=38.51 Aligned_cols=352 Identities=9% Similarity=0.021 Sum_probs=164.6 Q ss_pred Ccchh-hhc---ccCCCcchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFK-SKK---KNKEAPGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~-~~~---~~~~~~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) ..+.. ... +.....-+. ...+..+.... ......+-...+.....++..+.-+-+-|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:96 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 00000 000 000000000 01111110000 00000111234567788888888888888775 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) -- ++ .+ .......++. | ........+..+++.+|.+|..+.++..|.+ .+..++|..+.++.+... . T Consensus 105 ~~-~~--~~--~~~~l~~~~~--n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~ 173 (474) T protein:vir:96 105 AH-DD--DK--VLDVIHQVLD--T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQ 173 (474) T ss_pred cc-CC--hH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCc Confidence 21 11 11 1122222322 2 3556677788999999999999999888865 577788888888765431 1 Q ss_pred ----EEEEEEcCce-EEEEehhheEEEeccC---------------------C---------CCCccCccHHHHHHHHHH Q lcl|NC_019422. 139 ----FLKFLLRNGK-IVSYPYSDIIHLRKDF---------------------N---------ENDLFGTSPAKVLEPIME 183 (384) Q Consensus 139 ----~~~~~~~~g~-~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~~~~~~~~i~ 183 (384) +.+|...+.. ...+.++.+.++..-. + .+...|.|.+..+...++ T Consensus 174 ~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liD 253 (474) T protein:vir:96 174 LNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVD 253 (474) T ss_pred eEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHH Confidence 1122221111 1112233333332110 0 012357888888888888 Q ss_pred HHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHH Q lcl|NC_019422. 184 VVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQM 262 (384) Q Consensus 184 ~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~ 262 (384) ....+.....+.+...+.|-.+++ +. +.+........+. ..+++.++++.++..+........+ ..+ T Consensus 254 a~d~~~S~~~~~~~~~~~p~lv~~--g~-~~~~~~~~~~~~~---------~~~~i~~~~~~~~~~l~~~~~~~~~~~~~ 321 (474) T protein:vir:96 254 AIDKRLSDVQNMFDESVELIYILR--GY-EGEDLSEFMEGLK---------YYKAINVSSDGGVETIQVEVPVASTKEYL 321 (474) T ss_pred HHHHHHHHHHHHHHHhhcchhhhc--CC-Ccccccchhhhhh---------ccceeeccCCCceeEEeccCCHHHHHHHH Confidence 777766666666666666644442 21 1111111111111 2245556666666666554444333 344 Q ss_pred HHHHHHHHHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEE Q lcl|NC_019422. 263 DKAIQRLYSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIV 325 (384) Q Consensus 263 ~~~~~~I~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~ 325 (384) +.+.+.|+..-++|..- ++++.+..+. ...+...+..+++.+...+.. .. ....++ T Consensus 322 ~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~---d~~~i~ 394 (474) T protein:vir:96 322 DMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KL---DAKEIE 394 (474) T ss_pred HHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----Cc---ccceee Confidence 56667787777777533 2333332221 123334444444444433221 11 122344 Q ss_pred eechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eecCceeecCCCC Q lcl|NC_019422. 326 FEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKP-----VRRLDTAVVEGGE 384 (384) Q Consensus 326 fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~-----~~~~n~~~~~~ge 384 (384) +.+..-...+..+.++.++ ..|+++...+++++++-..+. .+.+ -...++..+.+++ T Consensus 395 i~f~~~~p~~~~e~a~~~~--~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:96 395 ITFNFNVMVNDLEQSQIGA--QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGG 458 (474) T ss_pred EEecCCCccCHHHHHHHHH--HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccc Confidence 4455555556666665543 469999988888887644321 1100 0001111111111 No 215 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=96.53 E-value=0.00054 Score=38.51 Aligned_cols=352 Identities=9% Similarity=0.021 Sum_probs=164.6 Q ss_pred Ccchh-hhc---ccCCCcchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFK-SKK---KNKEAPGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~-~~~---~~~~~~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) ..+.. ... +.....-+. ...+..+.... ......+-...+.....++..+.-+-+-|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:95 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 00000 000 000000000 01111110000 00000111234567788888888888888775 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) -- ++ .+ .......++. | ........+..+++.+|.+|..+.++..|.+ .+..++|..+.++.+... . T Consensus 105 ~~-~~--~~--~~~~l~~~~~--n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~ 173 (474) T protein:vir:95 105 AH-DD--DK--VLDVIHQVLD--T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQ 173 (474) T ss_pred cc-CC--hH--HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCc Confidence 21 11 11 1122222322 2 3556677788999999999999999888865 577788888888765431 1 Q ss_pred ----EEEEEEcCce-EEEEehhheEEEeccC---------------------C---------CCCccCccHHHHHHHHHH Q lcl|NC_019422. 139 ----FLKFLLRNGK-IVSYPYSDIIHLRKDF---------------------N---------ENDLFGTSPAKVLEPIME 183 (384) Q Consensus 139 ----~~~~~~~~g~-~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~~~~~~~~i~ 183 (384) +.+|...+.. ...+.++.+.++..-. + .+...|.|.+..+...++ T Consensus 174 ~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liD 253 (474) T protein:vir:95 174 LNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVD 253 (474) T ss_pred eEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHH Confidence 1122221111 1112233333332110 0 012357888888888888 Q ss_pred HHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHH-HHH Q lcl|NC_019422. 184 VVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNA-AQM 262 (384) Q Consensus 184 ~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~-~~~ 262 (384) ....+.....+.+...+.|-.+++ +. +.+........+. ..+++.++++.++..+........+ ..+ T Consensus 254 a~d~~~S~~~~~~~~~~~p~lv~~--g~-~~~~~~~~~~~~~---------~~~~i~~~~~~~~~~l~~~~~~~~~~~~~ 321 (474) T protein:vir:95 254 AIDKRLSDVQNMFDESVELIYILR--GY-EGEDLSEFMEGLK---------YYKAINVSSDGGVETIQVEVPVASTKEYL 321 (474) T ss_pred HHHHHHHHHHHHHHHhhcchhhhc--CC-Ccccccchhhhhh---------ccceeeccCCCceeEEeccCCHHHHHHHH Confidence 777766666666666666644442 21 1111111111111 2245556666666666554444333 344 Q ss_pred HHHHHHHHHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEE Q lcl|NC_019422. 263 DKAIQRLYSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIV 325 (384) Q Consensus 263 ~~~~~~I~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~ 325 (384) +.+.+.|+..-++|..- ++++.+..+. ...+...+..+++.+...+.. .. ....++ T Consensus 322 ~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~---d~~~i~ 394 (474) T protein:vir:95 322 DMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KL---DAKEIE 394 (474) T ss_pred HHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----Cc---ccceee Confidence 56667787777777533 2333332221 123334444444444433221 11 122344 Q ss_pred eechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eecCceeecCCCC Q lcl|NC_019422. 326 FEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKP-----VRRLDTAVVEGGE 384 (384) Q Consensus 326 fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~-----~~~~n~~~~~~ge 384 (384) +.+..-...+..+.++.++ ..|+++...+++++++-..+. .+.+ -...++..+.+++ T Consensus 395 i~f~~~~p~~~~e~a~~~~--~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:95 395 ITFNFNVMVNDLEQSQIGA--QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGG 458 (474) T ss_pred EEecCCCccCHHHHHHHHH--HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccc Confidence 4455555556666665543 469999988888887644321 1100 0001111111111 No 216 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=96.45 E-value=0.00061 Score=38.20 Aligned_cols=370 Identities=9% Similarity=0.050 Sum_probs=160.3 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcc------e---echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcceec Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGF------Y---SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFKTN 71 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~------~---~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 71 (384) -.+.++........-.....+..+..... . .....-...+....+++..+.-+-+-|+++-- +++ . T Consensus 28 ~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~-~d~----~ 102 (506) T protein:vir:94 28 MKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKL-PDD----G 102 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeec-Ccc----h Confidence 11111111100000001111221111000 0 00011134567788888888887777776521 111 1 Q ss_pred cchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCE------EEEEEE- Q lcl|NC_019422. 72 PEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVL------FLKFLL- 144 (384) Q Consensus 72 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~- 144 (384) .......++. ..........+..+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+.... +.+|.. T Consensus 103 ~~~~l~~~~~----~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~ 177 (506) T protein:vir:94 103 SNSGFDTFNK----ANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIE 177 (506) T ss_pred HHHHHHHHHh----ccCHhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCCCceEEEEEEEeee Confidence 1122333332 234556677788899999999999999888865 4667888888887764321 111111 Q ss_pred -cCce----E----EEEehhheEEEec-----------cCCC---------CCccCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 145 -RNGK----I----VSYPYSDIIHLRK-----------DFNE---------NDLFGTSPAKVLEPIMEVVNTTDQGVVKA 195 (384) Q Consensus 145 -~~g~----~----~~~~~~evih~~~-----------~~~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 195 (384) ..+. . ..+.+..+.++.. .++. +.-.|.|.+......++....+.....+. T Consensus 178 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~ 257 (506) T protein:vir:94 178 LVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANY 257 (506) T ss_pred eccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 0110 0 0011111111100 0000 11246677776666666655544444433 Q ss_pred HHccCCcceEEeeCC------------------CC---ChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccch Q lcl|NC_019422. 196 IKNSNTIKWLLKFKT------------------AL---RPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAES 254 (384) Q Consensus 196 ~~ng~~p~~il~~~~------------------~~---~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~ 254 (384) ......|-.+++-.. .. ..+........+...-.-.....+.+...+.+.+++.+.... T Consensus 258 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 337 (506) T protein:vir:94 258 MTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTY 337 (506) T ss_pred HHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecC Confidence 333222322221100 00 011111111111110000011112222233445566665554 Q ss_pred hHHHH-HHHHHHHHHHHHHhCCCHHH---hccccHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcccCcc Q lcl|NC_019422. 255 YVPNA-AQMDKAIQRLYSFFNTNEKI---IQSKYSED--------------EWNAYYESEIEPVGLQLSNQYTEKLFTRK 316 (384) Q Consensus 255 ~~~~~-~~~~~~~~~I~~~fgvp~~~---l~~~~~e~--------------~~~~~~~~~i~P~~~~i~~~l~~~l~~~~ 316 (384) ....+ ..++.+...|+..-++|..- ++++.+.. .....+...+..+++.+...++.. ... T Consensus 338 ~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~~~ 415 (506) T protein:vir:94 338 DVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSI--HGD 415 (506) T ss_pred CHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCc Confidence 44443 34566778888888888632 22222221 122345555666665555554421 110 Q ss_pred cccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC------------CCeeeecCceeecCCCC Q lcl|NC_019422. 317 ARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN------------GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 317 ~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~------------gd~~~~~~n~~~~~~ge 384 (384) .......+++.+..-...|..+.++.+... .|+++...++++++.-..|. .++...... ..-+.++ T Consensus 416 ~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~ 493 (506) T protein:vir:94 416 WTFDPQELTFTFRDNLPADNISQIKALVQA-GATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNG-VISNDGQ 493 (506) T ss_pred cccccccceEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhc-CCCcccC Confidence 011112344555665667777777765433 48999999999987644211 111111110 0001111 No 217 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=96.28 E-value=0.00079 Score=37.58 Aligned_cols=379 Identities=9% Similarity=0.037 Sum_probs=177.2 Q ss_pred Cc-chhhh----cccC-CCcchhH----HHhh-ccccCcceech---------------hhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 1 MN-IFKSK----KKNK-EAPGKVM----MELI-SDSGNGFYSWH---------------GNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 1 M~-~f~~~----~~~~-~~~~~~~----~~~~-~~~~~~~~~~~---------------~~~~~~~~v~~~i~~ia~~ia 54 (384) |. +|+.+ ...+ .++.++. ...+ ++..+.+.... +....+|.|.+||+.|.+.+. T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVneaI 80 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEFV 80 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 43 34432 1111 1111110 1111 11111111111 123468999999999998864 Q ss_pred cC-----ceEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCC----CCceeeEE Q lcl|NC_019422. 55 KM-----TAKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDD----YNMPTQIY 122 (384) Q Consensus 55 ~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~----~g~~~~l~ 122 (384) -+ |+.|--.+.+-.+... ..++..-+ ..+++..--+.++..|.+.|..|+.++-+. .| ..+|+ T Consensus 81 v~d~~~~pV~vdL~~~~~s~siK----~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~G-I~eLr 155 (564) T protein:vir:10 81 VNDGDDKPVEVDLQNLEIGSGVK----KKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKG-ILELR 155 (564) T ss_pred EecCCCceEEEEecccCcchHHH----HHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhh-hhhhh Confidence 33 2222111111111111 11111111 123344445566778889999999987652 24 88999 Q ss_pred EEcCceEEEEEcCC------CE-----------------EEEEEEc--C--------------ceEEEEehhheEEEecc Q lcl|NC_019422. 123 PLNALNVEAIYENE------VL-----------------FLKFLLR--N--------------GKIVSYPYSDIIHLRKD 163 (384) Q Consensus 123 ~l~~~~v~~~~~~~------~~-----------------~~~~~~~--~--------------g~~~~~~~~evih~~~~ 163 (384) .|||..++.++..- +. +|.|... . +..+.++.+-|.|.... T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSG 235 (564) T protein:vir:10 156 YIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSG 235 (564) T ss_pred hhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceeccc Confidence 99999888765211 10 1112111 0 12245566666665543 Q ss_pred C-CCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------cc Q lcl|NC_019422. 164 F-NENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------ID 232 (384) Q Consensus 164 ~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~ 232 (384) - ..+...-+|-+..|.+.+..+.-......=+--..+.-+-++.++ |.+....+++....+-.+|+. .. T Consensus 236 L~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 315 (564) T protein:vir:10 236 LMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEI 315 (564) T ss_pred ceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCcee Confidence 2 224445677889999988888877777666655555556677776 445444455555555554442 11 Q ss_pred ccCCcc--------eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc--------cHHHHH---- Q lcl|NC_019422. 233 SEAGGA--------AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--------YSEDEW---- 289 (384) Q Consensus 233 ~~~~~~--------~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--------~~e~~~---- 289 (384) .+..+. ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+ T Consensus 316 rddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiK 395 (564) T protein:vir:10 316 RDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELK 395 (564) T ss_pred cccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHH Confidence 111111 111 234555556544555566777899999999999999877532 122221 Q ss_pred -HHHHHHHHHHHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCC Q lcl|NC_019422. 290 -NAYYESEIEPVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLT 352 (384) Q Consensus 290 -~~~~~~~i~P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t 352 (384) .-|+..-=.-+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-.++ T Consensus 396 F~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 475 (564) T protein:vir:10 396 FTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFS 475 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 12332221222333344444444332 11 1123334443333 22222 3334444332 2233445 Q ss_pred HHHHHHH-----------------------hCCCCC--CCCCe-eeecCceeecCCCC Q lcl|NC_019422. 353 PNEWRKI-----------------------MNLSPI--ENGDK-PVRRLDTAVVEGGE 384 (384) Q Consensus 353 ~NE~R~~-----------------------lG~~p~--~~gd~-~~~~~n~~~~~~ge 384 (384) .+=+|+. +..+|. +.||. ...+..++|.+.|= T Consensus 476 ~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~ 533 (564) T protein:vir:10 476 TEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAA 533 (564) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhh Confidence 5444421 112232 12432 22222233332211 No 218 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=95.95 E-value=0.0012 Score=36.59 Aligned_cols=352 Identities=10% Similarity=-0.007 Sum_probs=163.5 Q ss_pred CcchhhhcccCCC--------------cchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEA--------------PGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~--------------~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia 50 (384) =.++.+....... .-+. ...+..+.... .......=+.++....+++..+ T Consensus 14 ~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 93 (474) T protein:vir:96 14 ERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKV 93 (474) T ss_pred hhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhh Confidence 0000000000000 0000 11111110000 0000001123456677888888 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEE Q lcl|NC_019422. 51 KAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVE 130 (384) Q Consensus 51 ~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~ 130 (384) .-+-+-|.++- .+++ .....+..++. | ........+..+...+|.+|+.+..+..|++. +..++|..+. T Consensus 94 ~~l~g~p~~~~-~~d~----~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~-i~~~~p~~~~ 162 (474) T protein:vir:96 94 AYAVANPVTFS-SDDD----KSLKTIQEVLN--H---KWDDKLVDILTAASNKGIEWLQPYIDENGEFK-TFRVPAEQAI 162 (474) T ss_pred hhhcccCceee-cCch----HHHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeeEEEEEecCCCceE-EEEEcccceE Confidence 88877777652 1111 11122222332 2 33455666778899999999999998888754 7788999988 Q ss_pred EEEcCCC--E----EEEEEEcCceE-EEEehhheEEEecc-------------------------CCC---------CCc Q lcl|NC_019422. 131 AIYENEV--L----FLKFLLRNGKI-VSYPYSDIIHLRKD-------------------------FNE---------NDL 169 (384) Q Consensus 131 ~~~~~~~--~----~~~~~~~~g~~-~~~~~~evih~~~~-------------------------~~~---------~~~ 169 (384) ++.+.+. . +.+|...+... ..+..+.+.|+... ++. +.. T Consensus 163 ~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~ 242 (474) T protein:vir:96 163 PIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNP 242 (474) T ss_pred EEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCC Confidence 8876431 1 11222221111 11222222222110 110 123 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC-CCceee Q lcl|NC_019422. 170 FGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-SKYDAE 248 (384) Q Consensus 170 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-~g~~~~ 248 (384) .|.|.+..+...++....+.....+.+...+.|-.+++- ... ++.+. +..... ..+++.++ .|.+++ T Consensus 243 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g-~~~--~~~~~----~~~~~~-----~~~~i~~~~~~~~~~ 310 (474) T protein:vir:96 243 QEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKG-YEG--QDLDE----FMRNLK-----YYKAINVDGDGSGVD 310 (474) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec-CCc--ccccc----hhhhhh-----cCceEEecCCCCcee Confidence 588888888888888877777777777777777554432 111 11111 111111 13444443 456666 Q ss_pred ecccchhHHHH-HHHHHHHHHHHHHhCCCHHHh---ccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 249 QVKAESYVPNA-AQMDKAIQRLYSFFNTNEKII---QSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 249 ~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~~l---~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~ 310 (384) .+........+ ..++.+.+.|+..-++|..-. +++.+..+. ...+...+..+++.+...+.. T Consensus 311 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (474) T protein:vir:96 311 TIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL 390 (474) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 66654333333 344566778888888875432 232222221 123344444444444433221 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C-----CeeeecCceeecCCC Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G-----DKPVRRLDTAVVEGG 383 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g-----d~~~~~~n~~~~~~g 383 (384) .. ....+.+.+..-...+..+.++.. ...|+++...++++++.-..+. . ++--....+.+++++ T Consensus 391 ----~~---~~~~i~i~f~~~~p~~~~e~~~~~--~~ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~ 461 (474) T protein:vir:96 391 ----NI---KVQDVEITFNFNVMVNELEQSQIG--VQSQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGD 461 (474) T ss_pred ----Cc---ccceeeEEeccCCCcCHHHHHHHH--HhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccc Confidence 11 112334444444455666665543 4569999999999887643211 0 000011111122111 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) + T Consensus 462 ~ 462 (474) T protein:vir:96 462 A 462 (474) T ss_pred c Confidence 1 No 219 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=95.91 E-value=0.0013 Score=36.48 Aligned_cols=377 Identities=10% Similarity=0.032 Sum_probs=177.1 Q ss_pred Cc-----chhhhc---ccCCCcchhH--------HHhhcc--ccCccee--------------chhhhhhcHHHHHHHHH Q lcl|NC_019422. 1 MN-----IFKSKK---KNKEAPGKVM--------MELISD--SGNGFYS--------------WHGNLYKSDIVRSIIRP 48 (384) Q Consensus 1 M~-----~f~~~~---~~~~~~~~~~--------~~~~~~--~~~~~~~--------------~~~~~~~~~~v~~~i~~ 48 (384) +. ..+..+ ....+|.... ...+.. ...+++. .-+....+|.|.+||+. T Consensus 15 ~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e 94 (524) T protein:vir:10 15 ANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNAVQE 94 (524) T ss_pred hcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchhhHHHH Confidence 11 111111 1111110000 000000 0000010 00234568999999999 Q ss_pred HHHhhccCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCc Q lcl|NC_019422. 49 KAKAVGKMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNM 117 (384) Q Consensus 49 ia~~ia~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~ 117 (384) |.+.+.-+. +.+--.+-+-.+... ..++..-+ ..+++..--+.++..|.+.|..|..++-+. +.- T Consensus 95 IVneaiv~d~~~~pV~l~Ld~~~~s~siK----~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~G 170 (524) T protein:vir:10 95 IVSDAIVYEDDKEVVALNLDGTDFSQSIK----DKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDG 170 (524) T ss_pred hhcceeEecCCCceEEEEecccCcchHHH----HHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCcccc Confidence 998875432 222111111111111 11111111 123344445566778889999999987652 224 Q ss_pred eeeEEEEcCceEEEEEcC-----CCE--------EEEEE------------EcCceEEEEehhheEEEeccC-CCCCccC Q lcl|NC_019422. 118 PTQIYPLNALNVEAIYEN-----EVL--------FLKFL------------LRNGKIVSYPYSDIIHLRKDF-NENDLFG 171 (384) Q Consensus 118 ~~~l~~l~~~~v~~~~~~-----~~~--------~~~~~------------~~~g~~~~~~~~evih~~~~~-~~~~~~G 171 (384) ..+|+.|||..++.++.- ++. .+.|. +..++.+.++.+.|.|...+- +.++-.- T Consensus 171 I~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~d~~~~~i 250 (524) T protein:vir:10 171 VQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLLDCCGKNI 250 (524) T ss_pred ceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcccCCCCce Confidence 889999999999764321 110 11111 112345667888888876553 2344466 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc--- Q lcl|NC_019422. 172 TSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA--- 238 (384) Q Consensus 172 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~--- 238 (384) +|-+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-..++. ...+..+. T Consensus 251 ~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msM 330 (524) T protein:vir:10 251 IGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSM 330 (524) T ss_pred eccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhh Confidence 78899999999888887777666655555556677776 445444455555555444432 11122221 Q ss_pred -----eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc---------cHHHHH-----HHHHHHH Q lcl|NC_019422. 239 -----AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK---------YSEDEW-----NAYYESE 296 (384) Q Consensus 239 -----~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~---------~~e~~~-----~~~~~~~ 296 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+ .-|+..- T Consensus 331 lEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rL 410 (524) T protein:vir:10 331 TEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQL 410 (524) T ss_pred HhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHH Confidence 111 234555555545555567778899999999999999887321 222222 1232221 Q ss_pred HHHHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH- Q lcl|NC_019422. 297 IEPVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI- 359 (384) Q Consensus 297 i~P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~- 359 (384) =.-+...+.+.|..+|+.+ .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. T Consensus 411 R~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~I 490 (524) T protein:vir:10 411 QNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDF 490 (524) T ss_pred HHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHH Confidence 1222333444444444332 11 1123334443333 22222 3344444332 23345566666542 Q ss_pred hCCCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 360 MNLSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 360 lG~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) |.+.-.+ +.+..+.+.. -++-| T Consensus 491 Lr~tDeei~~~~k~I~~E~k~~~~~~~---~~~~~ 522 (524) T protein:vir:10 491 LQMTDEEINQEAKQIEEESKEARFQNP---DEEEE 522 (524) T ss_pred hccCHHHHHHHHHHHHHHhhcCCCCCC---Chhhh Confidence 3332110 0111111110 01111 No 220 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=95.88 E-value=0.0013 Score=36.37 Aligned_cols=377 Identities=11% Similarity=0.033 Sum_probs=171.7 Q ss_pred CcchhhhcccCCCcchh------HHHh--------------hccccCccee------chhhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV------MMEL--------------ISDSGNGFYS------WHGNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~------~~~~--------------~~~~~~~~~~------~~~~~~~~~~v~~~i~~ia~~ia 54 (384) =...+.......+|... .... ..+.-..+.+ .-+....+|.|.+||+.|.+.+. T Consensus 21 ~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 100 (523) T protein:vir:68 21 KDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVDNAVSEIVSDAI 100 (523) T ss_pred hhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 11111111110110000 0000 0000000100 01234568999999999998886 Q ss_pred cCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---CceeeEEE Q lcl|NC_019422. 55 KMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQIYP 123 (384) Q Consensus 55 ~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~l~~ 123 (384) -+. ..+--.+-+-.+ ..-..++..-+ ..+++..--+.++..|.+.|..|+.++-+.. .-..+|+. T Consensus 101 v~d~~~~pV~i~Ld~~~~s~----~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~ 176 (523) T protein:vir:68 101 VYEDDTEVVSINLDNTKFSP----NIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRR 176 (523) T ss_pred eecCCCceEEEEecccccch----HHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeee Confidence 443 222111111111 11111111111 1233444455667788899999999987632 24889999 Q ss_pred EcCceEEEEEc-----CCCE--------EEEEEEc------C------ceEEEEehhheEEEeccC-CCCCccCccHHHH Q lcl|NC_019422. 124 LNALNVEAIYE-----NEVL--------FLKFLLR------N------GKIVSYPYSDIIHLRKDF-NENDLFGTSPAKV 177 (384) Q Consensus 124 l~~~~v~~~~~-----~~~~--------~~~~~~~------~------g~~~~~~~~evih~~~~~-~~~~~~G~s~~~~ 177 (384) |||..|+.++. ..+. .|.|... + |..+.++.+-|.|...+- +.++..-+|-+.. T Consensus 177 lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhk 256 (523) T protein:vir:68 177 LDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHR 256 (523) T ss_pred eCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhheeeeeccceeCCCCceeccchh Confidence 99999977432 1111 1122211 1 233444444443333221 2344455788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc--------e Q lcl|NC_019422. 178 LEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA--------A 239 (384) Q Consensus 178 ~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~--------~ 239 (384) |.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-..++. ...+..+. + T Consensus 257 AiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWL 336 (523) T protein:vir:68 257 AIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWL 336 (523) T ss_pred hhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcc Confidence 99998888877777666655555556677776 445444455555555444432 11122221 1 Q ss_pred ec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhccc--------cHHHHH-----HHHHHHHHHHHHHH Q lcl|NC_019422. 240 AT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQSK--------YSEDEW-----NAYYESEIEPVGLQ 303 (384) Q Consensus 240 v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~~--------~~e~~~-----~~~~~~~i~P~~~~ 303 (384) +- +.|.+++.|.-...-.++...++..+.+..+++||.+-|... .+|-.+ .-|+..-=.-+... T Consensus 337 pRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~l 416 (523) T protein:vir:68 337 QRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEI 416 (523) T ss_pred cccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHH Confidence 11 234555555555555667778899999999999999877321 122222 12322211222333 Q ss_pred HHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hCCCCCC Q lcl|NC_019422. 304 LSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MNLSPIE 366 (384) Q Consensus 304 i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG~~p~~ 366 (384) +.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |.+.-.+ T Consensus 417 f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee 496 (523) T protein:vir:68 417 FLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEE 496 (523) T ss_pred HHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH Confidence 344444444332 21 1123334443333 22222 3344444332 23345566666542 3332110 Q ss_pred ----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 367 ----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 367 ----------~gd~~~~~~n~~~~~~ge 384 (384) +....+.+.. -++-| T Consensus 497 i~~~~kqI~~E~k~~~~~~p---~~e~~ 521 (523) T protein:vir:68 497 IEQEAKQIEEESKEARFQDP---DQEQE 521 (523) T ss_pred HHHHHHHHHHHhhcCCCCCC---chhhh Confidence 0111111110 01111 No 221 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=95.76 E-value=0.0015 Score=36.06 Aligned_cols=379 Identities=11% Similarity=0.050 Sum_probs=172.3 Q ss_pred Ccchhhhc--ccCCCcchhHHH-------hhccccC-cceech-------------hhhhhcHHHHHHHHHHHHhhccCc Q lcl|NC_019422. 1 MNIFKSKK--KNKEAPGKVMME-------LISDSGN-GFYSWH-------------GNLYKSDIVRSIIRPKAKAVGKMT 57 (384) Q Consensus 1 M~~f~~~~--~~~~~~~~~~~~-------~~~~~~~-~~~~~~-------------~~~~~~~~v~~~i~~ia~~ia~~~ 57 (384) |||.=..+ ......++.... +..+... .++... +....+|.|.+||+.|.+.+.-+. T Consensus 5 fgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d 84 (558) T protein:vir:10 5 FGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEAIVSD 84 (558) T ss_pred hcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEec Confidence 55533111 111111111111 1111101 111100 123568999999999998886442 Q ss_pred -----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---CceeeEEEEcC Q lcl|NC_019422. 58 -----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQIYPLNA 126 (384) Q Consensus 58 -----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~l~~l~~ 126 (384) ..+--.+-+..+.. ...++..-+ ..+++..--+.++..|.+.|..|+.++-+.. .-..+|+.||| T Consensus 85 ~~~~pV~i~Ld~~~~s~~i----K~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDP 160 (558) T protein:vir:10 85 LYDSPVEVELSNLNASNTL----KKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDP 160 (558) T ss_pred CCCceEEEEecccCcchHH----HHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCc Confidence 22211111111111 111222111 1233444455667788999999999977533 24889999999 Q ss_pred ceEEEEEcC----------------CC------EEEEEEEcC--------------ceEEEEehhheEEEeccC--CCCC Q lcl|NC_019422. 127 LNVEAIYEN----------------EV------LFLKFLLRN--------------GKIVSYPYSDIIHLRKDF--NEND 168 (384) Q Consensus 127 ~~v~~~~~~----------------~~------~~~~~~~~~--------------g~~~~~~~~evih~~~~~--~~~~ 168 (384) ..++.++.- .+ ...+|.+.. +..+.++ .+.|++-+.- ..+. T Consensus 161 r~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~-~dAI~y~hSGL~d~~~ 239 (558) T protein:vir:10 161 LKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIA-KDSITMCTSGLVDRNK 239 (558) T ss_pred ccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeec-hhheeeecccceecCC Confidence 999765442 11 111121111 1122233 3445543221 1233 Q ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc Q lcl|NC_019422. 169 LFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA 238 (384) Q Consensus 169 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~ 238 (384) -.-+|-+..|.+.+..+.-......=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 240 ~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~ 319 (558) T protein:vir:10 240 NRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKF 319 (558) T ss_pred CeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchh Confidence 45568899999988888877777666655555556677776 445444455555555544442 11111111 Q ss_pred --------eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc-------ccHHHHH-----HHHHHH Q lcl|NC_019422. 239 --------AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS-------KYSEDEW-----NAYYES 295 (384) Q Consensus 239 --------~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~-------~~~e~~~-----~~~~~~ 295 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+ .-|+.. T Consensus 320 msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~R 399 (558) T protein:vir:10 320 MSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGR 399 (558) T ss_pred hhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHH Confidence 111 23455555554445556677789999999999999987742 1223222 123322 Q ss_pred HHHHHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH Q lcl|NC_019422. 296 EIEPVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI 359 (384) Q Consensus 296 ~i~P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~ 359 (384) -=.-+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+.+. +.+-.++.+=+|+. T Consensus 400 LR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ 479 (558) T protein:vir:10 400 LRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKR 479 (558) T ss_pred HHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHH Confidence 21222333344444444332 11 1123334443333 22221 3334443322 22335555555432 Q ss_pred -hCC------------------CCC--CCCCeeee-------------cCceeecCC-CC Q lcl|NC_019422. 360 -MNL------------------SPI--ENGDKPVR-------------RLDTAVVEG-GE 384 (384) Q Consensus 360 -lG~------------------~p~--~~gd~~~~-------------~~n~~~~~~-ge 384 (384) |.+ +.. |....++. .....|.++ .+ T Consensus 480 ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (558) T protein:vir:10 480 VLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLE 539 (558) T ss_pred HhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccc Confidence 111 111 21111111 111111110 00 No 222 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=362 Identities=9% Similarity=-0.012 Sum_probs=161.5 Q ss_pred Cc---chhhhcccCCC-----cchhHHHhhccccC------------ccee----chhhhhhcHHHHHHHHHHHHhhccC Q lcl|NC_019422. 1 MN---IFKSKKKNKEA-----PGKVMMELISDSGN------------GFYS----WHGNLYKSDIVRSIIRPKAKAVGKM 56 (384) Q Consensus 1 M~---~f~~~~~~~~~-----~~~~~~~~~~~~~~------------~~~~----~~~~~~~~~~v~~~i~~ia~~ia~~ 56 (384) |. +|......... .-...-.+..+-.. +... ...+=+.+......++..+.-+-+- T Consensus 11 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl~G~ 90 (537) T protein:vir:78 11 DQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYLLSN 90 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhhccc Confidence 11 11110000000 00000000000000 0000 0000122345566777777777777 Q ss_pred ceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCC Q lcl|NC_019422. 57 TAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENE 136 (384) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~ 136 (384) |.++- ..+++.+ .....|.. -+. .........+..++..+|.+|.++..+..|.+. +..++|..+-++.+.. T Consensus 91 Pv~~~-~~d~~~~----e~~~~l~~-~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~-~~~i~p~~~~pv~d~~ 162 (537) T protein:vir:78 91 GVEVK-VKDEDNT----QLDEILQE-YFD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKLK-FQTVDGLTLIPVFDDY 162 (537) T ss_pred Cceee-cCcchhH----HHHHHHHH-Hhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCceE-EEEEccceeEEEEcCC Confidence 87752 2222111 12222222 221 233456677788999999999999999888654 6777888877776644 Q ss_pred CEE----EEEE--Ec-----Cce----EEEEehhheEEEeccC------------------------------------- Q lcl|NC_019422. 137 VLF----LKFL--LR-----NGK----IVSYPYSDIIHLRKDF------------------------------------- 164 (384) Q Consensus 137 ~~~----~~~~--~~-----~g~----~~~~~~~evih~~~~~------------------------------------- 164 (384) +.. ..|. .. ++. ...+.++.+.+++... T Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 242 (537) T protein:vir:78 163 GVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDG 242 (537) T ss_pred CCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccccc Confidence 321 1110 00 011 1122334444432110 Q ss_pred ------CC---------CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhc Q lcl|NC_019422. 165 ------NE---------NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYL 229 (384) Q Consensus 165 ------~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~ 229 (384) ++ +.-.|+|.+..+...++....+....++.+...+.|-.++. +..+.. ....+..++. T Consensus 243 ~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~-g~~~~~--~~~~~~~l~~--- 316 (537) T protein:vir:78 243 YQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVK-GFSGDS--TDKLRQNIKA--- 316 (537) T ss_pred ccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeee-cCCCcc--chhHHHHHhh--- Confidence 00 12358888888888888888877777777776655544432 112211 1122222211 Q ss_pred cccccCCcceecCCCceeeecccchhHH----HHHHHHHHHHHHHHHhCCCHHHhccccHHH-------------HHHHH Q lcl|NC_019422. 230 QIDSEAGGAAATDSKYDAEQVKAESYVP----NAAQMDKAIQRLYSFFNTNEKIIQSKYSED-------------EWNAY 292 (384) Q Consensus 230 ~~~~~~~~~~v~~~g~~~~~l~~~~~~~----~~~~~~~~~~~I~~~fgvp~~~l~~~~~e~-------------~~~~~ 292 (384) .+-+.+-+.+.++..+....... .+..++..+-+++.+...+....|.....+ ..... T Consensus 317 -----~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~ 391 (537) T protein:vir:78 317 -----KKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETS 391 (537) T ss_pred -----cCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHH Confidence 11122223344444444332222 222233333344444444444333222211 12234 Q ss_pred HHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHH-HHhCCCCCHHHHHHHhCCCCCCC---- Q lcl|NC_019422. 293 YESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQ-MVDRGSLTPNEWRKIMNLSPIEN---- 367 (384) Q Consensus 293 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~-~~~~g~~t~NE~R~~lG~~p~~~---- 367 (384) +...+.-+++.|...++.+-.... ....+++.+..-...|..+.++.++ ++..|+++..-+.+.+++-..+. T Consensus 392 f~~~l~~~~~~i~~~~~~~~~~~~---d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~ 468 (537) T protein:vir:78 392 LRKVLRWCADMVVSDIALRGLGEY---DSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKL 468 (537) T ss_pred HHHHHHHHHHHHHHHHhhcCCccc---ccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHH Confidence 555566666666665554321111 1234566666666778888887764 67789999887777765422110 Q ss_pred ----------------CCeeeecC----ceeecCCCC Q lcl|NC_019422. 368 ----------------GDKPVRRL----DTAVVEGGE 384 (384) Q Consensus 368 ----------------gd~~~~~~----n~~~~~~ge 384 (384) .+.-.... ...+..+|+ T Consensus 469 ~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (537) T protein:vir:78 469 IAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGL 505 (537) T ss_pred HHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCC Confidence 00000000 001111111 No 223 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=95.70 E-value=0.0016 Score=35.92 Aligned_cols=376 Identities=9% Similarity=0.006 Sum_probs=173.7 Q ss_pred CcchhhhcccCCCcchhH-----------HHhhccccCccee-----------ch-------hhhhhcHHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM-----------MELISDSGNGFYS-----------WH-------GNLYKSDIVRSIIRPKAK 51 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----------~~-------~~~~~~~~v~~~i~~ia~ 51 (384) --+.+..+....+..++. .........+.++ +. +....+|.|.+||+.|.+ T Consensus 18 ~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVn 97 (524) T protein:vir:72 18 RNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVS 97 (524) T ss_pred hhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhhHHHHhhc Confidence 233332221111111110 0001001111110 00 123568999999999998 Q ss_pred hhccCc-----eEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---Cceee Q lcl|NC_019422. 52 AVGKMT-----AKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQ 120 (384) Q Consensus 52 ~ia~~~-----~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~ 120 (384) .+.-+. +.+--.+-+..+..+... +..++ ..+++..--+.++..|.+.|..|+.++-+.. .-..+ T Consensus 98 eaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~E 173 (524) T protein:vir:72 98 DAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVL----NHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKE 173 (524) T ss_pred ceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHH----HHhccchhhhHHHhhheeeeEEEEEEEEeCCCcccccee Confidence 876432 222111111111111111 11111 1233444455667788899999999977533 24889 Q ss_pred EEEEcCceEEEEEcC-----CCE--------EEEEEEc------C------ceEEEEehhheEEEeccC--CCCCccCcc Q lcl|NC_019422. 121 IYPLNALNVEAIYEN-----EVL--------FLKFLLR------N------GKIVSYPYSDIIHLRKDF--NENDLFGTS 173 (384) Q Consensus 121 l~~l~~~~v~~~~~~-----~~~--------~~~~~~~------~------g~~~~~~~~evih~~~~~--~~~~~~G~s 173 (384) |+.|||..++.++.- ++. .|.|... + +..+.++. +.|++-+.. +.++..-+| T Consensus 174 lr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i~g 252 (524) T protein:vir:72 174 LRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK-AAVVYAHSGLVDCCGKNIIG 252 (524) T ss_pred eeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecch-hheeeeeccceeCCCCceec Confidence 999999999764321 111 1122211 1 22334444 444443322 233445578 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc----- Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA----- 238 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~----- 238 (384) -+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 253 yLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlE 332 (524) T protein:vir:72 253 YLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTE 332 (524) T ss_pred cchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 899999988888877777666655555556677776 445544455555555554442 11111121 Q ss_pred ---eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc---------ccHHHHH-----HHHHHHHHH Q lcl|NC_019422. 239 ---AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS---------KYSEDEW-----NAYYESEIE 298 (384) Q Consensus 239 ---~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~---------~~~e~~~-----~~~~~~~i~ 298 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+ .-|+..-=. T Consensus 333 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~ 412 (524) T protein:vir:72 333 DYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQH 412 (524) T ss_pred hhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHH Confidence 111 23455555555555566777889999999999999987721 1222222 123222112 Q ss_pred HHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MN 361 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG 361 (384) -+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |. T Consensus 413 rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 492 (524) T protein:vir:72 413 KFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQ 492 (524) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhc Confidence 22333344444444332 11 1123334443333 22222 3344444332 23345566666542 33 Q ss_pred CCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) +.-.+ +....+.+.. -++-| T Consensus 493 ~tDeei~~~~k~I~~E~k~~~~~~~---~~~~~ 522 (524) T protein:vir:72 493 MTDEEIEQEAKQIEEESKEARFQDP---DQEQE 522 (524) T ss_pred cCHHHHHHHHHHHHHHhhcCCCCCC---chhhh Confidence 32110 0111111110 01111 No 224 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=95.62 E-value=0.0017 Score=35.72 Aligned_cols=376 Identities=10% Similarity=0.012 Sum_probs=173.9 Q ss_pred CcchhhhcccCCCcchhH-----------HHhhccccCccee-----------ch-------hhhhhcHHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM-----------MELISDSGNGFYS-----------WH-------GNLYKSDIVRSIIRPKAK 51 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----------~~-------~~~~~~~~v~~~i~~ia~ 51 (384) --+.+..+....+..++. .........+.++ +. +....+|.|.+||+.|.+ T Consensus 18 ~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVn 97 (524) T protein:vir:10 18 RNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVS 97 (524) T ss_pred hhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhhHHHHhhc Confidence 233332221111111110 0001001111110 00 123568999999999998 Q ss_pred hhccCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC---Cceee Q lcl|NC_019422. 52 AVGKMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDDY---NMPTQ 120 (384) Q Consensus 52 ~ia~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~---g~~~~ 120 (384) .+.-+. +.+--.+-+..+..+. .++..-+ ..+++..--+.++..|.+.|..|+.++-+.. .-..+ T Consensus 98 eaiv~d~~~~pV~l~L~~~~~s~~iK~----kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~E 173 (524) T protein:vir:10 98 DAIVYEDDTEVVALNLDKSKFSPKIKN----MMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKE 173 (524) T ss_pred ceeEecCCCceEEEEecCcCcchHHHH----HHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCcccccee Confidence 876432 2221111111111111 1111111 1233444455667788899999999987632 24889 Q ss_pred EEEEcCceEEEEEcC-----CCE--------EEEEEEc------C------ceEEEEehhheEEEeccC--CCCCccCcc Q lcl|NC_019422. 121 IYPLNALNVEAIYEN-----EVL--------FLKFLLR------N------GKIVSYPYSDIIHLRKDF--NENDLFGTS 173 (384) Q Consensus 121 l~~l~~~~v~~~~~~-----~~~--------~~~~~~~------~------g~~~~~~~~evih~~~~~--~~~~~~G~s 173 (384) |+.|||..++.++.- ++. .|.|... + +..+.++. +.|++-+.. +.++..-+| T Consensus 174 lr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i~g 252 (524) T protein:vir:10 174 LRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK-AAIVYAHSGLVDCCGKNIIG 252 (524) T ss_pred eeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecch-hheeeeeccceeCCCCceec Confidence 999999999764321 111 1122211 1 22334444 444443322 233445578 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc----- Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA----- 238 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~----- 238 (384) -+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 253 yLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlE 332 (524) T protein:vir:10 253 YLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTE 332 (524) T ss_pred cchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 899999988888877777666655555556677776 445544455555555554442 11111121 Q ss_pred ---eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc---------ccHHHHH-----HHHHHHHHH Q lcl|NC_019422. 239 ---AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS---------KYSEDEW-----NAYYESEIE 298 (384) Q Consensus 239 ---~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~---------~~~e~~~-----~~~~~~~i~ 298 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+ .-|+..-=. T Consensus 333 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~ 412 (524) T protein:vir:10 333 DYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQH 412 (524) T ss_pred hhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHH Confidence 111 23455555555555566777889999999999999987721 1222222 123222112 Q ss_pred HHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MN 361 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG 361 (384) -+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |. T Consensus 413 rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 492 (524) T protein:vir:10 413 KFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQ 492 (524) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhc Confidence 22333344444444332 11 1123334443333 22222 3344444332 23345566666542 33 Q ss_pred CCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) +.-.+ +....+.+.. -++-| T Consensus 493 ~tDeei~~~~k~I~~E~k~~~~~~~---~~~~~ 522 (524) T protein:vir:10 493 MTDEEIEQEAKQIEEESKEARFQDP---DQEQE 522 (524) T ss_pred cCHHHHHHHHHHHHHHhhcCCCCCC---chhhh Confidence 32110 0111111110 01111 No 225 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=95.51 E-value=0.0019 Score=35.46 Aligned_cols=377 Identities=12% Similarity=0.087 Sum_probs=174.5 Q ss_pred Ccchhh----hcccCCCcc----hhHHH----hhccccCc----ceec-------------hhhhhhcHHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKS----KKKNKEAPG----KVMME----LISDSGNG----FYSW-------------HGNLYKSDIVRSIIRPKAK 51 (384) Q Consensus 1 M~~f~~----~~~~~~~~~----~~~~~----~~~~~~~~----~~~~-------------~~~~~~~~~v~~~i~~ia~ 51 (384) =.-+.. +.....+|. ....+ ......++ +... -+....+|.|.+||+.|.+ T Consensus 16 ~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVn 95 (521) T protein:vir:65 16 NDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVN 95 (521) T ss_pred hhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchhhHHHHhhc Confidence 111110 100000000 00000 00000000 0000 0234568999999999998 Q ss_pred hhccCc-----eEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC--CceeeE Q lcl|NC_019422. 52 AVGKMT-----AKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY--NMPTQI 121 (384) Q Consensus 52 ~ia~~~-----~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~--g~~~~l 121 (384) .+.-+. +.+--.+-+-.+...... +..++ ..+++..--+.++..|.+.|..|+.++-+++ .-..+| T Consensus 96 eaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~EL 171 (521) T protein:vir:65 96 DAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLL----NTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVEL 171 (521) T ss_pred ceeEecCCCceEEEEecccccchHHHHHHHHHHHHHH----HHhccchhhhHHHhhhhhcceeEEEEEEcCCccccceee Confidence 886432 222111111111111111 11111 1233444455667788899999999995533 348899 Q ss_pred EEEcCceEEEEEcCC-----C--------EEEEEEEcC------------ceEEEEehhheEEEeccC--CCCCccCccH Q lcl|NC_019422. 122 YPLNALNVEAIYENE-----V--------LFLKFLLRN------------GKIVSYPYSDIIHLRKDF--NENDLFGTSP 174 (384) Q Consensus 122 ~~l~~~~v~~~~~~~-----~--------~~~~~~~~~------------g~~~~~~~~evih~~~~~--~~~~~~G~s~ 174 (384) +.|||..++.++... + ..+.|...+ +..+.++ .+.|++-+.. +.++..-+|- T Consensus 172 r~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~-~dAI~y~hSGl~d~~~~~i~sy 250 (521) T protein:vir:65 172 RQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIP-RSAITYAHSGLMDCDDKYIIGY 250 (521) T ss_pred eeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeec-hhheeeeeccceeCCCCeeeec Confidence 999999998765321 1 012222111 1223333 3445553322 2334455688 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcce----- Q lcl|NC_019422. 175 AKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGAA----- 239 (384) Q Consensus 175 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~~----- 239 (384) +..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+.+ T Consensus 251 LhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlED 330 (521) T protein:vir:65 251 LHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTED 330 (521) T ss_pred chhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhh Confidence 99999999888887777666655555556677776 445544455555555554543 111222211 Q ss_pred ---ec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhc---------cccHHHHH-----HHHHHHHHHH Q lcl|NC_019422. 240 ---AT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQ---------SKYSEDEW-----NAYYESEIEP 299 (384) Q Consensus 240 ---v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~---------~~~~e~~~-----~~~~~~~i~P 299 (384) +- +.|.+++.|.--..-.++...++..+.+..+++||.+-+. |..+|-.+ .-|+..-=.- T Consensus 331 yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~r 410 (521) T protein:vir:65 331 YWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQ 410 (521) T ss_pred hcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHH Confidence 11 2345555555445555667778999999999999998753 11223222 1233222222 Q ss_pred HHHHHHHHHhhcccCccc--------ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hCC Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKA--------RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MNL 362 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~--------~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG~ 362 (384) +...+.+.|..+|+.+.- ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |.+ T Consensus 411 Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~ 490 (521) T protein:vir:65 411 FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKY 490 (521) T ss_pred HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcc Confidence 333344444444443311 1122344443333 22222 3344444432 33346677766643 333 Q ss_pred CCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 363 SPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 363 ~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) .-.+ +....+.+..-. +.++ T Consensus 491 tDeei~~~~k~I~~E~~~~~~~~p~~--~~~~ 520 (521) T protein:vir:65 491 TDDQMDTEKKQIEEEANDPRFKQTPD--EIED 520 (521) T ss_pred CHHHHHHHHHHHHHhhhCCCCCCCcc--cccC Confidence 2210 111111111100 1111 No 226 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=95.45 E-value=0.002 Score=35.33 Aligned_cols=352 Identities=9% Similarity=0.010 Sum_probs=157.2 Q ss_pred CcchhhhcccCCCcchhHHHhhcccc-------------CcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSG-------------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ..+.+...... ..-.....+..+-. .........-+.++....+++..+.-+-+-|+++- .+++ T Consensus 32 ~~~i~~~~~~~-~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~-~~~d- 108 (478) T protein:vir:10 32 LRLVREHKENI-DNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFG-VDND- 108 (478) T ss_pred HHHHHHHHHHH-HHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeee-cCCh- Confidence 00000000000 00000000000000 00000000112345667888888888877777752 1111 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC------EEEE Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV------LFLK 141 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~------~~~~ 141 (384) + .......++. | ...+....++.+.+.+|.+|+.+..+..|.+ .+..++|..+.++.+.+. ...+ T Consensus 109 -~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~ 179 (478) T protein:vir:10 109 -K--ALKQIQHTLN--H---KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRV 179 (478) T ss_pred -H--HHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEE Confidence 1 1111222222 2 3556777788999999999999999888865 467788888888765431 1112 Q ss_pred EEEcCceEE-EEehhheEEEecc-------------------------CCC---------CCccCccHHHHHHHHHHHHH Q lcl|NC_019422. 142 FLLRNGKIV-SYPYSDIIHLRKD-------------------------FNE---------NDLFGTSPAKVLEPIMEVVN 186 (384) Q Consensus 142 ~~~~~g~~~-~~~~~evih~~~~-------------------------~~~---------~~~~G~s~~~~~~~~i~~~~ 186 (384) |...+...+ .+.++.+.+++.. ++. +...|.|.+..+...++... T Consensus 180 ~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~ 259 (478) T protein:vir:10 180 YELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALD 259 (478) T ss_pred EEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHH Confidence 222211111 1122222222110 111 22468888888888887777 Q ss_pred HHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceec--CCCceeeecccchhHHHH-HHHH Q lcl|NC_019422. 187 TTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT--DSKYDAEQVKAESYVPNA-AQMD 263 (384) Q Consensus 187 ~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~--~~g~~~~~l~~~~~~~~~-~~~~ 263 (384) .......+.++..+.|..+++ +....+ .+........ .+++.+ ++|.++..+........+ ..++ T Consensus 260 ~~~S~~~~~~~~~~~p~~~~~-g~~~~~--~~~~~~~~~~---------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 327 (478) T protein:vir:10 260 KRLSDTQNTFDESVELIYILK-GYEGED--MKDFMHNLKY---------YKAISVAGESGSGVDTIKVEVPIDSVKEYTK 327 (478) T ss_pred HHHHHHHHHHHHhhCceeeee-cCCccc--cchhhhhhhh---------cceEEecCCCCCcceEEeecCChHHHHHHHH Confidence 766666666666666754442 222211 1111111110 122222 234444444433333333 3345 Q ss_pred HHHHHHHHHhCCCHH---HhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEe Q lcl|NC_019422. 264 KAIQRLYSFFNTNEK---IIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVF 326 (384) Q Consensus 264 ~~~~~I~~~fgvp~~---~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~f 326 (384) .+.+.|+..-++|.. -++++.+..+. ...+...+.-+++.+...+.. .. ....+++ T Consensus 328 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~----~~---~~~~i~i 400 (478) T protein:vir:10 328 MLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL----DV---KVQDIEI 400 (478) T ss_pred HHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----Cc---ccccceE Confidence 555666666666642 23333232221 123334444444444333211 11 1123344 Q ss_pred echhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC---------CC---eeeecCceeecCCCC Q lcl|NC_019422. 327 EASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN---------GD---KPVRRLDTAVVEGGE 384 (384) Q Consensus 327 d~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~---------gd---~~~~~~n~~~~~~ge 384 (384) .+..-...|..+.++.+... .|+++...+++++++-..+. -+ .......-...++.+ T Consensus 401 ~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 469 (478) T protein:vir:10 401 TFNFNVMVNELENSQIAMNS-TGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLNGEQQ 469 (478) T ss_pred EecCCCCCCHHHHHHHHHHH-hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCC Confidence 44555556777777765433 68899999999998743211 01 111111111111111 No 227 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=95.20 E-value=0.0025 Score=34.81 Aligned_cols=377 Identities=10% Similarity=0.057 Sum_probs=175.0 Q ss_pred CcchhhhcccCCCcchhH----H-----------HhhccccCcc----ee------chhhhhhcHHHHHHHHHHHHhhcc Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM----M-----------ELISDSGNGF----YS------WHGNLYKSDIVRSIIRPKAKAVGK 55 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~----~-----------~~~~~~~~~~----~~------~~~~~~~~~~v~~~i~~ia~~ia~ 55 (384) =.-.+.+.....+|.... . ..+..+..+. .+ .-+....+|.|.+||+.|.+.+.- T Consensus 20 ~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv 99 (521) T protein:vir:81 20 EEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIV 99 (521) T ss_pred HhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcceeE Confidence 011111111111111000 0 0000000000 00 012345689999999999988864 Q ss_pred Cc-----eEEEEecCCcceeccchH---HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC--CceeeEEEEc Q lcl|NC_019422. 56 MT-----AKHIRSNETEFKTNPEIY---IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY--NMPTQIYPLN 125 (384) Q Consensus 56 ~~-----~~~~~~~~~~~~~~~~~~---~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~--g~~~~l~~l~ 125 (384) +. +.+--.+-+-.+...... +..++ ..+++..--+.++..|.+.|..|+.++-+++ .-..+|+.|| T Consensus 100 ~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il----~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lD 175 (521) T protein:vir:81 100 FEEGHEVVSLNLEATGFSESVKERIHEEFKDLL----NTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLD 175 (521) T ss_pred ecCCCceEEEEecccccchHHHHHHHHHHHHHH----HHhccchhhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeC Confidence 42 222111111111111111 11111 1233444455667788899999999995533 3488999999 Q ss_pred CceEEEEEcCC-----C--------EEEEEEEc------------CceEEEEehhheEEEeccC--CCCCccCccHHHHH Q lcl|NC_019422. 126 ALNVEAIYENE-----V--------LFLKFLLR------------NGKIVSYPYSDIIHLRKDF--NENDLFGTSPAKVL 178 (384) Q Consensus 126 ~~~v~~~~~~~-----~--------~~~~~~~~------------~g~~~~~~~~evih~~~~~--~~~~~~G~s~~~~~ 178 (384) |..++.++... + ..+.|... .+..+.++ .+.|++-+.. +.++..-+|-+..| T Consensus 176 Pr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~-~dAI~y~hSGl~d~~~~~i~syLhkA 254 (521) T protein:vir:81 176 PRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIP-RSAITYAHSGLMDCDDKYIIGYLHRA 254 (521) T ss_pred CcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeec-hhheeeeeccceeCCCCeeeecchhh Confidence 99998765321 1 01222221 11223333 4445553322 23344456889999 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcce--------e Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGAA--------A 240 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~~--------v 240 (384) .+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+.+ + T Consensus 255 iKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp 334 (521) T protein:vir:81 255 VKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQ 334 (521) T ss_pred hHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhccc Confidence 9999888887777666655555556677776 445544455555555554543 111222211 1 Q ss_pred c---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc---------ccHHHHH-----HHHHHHHHHHHHHH Q lcl|NC_019422. 241 T---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS---------KYSEDEW-----NAYYESEIEPVGLQ 303 (384) Q Consensus 241 ~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~---------~~~e~~~-----~~~~~~~i~P~~~~ 303 (384) - +.|.+++.|.--..-.++...++..+.+..+++||.+-|+. ..+|-.+ .-|+..-=.-+... T Consensus 335 RReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~l 414 (521) T protein:vir:81 335 RRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEV 414 (521) T ss_pred ccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 1 23455555554455556677789999999999999988731 1222222 12332222223333 Q ss_pred HHHHHhhcccCccc--------ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hCCCCCC Q lcl|NC_019422. 304 LSNQYTEKLFTRKA--------RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MNLSPIE 366 (384) Q Consensus 304 i~~~l~~~l~~~~~--------~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG~~p~~ 366 (384) +.+.|..+|+.+.- ......+.|..|. ++... +..|+.+++. +.+-.++.+=+|+. |.+.-.+ T Consensus 415 f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee 494 (521) T protein:vir:81 415 LRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQ 494 (521) T ss_pred HHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH Confidence 44444444444311 1122344443333 22222 3344444332 23345666666543 3332210 Q ss_pred ----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 367 ----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 367 ----------~gd~~~~~~n~~~~~~ge 384 (384) +....+.+..-. +.++ T Consensus 495 i~~~~k~I~~E~~~~~~~~p~~--~~~~ 520 (521) T protein:vir:81 495 MDTEKKQIEEEANDPRFKQTPD--EIED 520 (521) T ss_pred HHHHHHHHHHHhhCCCCCCCcc--cccC Confidence 111122111100 0111 No 228 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=95.08 E-value=0.0028 Score=34.57 Aligned_cols=376 Identities=7% Similarity=0.011 Sum_probs=176.9 Q ss_pred Ccc----hhhhcc-cCCCcchhH-------H--HhhccccCcce-----------e------chhhhhhcHHHHHHHHHH Q lcl|NC_019422. 1 MNI----FKSKKK-NKEAPGKVM-------M--ELISDSGNGFY-----------S------WHGNLYKSDIVRSIIRPK 49 (384) Q Consensus 1 M~~----f~~~~~-~~~~~~~~~-------~--~~~~~~~~~~~-----------~------~~~~~~~~~~v~~~i~~i 49 (384) +.- ...+.+ ...++.++. . ...+...++++ + .-+....+|.|.+||+.| T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eI 90 (516) T protein:vir:10 11 DRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANI 90 (516) T ss_pred cchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHh Confidence 211 111100 000000000 0 00000001110 0 012345689999999999 Q ss_pred HHhhccCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeC-CCCceee Q lcl|NC_019422. 50 AKAVGKMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKD-DYNMPTQ 120 (384) Q Consensus 50 a~~ia~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~~g~~~~ 120 (384) .+.+.-+. +.+--.+-+-.+... ..++..-+ ..+++..--+.++..|.+.|..|+.++-+ ++.-..+ T Consensus 91 Vneaiv~d~~~~pV~l~L~~~~~s~~ik----~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~E 166 (516) T protein:vir:10 91 VNEAIVYERGHKVVSLDLDDTDFGSNVK----EKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAE 166 (516) T ss_pred hcceeEecCCCceEEEEecccCcchHHH----HHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCcccccee Confidence 98876432 222111111111111 11111111 12333444555677788999999997654 3444889 Q ss_pred EEEEcCceEEEEEcC-----CC--------EEEEEEEc------Cc------eEEEEehhheEEEecc-C-CCCCccCcc Q lcl|NC_019422. 121 IYPLNALNVEAIYEN-----EV--------LFLKFLLR------NG------KIVSYPYSDIIHLRKD-F-NENDLFGTS 173 (384) Q Consensus 121 l~~l~~~~v~~~~~~-----~~--------~~~~~~~~------~g------~~~~~~~~evih~~~~-~-~~~~~~G~s 173 (384) |+.|||..++.++.- ++ .++.|... +| ..+.++. +.|++-+. - ..+...-+| T Consensus 167 lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i~s 245 (516) T protein:vir:10 167 LRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPR-SAVVYASSGLMDCSDRGIIG 245 (516) T ss_pred eeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeech-hheeeecccceeCCCCceee Confidence 999999999876542 11 11222211 12 2334444 44444332 1 223334478 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc----- Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA----- 238 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~----- 238 (384) -+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 246 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 246 YLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred eehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 899999999888887777666655555556677776 445444455555555554442 11111121 Q ss_pred ---eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc---------ccHHHHHH-----HHHHHHHH Q lcl|NC_019422. 239 ---AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS---------KYSEDEWN-----AYYESEIE 298 (384) Q Consensus 239 ---~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~---------~~~e~~~~-----~~~~~~i~ 298 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+. -|+..-=. T Consensus 326 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~ 405 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQH 405 (516) T ss_pred hhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHH Confidence 111 23455555554555556777889999999999999987642 22332222 22222112 Q ss_pred HHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MN 361 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG 361 (384) -+...+.+.|..+|+.+ .+ ......+.|..|. ++... +..|+.+++. +.+.+++.+=+|+. |. T Consensus 406 rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 485 (516) T protein:vir:10 406 DFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQ 485 (516) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 22333344444444332 21 1123334443333 32222 4444554432 34568888888754 44 Q ss_pred CCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) +.-.+ +....+.+ .|-+..| T Consensus 486 ~tDeei~~e~k~I~~E~~~~~~~---~p~~~~~ 515 (516) T protein:vir:10 486 MTEEQIAQEEKQIEQEAGIKRFQ---NPENEDD 515 (516) T ss_pred CCHhhHHHHHHHHHHhhhCCCCC---CCCcccc Confidence 44221 11111111 0111111 No 229 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=95.08 E-value=0.0028 Score=34.57 Aligned_cols=376 Identities=7% Similarity=0.011 Sum_probs=176.9 Q ss_pred Ccc----hhhhcc-cCCCcchhH-------H--HhhccccCcce-----------e------chhhhhhcHHHHHHHHHH Q lcl|NC_019422. 1 MNI----FKSKKK-NKEAPGKVM-------M--ELISDSGNGFY-----------S------WHGNLYKSDIVRSIIRPK 49 (384) Q Consensus 1 M~~----f~~~~~-~~~~~~~~~-------~--~~~~~~~~~~~-----------~------~~~~~~~~~~v~~~i~~i 49 (384) +.- ...+.+ ...++.++. . ...+...++++ + .-+....+|.|.+||+.| T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eI 90 (516) T protein:vir:10 11 DRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANI 90 (516) T ss_pred cchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHh Confidence 211 111100 000000000 0 00000001110 0 012345689999999999 Q ss_pred HHhhccCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeC-CCCceee Q lcl|NC_019422. 50 AKAVGKMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKD-DYNMPTQ 120 (384) Q Consensus 50 a~~ia~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~~g~~~~ 120 (384) .+.+.-+. +.+--.+-+-.+... ..++..-+ ..+++..--+.++..|.+.|..|+.++-+ ++.-..+ T Consensus 91 Vneaiv~d~~~~pV~l~L~~~~~s~~ik----~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~E 166 (516) T protein:vir:10 91 VNEAIVYERGHKVVSLDLDDTDFGSNVK----EKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAE 166 (516) T ss_pred hcceeEecCCCceEEEEecccCcchHHH----HHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCcccccee Confidence 98876432 222111111111111 11111111 12333444555677788999999997654 3444889 Q ss_pred EEEEcCceEEEEEcC-----CC--------EEEEEEEc------Cc------eEEEEehhheEEEecc-C-CCCCccCcc Q lcl|NC_019422. 121 IYPLNALNVEAIYEN-----EV--------LFLKFLLR------NG------KIVSYPYSDIIHLRKD-F-NENDLFGTS 173 (384) Q Consensus 121 l~~l~~~~v~~~~~~-----~~--------~~~~~~~~------~g------~~~~~~~~evih~~~~-~-~~~~~~G~s 173 (384) |+.|||..++.++.- ++ .++.|... +| ..+.++. +.|++-+. - ..+...-+| T Consensus 167 lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i~s 245 (516) T protein:vir:10 167 LRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPR-SAVVYASSGLMDCSDRGIIG 245 (516) T ss_pred eeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeech-hheeeecccceeCCCCceee Confidence 999999999876542 11 11222211 12 2334444 44444332 1 223334478 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc----- Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA----- 238 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~----- 238 (384) -+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 246 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 246 YLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred eehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 899999999888887777666655555556677776 445444455555555554442 11111121 Q ss_pred ---eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhcc---------ccHHHHHH-----HHHHHHHH Q lcl|NC_019422. 239 ---AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQS---------KYSEDEWN-----AYYESEIE 298 (384) Q Consensus 239 ---~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~~---------~~~e~~~~-----~~~~~~i~ 298 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|.. ..+|-.+. -|+..-=. T Consensus 326 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~ 405 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQH 405 (516) T ss_pred hhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHH Confidence 111 23455555554555556777889999999999999987642 22332222 22222112 Q ss_pred HHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MN 361 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG 361 (384) -+...+.+.|..+|+.+ .+ ......+.|..|. ++... +..|+.+++. +.+.+++.+=+|+. |. T Consensus 406 rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 485 (516) T protein:vir:10 406 DFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQ 485 (516) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 22333344444444332 21 1123334443333 32222 4444554432 34568888888754 44 Q ss_pred CCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) +.-.+ +....+.+ .|-+..| T Consensus 486 ~tDeei~~e~k~I~~E~~~~~~~---~p~~~~~ 515 (516) T protein:vir:10 486 MTEEQIAQEEKQIEQEAGIKRFQ---NPENEDD 515 (516) T ss_pred CCHhhHHHHHHHHHHhhhCCCCC---CCCcccc Confidence 44221 11111111 0111111 No 230 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=94.93 E-value=0.0031 Score=34.30 Aligned_cols=352 Identities=9% Similarity=0.037 Sum_probs=158.5 Q ss_pred Ccc--hhh----hc---ccCCCcchhHHHhhcccc---------------------CcceechhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNI--FKS----KK---KNKEAPGKVMMELISDSG---------------------NGFYSWHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~--f~~----~~---~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~v~~~i~~ia 50 (384) |-+ ... .. +.....-.....+..+-. .........-+..+....+++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 111 000 00 000000000000110000 000000011123456677777777 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeEEEEcCceE Q lcl|NC_019422. 51 KAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQIYPLNALNV 129 (384) Q Consensus 51 ~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l~~l~~~~v 129 (384) .-+.+-|.++-- +..+ ..+....+.. | ........+...+..+|.+|..+.++. .|. ..+..++|..+ T Consensus 81 ~yl~G~p~~~~~---~~~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~~~p~~~ 149 (471) T protein:vir:10 81 AYALTYPPTFDV---DDKK--VNDMIVDVLG--D---DYERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYACVDSKEV 149 (471) T ss_pred hhhcccCceecc---CChH--HHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEEEcccce Confidence 777777776521 1111 1112222221 2 344566777889999999999998875 454 45777899988 Q ss_pred EEEEcCCC---E---EEEEEE---cCceE----EEEehhheEEEeccC-------------------------------C Q lcl|NC_019422. 130 EAIYENEV---L---FLKFLL---RNGKI----VSYPYSDIIHLRKDF-------------------------------N 165 (384) Q Consensus 130 ~~~~~~~~---~---~~~~~~---~~g~~----~~~~~~evih~~~~~-------------------------------~ 165 (384) -++.+... . ..+|.. .+++. ..+..+.+.|++... + T Consensus 150 ~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (471) T protein:vir:10 150 IPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHD 229 (471) T ss_pred EEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCC Confidence 88776542 1 112221 11111 112334444443211 1 Q ss_pred C---------CCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCC Q lcl|NC_019422. 166 E---------NDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAG 236 (384) Q Consensus 166 ~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 236 (384) . +...|.|.+..+...++....+.....+.+...+.|-.+++-...... +.....+.. . T Consensus 230 ~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~~---------~ 297 (471) T protein:vir:10 230 FGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDK---QEFLEDLKR---------Y 297 (471) T ss_pred CCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc---chhHHHhhc---------C Confidence 1 112477888888777777776666666666666666444432111111 111111111 1 Q ss_pred cceec-----CCCceeeecccchhHHH-HHHHHHHHHHHHHHhCCCHH---HhccccHHH-------------HHHHHHH Q lcl|NC_019422. 237 GAAAT-----DSKYDAEQVKAESYVPN-AAQMDKAIQRLYSFFNTNEK---IIQSKYSED-------------EWNAYYE 294 (384) Q Consensus 237 ~~~v~-----~~g~~~~~l~~~~~~~~-~~~~~~~~~~I~~~fgvp~~---~l~~~~~e~-------------~~~~~~~ 294 (384) +.+.+ +.+.+++.+........ ...++.+.+.|+..-++|.. -+|.....+ .....+. T Consensus 298 ~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~ 377 (471) T protein:vir:10 298 KMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFR 377 (471) T ss_pred CeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 12234444443322222 23445566677776676643 222221111 1122333 Q ss_pred HHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C---- Q lcl|NC_019422. 295 SEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G---- 368 (384) Q Consensus 295 ~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g---- 368 (384) ..+.-+++.+...+.. . ....+++.+......|..+.++.+... .|++|..-++++++.-..+. . T Consensus 378 ~~l~~~~~li~~~~~~-----~---d~~~i~i~f~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~~p~v~D~~~E~eri~ 448 (471) T protein:vir:10 378 SGYATLVKMILKHLGL-----S---DKLKIKQTWTRNSINNDTEMAQVVSTL-ATITSRENVAKSNPIVEDWQDELRLQK 448 (471) T ss_pred HHHHHHHHHHHHHhcc-----C---CCceeEEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHH Confidence 4444444444433321 1 123456666666677888888776543 48899988888887643221 0 Q ss_pred -CeeeecCceeecCCCC Q lcl|NC_019422. 369 -DKPVRRLDTAVVEGGE 384 (384) Q Consensus 369 -d~~~~~~n~~~~~~ge 384 (384) ++-........+.+++ T Consensus 449 ~E~~~~~~~~~~~~~~~ 465 (471) T protein:vir:10 449 AEQEGRSEKLYDMEEVE 465 (471) T ss_pred HHHHHHHhcccccCCCC Confidence 1111122222333333 No 231 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=94.68 E-value=0.0038 Score=33.87 Aligned_cols=372 Identities=13% Similarity=0.066 Sum_probs=155.9 Q ss_pred Cc--------chhhhcccCCCcchhHHHhhccccCcce----ech---hhhhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_019422. 1 MN--------IFKSKKKNKEAPGKVMMELISDSGNGFY----SWH---GNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE 65 (384) Q Consensus 1 M~--------~f~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 65 (384) |. +.++........-.....+..+...... ... ..=+..+....+|+..+.-+-.-|+++-- ++ T Consensus 13 ~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~-~d 91 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLGVPVEYKN-EN 91 (489) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhccCCceeec-CC Confidence 21 1111110000000111112211111000 000 00123556678888888777666766421 11 Q ss_pred CcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEee----CCCCceeeEEEEcCceEEEEEcCCC---E Q lcl|NC_019422. 66 TEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIK----DDYNMPTQIYPLNALNVEAIYENEV---L 138 (384) Q Consensus 66 ~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~----~~~g~~~~l~~l~~~~v~~~~~~~~---~ 138 (384) + ........++.. .....+...+..+.+.+|.+|..+.. ++.|. ..+..++|..+.++.+... . T Consensus 92 ~----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~~~~~ 162 (489) T protein:vir:99 92 K----DLQAAIDLMSVR----NNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTYQRNS 162 (489) T ss_pred h----hHHHHHHHHHhh----cChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCCCCce Confidence 1 111222233322 24446778888999999999987764 33343 4577888888887766432 1 Q ss_pred E---EEEEEc--CceE----EEEehhheEEEeccC--------------CC---------CCccCccHHHHHHHHHHHHH Q lcl|NC_019422. 139 F---LKFLLR--NGKI----VSYPYSDIIHLRKDF--------------NE---------NDLFGTSPAKVLEPIMEVVN 186 (384) Q Consensus 139 ~---~~~~~~--~g~~----~~~~~~evih~~~~~--------------~~---------~~~~G~s~~~~~~~~i~~~~ 186 (384) . .+|... .+.. ..+.++.+.+++... +. +...|.|.+..+...++... T Consensus 163 ~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d 242 (489) T protein:vir:99 163 LMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYD 242 (489) T ss_pred EEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHH Confidence 1 111111 1111 112233333332211 00 11247777777777666666 Q ss_pred HHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc-----ccccCCcceecCCC-------ceeeecccch Q lcl|NC_019422. 187 TTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ-----IDSEAGGAAATDSK-------YDAEQVKAES 254 (384) Q Consensus 187 ~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~-----~~~~~~~~~v~~~g-------~~~~~l~~~~ 254 (384) .+.....+.....+.|-.+++- .....+...+....+.....+ .....++++.++.+ .+.+.+.... T Consensus 243 ~~~s~~~~~~~~~~~~~l~i~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 321 (489) T protein:vir:99 243 LSQSELANFQQDSVNALLVIAG-NAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEY 321 (489) T ss_pred HHHHHHHHHHHHhhhhhhhhcc-CCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecC Confidence 5555555544444444333321 112111111111111110000 00111233333322 2334444332 Q ss_pred hHHHH-HHHHHHHHHHHHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCcc Q lcl|NC_019422. 255 YVPNA-AQMDKAIQRLYSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRK 316 (384) Q Consensus 255 ~~~~~-~~~~~~~~~I~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~ 316 (384) ..... ..++.+.+.|+..-++|..- .+++.+..+. ...+...+.-+++.+...+...-.... T Consensus 322 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~ 401 (489) T protein:vir:99 322 DTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEAT 401 (489) T ss_pred ChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc Confidence 22222 34466777888888877532 2222222211 123444555555555554432111111 Q ss_pred cccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CC-------CeeeecCceeec--CCC Q lcl|NC_019422. 317 ARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIE----NG-------DKPVRRLDTAVV--EGG 383 (384) Q Consensus 317 ~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~----~g-------d~~~~~~n~~~~--~~g 383 (384) .......+.+.++.-...|..+.++.+... .|+++...+.++++.-..+ .. +........... .++ T Consensus 402 ~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl-~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 480 (489) T protein:vir:99 402 TYSLVNDTSIVFTPNLPQNDNEIVTAAQNL-YGIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASG 480 (489) T ss_pred cccccccceEEeCCCCCcCHHHHHHHHHHH-hccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCC Confidence 111112344555655566777777765432 3789988888887542110 00 000000011101 111 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) | T Consensus 481 ~ 481 (489) T protein:vir:99 481 Q 481 (489) T ss_pred C Confidence 1 No 232 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=94.57 E-value=0.004 Score=33.70 Aligned_cols=353 Identities=9% Similarity=0.084 Sum_probs=161.9 Q ss_pred Cc------chhhhcccCCCcchh---HHHhhccc-----------------cCcceechhhhhhcHHHHHHHHHHHHhhc Q lcl|NC_019422. 1 MN------IFKSKKKNKEAPGKV---MMELISDS-----------------GNGFYSWHGNLYKSDIVRSIIRPKAKAVG 54 (384) Q Consensus 1 M~------~f~~~~~~~~~~~~~---~~~~~~~~-----------------~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 54 (384) |= +-.+........-+. ...+..+- .........+-+.++.....++..+.-+- T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 10 000000000000000 00000000 00000000111234566677888888887 Q ss_pred cCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEc Q lcl|NC_019422. 55 KMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYE 134 (384) Q Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 134 (384) +-|.++--.+ + + ..+. ...++. .+..+-...+..++..+|.+|.++..+..|.+ .+..++|..+.++.+ T Consensus 81 G~p~~~~~~d-~--~-~~~~-l~~~~~-----~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~~~v~d 149 (470) T protein:vir:10 81 SVFPDIDVGK-D--A-DNKK-IIDVLG-----DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQITPIYA 149 (470) T ss_pred ccceeeecCc-h--H-HHHH-HHHHHh-----hhHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccceEEEEc Confidence 7787752111 1 1 1111 222222 12345556677899999999999999988865 467788888888876 Q ss_pred CCC--E----EEEEEE--cCc-eE----EEEehhheEEEeccC-------------------------------C----- Q lcl|NC_019422. 135 NEV--L----FLKFLL--RNG-KI----VSYPYSDIIHLRKDF-------------------------------N----- 165 (384) Q Consensus 135 ~~~--~----~~~~~~--~~g-~~----~~~~~~evih~~~~~-------------------------------~----- 165 (384) ... . +.+|.. ..+ .. ..+.+..+.|++... + T Consensus 150 ~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (470) T protein:vir:10 150 TTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (470) T ss_pred CCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeee Confidence 532 1 111111 111 11 112223333332110 0 Q ss_pred ----CCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceec Q lcl|NC_019422. 166 ----ENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT 241 (384) Q Consensus 166 ----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~ 241 (384) .+...|.|.+..+...++..........+.+...+.|-.+++-- .+. ...+....+.. .+.+.+ T Consensus 230 vv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~-~~~--~~~~~~~~~~~---------~~~i~~ 297 (470) T protein:vir:10 230 FIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-GGA--DLHQFMNDLRK---------YKSIKI 297 (470) T ss_pred EEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecC-Ccc--ccchhhhhhhh---------cCeEec Confidence 01235888888888888888777777777777666665555321 111 11112221111 111222 Q ss_pred -----CCCceeeecccchhHH-HHHHHHHHHHHHHHHhCCCHHH-hc-cccHHH--------------HHHHHHHHHHHH Q lcl|NC_019422. 242 -----DSKYDAEQVKAESYVP-NAAQMDKAIQRLYSFFNTNEKI-IQ-SKYSED--------------EWNAYYESEIEP 299 (384) Q Consensus 242 -----~~g~~~~~l~~~~~~~-~~~~~~~~~~~I~~~fgvp~~~-l~-~~~~e~--------------~~~~~~~~~i~P 299 (384) +.+.+++-+....... ....++.+.+.|+..-++|..- .+ |+.+.. .....+..++.- T Consensus 298 ~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~ 377 (470) T protein:vir:10 298 NNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (470) T ss_pred cCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1233444444333222 2234455666776666666422 12 111111 122344555555 Q ss_pred HHHHHHHHHhhcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C--------- Q lcl|NC_019422. 300 VGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G--------- 368 (384) Q Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g--------- 368 (384) +++.+...++.. . .....+.+.+..-...|..+.++.+... .|+++..-+++++++-..+. . T Consensus 378 ~~~~i~~~l~~~-----~-~d~~~i~i~f~~~~p~d~~e~~~~~~~~-~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e 450 (470) T protein:vir:10 378 LVRAIMRYLNFS-----D-ADKRHISQHWTRTKVEDSLTKAQIVSTV-ANYSSKEAVAKANPIVDDWQQELKDLAKDKEE 450 (470) T ss_pred HHHHHHHHhccc-----C-cccceeeEEeccCCCCCHHHHHHHHHHH-hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHH Confidence 555555444321 1 1123556666766777888888776543 58899988888887643221 0 Q ss_pred CeeeecCceeecC-----CCC Q lcl|NC_019422. 369 DKPVRRLDTAVVE-----GGE 384 (384) Q Consensus 369 d~~~~~~n~~~~~-----~ge 384 (384) +.... .+.-..+ +.| T Consensus 451 ~~~~~-~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 451 NDPYS-NQADELNGKGVNDEQ 470 (470) T ss_pred HHHhh-ccccccCCCCCCCCC Confidence 11111 1111111 111 No 233 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=93.83 E-value=0.0063 Score=32.67 Aligned_cols=376 Identities=8% Similarity=0.009 Sum_probs=177.5 Q ss_pred Ccc----hhhhcc-cCCCcchhH--HH-------hhccccCcc-----------ee------chhhhhhcHHHHHHHHHH Q lcl|NC_019422. 1 MNI----FKSKKK-NKEAPGKVM--ME-------LISDSGNGF-----------YS------WHGNLYKSDIVRSIIRPK 49 (384) Q Consensus 1 M~~----f~~~~~-~~~~~~~~~--~~-------~~~~~~~~~-----------~~------~~~~~~~~~~v~~~i~~i 49 (384) +.. ...+.+ ...+..++. .. ..+...+++ .+ .-+....+|.|.+||+.| T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEvd~Av~eI 90 (516) T protein:vir:10 11 DRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPEVERAVANI 90 (516) T ss_pred cchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhccchhHHHHHh Confidence 111 111110 000000000 00 000000010 00 012334588999999999 Q ss_pred HHhhccCc-----eEEEEecCCcceeccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeC-CCCceee Q lcl|NC_019422. 50 AKAVGKMT-----AKHIRSNETEFKTNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKD-DYNMPTQ 120 (384) Q Consensus 50 a~~ia~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~-~~g~~~~ 120 (384) .+.+.-+. +.+- -+..+ .....-..++..-+ ..+++..--+.++..|.+.|..|+.++-+ .+.-..+ T Consensus 91 vneaiv~d~~~~pV~l~---l~~~e-~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~e 166 (516) T protein:vir:10 91 VNEAVVYEKGHKVVSLD---LDDTE-FSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVE 166 (516) T ss_pred hcceeEecCCCceEEEE---ecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceee Confidence 98886442 2221 11110 11111111111111 12333444555677788999999997654 3445889 Q ss_pred EEEEcCceEEEEEcC-----CC------E--EEEEEEc------Cc------eEEEEehhheEEEeccC--CCCCccCcc Q lcl|NC_019422. 121 IYPLNALNVEAIYEN-----EV------L--FLKFLLR------NG------KIVSYPYSDIIHLRKDF--NENDLFGTS 173 (384) Q Consensus 121 l~~l~~~~v~~~~~~-----~~------~--~~~~~~~------~g------~~~~~~~~evih~~~~~--~~~~~~G~s 173 (384) |+.|||..++.++.- ++ . ++.|... +| ..+.++ .+.|++-+.. +.++..-+| T Consensus 167 lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~-~daI~y~hSGl~d~~~~~i~s 245 (516) T protein:vir:10 167 LRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIP-RSAIVYAHSGLQDCSDRGIVG 245 (516) T ss_pred eeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecc-hhheeeeecCcccCCCCceec Confidence 999999999876532 11 1 1222211 12 223333 4455553322 223333478 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHHHHHHHHHHHHhcc---------ccccCCcc----- Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDIKKEVKSFEKNYLQ---------IDSEAGGA----- 238 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~~~~~~~~~~~---------~~~~~~~~----- 238 (384) -+..|.+.+..+.-...+..=+--..+.-+-++.++ |.+....+++....+-.+|+. ...+..+. T Consensus 246 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 246 YLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred eehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 899999988888877777666655555556677776 445444455555555554442 11111121 Q ss_pred ---eec---CCCceeeecccchhHHHHHHHHHHHHHHHHHhCCCHHHhc---------cccHHHHHH-----HHHHHHHH Q lcl|NC_019422. 239 ---AAT---DSKYDAEQVKAESYVPNAAQMDKAIQRLYSFFNTNEKIIQ---------SKYSEDEWN-----AYYESEIE 298 (384) Q Consensus 239 ---~v~---~~g~~~~~l~~~~~~~~~~~~~~~~~~I~~~fgvp~~~l~---------~~~~e~~~~-----~~~~~~i~ 298 (384) ++- +.|.+++.|.-...-.++...++..+.+..+++||.+-|. |..+|-.+. -|+..-=. T Consensus 326 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~ 405 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQH 405 (516) T ss_pred hhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHH Confidence 111 2345555555455555677788999999999999998764 222332222 23222212 Q ss_pred HHHHHHHHHHhhcccCc-----cc---ccCcceEEeechh----hhccC-HHHHHHHHHH---HhCCCCCHHHHHHH-hC Q lcl|NC_019422. 299 PVGLQLSNQYTEKLFTR-----KA---RSFGNEIVFEASN----LQYAS-MSTKLNLVQM---VDRGSLTPNEWRKI-MN 361 (384) Q Consensus 299 P~~~~i~~~l~~~l~~~-----~~---~~~~~~i~fd~~~----~~~~d-~~~~~~~~~~---~~~g~~t~NE~R~~-lG 361 (384) -+...+.+.|..+|+.. .+ ......+.|..|. ++... +..|+.+++. +.+.+++.+=+|+. |. T Consensus 406 rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr 485 (516) T protein:vir:10 406 NFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQ 485 (516) T ss_pred HHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 23334455555544443 21 1123334443333 32222 4444554432 34568888888764 44 Q ss_pred CCCCC----------CCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPIE----------NGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~~----------~gd~~~~~~n~~~~~~ge 384 (384) +.-.+ +.+..+.+. |-++.| T Consensus 486 ~tDeei~~~~k~I~~E~~~~~~~~---p~~e~~ 515 (516) T protein:vir:10 486 MTDEQIAQEEKQIEKEANVKRFQN---PENEDD 515 (516) T ss_pred CCHhHHHHHHHHHHHhhhCCCCCC---CCcccc Confidence 44221 111111111 122222 No 234 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=93.82 E-value=0.0063 Score=32.67 Aligned_cols=352 Identities=9% Similarity=-0.018 Sum_probs=155.0 Q ss_pred Ccchhh----hcccCCCcchh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFKS----KKKNKEAPGKV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~~----~~~~~~~~~~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) ..+... ..+.-....+. ...+..+.... .-....+=+..+....+++..+.-+-+-|.++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~ 103 (468) T protein:vir:96 24 YETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTY 103 (468) T ss_pred ccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCcee Confidence 011000 00000000000 00111110000 00000111234566777887777777777765 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCC--E Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEV--L 138 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~ 138 (384) - .++ .+ .......++. | +.......+..+...+|.+|+.+..+..|.+ .+..++|..+.++.+... . T Consensus 104 ~-~~d--~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~~~~~~~~ 172 (468) T protein:vir:96 104 G-TED--EK--SLKTIQEVLN--H---KWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRVPAEQAIPIWTNKERDE 172 (468) T ss_pred c-cCC--hH--HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCc Confidence 2 111 11 1122223332 2 3456667788999999999999988888864 467788888877765321 1 Q ss_pred ----EEEEEEcCceEE-EEehhheEEEecc-------------------------CC---------CCCccCccHHHHHH Q lcl|NC_019422. 139 ----FLKFLLRNGKIV-SYPYSDIIHLRKD-------------------------FN---------ENDLFGTSPAKVLE 179 (384) Q Consensus 139 ----~~~~~~~~g~~~-~~~~~evih~~~~-------------------------~~---------~~~~~G~s~~~~~~ 179 (384) .++|...+.... .+.+..+.|++.. ++ .+...|.|.+..+. T Consensus 173 ~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~ 252 (468) T protein:vir:96 173 LKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYK 252 (468) T ss_pred eEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHH Confidence 112222211111 1122223222211 00 01235888888888 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceec--CCCceeeecccchhHH Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT--DSKYDAEQVKAESYVP 257 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~--~~g~~~~~l~~~~~~~ 257 (384) ..++..........+.++..+.|..+++- ..+.+ .+.... ...+ ++++.+ +++.+++.+....... T Consensus 253 ~liDa~d~~~S~~~~~~~~~~~p~lv~~g-~~~~~--~~~~~~----~~~~-----~~~i~~~~d~~~~~~~l~~~~~~~ 320 (468) T protein:vir:96 253 TIIDAMDKRLSDTQNTFDEATELIYVLKG-YEGED--LEEFMY----NLKY-----YKAINVDGDGSGGVDTIQIDVPVQ 320 (468) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeec-CCccc--cchhhh----hhhc-----CceEEecCCCCCcceEEeecCChH Confidence 87777777666666677766677555432 12211 111111 1111 223333 2344455454433333 Q ss_pred HH-HHHHHHHHHHHHHhCCCHHH---hccccHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcccCccccc Q lcl|NC_019422. 258 NA-AQMDKAIQRLYSFFNTNEKI---IQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYTEKLFTRKARS 319 (384) Q Consensus 258 ~~-~~~~~~~~~I~~~fgvp~~~---l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~ 319 (384) .+ ..++.+.+.|+..-++|... .+++.+..+. ...+...+.-+++.+...+.. .. T Consensus 321 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-----~~-- 393 (468) T protein:vir:96 321 SAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-----SI-- 393 (468) T ss_pred HHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----Cc-- Confidence 32 34456667777777776532 2222222211 123334444444444333221 11 Q ss_pred CcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eecCceeecCCCC Q lcl|NC_019422. 320 FGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--GDKP-----VRRLDTAVVEGGE 384 (384) Q Consensus 320 ~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--gd~~-----~~~~n~~~~~~ge 384 (384) ....+.+.++.-...|..+.++.+ ...|++|.-.++++++.-..+. .+.. -.......+.+++ T Consensus 394 d~~~i~i~f~~~~p~d~~e~a~~~--~~~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~ 463 (468) T protein:vir:96 394 KVQDVEITFNFNVMVNELEQSQIG--VNSQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIEEGLNGKE 463 (468) T ss_pred ccceeeEEecCCCCcCHHHHHHHH--HhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhccCCCC Confidence 112334444444445565555543 4569999988988886633210 0000 0000111223333 No 235 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=93.22 E-value=0.0083 Score=31.98 Aligned_cols=358 Identities=9% Similarity=0.000 Sum_probs=160.7 Q ss_pred Ccc--hhhh---cccCCCcchhHHHhhccccCcce------echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_019422. 1 MNI--FKSK---KKNKEAPGKVMMELISDSGNGFY------SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEFK 69 (384) Q Consensus 1 M~~--f~~~---~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 69 (384) |.+ ..+. .......-.....+..+...... .....-+..+....+|+..+.-+-+-|+.+--.++ T Consensus 16 ~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~~~---- 91 (499) T protein:vir:10 16 PNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNVGFMTGNPVKYVAEKG---- 91 (499) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHhhhhcccCceeecCCh---- Confidence 110 0000 00000000011111111110000 00011123456677888888877777776521111 Q ss_pred eccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce----------------eeEEEEcCceEEEEE Q lcl|NC_019422. 70 TNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP----------------TQIYPLNALNVEAIY 133 (384) Q Consensus 70 ~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~----------------~~l~~l~~~~v~~~~ 133 (384) . ....+..++.. .....+...+..+.+.+|.+|.++..+..|.+ ..+..++|..+.++. T Consensus 92 ~-~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~ 166 (499) T protein:vir:10 92 K-NIDDILEVFNQ----IDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVC 166 (499) T ss_pred h-HHHHHHHHHhh----cCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEe Confidence 1 11112222222 24556788888999999999999988877642 346677888777666 Q ss_pred cCCCE------EEEEEEc---CceE----EEEehhheEEEeccC----------------CC---------CCccCccHH Q lcl|NC_019422. 134 ENEVL------FLKFLLR---NGKI----VSYPYSDIIHLRKDF----------------NE---------NDLFGTSPA 175 (384) Q Consensus 134 ~~~~~------~~~~~~~---~g~~----~~~~~~evih~~~~~----------------~~---------~~~~G~s~~ 175 (384) +.... +++|... +++. ..+.++.+.+++... ++ +...|.|.+ T Consensus 167 ~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~ 246 (499) T protein:vir:10 167 DDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNNEERQGDF 246 (499) T ss_pred cCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCCCCCCCch Confidence 54321 1111111 1111 123334444432111 11 123577888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCccee--cCCCceeeecccc Q lcl|NC_019422. 176 KVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAA--TDSKYDAEQVKAE 253 (384) Q Consensus 176 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v--~~~g~~~~~l~~~ 253 (384) ..+...++..........+.+...+.|..+++- ..+..+. . ....+. .+.+.. .+++.+++.+... T Consensus 247 e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G-~~~~~~~-~-~~~~~~---------~~~~~~~~~~~~~d~~~l~~~ 314 (499) T protein:vir:10 247 EQLISLIDAYNLLQTDRISDKEAFVDALLVTFG-FGLGDDK-D-DIQRLK---------RGAIEAPPREEGADIEWLTKS 314 (499) T ss_pred HhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-Ccccccc-c-hhhhhh---------hcceeccCCCCCCcceEEecc Confidence 888888887777766666677676777555532 2222111 0 011110 111222 3456666666654 Q ss_pred hhHHHH-HHHHHHHHHHHHHhCCC---HHHhccccHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcccCc Q lcl|NC_019422. 254 SYVPNA-AQMDKAIQRLYSFFNTN---EKIIQSKYSEDE--------------WNAYYESEIEPVGLQLSNQYTEKLFTR 315 (384) Q Consensus 254 ~~~~~~-~~~~~~~~~I~~~fgvp---~~~l~~~~~e~~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~ 315 (384) .....+ ..++.+.+.|+..-++| +.-++++.+..+ ....+...+.-+++.+...++.. .. T Consensus 315 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~~ 392 (499) T protein:vir:10 315 FDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK--GA 392 (499) T ss_pred CCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CC Confidence 333332 23445555665555555 333433322211 12244444555555555444422 11 Q ss_pred ccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C--------------CeeeecCceee Q lcl|NC_019422. 316 KARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G--------------DKPVRRLDTAV 379 (384) Q Consensus 316 ~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g--------------d~~~~~~n~~~ 379 (384) . .....+++.+..-...|..+.++.+... .|+++..-++++++.-..+. . .+.....+... T Consensus 393 -~-~d~~~i~i~f~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (499) T protein:vir:10 393 -N-DDASGCKISLVANIPSNLSDVVNNVKNA-DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDR 469 (499) T ss_pred -c-cccccceEEeCCCCCCCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Confidence 1 1112334444554556777777766433 47888887887776633210 0 01111111111 Q ss_pred cCCCC Q lcl|NC_019422. 380 VEGGE 384 (384) Q Consensus 380 ~~~ge 384 (384) .+.++ T Consensus 470 ~~~~~ 474 (499) T protein:vir:10 470 LELED 474 (499) T ss_pred CCCCC Confidence 11111 No 236 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=93.13 E-value=0.0087 Score=31.89 Aligned_cols=353 Identities=9% Similarity=-0.016 Sum_probs=156.1 Q ss_pred CcchhhhcccCCCcc--------------hh---HHHhhccccCc-------------ceechhhhhhcHHHHHHHHHHH Q lcl|NC_019422. 1 MNIFKSKKKNKEAPG--------------KV---MMELISDSGNG-------------FYSWHGNLYKSDIVRSIIRPKA 50 (384) Q Consensus 1 M~~f~~~~~~~~~~~--------------~~---~~~~~~~~~~~-------------~~~~~~~~~~~~~v~~~i~~ia 50 (384) =-+|.++........ +. ...+..+.... .......-+.++....+++..+ T Consensus 14 ~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~ 93 (478) T protein:vir:10 14 EQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKV 93 (478) T ss_pred hHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHh Confidence 001111110000000 00 01111110000 0000001123567778888888 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEE Q lcl|NC_019422. 51 KAVGKMTAKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVE 130 (384) Q Consensus 51 ~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~ 130 (384) .-+-+-|+++-- +++ + . ......++. | ........+..+...+|.+|+.+..+..|.+ .+..++|..+. T Consensus 94 ~yl~g~p~~~~~-~~~--~-~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~ 162 (478) T protein:vir:10 94 AYAVANPVTFGV-DND--K-A-LKQIQHTLN--H---KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAV 162 (478) T ss_pred hhhcccCceeec-CCh--H-H-HHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceE Confidence 888777877521 111 1 1 111222222 2 4556777778999999999999998888865 47778888888 Q ss_pred EEEcCC--CE----EEEEEEcCceE-EEEehhheEEEecc-------------------------CCC---------CCc Q lcl|NC_019422. 131 AIYENE--VL----FLKFLLRNGKI-VSYPYSDIIHLRKD-------------------------FNE---------NDL 169 (384) Q Consensus 131 ~~~~~~--~~----~~~~~~~~g~~-~~~~~~evih~~~~-------------------------~~~---------~~~ 169 (384) ++.+.. +. .++|...+-.. ..+.++.+.+++.. ++. +.. T Consensus 163 ~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 242 (478) T protein:vir:10 163 PIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNP 242 (478) T ss_pred EEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCC Confidence 776532 11 12222221111 11223333333221 000 123 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceec--CCCcee Q lcl|NC_019422. 170 FGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAAT--DSKYDA 247 (384) Q Consensus 170 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~--~~g~~~ 247 (384) .|.|.+..+...++....+.....+.+...+.|..+++- ....+ .......+. + .+++.+ +.|.++ T Consensus 243 ~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g-~~~~~--~~~~~~~~~----~-----~~~~~~~~~~~~~~ 310 (478) T protein:vir:10 243 QEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG-YEGED--MKDFMHNLK----Y-----YKAISVAGESGSGV 310 (478) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeec-CCccc--ccchhhhhh----h-----CceeEecCCCCCcc Confidence 588888888888887777666666666665666444422 12111 111111111 1 112222 233444 Q ss_pred eecccchhHHHH-HHHHHHHHHHHHHhCCCHH---HhccccHHHHH--------------HHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 248 EQVKAESYVPNA-AQMDKAIQRLYSFFNTNEK---IIQSKYSEDEW--------------NAYYESEIEPVGLQLSNQYT 309 (384) Q Consensus 248 ~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~---~l~~~~~e~~~--------------~~~~~~~i~P~~~~i~~~l~ 309 (384) ..+........+ ..++.+.+.|+..-++|.. -++++.+..+. ...+...+.-+++.+.+.+. T Consensus 311 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 390 (478) T protein:vir:10 311 DTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR 390 (478) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 444443333333 3345566677777777643 23333332221 12333334444444433322 Q ss_pred hcccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC---------Ce--eeecCcee Q lcl|NC_019422. 310 EKLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIENG---------DK--PVRRLDTA 378 (384) Q Consensus 310 ~~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~g---------d~--~~~~~n~~ 378 (384) . .. . ...+++.++.-...+..+.++..... .|+++...+.++++.-..+.. ++ ...+.... T Consensus 391 ~----~~-d--~~~i~i~f~~~~p~~~~e~~~~~~~~-~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 462 (478) T protein:vir:10 391 L----DV-R--VQDIEITFNFNVMVNELENSQIAMNS-TGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQLPDIEE 462 (478) T ss_pred C----Cc-c--cccceEEeCCCCCCCHHHHHHHHHHH-hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccCC Confidence 1 11 1 12334444555556677776655322 588888888888776322100 00 00010000 Q ss_pred ec--------CCCC Q lcl|NC_019422. 379 VV--------EGGE 384 (384) Q Consensus 379 ~~--------~~ge 384 (384) +. ++++ T Consensus 463 ~~~d~~~~~~~d~~ 476 (478) T protein:vir:10 463 GLNDEQQRQSEDNQ 476 (478) T ss_pred CCcccccccCcCCC Confidence 11 1111 No 237 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=87.94 E-value=0.035 Score=28.53 Aligned_cols=353 Identities=9% Similarity=-0.031 Sum_probs=148.1 Q ss_pred CcchhhhcccCCCcchhHHHhhccc--------------cCcceech----hhhhh----cHHHHHHHHHHHHhhccCce Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDS--------------GNGFYSWH----GNLYK----SDIVRSIIRPKAKAVGKMTA 58 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~----~~~~~----~~~v~~~i~~ia~~ia~~~~ 58 (384) |++-.+-- .... ....+..+.+. .+....-. ..+++ .++....++.++..+-+-+. T Consensus 1 m~V~~~hp-~y~a-~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p 78 (452) T protein:vir:94 1 MPIETKHP-EYLA-YENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPP 78 (452) T ss_pred CCCCCcCH-HHHH-HHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCc Confidence 66433110 0000 00111111111 00000000 12222 34555666655555544444 Q ss_pred EEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEE------- Q lcl|NC_019422. 59 KHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEA------- 131 (384) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~------- 131 (384) .+ .....+.++ +. =-...+-.+|.+.++...+.+|.+++.+..+..|.-.-+..++|..|-= T Consensus 79 ~~---------~~p~~l~~~-~~-D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g 147 (452) T protein:vir:94 79 VI---------THPDAMSKY-FE-DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDG 147 (452) T ss_pred ee---------cccHHHHHH-Hh-cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccC Confidence 33 112223332 22 1357889999999999999999999999887666433334444433311 Q ss_pred ------------EEcC-CCE------EEE-------------EEEcCceEEE----Eehh------heEEEe--ccCCCC Q lcl|NC_019422. 132 ------------IYEN-EVL------FLK-------------FLLRNGKIVS----YPYS------DIIHLR--KDFNEN 167 (384) Q Consensus 132 ------------~~~~-~~~------~~~-------------~~~~~g~~~~----~~~~------evih~~--~~~~~~ 167 (384) ..+. +.+ .|+ |...++.... .+++ ..|-+- +....+ T Consensus 148 ~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~ 227 (452) T protein:vir:94 148 RLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLS 227 (452) T ss_pred CeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCCC Confidence 1111 110 000 0001111100 0000 111111 112234 Q ss_pred CccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCC-Cce Q lcl|NC_019422. 168 DLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDS-KYD 246 (384) Q Consensus 168 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~-g~~ 246 (384) ...|.||+..++..--.......-....+...+.|-.++.-..... +..-.++.++.+++ |.+ T Consensus 228 ~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~----------------~i~iG~~~~~~lpe~~~~ 291 (452) T protein:vir:94 228 MTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS----------------TMHIGSTKAWVIPEVAAK 291 (452) T ss_pred CCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC----------------ceEecccccccCCCCCCc Confidence 4578888887666433333333334444455567755553211110 11112344566664 655 Q ss_pred eeecccchh--HHHHHHHHHHHHH---HHHHhCCCHHHhccccHHHHHH--HHHHHHHHHHHHHHHHHHhhcc--cCcc- Q lcl|NC_019422. 247 AEQVKAESY--VPNAAQMDKAIQR---LYSFFNTNEKIIQSKYSEDEWN--AYYESEIEPVGLQLSNQYTEKL--FTRK- 316 (384) Q Consensus 247 ~~~l~~~~~--~~~~~~~~~~~~~---I~~~fgvp~~~l~~~~~e~~~~--~~~~~~i~P~~~~i~~~l~~~l--~~~~- 316 (384) +.-++.+.. ......++....+ +.+.+ ++..-.+.+..++... +-.+..+.-+...++++++..| ...+ T Consensus 292 ~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~l-l~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~ 370 (452) T protein:vir:94 292 VGFLEFTGQGLQSLEKALSEKQAQLASLSARL-IDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDME 370 (452) T ss_pred ceEEccCchhHHHHHHHHHHHHHHHHHHHHHh-hccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 554444433 3333333333322 22211 1111111222232211 2223445555555555554321 1111 Q ss_pred cccCcceEEeechhh-hccCHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCCC-----CCeee----ecCceeecCCC Q lcl|NC_019422. 317 ARSFGNEIVFEASNL-QYASMSTKLNLVQMVDRGSLTPNEWRKIM---NLSPIEN-----GDKPV----RRLDTAVVEGG 383 (384) Q Consensus 317 ~~~~~~~i~fd~~~~-~~~d~~~~~~~~~~~~~g~~t~NE~R~~l---G~~p~~~-----gd~~~----~~~n~~~~~~g 383 (384) +.+....|+.+.+=. ...+........+++..|.+|-...++.| |....+. -++.- .+.| .|.++| T Consensus 371 g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~-~~~~~~ 449 (452) T protein:vir:94 371 SMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSN-TPPNPS 449 (452) T ss_pred CCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCC-CCCCCc Confidence 112233344433322 33456666666788999999998888877 5443321 01111 1223 455555 Q ss_pred C Q lcl|NC_019422. 384 E 384 (384) Q Consensus 384 e 384 (384) . T Consensus 450 ~ 450 (452) T protein:vir:94 450 S 450 (452) T ss_pred c Confidence 5 No 238 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=83.93 E-value=0.064 Score=27.11 Aligned_cols=363 Identities=13% Similarity=0.058 Sum_probs=164.5 Q ss_pred CcchhhhcccCCCcchh--------------------HHHhhccccCc----cee--chhhh-hhcHHHHHHHHHHHHhh Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV--------------------MMELISDSGNG----FYS--WHGNL-YKSDIVRSIIRPKAKAV 53 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~--------------------~~~~~~~~~~~----~~~--~~~~~-~~~~~v~~~i~~ia~~i 53 (384) |+.=.+-- ..+.+. ...+..+..++ +.+ ..+.+ ...+.+.-.. +..+ T Consensus 1 ~~~~~~~~---~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~---~~~~ 74 (527) T protein:vir:10 1 MGQDKRQY---GSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNG---EKLI 74 (527) T ss_pred CCcccccc---CCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhh---HHhh Confidence 55322110 000000 00000000000 000 00000 0000000000 1111 Q ss_pred cc-CceEEEEec--CCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCceeeEEEEcCc Q lcl|NC_019422. 54 GK-MTAKHIRSN--ETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNMPTQIYPLNAL 127 (384) Q Consensus 54 a~-~~~~~~~~~--~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~~~~l~~l~~~ 127 (384) .+ ..|.+-.-+ .++.-+.....+..+.++ .+.....++...+.++.|.+.+.+.+|. .|.-+.+..+||. T Consensus 75 ~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~ 150 (527) T protein:vir:10 75 EAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPS 150 (527) T ss_pred CCcceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcc Confidence 11 111110000 011111122233334433 3444566777788889999999999884 3445677788877 Q ss_pred eEEEEEcCCCEE----EE----EEE---------------------------cCceEE-EE--------------e--h- Q lcl|NC_019422. 128 NVEAIYENEVLF----LK----FLL---------------------------RNGKIV-SY--------------P--Y- 154 (384) Q Consensus 128 ~v~~~~~~~~~~----~~----~~~---------------------------~~g~~~-~~--------------~--~- 154 (384) .+.+..+.++.. ++ |.. ..|... +. + . T Consensus 151 ~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~ 230 (527) T protein:vir:10 151 TYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPD 230 (527) T ss_pred eeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchh Confidence 776665543211 00 000 001000 00 0 0 Q ss_pred --------------------hheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCCh Q lcl|NC_019422. 155 --------------------SDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRP 214 (384) Q Consensus 155 --------------------~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~ 214 (384) =-|+|++.-++.+..+|.|-+.-+...+........-.....+=+|.|-.+++- ..+ T Consensus 231 ~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg---~~~ 307 (527) T protein:vir:10 231 DIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDS---APP 307 (527) T ss_pred hhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecc---ccc Confidence 026788766777888999999988888887766665555555557777544422 111 Q ss_pred HHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHH-HHHHHHHHHHHhCCCHHHhcccc-HHHHHHHH Q lcl|NC_019422. 215 DDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKY-SEDEWNAY 292 (384) Q Consensus 215 e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~-~e~~~~~~ 292 (384) -+. ........-..|.++-++++.++..++..+....+.. +.++.+.|+..-++|..-+|.-. +....-.- T Consensus 308 vd~-------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~A 380 (527) T protein:vir:10 308 RDS-------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIA 380 (527) T ss_pred ccc-------cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHH Confidence 110 0000111112345666888899998887665555543 46777899999999999888221 11111111 Q ss_pred HHHHHHHHHHHH----------HHHHh-----hcc-----cCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCC Q lcl|NC_019422. 293 YESEIEPVGLQL----------SNQYT-----EKL-----FTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSL 351 (384) Q Consensus 293 ~~~~i~P~~~~i----------~~~l~-----~~l-----~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~ 351 (384) +.-.+.|++... ...+. +.| +.-........+.+.+...+..|.++.++.. +++.+|++ T Consensus 381 LeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~ 460 (527) T protein:vir:10 381 LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLI 460 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCch Confidence 112222222211 11100 000 1101111122345666777788888888765 68999999 Q ss_pred CHHHHHHHh----CCCCCCCCC---------------------eeeecCceeecCCCC Q lcl|NC_019422. 352 TPNEWRKIM----NLSPIENGD---------------------KPVRRLDTAVVEGGE 384 (384) Q Consensus 352 t~NE~R~~l----G~~p~~~gd---------------------~~~~~~n~~~~~~ge 384 (384) +.-=+-++| |.+.. ..+ .=.+-....-++.+| T Consensus 461 S~~tAv~~L~~~~g~eD~-E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~ 517 (527) T protein:vir:10 461 PAKKLTEELSKIMGFELT-EEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE 517 (527) T ss_pred hHHHHHHHHHhccCCCCh-HHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC Confidence 988887777 43221 111 000001111112222 No 239 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=83.53 E-value=0.068 Score=26.99 Aligned_cols=363 Identities=12% Similarity=0.054 Sum_probs=164.2 Q ss_pred CcchhhhcccCCCcchh--------------------HHHhhccccCc----cee--chhhh-hhcHHHHHHHHHHHHhh Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKV--------------------MMELISDSGNG----FYS--WHGNL-YKSDIVRSIIRPKAKAV 53 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~--------------------~~~~~~~~~~~----~~~--~~~~~-~~~~~v~~~i~~ia~~i 53 (384) |+.=.+-- ..+.+. ...+..+..++ +.+ ..+.+ ...+.+.-.. +..+ T Consensus 1 ~~~~~~~~---~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~---~~~~ 74 (527) T protein:vir:10 1 MGQDKRQY---GSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNG---EKLI 74 (527) T ss_pred CCcccccc---CCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhh---HHhh Confidence 55322110 000000 00000000000 000 00000 0000000000 1111 Q ss_pred cc-CceEEEEec--CCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC---CCceeeEEEEcCc Q lcl|NC_019422. 54 GK-MTAKHIRSN--ETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD---YNMPTQIYPLNAL 127 (384) Q Consensus 54 a~-~~~~~~~~~--~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~---~g~~~~l~~l~~~ 127 (384) .+ ..|.+-.-+ .++.-+.....+..+.++ .+.....++...+.++.|.+.+.+.+|. .|.-+.+..+||. T Consensus 75 ~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~ 150 (527) T protein:vir:10 75 EAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPS 150 (527) T ss_pred CCcceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcc Confidence 11 111110000 011111122333334443 3444566777788889999999999884 3445677788877 Q ss_pred eEEEEEcCCCEE----EE----EEE---------------------------cCceEE-EE--------------e--h- Q lcl|NC_019422. 128 NVEAIYENEVLF----LK----FLL---------------------------RNGKIV-SY--------------P--Y- 154 (384) Q Consensus 128 ~v~~~~~~~~~~----~~----~~~---------------------------~~g~~~-~~--------------~--~- 154 (384) .+.+..+.++.. ++ |.. ..|... +. + . T Consensus 151 ~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~ 230 (527) T protein:vir:10 151 TYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPD 230 (527) T ss_pred eeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchh Confidence 776665543211 00 000 001000 00 0 0 Q ss_pred --------------------hheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCCh Q lcl|NC_019422. 155 --------------------SDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRP 214 (384) Q Consensus 155 --------------------~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~ 214 (384) =-|+|++.-++.+..+|.|-+.-+...+........-.....+=+|.|-.+++- ..+ T Consensus 231 ~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg---~~~ 307 (527) T protein:vir:10 231 DIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDS---APP 307 (527) T ss_pred hhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecc---ccc Confidence 026788766777888999999988888887766665555555557777544422 111 Q ss_pred HHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHHHHH-HHHHHHHHHHHhCCCHHHhcccc-HHHHHHHH Q lcl|NC_019422. 215 DDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPNAAQ-MDKAIQRLYSFFNTNEKIIQSKY-SEDEWNAY 292 (384) Q Consensus 215 e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~~~~-~~~~~~~I~~~fgvp~~~l~~~~-~e~~~~~~ 292 (384) -+. ........-..|.++-++++.++..++..+....+.. +.++.+.|+..-++|..-+|.-. +....-.- T Consensus 308 vd~-------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~A 380 (527) T protein:vir:10 308 RDS-------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIA 380 (527) T ss_pred ccc-------cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHH Confidence 110 0000111112345666888899998887665555544 46777899999999999888221 11111111 Q ss_pred HHHHHHHHHHHH----------HHHHh-----hcc-----cCcccccCcceEEeechhhhccCHHHHHHHH-HHHhCCCC Q lcl|NC_019422. 293 YESEIEPVGLQL----------SNQYT-----EKL-----FTRKARSFGNEIVFEASNLQYASMSTKLNLV-QMVDRGSL 351 (384) Q Consensus 293 ~~~~i~P~~~~i----------~~~l~-----~~l-----~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~-~~~~~g~~ 351 (384) +.-.+.|++... ...+. +.| +.-........+.+.+...+..|.++.++.. +++.+|++ T Consensus 381 LeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGii 460 (527) T protein:vir:10 381 LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLI 460 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCch Confidence 112222222211 11100 000 1101111122345566777778888887765 58999999 Q ss_pred CHHHHHHHh----CCCCCCCCC---------------------eeeecCceeecCCCC Q lcl|NC_019422. 352 TPNEWRKIM----NLSPIENGD---------------------KPVRRLDTAVVEGGE 384 (384) Q Consensus 352 t~NE~R~~l----G~~p~~~gd---------------------~~~~~~n~~~~~~ge 384 (384) +.-=+-++| |.+.. ..+ .=.+-....-++.+| T Consensus 461 S~etAv~~L~~~~g~eD~-E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~ 517 (527) T protein:vir:10 461 PAKKLTEELSKIMGFELT-EEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE 517 (527) T ss_pred hHHHHHHHHHhccCCCch-HHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC Confidence 988887777 43221 111 000101111112222 No 240 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=81.43 E-value=0.086 Score=26.43 Aligned_cols=370 Identities=10% Similarity=0.059 Sum_probs=147.3 Q ss_pred CcchhhhcccCCCcchhHHHhhccccCcc---------eechhhhhhcHHHHHHHHHHHHhh-c-----cCceEEEEe-- Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVMMELISDSGNGF---------YSWHGNLYKSDIVRSIIRPKAKAV-G-----KMTAKHIRS-- 63 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~v~~~i~~ia~~i-a-----~~~~~~~~~-- 63 (384) |++...+..............++-.++.. ++--......|.+-++.++--... + ..-+.+... T Consensus 53 ~~~~~~~~~~~t~~~D~~~~g~~~~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~ 132 (569) T protein:vir:10 53 SGFLGGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHN 132 (569) T ss_pred hhhhccCccccchhhhhHHHHHHHHhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecC Confidence 44444333221111111111110000000 000011223555556555543221 1 112333211 Q ss_pred cC----Ccc----eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeE---EEEcCceEEE Q lcl|NC_019422. 64 NE----TEF----KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQI---YPLNALNVEA 131 (384) Q Consensus 64 ~~----~~~----~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l---~~l~~~~v~~ 131 (384) .+ +.. ++..+++...+| .-...+..++..+|.+|+.+.-++ .| +..+ +...|+-+++ T Consensus 133 ~~~a~~daakai~~el~~dl~~~iN----------r~~~~lA~~~~aFGdsYaRiY~~~~~G-V~dl~~s~yt~PsfIqp 201 (569) T protein:vir:10 133 GNDSDYDAAQALCGELMNDIGRTIN----------KEVAGWAFIMSVFGVAYVRPYAKEGIG-ITSFECSYYTLPSFIKE 201 (569) T ss_pred CCCCcchHHHHHHHHHHHHHHHHHH----------HHhhHHHHHHHhhhhhheeeeccCCce-eEEEEecccccccccch Confidence 11 111 111222333333 345678889999999999998653 34 3322 3345555555 Q ss_pred EEcCCCEEE---EEEEcC-ceEEEEehhheEEEeccC---------------------------CC-CCccCccHHHHHH Q lcl|NC_019422. 132 IYENEVLFL---KFLLRN-GKIVSYPYSDIIHLRKDF---------------------------NE-NDLFGTSPAKVLE 179 (384) Q Consensus 132 ~~~~~~~~~---~~~~~~-g~~~~~~~~evih~~~~~---------------------------~~-~~~~G~s~~~~~~ 179 (384) ..-.+...- -|..+. ++.....+-+++.+|.+. |. -..+|-|-+..+- T Consensus 202 FE~g~~tvGF~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae 281 (569) T protein:vir:10 202 FEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAY 281 (569) T ss_pred hhhcCceEEeecccCCccccceeeechhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHH Confidence 433332211 111111 122222333333333222 00 1237888888888 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcceEEeeC-CCCChHHH-----------HHHHHHHHHHhccccccCC---cc-eecCC Q lcl|NC_019422. 180 PIMEVVNTTDQGVVKAIKNSNTIKWLLKFK-TALRPDDI-----------KKEVKSFEKNYLQIDSEAG---GA-AATDS 243 (384) Q Consensus 180 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~-----------~~~~~~~~~~~~~~~~~~~---~~-~v~~~ 243 (384) +.......+.....+.--+......+|.+. ..+++.++ ++.++.+++...|+..-.. .+ .+.++ T Consensus 282 ~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~ge 361 (569) T protein:vir:10 282 EPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGD 361 (569) T ss_pred hHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecC Confidence 777666544433332222223334455553 34555544 3444555555544321111 12 23344 Q ss_pred Cceeeecccch--hHHH-HHHHHHHHHHHHHHhCCCHHHhccc---------------cHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019422. 244 KYDAEQVKAES--YVPN-AAQMDKAIQRLYSFFNTNEKIIQSK---------------YSED-EWNAYYESEIEPVGLQL 304 (384) Q Consensus 244 g~~~~~l~~~~--~~~~-~~~~~~~~~~I~~~fgvp~~~l~~~---------------~~e~-~~~~~~~~~i~P~~~~i 304 (384) +--..+++... .+.. ...+-+..+.+|.++|+.+.|||.. .++. .+...++..+.-++..+ T Consensus 362 kq~~~tvDt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa~RS~~iRqa~~e~in~i 441 (569) T protein:vir:10 362 GKGQMTIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRA 441 (569) T ss_pred ccccccccccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43233333322 2222 2234456889999999999998721 2222 23345555555544443 Q ss_pred HHH-Hhh---cccCcccccCcceEEee--chhhhccCH---HHHHHHHHH-H-------hCCCCCHHHH--HH----HhC Q lcl|NC_019422. 305 SNQ-YTE---KLFTRKARSFGNEIVFE--ASNLQYASM---STKLNLVQM-V-------DRGSLTPNEW--RK----IMN 361 (384) Q Consensus 305 ~~~-l~~---~l~~~~~~~~~~~i~fd--~~~~~~~d~---~~~~~~~~~-~-------~~g~~t~NE~--R~----~lG 361 (384) -+- +.. +.|++.++ -+.++|. .+++..... .++++.+.. + ++..+--||. |. .+| T Consensus 442 idiH~~fKYgevf~~~dr--P~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~ 519 (569) T protein:vir:10 442 IDIHLAFKYGKVYPEGDR--PYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLE 519 (569) T ss_pred HHHHhhhhcCcccCCCCc--ceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhh Confidence 322 111 23444444 3456663 455543332 233332221 1 1223322332 21 133 Q ss_pred CC------------CCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 362 LS------------PIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~------------p~~~gd~~~~~~n~~~~~~ge 384 (384) ++ +.|.-+++++-.-. ..-+.| T Consensus 520 ~De~~~e~l~ae~~akp~DEe~~~~~~~-~~~~~~ 553 (569) T protein:vir:10 520 IDEKISEALVNELKAKSEDDDHLMDSII-KTPPQE 553 (569) T ss_pred cchhHHHHHHhhcCCCcchhHHHHHHHh-cCChHH Confidence 32 11111122211100 000111 No 241 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=76.35 E-value=0.14 Score=25.33 Aligned_cols=337 Identities=12% Similarity=0.041 Sum_probs=138.8 Q ss_pred CcchhhhcccCCC-cchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecC--------Ccce-e Q lcl|NC_019422. 1 MNIFKSKKKNKEA-PGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNE--------TEFK-T 70 (384) Q Consensus 1 M~~f~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~--------~~~~-~ 70 (384) +.-.....+.... ..... +.+....+.++ -|-.+.+...-.-||-=....+ +... . T Consensus 38 lP~~~~~~~~~~~~~~~~~----------~dst~~~a~~~----Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~ 103 (522) T protein:vir:94 38 IPSLFPKESDNSSTEYTTP----------WQAVGARCLNN----LAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAA 103 (522) T ss_pred cccccCCCCCccccccccc----------ccccHHHHHHH----HHHHHHhhcCCCCcccccccchhhhhccCcccchhH Confidence 2211111110000 00000 00000000000 0000000111111221111110 0000 0 Q ss_pred -------ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEEEE-- Q lcl|NC_019422. 71 -------NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLFLK-- 141 (384) Q Consensus 71 -------~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~-- 141 (384) ...+..+..+.+- +++.-+..++.++..+|++++++..+..|.+..+..++-..+-+..+..|.+.. T Consensus 104 ~v~~~L~~ve~~~~~~~~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~ 179 (522) T protein:vir:94 104 RVDEGLAMVERVLMAYMETN----SFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIV 179 (522) T ss_pred HHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEe Confidence 0011112222222 245556667888999999999998877666544433444445555554442210 Q ss_pred --EE------------------------------------------EcCceEEEEe-------hhheEEEeccCCCCCcc Q lcl|NC_019422. 142 --FL------------------------------------------LRNGKIVSYP-------YSDIIHLRKDFNENDLF 170 (384) Q Consensus 142 --~~------------------------------------------~~~g~~~~~~-------~~evih~~~~~~~~~~~ 170 (384) +. ...|+.+... .--.+..|+....+..| T Consensus 180 r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y 259 (522) T protein:vir:94 180 TIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDY 259 (522) T ss_pred eeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccccCCceeeeeeecCCCcc Confidence 00 0011111000 00133444444456689 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCc--eee Q lcl|NC_019422. 171 GTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKY--DAE 248 (384) Q Consensus 171 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~--~~~ 248 (384) |.||...++..+..++...+...........|.+++.-++...+... ..+ ..+.++.+..- ... T Consensus 260 Grgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~----------~~~----~~g~~v~g~~~~v~~~ 325 (522) T protein:vir:94 260 GRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRL----------NKA----ATGEFVAGRVEDINFL 325 (522) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhe----------ecc----CCceeecCCcccceee Confidence 99999999999999999999999999998888876655544443221 111 11223333333 333 Q ss_pred ecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH-------------HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 249 QVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE-------------WNAYYESEIEPVGLQLSNQYT 309 (384) Q Consensus 249 ~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~-------------~~~~~~~~i~P~~~~i~~~l~ 309 (384) ++.. ..+.+. ..++.....|-.+|-+.... .++ |..|-. ...+...-+.|++.+.-..+. T Consensus 326 ~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~ 404 (522) T protein:vir:94 326 QLTK-GQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQ 404 (522) T ss_pred eccc-ccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 4332 223433 33456677888888765322 222 222211 112455667787766555553 Q ss_pred h-cccCcccccCcceEEeechh-hhc----cCHHHHHHHHHHHhC-C------CCCHH----HHHHHhCCCCCCCCCeee Q lcl|NC_019422. 310 E-KLFTRKARSFGNEIVFEASN-LQY----ASMSTKLNLVQMVDR-G------SLTPN----EWRKIMNLSPIENGDKPV 372 (384) Q Consensus 310 ~-~l~~~~~~~~~~~i~fd~~~-~~~----~d~~~~~~~~~~~~~-g------~~t~N----E~R~~lG~~p~~~gd~~~ 372 (384) + .++++.... .++.++.. |-. .+......+++.+.. + -+..+ ++.+.+|.+|. T Consensus 405 r~g~lP~~p~~---~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~------- 474 (522) T protein:vir:94 405 SAGMIPDLPKE---AVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTA------- 474 (522) T ss_pred hcCCCCCCCcc---cEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChh------- Confidence 3 355543222 23333332 211 112222222221111 0 01111 12223444321 Q ss_pred ecCceeecCCCC Q lcl|NC_019422. 373 RRLDTAVVEGGE 384 (384) Q Consensus 373 ~~~n~~~~~~ge 384 (384) ++. -...| T Consensus 475 ---~iv-r~~ee 482 (522) T protein:vir:94 475 ---GLL-LTQDE 482 (522) T ss_pred ---hcc-CCHHH Confidence 010 01111 No 242 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=67.20 E-value=0.26 Score=23.83 Aligned_cols=350 Identities=8% Similarity=-0.001 Sum_probs=155.2 Q ss_pred Ccc------hhhhcccCCCcchhHHHhhcccc-------------CcceechhhhhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNI------FKSKKKNKEAPGKVMMELISDSG-------------NGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~------f~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) |.+ ..+.... ...-.....+..+-. ........+-+.++....+++..+.-+-+-|+++- T Consensus 1 l~~~~i~~~i~~~~~~-~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAAR-RQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHHH-HHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 111 1100000 000000001110000 00000001112245667788888888877777652 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC-------ceeeEEEEcCceEEEEEc Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN-------MPTQIYPLNALNVEAIYE 134 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g-------~~~~l~~l~~~~v~~~~~ 134 (384) ..++ +. .......++. | ........+..+.+.+|.||..+.++... ....+..++|..+-++.+ T Consensus 80 -~~~~--~~-~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vyd 150 (451) T protein:vir:10 80 -IDNN--KE-LNEKVTDVLG--N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYR 150 (451) T ss_pred -cCCc--HH-HHHHHHHHhc--c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEc Confidence 1111 11 0111111211 2 35567777889999999999998887542 133477788888877765 Q ss_pred CCC--E----EEEEEE-c--Cce--------EEEEehhheEEEeccC---------------CC---------CCccCcc Q lcl|NC_019422. 135 NEV--L----FLKFLL-R--NGK--------IVSYPYSDIIHLRKDF---------------NE---------NDLFGTS 173 (384) Q Consensus 135 ~~~--~----~~~~~~-~--~g~--------~~~~~~~evih~~~~~---------------~~---------~~~~G~s 173 (384) ... . +.+|.. . +|. ...+.++.+.+++... +. +.-.|.| T Consensus 151 d~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~ 230 (451) T protein:vir:10 151 NGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQS 230 (451) T ss_pred CCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCC Confidence 431 1 111111 1 110 1112333444433111 00 1224778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEee-CCCCChHHHHHHHHHHHHHhccccccCCcceecC-----CCcee Q lcl|NC_019422. 174 PAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKF-KTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD-----SKYDA 247 (384) Q Consensus 174 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~-~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~g~~~ 247 (384) -+..+...++....+.....+.+...+.|-.+++- ++....+.. ..+.. .+++.+. .|.++ T Consensus 231 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~----~~~~~---------~~~i~~~~~~~~~~~~~ 297 (451) T protein:vir:10 231 DLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFL----KELKR---------YKTIKTETDSEGDSGGL 297 (451) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhH----HHHhh---------CCeEEecCcCCccCCcc Confidence 88887777777776666666666655556444431 111122211 11111 1122221 23334 Q ss_pred eecccchhHHHH-HHHHHHHHHHHHHhCCCHH---HhccccHHH-------------HHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019422. 248 EQVKAESYVPNA-AQMDKAIQRLYSFFNTNEK---IIQSKYSED-------------EWNAYYESEIEPVGLQLSNQYTE 310 (384) Q Consensus 248 ~~l~~~~~~~~~-~~~~~~~~~I~~~fgvp~~---~l~~~~~e~-------------~~~~~~~~~i~P~~~~i~~~l~~ 310 (384) ..+......... ..++.+.+.|+..-++|.. .+|.....+ .....+...+.-+++.+...++ T Consensus 298 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~- 376 (451) T protein:vir:10 298 KTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG- 376 (451) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC- Confidence 444433222222 3455667777777777642 233222221 1122334444444444444332 Q ss_pred cccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C---------CeeeecCceee Q lcl|NC_019422. 311 KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWRKIMNLSPIEN--G---------DKPVRRLDTAV 379 (384) Q Consensus 311 ~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~~~--g---------d~~~~~~n~~~ 379 (384) .. ....+++.+..-...|..+.++.+... .|+++..-+.+++++-..+. . ..--...++-+ T Consensus 377 ----~~---d~~~i~i~f~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~~~ 448 (451) T protein:vir:10 377 ----VT---DYKKIQQTYTRNMMSNDLEDADIATKS-VGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDYNN 448 (451) T ss_pred ----CC---CccceeEEecCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 11 122344555665667787777776433 37889888888886643221 0 00011122222 Q ss_pred cCC Q lcl|NC_019422. 380 VEG 382 (384) Q Consensus 380 ~~~ 382 (384) +.+ T Consensus 449 ~~~ 451 (451) T protein:vir:10 449 FTE 451 (451) T ss_pred CCC Confidence 323 No 243 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=60.78 E-value=0.37 Score=22.98 Aligned_cols=335 Identities=12% Similarity=0.036 Sum_probs=124.9 Q ss_pred CcchhhhcccCCCc---chh--------HHHhhccccCcceec-----h----hhhhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_019422. 1 MNIFKSKKKNKEAP---GKV--------MMELISDSGNGFYSW-----H----GNLYKSDIVRSIIRPKAKAVGKMTAKH 60 (384) Q Consensus 1 M~~f~~~~~~~~~~---~~~--------~~~~~~~~~~~~~~~-----~----~~~~~~~~v~~~i~~ia~~ia~~~~~~ 60 (384) =.+|.......... ..+ ...+.+..++....+ . ......+..++-++.. ++.+ T Consensus 42 P~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~---l~~v---- 114 (515) T protein:vir:70 42 PYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATI---FARV---- 114 (515) T ss_pred ccccCCCCCcccccccccchHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHH---HHHH---- Confidence 00111100000000 000 011111111100000 0 0001111111111100 0000 Q ss_pred EEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE- Q lcl|NC_019422. 61 IRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF- 139 (384) Q Consensus 61 ~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~- 139 (384) .+..+.-+.+- +++.-+..++.++..+|++++++.. .+ ....||+.. +-+..+..|.+ T Consensus 115 ------------e~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl~~--y~v~~d~~G~v~ 173 (515) T protein:vir:70 115 ------------ETTAMKALEQR----QFRPAIVEVFKHLIVAGNCLLYKPS--KG-AMSAVPMHH--YVVNRDTNGDLM 173 (515) T ss_pred ------------HHHHHHHHHhc----CchHHHHHHHHHHHhHCeEEEEEeC--CC-CeEEEEcCe--EEEeeCCCcCee Confidence 00111111121 2344555667778899999888743 22 144555533 33333333311 Q ss_pred --------------------------------------------------EEEEEcCceEEEEehh-------heEEEec Q lcl|NC_019422. 140 --------------------------------------------------LKFLLRNGKIVSYPYS-------DIIHLRK 162 (384) Q Consensus 140 --------------------------------------------------~~~~~~~g~~~~~~~~-------evih~~~ 162 (384) .++...+|... .... -.+..|+ T Consensus 174 ~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~-~~es~y~~~e~P~~~~Rw 252 (515) T protein:vir:70 174 DVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPV-GKESRIKSEKLPFIPLTW 252 (515) T ss_pred EEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEecCceee-ccccccccccCCceeeee Confidence 11111111110 0000 1233344 Q ss_pred cCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC Q lcl|NC_019422. 163 DFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD 242 (384) Q Consensus 163 ~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~ 242 (384) ....+..||.||...++..+..++...+...........|.+++.-++...+. .+..+ ..+.++.+ T Consensus 253 ~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~-----------~l~~~---~~g~iv~g 318 (515) T protein:vir:70 253 KRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVD-----------HFVNS---GTGEVITG 318 (515) T ss_pred eecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchh-----------hcccc---CCceeecC Confidence 44456689999999999999999999999999888888887766444443321 12111 11223333 Q ss_pred CCceeeecccc-hhHHHH--HHHHHHHHHHHHHhCCCHHHh-cc---ccHHH-HHHH------------HHHHHHHHHHH Q lcl|NC_019422. 243 SKYDAEQVKAE-SYVPNA--AQMDKAIQRLYSFFNTNEKII-QS---KYSED-EWNA------------YYESEIEPVGL 302 (384) Q Consensus 243 ~g~~~~~l~~~-~~~~~~--~~~~~~~~~I~~~fgvp~~~l-~~---~~~e~-~~~~------------~~~~~i~P~~~ 302 (384) ..-++.++... ..+.+. ..+......|-.+|-+..... .+ |..|- .+.. +...-+.|++. T Consensus 319 ~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~ 398 (515) T protein:vir:70 319 VAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAM 398 (515) T ss_pred CcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 33445555432 344443 234567778888887754332 21 22331 1112 33444555544 Q ss_pred HHHHHHhhcccCccccc-CcceEEeechhhhcc-CHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC-CC-CCeeeecCcee Q lcl|NC_019422. 303 QLSNQYTEKLFTRKARS-FGNEIVFEASNLQYA-SMSTKLNLVQMVDRGSLTPNEWRKIMNLSPI-EN-GDKPVRRLDTA 378 (384) Q Consensus 303 ~i~~~l~~~l~~~~~~~-~~~~i~fd~~~~~~~-d~~~~~~~~~~~~~g~~t~NE~R~~lG~~p~-~~-gd~~~~~~n~~ 378 (384) ... +. ++++.... ....+.--+..+.+. +......+.+.+..-..-+.++...++.+.. ++ ++..-+|.++. T Consensus 399 r~~---~~-~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~ 474 (515) T protein:vir:70 399 WGL---QE-AGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFL 474 (515) T ss_pred HHH---Hh-hCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcccc Confidence 332 21 23332111 111111011222211 1222222222221100011223333322221 10 11222222221 Q ss_pred ecCCCC Q lcl|NC_019422. 379 VVEGGE 384 (384) Q Consensus 379 ~~~~ge 384 (384) ...| T Consensus 475 --rs~e 478 (515) T protein:vir:70 475 --KSEE 478 (515) T ss_pred --CCHH Confidence 1222 No 244 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=54.39 E-value=0.5 Score=22.21 Aligned_cols=335 Identities=12% Similarity=0.067 Sum_probs=134.6 Q ss_pred Ccchhhhcc-----------cCCCcchhHHHhhccccCcce--echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKK-----------NKEAPGKVMMELISDSGNGFY--SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~-----------~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) -.+|+..-. ..-.|..+|+.+ ...... ...........+..-.+.+.+ T Consensus 56 ~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l---~~~d~~~~~~~~~~~~~~~v~~~L~~ve~---------------- 116 (535) T protein:vir:15 56 TTPWQAVGARGLNNLASKLMLALFPMQSWMKL---TISEYEAKQLVGDPDGLAKVDEGLSMVER---------------- 116 (535) T ss_pred cccccccHHHHHHHHHHHHHHhhcCCCccccc---ccChHHHhccCCCcchHHHHHHHHHHHHH---------------- Confidence 222221000 000000111111 000000 000000001111111111111 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCC-ceeeEEEEcCceEEEEEcCCCEE------- Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYN-MPTQIYPLNALNVEAIYENEVLF------- 139 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g-~~~~l~~l~~~~v~~~~~~~~~~------- 139 (384) ..+..+.+- +++.-+..++.+++.+|++++++..+..+ .....|++.. +-+..+..|.+ T Consensus 117 -------~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~--~~v~~d~~G~vd~i~r~~ 183 (535) T protein:vir:15 117 -------IIMNYIESN----SYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSS--YVVQRDAYGNVLQIVTRD 183 (535) T ss_pred -------HHHHHHHhc----CcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCe--eEEeeCCCCCeeEEEEeE Confidence 111122222 24445666678899999999998765432 2344555543 33333333311 Q ss_pred ------------------------------E-------------EEEEcCceEEE-------EehhheEEEeccCCCCCc Q lcl|NC_019422. 140 ------------------------------L-------------KFLLRNGKIVS-------YPYSDIIHLRKDFNENDL 169 (384) Q Consensus 140 ------------------------------~-------------~~~~~~g~~~~-------~~~~evih~~~~~~~~~~ 169 (384) | .|....|..+. +..--.+..|+....+.. T Consensus 184 ~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~ 263 (535) T protein:vir:15 184 QIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGES 263 (535) T ss_pred eecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeecCCCc Confidence 1 01100111100 000013444444455668 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC--CCcee Q lcl|NC_019422. 170 FGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD--SKYDA 247 (384) Q Consensus 170 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~--~g~~~ 247 (384) ||.||...++..+..++...+...........|.+++.-++...+.. + ..+. .+.++.+ +++.. T Consensus 264 YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~-------l---~~~~----~g~~v~g~~~~v~~ 329 (535) T protein:vir:15 264 YGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRR-------L---TKAQ----TGDFVPGRREDIDF 329 (535) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchh-------c---ccCC----ceeeecCCccccee Confidence 99999999999999999999999999999888887765444443321 1 1111 1223323 33334 Q ss_pred eecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH-------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 248 EQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE-------------WNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 248 ~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~-------------~~~~~~~~i~P~~~~i~~~l 308 (384) .++.... +.+. ..++.....|-.+|-+.... .++ |..|-. ...+...-+.|++.+....+ T Consensus 330 ~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il 408 (535) T protein:vir:15 330 LQLEKQA-DFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQL 408 (535) T ss_pred eeccccc-chhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 4444332 3443 23456777888888554221 222 222211 11255667788887766666 Q ss_pred hhc-ccCcccccCcceEEeechhhhc----cCHHHHHHHHHHHhC-C------CCCHHH----HHHHhCCCCCC---CCC Q lcl|NC_019422. 309 TEK-LFTRKARSFGNEIVFEASNLQY----ASMSTKLNLVQMVDR-G------SLTPNE----WRKIMNLSPIE---NGD 369 (384) Q Consensus 309 ~~~-l~~~~~~~~~~~i~fd~~~~~~----~d~~~~~~~~~~~~~-g------~~t~NE----~R~~lG~~p~~---~gd 369 (384) .+. ++++... ....+++ .+.|-. .+......++..+.. + .+..++ +.+.+|.|+.. .-+ T Consensus 409 ~r~g~lP~~p~-~~v~~~y-is~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~e 486 (535) T protein:vir:15 409 QATSQIPELPK-EAVEPTI-STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDE 486 (535) T ss_pred HhcCCCCCCCc-cceeEEE-ecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHH Confidence 544 5554322 2233343 233211 112222222222211 0 111222 22234444310 000 Q ss_pred ee---ee-----------------cCceeecCCCC Q lcl|NC_019422. 370 KP---VR-----------------RLDTAVVEGGE 384 (384) Q Consensus 370 ~~---~~-----------------~~n~~~~~~ge 384 (384) +. .. .....+..++| T Consensus 487 ev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~ 521 (535) T protein:vir:15 487 QKQALMMQDAAQTGIENAAATGGAGVGALATSSPE 521 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccchhccChH Confidence 00 00 00000111111 No 245 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=53.56 E-value=0.53 Score=22.12 Aligned_cols=335 Identities=13% Similarity=0.081 Sum_probs=135.2 Q ss_pred Ccchhhhccc-----------CCCcchhHHHhhccccCcce--echhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_019422. 1 MNIFKSKKKN-----------KEAPGKVMMELISDSGNGFY--SWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETE 67 (384) Q Consensus 1 M~~f~~~~~~-----------~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 67 (384) ..+|+..-.. .-.|..+|+.+ ...... ...+.......+..-.+.+. T Consensus 56 ~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l---~~~d~~~~~~~~~~~~~~~v~~~l~~ve----------------- 115 (535) T protein:vir:33 56 TTPWQAVGARGLNNLASKLMLALFPMQSWMKL---TISEYEAKQLVGDPDGLAKVDEGLSMVE----------------- 115 (535) T ss_pred cccccccHHHHHHHHHHHHHHhhcCCCccccc---ccChHHHhccccCcchHHHHHHHHHHHH----------------- Confidence 3333321000 00000111111 000000 00000000001111111111 Q ss_pred ceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-CceeeEEEEcCceEEEEEcCCCEE------- Q lcl|NC_019422. 68 FKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-NMPTQIYPLNALNVEAIYENEVLF------- 139 (384) Q Consensus 68 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g~~~~l~~l~~~~v~~~~~~~~~~------- 139 (384) +..+..+.+-| ++.-+..++.+++.+|++++++..+.. +.....|++. ++-+..+..|.+ T Consensus 116 ------~~~~~~~~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~--~~~v~~d~~G~vd~i~r~~ 183 (535) T protein:vir:33 116 ------RIIMNYIESNS----YRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLS--SYVVQRDAYGNVLQIVTRD 183 (535) T ss_pred ------HHHHHHHHhcC----cHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcC--eeEEeeCCCCCeeEEEeeE Confidence 11111222222 444556667889999999999876533 2234445554 344444443321 Q ss_pred ------------------------------EE--EEEc-CceEEEE-----------------ehhheEEEeccCCCCCc Q lcl|NC_019422. 140 ------------------------------LK--FLLR-NGKIVSY-----------------PYSDIIHLRKDFNENDL 169 (384) Q Consensus 140 ------------------------------~~--~~~~-~g~~~~~-----------------~~~evih~~~~~~~~~~ 169 (384) |. +... ++....+ ..--.+..|+....+.. T Consensus 184 ~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~ 263 (535) T protein:vir:33 184 QIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGES 263 (535) T ss_pred eecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCc Confidence 00 0000 1111000 00013444544455668 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecC--CCcee Q lcl|NC_019422. 170 FGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATD--SKYDA 247 (384) Q Consensus 170 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~--~g~~~ 247 (384) ||.||...++..+..++...+...........|.+++.-++...+.. + ..+. .+.++.+ +++.. T Consensus 264 YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~-------~---~~~~----~g~~v~g~~~~v~~ 329 (535) T protein:vir:33 264 YGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRR-------L---TKAQ----TGDFVPGRREDIDF 329 (535) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh-------c---ccCC----ceeeecCCccccee Confidence 99999999999999999999999999999888887765444443321 1 1111 1223323 33334 Q ss_pred eecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH-------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 248 EQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE-------------WNAYYESEIEPVGLQLSNQY 308 (384) Q Consensus 248 ~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~-------------~~~~~~~~i~P~~~~i~~~l 308 (384) .++.... +.+. ..++.....|-.+|-+.... .++ |..|-. ...+...-+.|++.+....+ T Consensus 330 ~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il 408 (535) T protein:vir:33 330 LQLEKQA-DFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQL 408 (535) T ss_pred eeccccc-chhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 4444332 3433 23456777888888554221 222 222211 11255667788887766666 Q ss_pred hhc-ccCcccccCcceEEeechhhhc----cCHHHHHHHHHHHhC------C-CCCHHH----HHHHhCCCCCC---CCC Q lcl|NC_019422. 309 TEK-LFTRKARSFGNEIVFEASNLQY----ASMSTKLNLVQMVDR------G-SLTPNE----WRKIMNLSPIE---NGD 369 (384) Q Consensus 309 ~~~-l~~~~~~~~~~~i~fd~~~~~~----~d~~~~~~~~~~~~~------g-~~t~NE----~R~~lG~~p~~---~gd 369 (384) .+. ++++... ....+++ .+.|-. .+......++..+.. . .+..++ +.+.+|.|+.. .-+ T Consensus 409 ~r~g~lP~~p~-~~v~~~y-is~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~e 486 (535) T protein:vir:33 409 QATSQIPELPK-EAVEPTI-STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDE 486 (535) T ss_pred HhcCCCCCCCc-cceeEEE-ecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHH Confidence 544 5554322 2233343 233211 112222222222211 0 111122 12234444210 000 Q ss_pred ee---ee-------cCce----------eecCCCC Q lcl|NC_019422. 370 KP---VR-------RLDT----------AVVEGGE 384 (384) Q Consensus 370 ~~---~~-------~~n~----------~~~~~ge 384 (384) +. .. ..+. .+.++.| T Consensus 487 e~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~ 521 (535) T protein:vir:33 487 QKQALMMQDAAQTGVENAAAAGGAGVGALATSSPE 521 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCCh Confidence 00 00 0000 0000000 No 246 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=53.22 E-value=0.53 Score=22.08 Aligned_cols=320 Identities=10% Similarity=0.040 Sum_probs=141.8 Q ss_pred Ccchhhh------------cccCCCcchhHHHhhccccCcceechhhhhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_019422. 1 MNIFKSK------------KKNKEAPGKVMMELISDSGNGFYSWHGNLYKSDIVRSIIRPKAKAVGKMTAKHIRSNETEF 68 (384) Q Consensus 1 M~~f~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 68 (384) =.+|... -+.--.+..+++.+- .......+...+..-++.+.+.+ T Consensus 54 ~~i~dst~~~a~~~Las~L~~~ltPp~~~WF~l~--------~~d~~~~~~~~v~~~L~~ve~~i--------------- 110 (547) T protein:vir:10 54 REVFDSTAGDGLETLSSSLHGSLTSPATKWFELA--------FRDKELNSDDECRKWLENATHDV--------------- 110 (547) T ss_pred cccccchHHHHHHHHHHHHHHhhcCCCCcccccc--------cCCccccchHHHHHHHHHHHHHH--------------- Confidence 0011100 000000111111110 00000111112222222221111 Q ss_pred eeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCceeeEEEEcCceEEEEEcCCCEEEEEE---- Q lcl|NC_019422. 69 KTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMPTQIYPLNALNVEAIYENEVLFLKFL---- 143 (384) Q Consensus 69 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~---- 143 (384) +..+.+-| .+.-+..++.+++.+|++.+++..+. ......+..++..++-+..+..|.+..+. T Consensus 111 --------~~~l~~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~ 178 (547) T protein:vir:10 111 --------YSALQDSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEEDSRGQVVNFYRVFR 178 (547) T ss_pred --------HHHHHhcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeCCCcCeeeeeeeee Confidence 12222222 33446667789999999999987653 22233344445455555555444321100 Q ss_pred ----------------------------------------Ec--Cc-----------------eEEEEehh--------- Q lcl|NC_019422. 144 ----------------------------------------LR--NG-----------------KIVSYPYS--------- 155 (384) Q Consensus 144 ----------------------------------------~~--~g-----------------~~~~~~~~--------- 155 (384) .. ++ .++.+..+ T Consensus 179 ~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~es 258 (547) T protein:vir:10 179 WTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEG 258 (547) T ss_pred ccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCceeeeecC Confidence 00 00 00000101 Q ss_pred -----heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 156 -----DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 156 -----evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) -.+..|+....+..||.||...++..+..++...+...........|.+.+.-++...+ +. T Consensus 259 g~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~---------~~----- 324 (547) T protein:vir:10 259 GYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD---------ID----- 324 (547) T ss_pred CcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc---------ce----- Confidence 13334444445668999999999999999999999988888888888766543333321 00 Q ss_pred ccccCCcceecCCCceeeecccchhHHHH--HHHHHHHHHHHHHhCCCHHHh-ccc---cHHHH-------------HHH Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKII-QSK---YSEDE-------------WNA 291 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~l-~~~---~~e~~-------------~~~ 291 (384) ...|++.+.+..-.++++.... +.+. ..+......|-.+|-+....+ ++. ..|-. ... T Consensus 325 --~~pgg~~~~~~~~~v~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~r 401 (547) T protein:vir:10 325 --LGASGLTVVRDMESMKPFESRA-RFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGR 401 (547) T ss_pred --ecCCeeeecCCcccceeeeccc-chHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHH Confidence 1135566667777788886554 3333 234566778888888766433 332 22211 112 Q ss_pred HHHHHHHHHHHHHHHHHhhc-ccCccccc----CcceEEeec-hhhhcc----CHHHHHHHHHHHhC--C-------CCC Q lcl|NC_019422. 292 YYESEIEPVGLQLSNQYTEK-LFTRKARS----FGNEIVFEA-SNLQYA----SMSTKLNLVQMVDR--G-------SLT 352 (384) Q Consensus 292 ~~~~~i~P~~~~i~~~l~~~-l~~~~~~~----~~~~i~fd~-~~~~~~----d~~~~~~~~~~~~~--g-------~~t 352 (384) +....+.|++.+.-..+.+. ++++.... .+..++... +.|-+. +......+++.+.. + .+. T Consensus 402 l~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id 481 (547) T protein:vir:10 402 LENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPD 481 (547) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCC Confidence 45567788877665555543 44442111 122344433 333222 11111122222211 1 122 Q ss_pred HHHH----HHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 353 PNEW----RKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 353 ~NE~----R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) .+++ .+.+|.|+ +++ ..+.| T Consensus 482 ~d~~~~~~a~~~Gvp~-----------~~i-rs~ee 505 (547) T protein:vir:10 482 WDEMVRMLGSLLGAPQ-----------TLM-RPKAK 505 (547) T ss_pred HHHHHHHHHHHhCCCh-----------hcc-CCHHH Confidence 2222 22234432 111 01111 No 247 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=50.28 E-value=0.61 Score=21.74 Aligned_cols=358 Identities=10% Similarity=0.088 Sum_probs=145.0 Q ss_pred Ccchh---hhcccCCCcchhHHHhhccccC----------cceechhhhhhcHHHHHHHHHHHHhhc------cCceEEE Q lcl|NC_019422. 1 MNIFK---SKKKNKEAPGKVMMELISDSGN----------GFYSWHGNLYKSDIVRSIIRPKAKAVG------KMTAKHI 61 (384) Q Consensus 1 M~~f~---~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~v~~~i~~ia~~ia------~~~~~~~ 61 (384) |.+=. ..++.+..-...+-+....+.+ .........+.+.. -.|++.+|..+. +-||-=. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg-~~a~~~LAa~l~~~ltpp~~~WF~l 79 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVG-AKCCVTLAAKLMLAVLPPQTSFFKL 79 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchH-HHHHHHHHHHHHHhhcCCCCccccc Confidence 77633 3333333322222111111111 00011123344433 455666665552 2233211 Q ss_pred EecCCc-cee-------ccchH-------HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcC Q lcl|NC_019422. 62 RSNETE-FKT-------NPEIY-------IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNA 126 (384) Q Consensus 62 ~~~~~~-~~~-------~~~~~-------~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~ 126 (384) .-.+.. .+. ..+.+ .+..+.+ -+++.-+..++.++..+|++++++..+. ...|++.. T Consensus 80 ~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~G~a~ly~~~~~----~~~~pl~~ 151 (522) T protein:vir:10 80 QVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAA----SNDRVAVHQALKHLIVGGNALIFMGKDG----LKTFPLTR 151 (522) T ss_pred cCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCceeEEEcCCC----ceEEEcce Confidence 111110 000 00111 1122222 3355566777888999999998876542 23344432 Q ss_pred ceEEEEEcCCCEEE---------------------------------------EEE--EcC-ceEEEE--ehhh------ Q lcl|NC_019422. 127 LNVEAIYENEVLFL---------------------------------------KFL--LRN-GKIVSY--PYSD------ 156 (384) Q Consensus 127 ~~v~~~~~~~~~~~---------------------------------------~~~--~~~-g~~~~~--~~~e------ 156 (384) +-+..+..|.+. ... ..+ +....+ ..+. T Consensus 152 --y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~ 229 (522) T protein:vir:10 152 --YVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSR 229 (522) T ss_pred --EEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCccccccc Confidence 333333333211 000 000 000000 0011 Q ss_pred ---------eEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHH Q lcl|NC_019422. 157 ---------IIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKN 227 (384) Q Consensus 157 ---------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~ 227 (384) .+..|+....+..||.||...++..+..++...+...........|.+++.-++...+.. . T Consensus 230 s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~----------l 299 (522) T protein:vir:10 230 STAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPAT----------I 299 (522) T ss_pred cccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccc----------c Confidence 222333334456899999999999999999999999999888888876665444443321 1 Q ss_pred hccccccCCcceecCCCceeeeccc-chhHHHHH--HHHHHHHHHHHHhCCCHHHhcc---ccHHHH------------- Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKA-ESYVPNAA--QMDKAIQRLYSFFNTNEKIIQS---KYSEDE------------- 288 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~-~~~~~~~~--~~~~~~~~I~~~fgvp~~~l~~---~~~e~~------------- 288 (384) ..+ ..+.++.+..-++.++.. +..+.+.. .++.....|..+|-+-. .-++ |..|-. T Consensus 300 ~~~----~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~-~~d~~rvTAtEV~~r~~E~~~~LGpv 374 (522) T protein:vir:10 300 AKA----GNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN-VRNAERVTAEEVRLTQLELEQQLGGI 374 (522) T ss_pred cCC----CCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhcc-CCCCCCCCHHHHHHHHHHHHHHhhHH Confidence 111 122333333333444432 23444432 23556677877873211 1111 222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc-ccCcccccCcceEEe-echhhhcc-CHHHHHHHHHHHhC--C------CCCHHH-- Q lcl|NC_019422. 289 WNAYYESEIEPVGLQLSNQYTEK-LFTRKARSFGNEIVF-EASNLQYA-SMSTKLNLVQMVDR--G------SLTPNE-- 355 (384) Q Consensus 289 ~~~~~~~~i~P~~~~i~~~l~~~-l~~~~~~~~~~~i~f-d~~~~~~~-d~~~~~~~~~~~~~--g------~~t~NE-- 355 (384) ...+...-+.|++.+.-..+.+. +|++.......-... -++.|-+. +......+++.+.. | .+..++ T Consensus 375 ~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~ 454 (522) T protein:vir:10 375 FSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAI 454 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHH Confidence 11245667788877777666655 555432211111111 12333211 12222233332211 1 111111 Q ss_pred --HHHHhCCCCC---CCCCeeee----------cCc---------eeecCC-CC Q lcl|NC_019422. 356 --WRKIMNLSPI---ENGDKPVR----------RLD---------TAVVEG-GE 384 (384) Q Consensus 356 --~R~~lG~~p~---~~gd~~~~----------~~n---------~~~~~~-ge 384 (384) +.+.+|.|+. -.-++.-. ... -.++.+ .+ T Consensus 455 ~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~ 508 (522) T protein:vir:10 455 KRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTK 508 (522) T ss_pred HHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccc Confidence 1223343311 00000000 000 000000 00 No 248 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=46.62 E-value=0.73 Score=21.34 Aligned_cols=328 Identities=15% Similarity=0.177 Sum_probs=132.5 Q ss_pred CcchhhhcccCC------Ccchh----HHHhhccccCc---ce-e-chhh----hhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_019422. 1 MNIFKSKKKNKE------APGKV----MMELISDSGNG---FY-S-WHGN----LYKSDIVRSIIRPKAKAVGKMTAKHI 61 (384) Q Consensus 1 M~~f~~~~~~~~------~~~~~----~~~~~~~~~~~---~~-~-~~~~----~~~~~~v~~~i~~ia~~ia~~~~~~~ 61 (384) |..-+...+.+. +.... ...+.+..++. +. . ..+. +-..+.+++-++ ..++.+ T Consensus 35 ~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~---~~l~~v----- 106 (555) T protein:vir:17 35 LTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQINDAEIDNLGMDEQARSEID---LSLSRI----- 106 (555) T ss_pred cCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccccCHHHHhhccCCHHHHHHHH---HHHHHH----- Confidence 222111111100 00000 01111111110 00 0 0000 001112221111 111100 Q ss_pred EecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCceEEEEEcCCCEE-- Q lcl|NC_019422. 62 RSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNALNVEAIYENEVLF-- 139 (384) Q Consensus 62 ~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-- 139 (384) .+..+.-+.+- +++.-+..++.++..+|++++++..+ +...|+|.. +-+..+..|.+ T Consensus 107 -----------e~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~~~----~~~~~pl~~--y~v~~d~~G~vd~ 165 (555) T protein:vir:17 107 -----------ERIVTQDIAES----SDRVHLEMAMKHLIVTGNALLYQGKK----NLKLYPLDR--FVVSRDGEGNVME 165 (555) T ss_pred -----------HHHHHHHHHhc----CcHHHHHHHHHHHHhHCeEEEEecCC----ceeEEEcCe--EEEeeCCCcCeeE Confidence 01111111222 24445556677888899998876543 133455533 33333333211 Q ss_pred -------------------------------------------------------------------EEEEEcCceEEE- Q lcl|NC_019422. 140 -------------------------------------------------------------------LKFLLRNGKIVS- 151 (384) Q Consensus 140 -------------------------------------------------------------------~~~~~~~g~~~~- 151 (384) +++...+|+.+. T Consensus 166 v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~~ 245 (555) T protein:vir:17 166 IVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDGQVKWHQECDGKVIPG 245 (555) T ss_pred EEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCCeeEEEEecCceeccc Confidence 111111111110 Q ss_pred ------EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHH Q lcl|NC_019422. 152 ------YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFE 225 (384) Q Consensus 152 ------~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~ 225 (384) +..--.+..|+....+..||.||...++..+..++...+...........|.+++.-++...+.. + T Consensus 246 ~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~-------l- 317 (555) T protein:vir:17 246 SNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQN-------L- 317 (555) T ss_pred cccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcce-------e- Confidence 00001245555555677899999999999999999999999999888888887775554444321 0 Q ss_pred HHhccccccCCcceecCCCceeeeccc-chhHHHHH--HHHHHHHHHHHHhCCCHHHhcc---ccHHHH----------- Q lcl|NC_019422. 226 KNYLQIDSEAGGAAATDSKYDAEQVKA-ESYVPNAA--QMDKAIQRLYSFFNTNEKIIQS---KYSEDE----------- 288 (384) Q Consensus 226 ~~~~~~~~~~~~~~v~~~g~~~~~l~~-~~~~~~~~--~~~~~~~~I~~~fgvp~~~l~~---~~~e~~----------- 288 (384) ..+ ..+.++.+..-++.++.. ++.+.+.. .+......|-.+|.+- ..-++ |..|-. T Consensus 318 --~~~----~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~-~~~d~~r~TAtEV~~r~~E~~~~LG 390 (555) T protein:vir:17 318 --ALA----ANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML-QVRQSERTTATEVQATVQELNEQIG 390 (555) T ss_pred --ecC----CCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc-CCCCcccchHHHHHHHHHHHHHHHh Confidence 111 123344343444555543 23445532 2345556677777541 11121 122211 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhc-ccCccccc-CcceEEeechhhhcc-CHHHHHHHHHHH-hC-C------CCCHHH Q lcl|NC_019422. 289 --WNAYYESEIEPVGLQLSNQYTEK-LFTRKARS-FGNEIVFEASNLQYA-SMSTKLNLVQMV-DR-G------SLTPNE 355 (384) Q Consensus 289 --~~~~~~~~i~P~~~~i~~~l~~~-l~~~~~~~-~~~~i~fd~~~~~~~-d~~~~~~~~~~~-~~-g------~~t~NE 355 (384) ...+...-+.|++.+.-..+.+. ++++.... .+..+..-+..+.+. +......+++.+ +. | .+..++ T Consensus 391 pv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~ 470 (555) T protein:vir:17 391 GIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTE 470 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHH Confidence 01244566778777655555443 44432211 223333333333322 222222333322 11 1 111111 Q ss_pred ----HHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 356 ----WRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 356 ----~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) +-+.+|.+|. +++ ...| T Consensus 471 ~~~~~a~~~Gv~p~----------~iv--rs~e 491 (555) T protein:vir:17 471 FIKRLAAAQGIDTL----------QLI--NSPE 491 (555) T ss_pred HHHHHHHHcCCChh----------hhc--CCHH Confidence 1223444332 111 1111 No 249 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=44.66 E-value=0.8 Score=21.12 Aligned_cols=341 Identities=12% Similarity=0.018 Sum_probs=142.0 Q ss_pred Ccc--------------hhhhcccCCCcchhHHHhhccccCc--------ceechhhhhhcHHHHHHHHHHHHhhccC-- Q lcl|NC_019422. 1 MNI--------------FKSKKKNKEAPGKVMMELISDSGNG--------FYSWHGNLYKSDIVRSIIRPKAKAVGKM-- 56 (384) Q Consensus 1 M~~--------------f~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 56 (384) |-= |...+..+..-...+-++...+.+. -.....+.+.+. .-.|++.+|..+-+. T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst-~~~a~~~Laa~l~~~lt 79 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAV-GARGLNNLSAKVMLALF 79 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccch-HHHHHHHHHHHHHHhhc Confidence 221 2222222222111111111111111 011111234433 345666666555321 Q ss_pred ---ceEEEEecCCcce---------ecc-------chHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC-C Q lcl|NC_019422. 57 ---TAKHIRSNETEFK---------TNP-------EIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY-N 116 (384) Q Consensus 57 ---~~~~~~~~~~~~~---------~~~-------~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~-g 116 (384) ||-=..-.+.... ... .+..+..+.+ -+++.-+..++.++..+|++++++..+.. + T Consensus 80 P~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~----snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~ 155 (543) T protein:vir:88 80 PLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEA----NSYRVTLFELIRQLALAGTALIYLPPPDASS 155 (543) T ss_pred CCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhhCceeeeeccCcccc Confidence 2321111111100 000 1111222222 23555666678889999999998876532 1 Q ss_pred ---ceeeEEEEcCceEEEEEcCCCEE------------------------------------EEEEE--cC--------- Q lcl|NC_019422. 117 ---MPTQIYPLNALNVEAIYENEVLF------------------------------------LKFLL--RN--------- 146 (384) Q Consensus 117 ---~~~~l~~l~~~~v~~~~~~~~~~------------------------------------~~~~~--~~--------- 146 (384) .+...||+.... +..+..|.+ |.... .+ T Consensus 156 ~~~~~~~~~pl~~y~--v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~ 233 (543) T protein:vir:88 156 NSYNPMKLYTLHNHV--VQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQE 233 (543) T ss_pred ceecceEEeEcceEE--EeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCccccccc Confidence 123445554333 223332211 11110 00 Q ss_pred --ceEEEEeh-------hheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHH Q lcl|NC_019422. 147 --GKIVSYPY-------SDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDI 217 (384) Q Consensus 147 --g~~~~~~~-------~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~ 217 (384) |..+.... --.+..|+....+..||.||...++..+..++...+...........|.+++.-++...+.. T Consensus 234 ~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~- 312 (543) T protein:vir:88 234 IEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRR- 312 (543) T ss_pred ccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh- Confidence 11110000 11344454445566899999999999999999999999999998888887765554444321 Q ss_pred HHHHHHHHHHhccccccCCcceecC--CCceeeecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH- Q lcl|NC_019422. 218 KKEVKSFEKNYLQIDSEAGGAAATD--SKYDAEQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE- 288 (384) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~v~~--~g~~~~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~- 288 (384) ...++ .+.++.+ +++...++.... +.+. ..++.....|-.+|-+.... .++ |..|-. T Consensus 313 ---------~~~~~----~g~~v~g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~ 378 (543) T protein:vir:88 313 ---------LVKAQ----TGDFVAGRKADIEFLQLEKTA-DFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRY 378 (543) T ss_pred ---------cccCC----CceeecCCCCcceeeeccccc-chhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHH Confidence 11121 1223333 333344444332 3332 23456777888888665322 222 222211 Q ss_pred H------------HHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHHHHHHHHh-CCCCCHH Q lcl|NC_019422. 289 W------------NAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVD-RGSLTPN 354 (384) Q Consensus 289 ~------------~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~-~g~~t~N 354 (384) + ..+...-+.|++.+.-..+.+ .++++.... .++.++... +.. +.++.. .++.+.- T Consensus 379 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~---~v~~~~vs~----l~~---l~r~~~~~~l~~~~ 448 (543) T protein:vir:88 379 VASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQE---AVEPTVTTG----AEA---LGRGQDLDKLTQFL 448 (543) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh---ceeeeEEec----HHH---HHHHHHHHHHHHHH Confidence 1 124455667777665555544 355443211 122221110 111 111111 1232222 Q ss_pred HHHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 355 EWRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 355 E~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ..-..++ + |. -+..++..+ T Consensus 449 ~~v~~~~--~-p~--------vld~id~d~ 467 (543) T protein:vir:88 449 NAVATVS--Q-LN--------GDPDLNVNN 467 (543) T ss_pred HHHHhcc--c-hh--------hhccCCHHH Confidence 2222122 1 11 111122222 No 250 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=42.15 E-value=0.9 Score=20.84 Aligned_cols=367 Identities=8% Similarity=-0.014 Sum_probs=151.5 Q ss_pred CcchhhhcccCCCcchhH-------------HHhhccccCccee----chhhhhhcHHHHHHHHHHHHhhccC-----c- Q lcl|NC_019422. 1 MNIFKSKKKNKEAPGKVM-------------MELISDSGNGFYS----WHGNLYKSDIVRSIIRPKAKAVGKM-----T- 57 (384) Q Consensus 1 M~~f~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~----~~~~~~~~~~v~~~i~~ia~~ia~~-----~- 57 (384) -..|.+....+..-...+ ..+..+....-.. ......-++.++.+|+.+...+... + T Consensus 26 ~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~ 105 (651) T protein:vir:80 26 KKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHAYLMSATFPNKNW 105 (651) T ss_pred HHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHHHHHHhhcCCCce Confidence 111111111111100001 0111000000000 0112234568888887665555432 2 Q ss_pred eEEEEecCCcceeccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCC----------------Cc---- Q lcl|NC_019422. 58 AKHIRSNETEFKTNPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDY----------------NM---- 117 (384) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~----------------g~---- 117 (384) |++.+..++..-.........++...=...........++.+.+..|++++.+.++.. |. T Consensus 106 ~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~ 185 (651) T protein:vir:80 106 FDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFE 185 (651) T ss_pred eEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheecccccccccccee Confidence 2222221111101111111112211001223556677788999999999887654311 00 Q ss_pred ----------eeeEEEEcCceEEEEEcCC----C---------------------------------------------- Q lcl|NC_019422. 118 ----------PTQIYPLNALNVEAIYENE----V---------------------------------------------- 137 (384) Q Consensus 118 ----------~~~l~~l~~~~v~~~~~~~----~---------------------------------------------- 137 (384) ...+..++|..+-+.+... . T Consensus 186 v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (651) T protein:vir:80 186 VVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDML 265 (651) T ss_pred eeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccCCcccc Confidence 0112223322221110000 0 Q ss_pred -----------------EEEEE----EEcCceE----EEEehhheEEEe--------------ccCCCCCccCccHHHHH Q lcl|NC_019422. 138 -----------------LFLKF----LLRNGKI----VSYPYSDIIHLR--------------KDFNENDLFGTSPAKVL 178 (384) Q Consensus 138 -----------------~~~~~----~~~~g~~----~~~~~~evih~~--------------~~~~~~~~~G~s~~~~~ 178 (384) ..|.+ ...++.. +......|+|.. +....+..||.|++..+ T Consensus 266 ~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~ 345 (651) T protein:vir:80 266 STFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPN 345 (651) T ss_pred ccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccCCCCCCCeeeecceecCccccCCChHHHH Confidence 01111 0011110 111112333332 22223456999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhccccccCCcceecCCCceeeecccchhHHH Q lcl|NC_019422. 179 EPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQIDSEAGGAAATDSKYDAEQVKAESYVPN 258 (384) Q Consensus 179 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~~~~~~ 258 (384) +......+...+.........+.|.+++.-++..++++ +. ...|++++...+.++.++.....+.+ T Consensus 346 ~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~-------l~-------~~pg~vi~~~~~~~~~~l~~~~~~~~ 411 (651) T protein:vir:80 346 LGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPED-------VY-------TEPGKVFLVSDHGDLQPLANQSSNFS 411 (651) T ss_pred hHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHH-------hh-------cCCCceEEecCCCCceeeccCcccch Confidence 99999999988888888887778887776665555433 11 12356777777777888765433332 Q ss_pred --HHHHHHHHHHHHHHhCCCHHHhccc-------c-HHHH-H------------HHHHHHHHHHHHHHHHHHHhhcccCc Q lcl|NC_019422. 259 --AAQMDKAIQRLYSFFNTNEKIIQSK-------Y-SEDE-W------------NAYYESEIEPVGLQLSNQYTEKLFTR 315 (384) Q Consensus 259 --~~~~~~~~~~I~~~fgvp~~~l~~~-------~-~e~~-~------------~~~~~~~i~P~~~~i~~~l~~~l~~~ 315 (384) +..+......+-..+||+...-|.+ + .+-+ . ..|....+.|+++.+-..+-+..-.+ T Consensus 412 ~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~ 491 (651) T protein:vir:80 412 ITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQP 491 (651) T ss_pred hHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 2334566678888999998775421 1 1211 1 11223345666555544333221110 Q ss_pred c-cc------c---------CcceEEeechhhhccCHHHH----HHHHHHHhCCCCC------------HHHHHHHhCCC Q lcl|NC_019422. 316 K-AR------S---------FGNEIVFEASNLQYASMSTK----LNLVQMVDRGSLT------------PNEWRKIMNLS 363 (384) Q Consensus 316 ~-~~------~---------~~~~i~fd~~~~~~~d~~~~----~~~~~~~~~g~~t------------~NE~R~~lG~~ 363 (384) . .+ . ......+++..+-.....++ ....++++.+.-. .-++.+.+|++ T Consensus 492 ~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~ 571 (651) T protein:vir:80 492 GMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWGFE 571 (651) T ss_pred cceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcCCC Confidence 0 00 0 01112222221111111111 1222223221101 12233445654 Q ss_pred CCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 364 PIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 364 p~~~gd~~~~~~n~~~~~~ge 384 (384) ..+.++.+..-++....+ T Consensus 572 ---~~~~~l~~~~q~~~~~~~ 589 (651) T protein:vir:80 572 ---EPEAYLKQQDQQAPANPQ 589 (651) T ss_pred ---CcHHhcCCCccchhhhhh Confidence 234444433322222212 No 251 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=36.34 E-value=1.2 Score=20.19 Aligned_cols=360 Identities=12% Similarity=0.036 Sum_probs=140.1 Q ss_pred Ccc------------hhhhcccCCCcchhHHHhhccccCcc------eechhhhhhcHHHHHHHHHHHHhhc------cC Q lcl|NC_019422. 1 MNI------------FKSKKKNKEAPGKVMMELISDSGNGF------YSWHGNLYKSDIVRSIIRPKAKAVG------KM 56 (384) Q Consensus 1 M~~------------f~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i~~ia~~ia------~~ 56 (384) |-. |...++.++.-...+-++...+.+.. .......+.+.. -.|++.+|..+- +- T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~dstg-~~a~~~LAa~l~~~ltpp~~ 79 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDG-ASATNFLSNKLSQVLFPAQR 79 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccccchH-HHHHHHHHHHHHHhhcCCCC Confidence 322 22223333222222222111111111 112233444433 455666655552 22 Q ss_pred ceEEEEecCCcc-e--------eccchHHHHHHhhcc---ccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEE Q lcl|NC_019422. 57 TAKHIRSNETEF-K--------TNPEIYIKFLLENPN---PFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPL 124 (384) Q Consensus 57 ~~~~~~~~~~~~-~--------~~~~~~~~~l~~~PN---~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l 124 (384) ||-=..-++... + .....++..+-.... ..-+++.-+..++.++..+|+++++.. +.+.....|++ T Consensus 80 ~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~--~~~~~~~~~pl 157 (517) T protein:vir:10 80 SFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHP--DKTSPIQAVPL 157 (517) T ss_pred ccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe--CCCCcEEEEEc Confidence 332111111100 0 001111111111111 122455666777888999999988753 33334556666 Q ss_pred cCceEEEEEcCCCEEE---------------------------------------------------EEEEcCceEEE-- Q lcl|NC_019422. 125 NALNVEAIYENEVLFL---------------------------------------------------KFLLRNGKIVS-- 151 (384) Q Consensus 125 ~~~~v~~~~~~~~~~~---------------------------------------------------~~~~~~g~~~~-- 151 (384) .. +-+..+..|.+. .|...+|+..- T Consensus 158 ~~--y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~~ 235 (517) T protein:vir:10 158 HH--YCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKE 235 (517) T ss_pred Ce--EEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeCceeeccc Confidence 43 333333333210 01111111100 Q ss_pred ----EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHH Q lcl|NC_019422. 152 ----YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKN 227 (384) Q Consensus 152 ----~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~ 227 (384) +..--.+..|+....+..||.||...++..+..++...+...........|.+++.-++...+.. T Consensus 236 s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~----------- 304 (517) T protein:vir:10 236 STVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQ----------- 304 (517) T ss_pred cccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhh----------- Confidence 00001233444444566899999999999999999998888888888777776654444433211 Q ss_pred hccccccCCcceecCCCceeeeccc-chhHHHH--HHHHHHHHHHHHHhCCCHHHh-cc---ccHHHH-HH--------- Q lcl|NC_019422. 228 YLQIDSEAGGAAATDSKYDAEQVKA-ESYVPNA--AQMDKAIQRLYSFFNTNEKII-QS---KYSEDE-WN--------- 290 (384) Q Consensus 228 ~~~~~~~~~~~~v~~~g~~~~~l~~-~~~~~~~--~~~~~~~~~I~~~fgvp~~~l-~~---~~~e~~-~~--------- 290 (384) +..+. .+.++.+..-++.++.. +..+.+. ..+......|-.+|-+..... ++ |..|-. +. T Consensus 305 l~~~~---~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGp 381 (517) T protein:vir:10 305 FVEGG---SGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGG 381 (517) T ss_pred ccCCC---ccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhh Confidence 11111 11222222233444432 2334442 234566778888886654222 22 222311 11 Q ss_pred ---HHHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhcc-CHHHHHHHHHHHhCCCCC-HHHHHHHhCCCCC Q lcl|NC_019422. 291 ---AYYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYA-SMSTKLNLVQMVDRGSLT-PNEWRKIMNLSPI 365 (384) Q Consensus 291 ---~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~-d~~~~~~~~~~~~~g~~t-~NE~R~~lG~~p~ 365 (384) .+...-+.|++.+.-..+-..+ +.... ...+---+..+.+. +......+.+.+.. +.- +..+...++.+.. T Consensus 382 v~~rl~~Ell~Pli~r~~~~l~~~l-~~~~v--~~~~~s~la~l~r~~~~~~i~~~~~~i~~-~a~~~~~~~~~id~d~~ 457 (517) T protein:vir:10 382 VYSLFATTFQGPLARWFMNGISSIL-TSKNV--SPTILTGIEALGRMAELDKLGTFNGYVSM-TAQWPEPLQQAIKWPDF 457 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhc-CCCCc--cceeeccHHHHHHHHHHHHHHHHHHHHHH-hhcCChHHHhcCCHHHH Confidence 1344556666655544443322 11111 11111011222221 12222222221211 100 1111111222111 Q ss_pred -CC-CCeeeecCceeecCCCC Q lcl|NC_019422. 366 -EN-GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 366 -~~-gd~~~~~~n~~~~~~ge 384 (384) ++ ++.+=+|.++.- .+.| T Consensus 458 ~~~~a~~~Gvp~~~ir-s~~e 477 (517) T protein:vir:10 458 TDWVQGQISANFPFFK-TQDE 477 (517) T ss_pred HHHHHHHhCCChhhcC-CHHH Confidence 00 111112221111 0111 No 252 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=34.79 E-value=1.3 Score=20.02 Aligned_cols=354 Identities=11% Similarity=0.003 Sum_probs=139.4 Q ss_pred Ccc------------hhhhcccCCCcchhHHHhhccccCcce------echhhhhhcHHHHHHHHHHHHhhc------cC Q lcl|NC_019422. 1 MNI------------FKSKKKNKEAPGKVMMELISDSGNGFY------SWHGNLYKSDIVRSIIRPKAKAVG------KM 56 (384) Q Consensus 1 M~~------------f~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~v~~~i~~ia~~ia------~~ 56 (384) |+- |...+..+..-...+-+....+.+... ......+.+. --.|++.+|..+- .- T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dst-g~~a~~~LAa~l~~~ltpp~~ 83 (516) T protein:vir:96 5 IDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGV-GAQATNHLANKLAQVLFPAQR 83 (516) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcccch-HHHHHHHHHHHHHhhhcCCCC Confidence 221 112222222212222221111111110 1112344443 3456666665552 22 Q ss_pred ceEEEEecCCc--------cee-ccchH-------HHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceee Q lcl|NC_019422. 57 TAKHIRSNETE--------FKT-NPEIY-------IKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQ 120 (384) Q Consensus 57 ~~~~~~~~~~~--------~~~-~~~~~-------~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~ 120 (384) ||-=..-+++. .+. ....+ .+..+.+ -+++.-+..++.++..+|++++++.. .+. .. T Consensus 84 ~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~G~a~l~~d~--~~~-~~ 156 (516) T protein:vir:96 84 SFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQ----RQFRPAVVEAFKHLIVAGSCMLYKPS--KGA-IS 156 (516) T ss_pred cccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCeEeEEecC--CCC-EE Confidence 34321211110 000 01111 1112222 23555566677889999999988753 222 34 Q ss_pred EEEEcCceEEEEEcCCCEE---------------------------------------------------EEEEEcCceE Q lcl|NC_019422. 121 IYPLNALNVEAIYENEVLF---------------------------------------------------LKFLLRNGKI 149 (384) Q Consensus 121 l~~l~~~~v~~~~~~~~~~---------------------------------------------------~~~~~~~g~~ 149 (384) .||+.. +-+..+..|.+ .+|...+|.. T Consensus 157 ~~pl~~--y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~ 234 (516) T protein:vir:96 157 AIPMHH--YVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIP 234 (516) T ss_pred EEEcCe--EEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCcee Confidence 555533 33333333211 1111111111 Q ss_pred EE----Eehh--heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHH Q lcl|NC_019422. 150 VS----YPYS--DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKS 223 (384) Q Consensus 150 ~~----~~~~--evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~ 223 (384) .. ++.+ -.+..|+....+..||.||...++..+..++...+.......-...|.+.+.-++...+.. T Consensus 235 ~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~------- 307 (516) T protein:vir:96 235 VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDH------- 307 (516) T ss_pred eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhh------- Confidence 10 0001 1244455545567899999999999999999999888888877777766554433333211 Q ss_pred HHHHhccccccCCcceecCCCceeeecccc-hhHHHH--HHHHHHHHHHHHHhCCCHHHh-c---cccHHH-HHHH---- Q lcl|NC_019422. 224 FEKNYLQIDSEAGGAAATDSKYDAEQVKAE-SYVPNA--AQMDKAIQRLYSFFNTNEKII-Q---SKYSED-EWNA---- 291 (384) Q Consensus 224 ~~~~~~~~~~~~~~~~v~~~g~~~~~l~~~-~~~~~~--~~~~~~~~~I~~~fgvp~~~l-~---~~~~e~-~~~~---- 291 (384) ...+. .+.++.+..-++.++... ..+.+. ..+......|-.+|-+..... + .|..|- .+.. T Consensus 308 ---l~~~~----~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~ 380 (516) T protein:vir:96 308 ---FVNSG----TGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQ 380 (516) T ss_pred ---hccCC----CceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHH Confidence 11121 123333333445555433 334443 234566778888886643222 2 122231 1111 Q ss_pred --------HHHHHHHHHHHHHHHHHhhcccCcccccCcceEEeechhhhcc-CHHHHHHHHHHHhCCCC-CHHHHHHHhC Q lcl|NC_019422. 292 --------YYESEIEPVGLQLSNQYTEKLFTRKARSFGNEIVFEASNLQYA-SMSTKLNLVQMVDRGSL-TPNEWRKIMN 361 (384) Q Consensus 292 --------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~~~~i~fd~~~~~~~-d~~~~~~~~~~~~~g~~-t~NE~R~~lG 361 (384) +...-+.|++.+....+... +++.. ....+.--++.+.+. +........+.+.. +. -+-++...++ T Consensus 381 ~LGpv~~rl~~Ell~Pli~r~l~~~~p~-lp~~~--v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~-~~~~~p~v~d~id 456 (516) T protein:vir:96 381 NMGGVYSLFATTMQSPVAMWGLLEAGES-FTSDL--VDPVIITGIEALGRMAELDKLANFAQYMSL-PLQWPEPVLAAVK 456 (516) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHhcCCC-Ccccc--ccceeechHHHHHHHHHHHHHHHHHHHHHH-HhcCChhHHhcCC Confidence 34445566555432222111 11111 011111112222221 12222222221110 11 1112232222 Q ss_pred CCCC-CC-CCeeeecCceeecCCCC Q lcl|NC_019422. 362 LSPI-EN-GDKPVRRLDTAVVEGGE 384 (384) Q Consensus 362 ~~p~-~~-gd~~~~~~n~~~~~~ge 384 (384) .+.. ++ ++.+=+|.++. ...| T Consensus 457 ~d~~~~~~a~~~Gvp~~~i--rs~e 479 (516) T protein:vir:96 457 WPDYMDWVRGQISAELPFL--KSAE 479 (516) T ss_pred HHHHHHHHHHHhCCCcccc--CCHH Confidence 2221 10 11111222211 2222 No 253 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=34.01 E-value=1.3 Score=19.92 Aligned_cols=346 Identities=10% Similarity=-0.027 Sum_probs=139.7 Q ss_pred Cc--------chhhhcccCCCcchhHHHhhccccCc------ceechhhhhhcHHHHHHHHHHHHhhc------cCceEE Q lcl|NC_019422. 1 MN--------IFKSKKKNKEAPGKVMMELISDSGNG------FYSWHGNLYKSDIVRSIIRPKAKAVG------KMTAKH 60 (384) Q Consensus 1 M~--------~f~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~v~~~i~~ia~~ia------~~~~~~ 60 (384) -+ .|...+..++.-...+-+....+.+. .....+..+.+. --.|++.+|..+- .-||-= T Consensus 9 ~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dst-g~~a~~~LAa~l~~~ltpp~~~WF~ 87 (516) T protein:vir:10 9 YGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGV-GAQATNHLANKLAQVLFPAQRSFFR 87 (516) T ss_pred hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccccccccch-HHHHHHHHHHHHHhhhcCCCCcccc Confidence 11 11222222222222222111111111 011112344443 3456666666552 223332 Q ss_pred EEecCCc--------cee-ccchHHH----HHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCceeeEEEEcCc Q lcl|NC_019422. 61 IRSNETE--------FKT-NPEIYIK----FLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMPTQIYPLNAL 127 (384) Q Consensus 61 ~~~~~~~--------~~~-~~~~~~~----~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~~~l~~l~~~ 127 (384) ..-++.. ... ....++. .+...- ..-+++.-+..++.++..+|++++++.. .+. ...||+.. T Consensus 88 L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~d~--~~~-~~~~pl~~- 162 (516) T protein:vir:10 88 VDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKEL-EQRQFRPAVVEAFKHLIVAGSCMLYKPS--KGA-ISAIPMHH- 162 (516) T ss_pred ccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEeEEecC--CCC-eEEEEcCe- Confidence 1211111 000 0111111 111111 1124555566677889999999988743 222 34555533 Q ss_pred eEEEEEcCCCEEE---------------------------------------------------EEEEcCceEEE----E Q lcl|NC_019422. 128 NVEAIYENEVLFL---------------------------------------------------KFLLRNGKIVS----Y 152 (384) Q Consensus 128 ~v~~~~~~~~~~~---------------------------------------------------~~~~~~g~~~~----~ 152 (384) +-+..+..|.+. ++...++.... . T Consensus 163 -y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~~~d~~~~~~~s~~ 241 (516) T protein:vir:10 163 -YVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQSADDIPVGKVSKI 241 (516) T ss_pred -EEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEEeeCceeecccccc Confidence 333333333211 01111111100 0 Q ss_pred ehh--heEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHHHHHHHhcc Q lcl|NC_019422. 153 PYS--DIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVKSFEKNYLQ 230 (384) Q Consensus 153 ~~~--evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 230 (384) +.. -.+..|+....+..||.||...++..+..++...+...........|.+++.-++...+.. ...+ T Consensus 242 ~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~----------l~~~ 311 (516) T protein:vir:10 242 KSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDH----------FVNS 311 (516) T ss_pred ccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhh----------hccC Confidence 001 1233444444566899999999999999999999998888888777766654434333211 1112 Q ss_pred ccccCCcceecCCCceeeecccc-hhHHHH--HHHHHHHHHHHHHhCCCHHHh-cc---ccHHHH-HHH----------- Q lcl|NC_019422. 231 IDSEAGGAAATDSKYDAEQVKAE-SYVPNA--AQMDKAIQRLYSFFNTNEKII-QS---KYSEDE-WNA----------- 291 (384) Q Consensus 231 ~~~~~~~~~v~~~g~~~~~l~~~-~~~~~~--~~~~~~~~~I~~~fgvp~~~l-~~---~~~e~~-~~~----------- 291 (384) . .+.++.+..-++.++... ..+.+. ..+......|-.+|-+..... ++ |..|-. +.. T Consensus 312 ~----~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~ 387 (516) T protein:vir:10 312 G----TGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYS 387 (516) T ss_pred C----CceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHH Confidence 1 122333333445555443 334443 234566778888887754332 22 233311 111 Q ss_pred -HHHHHHHHHHHHHHHHHhhcccCcc-cccCcceEEeechhhhcc-CHHHHHHHHHHHhC--CC-------CC----HHH Q lcl|NC_019422. 292 -YYESEIEPVGLQLSNQYTEKLFTRK-ARSFGNEIVFEASNLQYA-SMSTKLNLVQMVDR--GS-------LT----PNE 355 (384) Q Consensus 292 -~~~~~i~P~~~~i~~~l~~~l~~~~-~~~~~~~i~fd~~~~~~~-d~~~~~~~~~~~~~--g~-------~t----~NE 355 (384) +...-+.|++.+.. .. ++++. +...+..+.--++.+.+. +......+.+.+.. ++ +. .++ T Consensus 388 rl~~Ell~Pli~r~~---~~-~~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~ 463 (516) T protein:vir:10 388 LFATTMQSPVAMWGL---LE-AGDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDW 463 (516) T ss_pred HHHHHHHHHHHHHHH---Hh-hCCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHH Confidence 34445566654432 21 12221 111111111112222221 22222222222210 00 00 233 Q ss_pred HHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 356 WRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 356 ~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) +.+.+|.|+ ++ +...| T Consensus 464 ~a~~~gvp~-----------~~--irs~e 479 (516) T protein:vir:10 464 VRGQISAEL-----------PF--LKSAE 479 (516) T ss_pred HHHHhCCCh-----------hc--cCCHH Confidence 333444432 11 12222 No 254 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=23.97 E-value=2.2 Score=18.67 Aligned_cols=342 Identities=14% Similarity=0.055 Sum_probs=137.2 Q ss_pred Cc-------------chhhhcccCCCcchhHHH---hhccc-----cCcceechhhhhhcHHHHHHHHHHHHhhc-cC-- Q lcl|NC_019422. 1 MN-------------IFKSKKKNKEAPGKVMME---LISDS-----GNGFYSWHGNLYKSDIVRSIIRPKAKAVG-KM-- 56 (384) Q Consensus 1 M~-------------~f~~~~~~~~~~~~~~~~---~~~~~-----~~~~~~~~~~~~~~~~v~~~i~~ia~~ia-~~-- 56 (384) |. .|.+.++.+..-...+-+ +.-.+ ...-.....+.+.+ +.-.|++.+|..+- .+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~ds-t~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQA-VGARGLNNLASKLMLALFP 79 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccc-cHHHHHHHHHHHHHHhhcC Confidence 21 122222222211111111 11111 01111111233444 33445555555552 22 Q ss_pred --ceEEEEecCCccee----------------ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce Q lcl|NC_019422. 57 --TAKHIRSNETEFKT----------------NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP 118 (384) Q Consensus 57 --~~~~~~~~~~~~~~----------------~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~ 118 (384) ||-=....+.+-.. ...+..+..+.+- +++.-+..++.+++.+||+++++..+..+.+ T Consensus 80 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:21 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 33211111111000 0011222222332 3555566778889999999999876654433 Q ss_pred --eeEEEEcCceEEEEEcCCCEEE--------------------------------------------------EEEEcC Q lcl|NC_019422. 119 --TQIYPLNALNVEAIYENEVLFL--------------------------------------------------KFLLRN 146 (384) Q Consensus 119 --~~l~~l~~~~v~~~~~~~~~~~--------------------------------------------------~~~~~~ 146 (384) ...||+ ..+-+..+..|.+. .|...+ T Consensus 156 ~~f~~~pl--~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:21 156 NPMKLYRL--SSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVE 233 (536) T ss_pred eeEEEEEc--CeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccC Confidence 344555 44444444433211 011111 Q ss_pred ceEEE-------EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHH Q lcl|NC_019422. 147 GKIVS-------YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKK 219 (384) Q Consensus 147 g~~~~-------~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~ 219 (384) |..+. |..--.+..|+....+..||.||...++..+..++...+...........|...+.-++...+.. T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~--- 310 (536) T protein:vir:21 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRR--- 310 (536) T ss_pred CeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhh--- Confidence 11110 00011244455545566899999999999999999988888887666666655554333333211 Q ss_pred HHHHHHHHhccccccCCcceecC--CCceeeecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH--- Q lcl|NC_019422. 220 EVKSFEKNYLQIDSEAGGAAATD--SKYDAEQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE--- 288 (384) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~v~~--~g~~~~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~--- 288 (384) + ..+. .| .++.+ +.....++.. ..+.+. ..+......|-.+|-+.... .++ |..|-. T Consensus 311 ----~---~~~~---~g-~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~ 378 (536) T protein:vir:21 311 ----L---TKAQ---TG-DFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVA 378 (536) T ss_pred ----h---ccCC---Cc-ceecCCcccceeeeccc-cccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHH Confidence 1 1111 11 12222 2233344443 233332 23456677888888654321 122 222211 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019422. 289 ----------WNAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWR 357 (384) Q Consensus 289 ----------~~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R 357 (384) ...+...-+.|++.+....+.+ .++++.... . ++.++... +...... +.+. .++..- - T Consensus 379 ~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~-~--v~~~~vs~----l~~l~r~-~~~~-~l~~~~--~ 447 (536) T protein:vir:21 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE-A--VEPTISTG----LEAIGRG-QDLD-KLERCV--T 447 (536) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChh-h--ccceEEec----HHHHHHH-HHHH-HHHHHH--H Confidence 1124556677777665555533 355432211 1 12111111 1111111 1111 111111 1 Q ss_pred HHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 358 KIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 358 ~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ..-++.|. -.|+. ++..+ T Consensus 448 ~la~~~Pe-~ld~~--------id~d~ 465 (536) T protein:vir:21 448 AWAALAPM-RDDPD--------INLAM 465 (536) T ss_pred HHHhhchh-hhccc--------CCHHH Confidence 11222221 01221 22222 No 255 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=23.92 E-value=2.2 Score=18.66 Aligned_cols=350 Identities=13% Similarity=0.052 Sum_probs=141.0 Q ss_pred Ccc---------------hhhhcccCCCcchhHHH---hhccc-----cCcceechhhhhhcHHHHHHHHHHHHhhc-cC Q lcl|NC_019422. 1 MNI---------------FKSKKKNKEAPGKVMME---LISDS-----GNGFYSWHGNLYKSDIVRSIIRPKAKAVG-KM 56 (384) Q Consensus 1 M~~---------------f~~~~~~~~~~~~~~~~---~~~~~-----~~~~~~~~~~~~~~~~v~~~i~~ia~~ia-~~ 56 (384) |.. |...++.+..-...+-+ +.-.+ ...-.....+.+.+. .-.|++.+|..+- .+ T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst-~~~a~~~Laa~l~~~l 79 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAV-GARGLNNLASKLMLAL 79 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCccccc-HHHHHHHHHHHHHhhh Confidence 322 22122222111111111 11111 000011112233333 3455555555552 22 Q ss_pred ----ceEEEEecCCc-ceec--------cchHHHH----HHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCC-CCce Q lcl|NC_019422. 57 ----TAKHIRSNETE-FKTN--------PEIYIKF----LLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDD-YNMP 118 (384) Q Consensus 57 ----~~~~~~~~~~~-~~~~--------~~~~~~~----l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~-~g~~ 118 (384) ||-=..-.+.. .... ...++.. +...- ..-+++.-+..++.++..+|++++++..+. .+.. T Consensus 80 tP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~ 158 (535) T protein:vir:94 80 FPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI-ESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNP 158 (535) T ss_pred cCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccc Confidence 33211111111 0000 1111111 11111 123355556667788999999999987653 3334 Q ss_pred eeEEEEcCceEEEEEcCCCEE------------------------------------EE-------------EEEcCceE Q lcl|NC_019422. 119 TQIYPLNALNVEAIYENEVLF------------------------------------LK-------------FLLRNGKI 149 (384) Q Consensus 119 ~~l~~l~~~~v~~~~~~~~~~------------------------------------~~-------------~~~~~g~~ 149 (384) ...|++.. +-+..+..|.+ |. |....|.. T Consensus 159 f~~~pl~~--y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~ 236 (535) T protein:vir:94 159 MKLYRLSS--YVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEEIDGVE 236 (535) T ss_pred eEEEEcCe--EEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEEecCee Confidence 44555543 33333333321 00 10111111 Q ss_pred EE-------EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHHHHH Q lcl|NC_019422. 150 VS-------YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKKEVK 222 (384) Q Consensus 150 ~~-------~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~ 222 (384) +. +..--.+..|+....+..||.||...++..+..++...+...........|.+.+.-++...+.. T Consensus 237 ~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~------ 310 (535) T protein:vir:94 237 VEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRR------ 310 (535) T ss_pred eccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhh------ Confidence 10 00012344454445566899999999999999999988888887777666665554333333211 Q ss_pred HHHHHhccccccCCcceecC--CCceeeecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH------ Q lcl|NC_019422. 223 SFEKNYLQIDSEAGGAAATD--SKYDAEQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE------ 288 (384) Q Consensus 223 ~~~~~~~~~~~~~~~~~v~~--~g~~~~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~------ 288 (384) +.. ...+.++.+ +++...++... .+.+. ..++.....|-.+|-+.... .++ |..|-. T Consensus 311 -----~~~---~~~g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~ 381 (535) T protein:vir:94 311 -----LTK---AQTGDFVSGRPEDISFLQLEKA-ADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASEL 381 (535) T ss_pred -----ccc---CCCceeecCCcccceeeecccc-cchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHH Confidence 111 111222222 33334444432 33332 23456677888888433211 122 222211 Q ss_pred -------HHHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEe--echhhhcc-CHHHHHHHHHHHhCCCCCH---- Q lcl|NC_019422. 289 -------WNAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVF--EASNLQYA-SMSTKLNLVQMVDRGSLTP---- 353 (384) Q Consensus 289 -------~~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~f--d~~~~~~~-d~~~~~~~~~~~~~g~~t~---- 353 (384) ...+...-+.|++.+.-..+.+ .++++-... ...+++ -+..+.+. +......++..+.. +-| T Consensus 382 ~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~-~v~~~~vs~la~l~r~~~~~~l~~~~~~laq--~~P~~ld 458 (535) T protein:vir:94 382 EDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKE-AVEPTISTGMEALGRGQDLDKLERCIAAWSA--LAPMQGD 458 (535) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChh-hccceEeehHHHHHHHHHHHHHHHHHHHHHh--hChHHhh Confidence 1124566678887776555543 355542211 122233 11222221 12222222221111 122 Q ss_pred -----HH----HHHHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 354 -----NE----WRKIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 354 -----NE----~R~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ++ +-+.+|.|+. ++. ...| T Consensus 459 ~~id~d~~~~~~a~~~Gvp~~----------~i~--rs~e 486 (535) T protein:vir:94 459 PDINIATIKLRIANAIGIDTS----------GIL--KTPE 486 (535) T ss_pred hcCCHHHHHHHHHHHhCCChh----------hhc--CCHH Confidence 11 1122333310 010 1111 No 256 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=22.08 E-value=2.5 Score=18.40 Aligned_cols=342 Identities=14% Similarity=0.049 Sum_probs=137.1 Q ss_pred Cc-------------chhhhcccCCCcchhH---HHhhccc-----cCcceechhhhhhcHHHHHHHHHHHHhhc-cC-- Q lcl|NC_019422. 1 MN-------------IFKSKKKNKEAPGKVM---MELISDS-----GNGFYSWHGNLYKSDIVRSIIRPKAKAVG-KM-- 56 (384) Q Consensus 1 M~-------------~f~~~~~~~~~~~~~~---~~~~~~~-----~~~~~~~~~~~~~~~~v~~~i~~ia~~ia-~~-- 56 (384) |. .|.+.++.+..-...+ ..+.-.+ ...-.....+.+.+. .-.|++.+|..+- .+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst-~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAV-GARGLNNLASKLMLALFP 79 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccccc-HHHHHHHHHHHHHhhhcC Confidence 21 1222222222111111 1111111 011111112234443 3455555555552 22 Q ss_pred --ceEEEEecCCccee----------------ccchHHHHHHhhccccCCHHHHHHHHHHHHHHhCCeeEEEeeCCCCce Q lcl|NC_019422. 57 --TAKHIRSNETEFKT----------------NPEIYIKFLLENPNPFMSGQILQEKMVTQLELNSNAFAVIIKDDYNMP 118 (384) Q Consensus 57 --~~~~~~~~~~~~~~----------------~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~l~~G~~~~~~~~~~~g~~ 118 (384) ||-=....+.+-.. ...+..+..+.+- +++.-+..++.+++.+||+++++..+..+.+ T Consensus 80 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:10 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 33211111111000 0011222222232 3555566778889999999999876654433 Q ss_pred --eeEEEEcCceEEEEEcCCCEEEE--------------------------------------------------EEEcC Q lcl|NC_019422. 119 --TQIYPLNALNVEAIYENEVLFLK--------------------------------------------------FLLRN 146 (384) Q Consensus 119 --~~l~~l~~~~v~~~~~~~~~~~~--------------------------------------------------~~~~~ 146 (384) ...||+ ..+-+..+..|.+.. |...+ T Consensus 156 ~~~~~~pl--~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:10 156 NPMKLYRL--SSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVE 233 (536) T ss_pred eeEEEEEc--CeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeec Confidence 344555 444444444432110 00011 Q ss_pred ceEEE-------EehhheEEEeccCCCCCccCccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEeeCCCCChHHHHH Q lcl|NC_019422. 147 GKIVS-------YPYSDIIHLRKDFNENDLFGTSPAKVLEPIMEVVNTTDQGVVKAIKNSNTIKWLLKFKTALRPDDIKK 219 (384) Q Consensus 147 g~~~~-------~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~ 219 (384) |+.+. |..--.+..|+....+..||.||...++..+..++...+...........|...+.-++...+.. T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~--- 310 (536) T protein:vir:10 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRR--- 310 (536) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhh--- Confidence 11110 00011244455545566899999999999999999988888887666666655554333333211 Q ss_pred HHHHHHHHhccccccCCcceecC--CCceeeecccchhHHHH--HHHHHHHHHHHHHhCCCHHH-hcc---ccHHHH--- Q lcl|NC_019422. 220 EVKSFEKNYLQIDSEAGGAAATD--SKYDAEQVKAESYVPNA--AQMDKAIQRLYSFFNTNEKI-IQS---KYSEDE--- 288 (384) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~v~~--~g~~~~~l~~~~~~~~~--~~~~~~~~~I~~~fgvp~~~-l~~---~~~e~~--- 288 (384) + ..+. .| .++.+ +.....++.. ..+.+. ..+......|-.+|-+.... .++ |..|-. T Consensus 311 ----~---~~~~---~g-~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~ 378 (536) T protein:vir:10 311 ----L---TKAQ---TG-DFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVA 378 (536) T ss_pred ----h---ccCC---Cc-ceecCCcccceeeeccc-cccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHH Confidence 1 1111 12 12222 2233344443 233332 23456677888888654321 122 222211 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHhh-cccCcccccCcceEEeechhhhccCHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019422. 289 ----------WNAYYESEIEPVGLQLSNQYTE-KLFTRKARSFGNEIVFEASNLQYASMSTKLNLVQMVDRGSLTPNEWR 357 (384) Q Consensus 289 ----------~~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~~~~i~fd~~~~~~~d~~~~~~~~~~~~~g~~t~NE~R 357 (384) ...+...-+.|++.+....+.+ .++++.... . ++.++... +...... +.+. .++..-. T Consensus 379 ~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~-~--v~~~~vs~----l~~l~r~-~~~~-~l~~~~~-- 447 (536) T protein:vir:10 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE-A--VEPTISTG----LEAIGRG-QDLD-KLERCVT-- 447 (536) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChh-h--ccceEEec----HHHHHHH-HHHH-HHHHHHH-- Confidence 1124556677777665555533 345432211 1 12111111 1111111 1111 1111111 Q ss_pred HHhCCCCCCCCCeeeecCceeecCCCC Q lcl|NC_019422. 358 KIMNLSPIENGDKPVRRLDTAVVEGGE 384 (384) Q Consensus 358 ~~lG~~p~~~gd~~~~~~n~~~~~~ge 384 (384) ..-++.|. -.|+. ++..+ T Consensus 448 ~la~~~P~-~ld~~--------id~d~ 465 (536) T protein:vir:10 448 AWAALAPM-RDDPD--------INLAM 465 (536) T ss_pred HHHhhchh-hhccc--------CCHHH Confidence 11222221 01221 22222 Done!