Query lcl|NC_011801.1_cdsid_YP_002455792.1 [gene=Lv-1_gp03] [protein=portal protein] [protein_id=YP_002455792.1] [location=2545..3705] Match_columns 386 No_of_seqs 122 out of 1069 Neff 9.7 Searched_HMMs 1612 Date Thu Nov 7 12:47:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4454 Length: 414 # 100.0 9.1E-89 5.7E-92 503.2 41.6 373 1-386 1-405 (414) 2 protein:vir:4598 Length: 416 # 100.0 9.6E-89 6E-92 503.1 41.5 380 1-386 1-414 (416) 3 protein:vir:81095 Length: 416 100.0 9.6E-89 6E-92 503.1 41.5 380 1-386 1-414 (416) 4 protein:vir:9408 Length: 441 # 100.0 1E-88 6.4E-92 502.9 41.6 380 1-386 26-439 (441) 5 protein:vir:79984 Length: 441 100.0 1E-88 6.4E-92 502.9 41.6 380 1-386 26-439 (441) 6 protein:vir:98396 Length: 441 100.0 1.3E-88 7.9E-92 502.5 41.5 380 1-386 26-439 (441) 7 protein:vir:100249 Length: 431 100.0 5.1E-88 3.1E-91 499.2 41.2 375 1-386 1-424 (431) 8 protein:vir:1884 Length: 424 # 100.0 5.7E-88 3.6E-91 498.9 40.5 375 1-386 14-421 (424) 9 protein:vir:1380 Length: 422 # 100.0 9.5E-88 5.9E-91 497.7 40.8 378 1-386 1-419 (422) 10 protein:vir:4509 Length: 424 # 100.0 1.3E-87 8.1E-91 496.9 39.8 372 1-386 16-423 (424) 11 protein:vir:189 Length: 424 # 100.0 2.5E-87 1.6E-90 495.4 40.0 375 1-386 14-422 (424) 12 protein:vir:81152 Length: 411 100.0 3E-87 1.9E-90 494.9 39.3 372 1-386 1-411 (411) 13 protein:vir:5737 Length: 419 # 100.0 9.1E-87 5.6E-90 492.3 39.6 368 1-386 1-393 (419) 14 protein:vir:102080 Length: 429 100.0 1.9E-86 1.2E-89 490.5 41.3 377 1-386 1-417 (429) 15 protein:vir:105064 Length: 421 100.0 1.2E-86 7.2E-90 491.7 40.1 367 1-386 1-395 (421) 16 protein:vir:107605 Length: 432 100.0 2.1E-86 1.3E-89 490.3 40.5 377 1-386 1-430 (432) 17 protein:vir:102855 Length: 432 100.0 2.1E-86 1.3E-89 490.3 40.5 377 1-386 1-430 (432) 18 protein:vir:105002 Length: 432 100.0 2.1E-86 1.3E-89 490.3 40.5 377 1-386 1-430 (432) 19 protein:vir:6240 Length: 457 # 100.0 2.9E-86 1.8E-89 489.6 41.1 380 1-386 1-425 (457) 20 protein:vir:81072 Length: 432 100.0 3.1E-86 1.9E-89 489.4 40.1 373 1-386 7-430 (432) 21 protein:vir:3843 Length: 397 # 100.0 1E-85 6.4E-89 486.5 42.1 378 1-386 1-384 (397) 22 protein:vir:93610 Length: 454 100.0 5.3E-86 3.3E-89 488.1 40.2 378 3-386 1-431 (454) 23 protein:vir:483 Length: 413 # 100.0 1.4E-85 9E-89 485.7 40.8 368 2-386 1-391 (413) 24 protein:vir:8418 Length: 409 # 100.0 1E-85 6.3E-89 486.5 40.0 375 1-386 1-408 (409) 25 protein:vir:97060 Length: 432 100.0 1.3E-85 8.2E-89 485.9 40.0 373 1-386 7-424 (432) 26 protein:vir:100150 Length: 437 100.0 1.9E-85 1.2E-88 485.0 40.8 376 1-386 1-419 (437) 27 protein:vir:1326 Length: 457 # 100.0 1.9E-85 1.2E-88 485.1 40.7 375 1-386 1-410 (457) 28 protein:vir:10362 Length: 432 100.0 1.6E-85 1E-88 485.4 40.0 373 1-386 7-430 (432) 29 protein:vir:102118 Length: 409 100.0 3.9E-85 2.4E-88 483.4 39.8 374 1-386 1-408 (409) 30 protein:vir:1431 Length: 419 # 100.0 3.9E-85 2.4E-88 483.4 39.4 370 2-386 1-412 (419) 31 protein:vir:1266 Length: 416 # 100.0 1.4E-84 8.9E-88 480.3 40.5 372 2-386 1-412 (416) 32 protein:vir:4337 Length: 434 # 100.0 2.2E-84 1.4E-87 479.2 40.7 368 1-386 1-406 (434) 33 protein:vir:80333 Length: 419 100.0 2.3E-84 1.4E-87 479.2 39.6 367 1-386 1-392 (419) 34 protein:vir:8317 Length: 409 # 100.0 2.4E-84 1.5E-87 479.0 37.9 360 1-376 1-409 (409) 35 protein:vir:7853 Length: 518 # 100.0 1.7E-83 1E-86 474.4 40.3 378 2-386 1-410 (518) 36 protein:vir:4854 Length: 386 # 100.0 3.2E-83 2E-86 472.8 40.9 378 1-386 1-382 (386) 37 protein:vir:101648 Length: 518 100.0 5.1E-83 3.2E-86 471.7 41.0 378 2-386 1-410 (518) 38 protein:vir:3868 Length: 417 # 100.0 7.1E-83 4.4E-86 470.9 40.1 369 1-386 1-412 (417) 39 protein:vir:100187 Length: 385 100.0 9.2E-83 5.7E-86 470.3 39.8 374 1-386 1-382 (385) 40 protein:vir:81218 Length: 423 100.0 9.5E-83 5.9E-86 470.3 39.8 379 1-386 1-415 (423) 41 protein:vir:2683 Length: 412 # 100.0 2.3E-82 1.4E-85 468.1 39.4 374 1-386 1-407 (412) 42 protein:vir:94666 Length: 723 100.0 3.3E-82 2.1E-85 467.3 39.7 363 1-386 1-397 (723) 43 protein:vir:9702 Length: 406 # 100.0 5.4E-82 3.3E-85 466.1 39.5 367 1-386 1-399 (406) 44 protein:vir:4952 Length: 386 # 100.0 2.2E-81 1.4E-84 462.7 41.8 378 1-386 1-382 (386) 45 protein:vir:1082 Length: 359 # 100.0 8.6E-82 5.3E-85 465.0 39.2 358 1-366 1-359 (359) 46 protein:vir:4828 Length: 382 # 100.0 1.9E-81 1.2E-84 463.1 40.7 378 1-386 1-378 (382) 47 protein:vir:96980 Length: 409 100.0 1.2E-81 7.7E-85 464.1 39.4 372 1-386 4-404 (409) 48 protein:vir:7407 Length: 392 # 100.0 2E-81 1.2E-84 463.0 40.5 377 1-386 3-388 (392) 49 protein:vir:1023 Length: 392 # 100.0 4.8E-81 3E-84 460.9 41.0 377 1-386 3-388 (392) 50 protein:vir:3989 Length: 392 # 100.0 4.8E-81 3E-84 460.9 41.0 377 1-386 3-388 (392) 51 protein:vir:100882 Length: 383 100.0 4.3E-81 2.7E-84 461.2 39.9 374 1-386 1-382 (383) 52 protein:vir:93943 Length: 409 100.0 9.9E-81 6.1E-84 459.2 39.4 373 1-386 4-405 (409) 53 protein:vir:94426 Length: 409 100.0 4.8E-80 3E-83 455.4 39.4 371 1-386 4-404 (409) 54 protein:vir:8100 Length: 466 # 100.0 1.6E-79 1E-82 452.5 38.8 382 1-386 1-463 (466) 55 protein:vir:101647 Length: 460 100.0 2.2E-79 1.4E-82 451.8 39.1 380 1-386 2-459 (460) 56 protein:vir:104259 Length: 403 100.0 1E-78 6.5E-82 448.1 38.8 366 1-386 1-392 (403) 57 protein:vir:95378 Length: 406 100.0 2.2E-78 1.4E-81 446.3 39.1 369 1-386 1-398 (406) 58 protein:vir:80134 Length: 403 100.0 1.1E-77 6.5E-81 442.6 37.7 366 1-386 1-395 (403) 59 protein:vir:4995 Length: 384 # 100.0 2.1E-77 1.3E-80 441.0 38.0 366 1-378 1-384 (384) 60 protein:vir:102727 Length: 945 100.0 3.8E-77 2.3E-80 439.6 39.3 382 1-386 62-517 (945) 61 protein:vir:960 Length: 413 # 100.0 5E-77 3.1E-80 438.9 37.8 363 1-386 13-412 (413) 62 protein:vir:6210 Length: 394 # 100.0 2.9E-75 1.8E-78 429.2 36.9 360 1-386 1-391 (394) 63 protein:vir:100650 Length: 395 100.0 1.9E-73 1.2E-76 419.3 36.8 354 1-386 1-390 (395) 64 protein:vir:101289 Length: 395 100.0 1.9E-73 1.2E-76 419.3 36.8 354 1-386 1-390 (395) 65 protein:vir:9507 Length: 395 # 100.0 1.9E-73 1.2E-76 419.3 36.8 354 1-386 1-390 (395) 66 protein:vir:95965 Length: 385 100.0 1.4E-73 8.7E-77 420.0 34.5 356 1-386 1-382 (385) 67 protein:vir:80644 Length: 551 100.0 1.3E-72 8.2E-76 414.7 37.1 379 1-386 5-487 (551) 68 protein:vir:9359 Length: 348 # 100.0 1.6E-72 9.7E-76 414.2 36.4 317 58-386 1-343 (348) 69 protein:vir:78310 Length: 376 100.0 1.2E-72 7.4E-76 414.9 34.7 355 1-386 1-375 (376) 70 protein:vir:80796 Length: 574 100.0 1E-71 6.4E-75 409.8 36.2 379 1-386 27-508 (574) 71 protein:vir:63755 Length: 547 100.0 4E-71 2.5E-74 406.6 38.4 379 1-386 1-483 (547) 72 protein:vir:100691 Length: 535 100.0 4E-71 2.5E-74 406.5 37.4 378 1-386 34-493 (535) 73 protein:vir:4156 Length: 542 # 100.0 2.2E-70 1.4E-73 402.4 36.1 376 3-386 1-432 (542) 74 protein:vir:3153 Length: 467 # 100.0 1.9E-69 1.2E-72 397.3 37.8 345 37-386 1-432 (467) 75 protein:vir:96579 Length: 576 100.0 9.4E-69 5.8E-72 393.5 38.8 382 1-386 32-491 (576) 76 protein:vir:4194 Length: 540 # 100.0 3.9E-69 2.4E-72 395.6 35.2 369 1-386 6-434 (540) 77 protein:vir:4089 Length: 395 # 100.0 3E-69 1.9E-72 396.2 34.3 360 1-386 1-388 (395) 78 protein:vir:99312 Length: 563 100.0 9.7E-68 6E-71 388.0 39.3 378 1-386 43-493 (563) 79 protein:vir:95599 Length: 563 100.0 9.7E-68 6E-71 388.0 39.3 378 1-386 43-493 (563) 80 protein:vir:93867 Length: 378 100.0 1.4E-68 8.8E-72 392.6 32.6 326 1-386 1-375 (378) 81 protein:vir:94002 Length: 378 100.0 1.5E-68 9.3E-72 392.4 32.6 324 1-386 1-373 (378) 82 protein:vir:1661 Length: 378 # 100.0 3.7E-68 2.3E-71 390.2 32.7 326 1-386 1-375 (378) 83 protein:vir:98643 Length: 395 100.0 4.3E-68 2.7E-71 389.9 32.8 361 1-386 1-390 (395) 84 protein:vir:9641 Length: 395 # 100.0 3.8E-68 2.3E-71 390.2 32.5 353 1-386 1-389 (395) 85 protein:vir:858 Length: 378 # 100.0 1.1E-65 6.9E-69 376.7 32.6 324 1-386 1-373 (378) 86 protein:vir:94869 Length: 378 100.0 1.1E-65 6.7E-69 376.8 32.2 320 1-386 1-375 (378) 87 protein:vir:79772 Length: 648 100.0 5.2E-64 3.3E-67 367.5 38.8 376 1-386 34-486 (648) 88 protein:vir:99452 Length: 651 100.0 4.3E-64 2.7E-67 368.0 31.8 380 1-386 1-523 (651) 89 protein:vir:78641 Length: 278 100.0 2.7E-58 1.6E-61 336.2 31.0 262 58-330 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 5.5E-56 3.4E-59 323.5 26.5 333 1-346 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 1.1E-54 6.7E-58 316.5 31.9 308 1-337 26-376 (376) 92 protein:vir:79207 Length: 351 100.0 3.7E-54 2.3E-57 313.5 30.7 308 4-337 1-351 (351) 93 protein:vir:100328 Length: 346 100.0 4E-54 2.5E-57 313.3 28.8 324 1-335 1-346 (346) 94 protein:vir:78191 Length: 351 100.0 1.5E-53 9.2E-57 310.2 31.1 308 4-337 1-351 (351) 95 protein:vir:98567 Length: 340 100.0 1.5E-53 9.4E-57 310.2 29.1 317 1-334 1-340 (340) 96 protein:vir:267 Length: 348 # 100.0 3.1E-53 1.9E-56 308.4 30.5 310 1-338 17-348 (348) 97 protein:vir:3780 Length: 345 # 100.0 1.4E-52 8.6E-56 304.9 30.2 324 1-332 1-345 (345) 98 protein:vir:78749 Length: 337 100.0 5.4E-53 3.4E-56 307.1 27.7 307 4-331 1-337 (337) 99 protein:vir:3743 Length: 345 # 100.0 9.8E-53 6.1E-56 305.7 28.0 309 1-332 1-345 (345) 100 protein:vir:6058 Length: 344 # 100.0 7.4E-52 4.6E-55 300.9 30.6 317 4-335 1-344 (344) 101 protein:vir:4698 Length: 251 # 100.0 1.9E-52 1.2E-55 304.1 27.2 242 1-246 1-251 (251) 102 protein:vir:1150 Length: 350 # 100.0 1.2E-51 7.3E-55 299.8 30.1 312 1-330 1-350 (350) 103 protein:vir:2013 Length: 344 # 100.0 2.5E-51 1.5E-54 298.0 29.3 317 4-335 1-344 (344) 104 protein:vir:5691 Length: 344 # 100.0 4.8E-51 3E-54 296.4 29.4 317 4-335 1-344 (344) 105 protein:vir:98853 Length: 219 100.0 4.5E-42 2.8E-45 247.3 21.6 205 124-334 1-219 (219) 106 protein:vir:5249 Length: 437 # 100.0 3.7E-29 2.3E-32 176.4 26.8 376 1-386 1-424 (437) 107 protein:vir:79647 Length: 435 99.9 2E-23 1.3E-26 145.0 26.8 374 1-386 1-434 (435) 108 protein:vir:107742 Length: 537 99.9 1.7E-22 1.1E-25 139.9 27.3 372 1-386 35-514 (537) 109 protein:vir:94049 Length: 532 99.9 1.9E-22 1.2E-25 139.7 25.8 374 1-386 33-510 (532) 110 protein:vir:80040 Length: 461 99.8 2E-21 1.2E-24 134.1 26.2 375 1-386 1-460 (461) 111 protein:vir:104338 Length: 422 99.8 3.8E-21 2.4E-24 132.5 27.2 372 1-386 1-422 (422) 112 protein:vir:99563 Length: 862 99.8 7E-21 4.3E-24 131.1 26.8 370 1-386 66-540 (862) 113 protein:vir:107662 Length: 427 99.8 1.3E-19 8E-23 124.1 26.5 367 1-386 1-425 (427) 114 protein:vir:96068 Length: 765 99.8 1.7E-19 1.1E-22 123.5 26.3 371 1-386 44-512 (765) 115 protein:vir:108215 Length: 469 99.7 1.1E-16 6.7E-20 108.1 34.4 370 7-386 1-445 (469) 116 protein:vir:79538 Length: 502 99.7 7E-17 4.4E-20 109.1 27.6 380 1-386 1-502 (502) 117 protein:vir:99853 Length: 488 99.7 2.8E-15 1.7E-18 100.4 34.3 361 8-386 1-407 (488) 118 protein:vir:99232 Length: 526 99.7 4.1E-15 2.6E-18 99.4 34.7 369 1-386 1-434 (526) 119 protein:vir:103860 Length: 528 99.7 5.1E-15 3.2E-18 98.9 34.8 368 1-386 1-435 (528) 120 protein:vir:96738 Length: 505 99.6 1.3E-15 8.1E-19 102.2 28.3 379 1-386 8-505 (505) 121 protein:vir:79233 Length: 526 99.6 5.1E-14 3.1E-17 93.5 35.0 365 1-386 1-426 (526) 122 protein:vir:389 Length: 530 # 99.6 1.5E-14 9.5E-18 96.3 30.3 381 1-386 1-525 (530) 123 protein:vir:95542 Length: 548 99.6 9.6E-15 6E-18 97.4 27.4 380 1-386 1-491 (548) 124 protein:vir:1986 Length: 512 # 99.6 3.4E-13 2.1E-16 89.0 34.6 368 1-386 1-425 (512) 125 protein:vir:79063 Length: 491 99.6 5.7E-13 3.5E-16 87.7 34.9 366 1-386 3-418 (491) 126 protein:vir:106716 Length: 698 99.5 2.7E-15 1.7E-18 100.4 21.0 370 1-386 67-525 (698) 127 protein:vir:107880 Length: 491 99.5 1.3E-12 8.3E-16 85.7 34.9 362 1-386 3-418 (491) 128 protein:vir:6382 Length: 553 # 99.5 1.3E-13 8.2E-17 91.2 28.9 381 1-386 2-542 (553) 129 protein:vir:79511 Length: 448 99.5 1.6E-13 9.8E-17 90.8 29.1 373 1-386 1-438 (448) 130 protein:vir:10321 Length: 495 99.5 2.7E-13 1.6E-16 89.5 29.6 379 1-386 1-494 (495) 131 protein:vir:98816 Length: 446 99.5 1.1E-13 6.7E-17 91.7 27.4 360 1-369 3-446 (446) 132 protein:vir:3420 Length: 533 # 99.5 3.6E-13 2.3E-16 88.8 30.2 381 1-386 3-529 (533) 133 protein:vir:101541 Length: 694 99.5 3.7E-14 2.3E-17 94.2 22.8 370 1-386 65-524 (694) 134 protein:vir:77981 Length: 448 99.5 1.2E-12 7.3E-16 86.0 30.7 364 1-386 1-426 (448) 135 protein:vir:78589 Length: 695 99.5 5.2E-14 3.2E-17 93.4 23.0 369 1-386 67-525 (695) 136 protein:vir:105782 Length: 449 99.5 1.8E-13 1.1E-16 90.5 25.4 359 1-386 23-446 (449) 137 protein:vir:3648 Length: 695 # 99.5 6.5E-14 4E-17 92.9 22.5 369 1-386 67-525 (695) 138 protein:vir:95254 Length: 488 99.3 4.8E-11 3E-14 77.2 31.1 373 1-386 1-463 (488) 139 protein:vir:78161 Length: 355 99.3 5.6E-11 3.5E-14 76.8 27.5 275 102-386 1-309 (355) 140 protein:vir:106491 Length: 646 98.9 1.1E-09 6.9E-13 69.7 20.0 371 1-386 1-480 (646) 141 protein:vir:94742 Length: 409 98.9 2E-08 1.2E-11 62.8 28.3 342 1-366 3-409 (409) 142 protein:vir:99916 Length: 504 98.8 3.3E-08 2E-11 61.6 29.9 374 1-386 23-497 (504) 143 protein:vir:98444 Length: 434 98.8 5.1E-08 3.2E-11 60.5 27.8 335 31-386 1-433 (434) 144 protein:vir:5839 Length: 533 # 98.7 2.8E-08 1.7E-11 62.0 21.4 382 1-386 4-492 (533) 145 protein:vir:8654 Length: 629 # 98.7 5.9E-09 3.6E-12 65.7 17.3 372 1-386 1-503 (629) 146 protein:vir:99088 Length: 629 98.7 6.7E-09 4.2E-12 65.4 17.0 372 1-386 1-503 (629) 147 protein:vir:103219 Length: 201 98.6 1E-09 6.5E-13 69.8 11.8 172 208-384 1-201 (201) 148 protein:vir:102602 Length: 456 98.6 2.2E-07 1.4E-10 57.1 23.6 371 1-386 7-455 (456) 149 protein:vir:105819 Length: 456 98.6 2.2E-07 1.4E-10 57.1 23.6 371 1-386 7-455 (456) 150 protein:vir:1236 Length: 483 # 98.6 2.6E-07 1.6E-10 56.7 30.6 368 1-386 34-481 (483) 151 protein:vir:5961 Length: 503 # 98.6 2.7E-07 1.7E-10 56.6 32.1 371 1-386 13-492 (503) 152 protein:vir:7987 Length: 456 # 98.6 3E-07 1.8E-10 56.4 24.5 370 1-386 7-453 (456) 153 protein:vir:80680 Length: 441 98.5 3.1E-07 1.9E-10 56.3 25.6 363 1-386 6-439 (441) 154 protein:vir:1634 Length: 409 # 98.5 3.5E-07 2.1E-10 56.0 27.3 341 1-366 3-409 (409) 155 protein:vir:3028 Length: 500 # 98.5 2.8E-07 1.7E-10 56.5 21.9 373 1-386 1-492 (500) 156 protein:vir:9815 Length: 500 # 98.5 2.8E-07 1.7E-10 56.5 21.9 373 1-386 1-492 (500) 157 protein:vir:102426 Length: 631 98.5 5.9E-08 3.6E-11 60.2 18.2 371 1-386 1-505 (631) 158 protein:vir:1587 Length: 508 # 98.5 4.3E-07 2.7E-10 55.5 23.4 370 1-386 1-506 (508) 159 protein:vir:94805 Length: 492 98.5 4.5E-07 2.8E-10 55.4 30.9 368 1-386 29-478 (492) 160 protein:vir:4073 Length: 279 # 98.5 9E-10 5.6E-13 70.2 7.7 270 42-368 1-279 (279) 161 protein:vir:9751 Length: 422 # 98.5 4.6E-07 2.9E-10 55.3 26.8 357 1-384 4-422 (422) 162 protein:vir:7768 Length: 484 # 98.4 6.2E-07 3.8E-10 54.6 26.9 365 1-386 1-480 (484) 163 protein:vir:93747 Length: 472 98.4 6.5E-07 4E-10 54.5 30.2 368 1-386 5-468 (472) 164 protein:vir:104082 Length: 485 98.4 7.8E-07 4.9E-10 54.1 28.1 370 1-386 8-483 (485) 165 protein:vir:79703 Length: 505 98.4 8.9E-07 5.5E-10 53.7 26.5 372 1-386 1-505 (505) 166 protein:vir:2427 Length: 485 # 98.4 9.5E-07 5.9E-10 53.6 29.1 367 1-386 6-483 (485) 167 protein:vir:97900 Length: 639 98.4 4.6E-07 2.8E-10 55.3 20.0 371 1-386 1-472 (639) 168 protein:vir:107517 Length: 639 98.4 4.6E-07 2.8E-10 55.3 20.0 371 1-386 1-472 (639) 169 protein:vir:105292 Length: 478 98.4 1.1E-06 6.8E-10 53.3 30.9 367 1-386 1-477 (478) 170 protein:vir:106027 Length: 629 98.4 3.6E-07 2.3E-10 55.9 18.9 370 1-386 1-463 (629) 171 protein:vir:97336 Length: 492 98.3 1.2E-06 7.4E-10 53.1 30.4 367 1-386 29-478 (492) 172 protein:vir:9568 Length: 410 # 98.3 1.5E-06 9.4E-10 52.5 25.9 347 1-385 1-410 (410) 173 protein:vir:9871 Length: 429 # 98.3 1.6E-06 9.6E-10 52.4 26.6 351 1-386 7-427 (429) 174 protein:vir:38 Length: 496 # N 98.3 1.6E-06 9.9E-10 52.4 23.2 374 1-386 16-494 (496) 175 protein:vir:94101 Length: 474 98.3 1.7E-06 1E-09 52.3 30.1 369 1-386 1-474 (474) 176 protein:vir:105889 Length: 474 98.3 1.7E-06 1E-09 52.3 30.1 369 1-386 1-474 (474) 177 protein:vir:78907 Length: 518 98.3 1.9E-06 1.2E-09 51.9 22.6 370 1-384 1-518 (518) 178 protein:vir:733 Length: 453 # 98.3 2E-06 1.2E-09 51.8 26.1 379 1-386 1-445 (453) 179 protein:vir:2500 Length: 501 # 98.3 2E-06 1.3E-09 51.8 25.9 355 1-386 33-496 (501) 180 protein:vir:4223 Length: 486 # 98.2 2.3E-06 1.4E-09 51.5 27.7 370 1-386 6-485 (486) 181 protein:vir:80959 Length: 499 98.2 2.3E-06 1.4E-09 51.5 22.1 373 1-386 16-498 (499) 182 protein:vir:79043 Length: 479 98.2 2.5E-06 1.6E-09 51.3 30.9 367 1-386 20-477 (479) 183 protein:vir:95113 Length: 474 98.1 3.7E-06 2.3E-09 50.3 30.3 365 1-386 2-470 (474) 184 protein:vir:96839 Length: 474 98.1 4.4E-06 2.7E-09 49.9 30.4 366 1-385 1-474 (474) 185 protein:vir:99072 Length: 479 98.1 4.4E-06 2.8E-09 49.9 27.9 365 1-386 15-469 (479) 186 protein:vir:96266 Length: 474 98.1 5.3E-06 3.3E-09 49.5 31.2 361 1-386 46-470 (474) 187 protein:vir:95899 Length: 474 98.1 5.3E-06 3.3E-09 49.5 31.2 361 1-386 46-470 (474) 188 protein:vir:8184 Length: 474 # 98.0 6.1E-06 3.8E-09 49.2 29.1 374 1-386 17-470 (474) 189 protein:vir:2341 Length: 488 # 98.0 7.6E-06 4.7E-09 48.6 25.2 368 1-386 10-481 (488) 190 protein:vir:9306 Length: 511 # 98.0 7.7E-06 4.8E-09 48.6 28.7 375 1-386 46-508 (511) 191 protein:vir:4898 Length: 502 # 98.0 9.1E-06 5.7E-09 48.2 29.7 379 1-386 31-499 (502) 192 protein:vir:95806 Length: 440 97.9 1.1E-05 6.6E-09 47.8 28.9 371 3-386 1-438 (440) 193 protein:vir:4782 Length: 522 # 97.9 1.1E-05 6.8E-09 47.8 25.8 374 1-386 1-519 (522) 194 protein:vir:106639 Length: 481 97.9 1.4E-05 8.5E-09 47.3 27.7 373 1-386 6-477 (481) 195 protein:vir:94498 Length: 474 97.9 1.4E-05 8.5E-09 47.2 31.2 365 1-386 2-460 (474) 196 protein:vir:97447 Length: 474 97.9 1.4E-05 8.5E-09 47.2 31.2 365 1-386 2-460 (474) 197 protein:vir:99522 Length: 470 97.8 1.6E-05 1E-08 46.9 28.8 363 1-386 19-469 (470) 198 protein:vir:3964 Length: 453 # 97.8 2.2E-05 1.3E-08 46.2 26.4 367 1-386 1-444 (453) 199 protein:vir:96240 Length: 511 97.7 2.5E-05 1.6E-08 45.8 29.4 375 1-386 46-508 (511) 200 protein:vir:97376 Length: 320 97.7 1E-06 6.3E-10 53.4 10.2 312 1-376 1-320 (320) 201 protein:vir:97171 Length: 512 97.7 2.7E-05 1.7E-08 45.6 30.4 380 1-386 42-509 (512) 202 protein:vir:103951 Length: 511 97.7 2.7E-05 1.7E-08 45.6 29.9 381 1-386 39-508 (511) 203 protein:vir:102950 Length: 471 97.7 2.8E-05 1.8E-08 45.5 25.7 369 1-386 1-467 (471) 204 protein:vir:107112 Length: 478 97.6 3.3E-05 2.1E-08 45.1 31.7 368 1-386 23-466 (478) 205 protein:vir:99781 Length: 511 97.6 4E-05 2.5E-08 44.7 29.0 374 1-386 46-508 (511) 206 protein:vir:2732 Length: 501 # 97.6 4.3E-05 2.7E-08 44.5 29.5 379 1-386 37-486 (501) 207 protein:vir:96366 Length: 511 97.6 4.4E-05 2.7E-08 44.5 28.8 375 1-386 46-508 (511) 208 protein:vir:78805 Length: 511 97.6 4.4E-05 2.7E-08 44.5 28.8 375 1-386 46-508 (511) 209 protein:vir:105461 Length: 470 97.5 5.5E-05 3.4E-08 43.9 26.4 365 1-386 7-467 (470) 210 protein:vir:78227 Length: 480 97.5 5.9E-05 3.6E-08 43.8 29.1 367 1-386 1-475 (480) 211 protein:vir:106571 Length: 499 97.4 7E-05 4.3E-08 43.4 29.2 370 1-386 1-490 (499) 212 protein:vir:98883 Length: 517 97.4 7.1E-05 4.4E-08 43.3 26.6 369 1-386 1-513 (517) 213 protein:vir:96494 Length: 501 97.4 7.1E-05 4.4E-08 43.3 28.9 379 1-386 38-494 (501) 214 protein:vir:3609 Length: 452 # 97.3 8.8E-05 5.5E-08 42.8 28.5 361 1-386 1-450 (452) 215 protein:vir:78537 Length: 480 97.3 0.0001 6.3E-08 42.5 27.5 366 1-386 1-462 (480) 216 protein:vir:96179 Length: 468 97.3 0.00011 6.7E-08 42.3 30.0 367 1-386 1-465 (468) 217 protein:vir:78083 Length: 537 97.2 0.00013 8.2E-08 41.9 31.9 372 1-386 8-522 (537) 218 protein:vir:94546 Length: 506 97.1 0.00019 1.2E-07 41.0 27.2 369 1-386 28-490 (506) 219 protein:vir:102330 Length: 451 96.3 0.00079 4.9E-07 37.6 24.0 368 1-386 7-451 (451) 220 protein:vir:9922 Length: 489 # 93.8 0.0063 3.9E-06 32.7 26.6 377 1-386 21-480 (489) 221 protein:vir:95149 Length: 501 91.3 0.016 1E-05 30.4 24.8 372 1-386 1-496 (501) 222 protein:vir:105154 Length: 525 87.3 0.04 2.5E-05 28.3 16.6 373 1-386 47-515 (525) 223 protein:vir:94956 Length: 452 80.6 0.093 5.8E-05 26.2 27.9 364 1-386 1-445 (452) 224 protein:vir:97265 Length: 513 77.6 0.12 7.7E-05 25.6 29.7 364 3-386 1-492 (513) 225 protein:vir:78393 Length: 489 76.7 0.13 8.2E-05 25.4 24.7 371 3-386 1-487 (489) 226 protein:vir:95014 Length: 491 76.2 0.14 8.5E-05 25.3 23.9 368 3-383 1-491 (491) 227 protein:vir:96783 Length: 488 62.3 0.34 0.00021 23.2 24.3 366 1-385 14-488 (488) 228 protein:vir:94709 Length: 522 62.1 0.34 0.00021 23.1 22.4 349 1-386 40-482 (522) 229 protein:vir:80453 Length: 535 61.7 0.35 0.00022 23.1 26.3 372 1-386 32-531 (535) 230 protein:vir:3361 Length: 535 # 57.4 0.43 0.00027 22.6 21.7 337 1-386 40-477 (535) 231 protein:vir:5665 Length: 511 # 55.8 0.47 0.00029 22.4 21.7 359 1-386 1-479 (511) 232 protein:vir:104892 Length: 558 54.7 0.5 0.00031 22.2 24.5 377 1-386 5-546 (558) 233 protein:vir:103177 Length: 533 54.4 0.5 0.00031 22.2 23.9 376 1-386 1-527 (533) 234 protein:vir:1538 Length: 535 # 44.1 0.82 0.00051 21.1 21.1 343 1-386 40-477 (535) 235 protein:vir:102668 Length: 547 43.4 0.84 0.00052 21.0 24.8 351 1-386 21-483 (547) 236 protein:vir:98265 Length: 524 39.9 0.99 0.00062 20.6 22.1 365 1-374 13-524 (524) 237 protein:vir:101806 Length: 516 37.3 1.1 0.0007 20.3 22.7 374 1-386 1-515 (516) 238 protein:vir:101189 Length: 516 37.3 1.1 0.0007 20.3 22.7 374 1-386 1-515 (516) 239 protein:vir:104500 Length: 537 34.6 1.3 0.0008 20.0 23.4 377 1-386 1-537 (537) 240 protein:vir:6596 Length: 521 # 25.9 2 0.0012 18.9 23.7 368 1-376 8-521 (521) 241 protein:vir:101418 Length: 569 24.9 2.1 0.0013 18.8 17.4 375 1-386 53-538 (569) 242 protein:vir:81017 Length: 521 23.7 2.3 0.0014 18.6 23.7 366 1-374 8-521 (521) 243 protein:vir:100598 Length: 516 23.7 2.3 0.0014 18.6 23.4 374 1-386 1-515 (516) 244 protein:vir:103765 Length: 549 21.2 2.6 0.0016 18.3 22.8 346 1-386 38-484 (549) No 1 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=9.1e-89 Score=503.25 Aligned_cols=373 Identities=19% Similarity=0.278 Sum_probs=319.9 Q ss_pred CchhhhhccccccCCccchhhh--hhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-----------c Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWI--LNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-----------A 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-----------~ 67 (386) ||||+++|+|+.......+... .........++..|+.+.|+++++|++||++||++||++|++++ + T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~~~~ 80 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATG 80 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCceeeccc Confidence 9999999998766655554322 33344556788899999999999999999999999999999875 3 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) |+++++|+.+||++||+++||+.++.+++++||||++++++ .|++.+||||+|+.|++..+.++.. .|.+...+ + T Consensus 81 ~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~-~y~~~~~~---g 155 (414) T protein:vir:44 81 ERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEP-VYQVTFPD---G 155 (414) T ss_pred chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcE-EEEEEecC---c Confidence 78999999999999999999999999999999999999887 6999999999999999998876654 44444432 3 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ....+++++|+|+++. +.++++|+||+..+..+++...++++++.++|+||++|++++++++ .+++|+.++++++ T Consensus 156 ~~~~~~~~evih~~~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~ 230 (414) T protein:vir:44 156 STDVLSQEDIWHVRTL----TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQ-TLSDQAYERLKKD 230 (414) T ss_pred eEEEEccccEEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCHHHHHHHHHH Confidence 4567999999999853 3577899999999999999999999999999999999999999875 6899999999999 Q ss_pred HHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHH Q lcl|NC_011801. 228 FEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSL 304 (386) Q Consensus 228 ~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l 304 (386) |++.++| +|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+|+..+. +++.+++..+|+++|| T Consensus 231 ~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l 310 (414) T protein:vir:44 231 FEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSL 310 (414) T ss_pred HHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHH Confidence 9999987 688999999999999999999999999999999999999999999999987654 4566889999999999 Q ss_pred HHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccc Q lcl|NC_011801. 305 SIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNL 377 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~ 377 (386) +|++++||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ |++|.. -.+ T Consensus 311 ~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~-~ggD~~--~~~ 387 (414) T protein:vir:44 311 VPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPR-PGGDVY--LTP 387 (414) T ss_pred HHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCccee--ccc Confidence 9999999999999985 357999999999999999999999999999999999999998775 333321 111 Q ss_pred cccCCCC---------CC Q lcl|NC_011801. 378 LDNTKNI---------ND 386 (386) Q Consensus 378 ~~~~~~~---------~~ 386 (386) ...+... .| T Consensus 388 ~n~~~~~~~~~~~~~~~~ 405 (414) T protein:vir:44 388 MNMTTKPSDGSKAGKQKD 405 (414) T ss_pred ccccccCCccccCCCCCC Confidence 1111000 00 No 2 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=9.6e-89 Score=503.12 Aligned_cols=380 Identities=18% Similarity=0.243 Sum_probs=322.5 Q ss_pred CchhhhhccccccCCccc-hhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc-------hhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSS-PVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA-------QPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~-------~~~~~ 72 (386) ||||++..+|....+... ..+......+...++..++...|+++++|++||++||+++|++|+++++ |++++ T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHH Confidence 999998766655444333 1222334445566777899999999999999999999999999998754 78999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++...++.......+.+....+ T Consensus 81 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~ 160 (416) T protein:vir:45 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 160 (416) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999988877666666655555666789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|||+|+. +.++++|+||+..+.++++...++++++.++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 161 ~~~evihir~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~ 236 (416) T protein:vir:45 161 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 236 (416) T ss_pred ccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 99999999863 45789999999999999999999999999999999999999999886668889999999999999 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) .| .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.++|....+++.+ +...+|.+||+|++++| T Consensus 237 ~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-~~~~~~~~~l~P~~~~i 315 (416) T protein:vir:45 237 SGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSIT-DANLDYLSTLKPYITCV 315 (416) T ss_pred cCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH-HHHHHHHHHHHHHHHHH Confidence 88 68899999999999999999999999999999999999999999999998755444444 44556677999999999 Q ss_pred HHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc-----------c Q lcl|NC_011801. 312 ESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG-----------T 375 (386) Q Consensus 312 e~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~-----------~ 375 (386) |++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |+++.+.. . T Consensus 316 e~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:45 316 CAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI-PGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred HHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCcceEeecccccccccc Confidence 999999994 578999999999999999999999999999999999999998775 44443221 0 Q ss_pred cccccCCCC---------CC Q lcl|NC_011801. 376 NLLDNTKNI---------ND 386 (386) Q Consensus 376 ~~~~~~~~~---------~~ 386 (386) +.++.++.+ .+ T Consensus 395 ~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:45 395 DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred cccCcccccccccccCCCCC Confidence 111122211 11 No 3 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=9.6e-89 Score=503.12 Aligned_cols=380 Identities=18% Similarity=0.243 Sum_probs=322.5 Q ss_pred CchhhhhccccccCCccc-hhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc-------hhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSS-PVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA-------QPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~-------~~~~~ 72 (386) ||||++..+|....+... ..+......+...++..++...|+++++|++||++||+++|++|+++++ |++++ T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHH Confidence 999998766655444333 1222334445566777899999999999999999999999999998754 78999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++...++.......+.+....+ T Consensus 81 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~ 160 (416) T protein:vir:81 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 160 (416) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999988877666666655555666789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|||+|+. +.++++|+||+..+.++++...++++++.++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 161 ~~~evihir~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~ 236 (416) T protein:vir:81 161 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 236 (416) T ss_pred ccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 99999999863 45789999999999999999999999999999999999999999886668889999999999999 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) .| .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.++|....+++.+ +...+|.+||+|++++| T Consensus 237 ~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-~~~~~~~~~l~P~~~~i 315 (416) T protein:vir:81 237 SGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSIT-DANLDYLSTLKPYITCV 315 (416) T ss_pred cCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH-HHHHHHHHHHHHHHHHH Confidence 88 68899999999999999999999999999999999999999999999998755444444 44556677999999999 Q ss_pred HHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc-----------c Q lcl|NC_011801. 312 ESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG-----------T 375 (386) Q Consensus 312 e~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~-----------~ 375 (386) |++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |+++.+.. . T Consensus 316 e~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:81 316 CAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI-PGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred HHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCcceEeecccccccccc Confidence 999999994 578999999999999999999999999999999999999998775 44443221 0 Q ss_pred cccccCCCC---------CC Q lcl|NC_011801. 376 NLLDNTKNI---------ND 386 (386) Q Consensus 376 ~~~~~~~~~---------~~ 386 (386) +.++.++.+ .+ T Consensus 395 ~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:81 395 DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred cccCcccccccccccCCCCC Confidence 111122211 11 No 4 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=1e-88 Score=502.95 Aligned_cols=380 Identities=18% Similarity=0.241 Sum_probs=318.3 Q ss_pred CchhhhhccccccCCccch-hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSP-VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------~~~~~~ 72 (386) ||+|.+..+|....+.... .+..........++..++...||++++|++||++||++||++|++++ .|++++ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:94 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCccccccchHHH Confidence 7777666555433332221 22233334445566778999999999999999999999999999875 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++...++.+.....+.+....+ T Consensus 106 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:94 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999988877666665555555666789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|||+|+. +.++++|+||+..+..+++...+++++..++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 186 ~~~dvih~k~~----~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:94 186 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred ccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHh Confidence 99999999863 46789999999999999999999999999999999999999999876668899999999999999 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) +| .|+|+++|+++|++|++++++++|+||+|.+++++++||++|||||.+||....+++.+++ ..++.+||+|++.+| T Consensus 262 ~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~-~~~~~~tl~P~~~~i 340 (441) T protein:vir:94 262 SGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCV 340 (441) T ss_pred cCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHH Confidence 88 6899999999999999999999999999999999999999999999999876554444544 445567999999999 Q ss_pred HHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc---ccc------ Q lcl|NC_011801. 312 ESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG---TNL------ 377 (386) Q Consensus 312 e~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~---~~~------ 377 (386) |++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ |+++.+.. .+. T Consensus 341 e~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi-~ggd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:94 341 CAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI-PGGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCcceEeecccccccccc Confidence 999999984 578999999999999999999999999999999999999998775 44443221 010 Q ss_pred --cccCC---------CCCC Q lcl|NC_011801. 378 --LDNTK---------NIND 386 (386) Q Consensus 378 --~~~~~---------~~~~ 386 (386) ++.++ ++++ T Consensus 420 ~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 420 DEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccccCCCCC Confidence 11111 1111 No 5 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=1e-88 Score=502.95 Aligned_cols=380 Identities=18% Similarity=0.241 Sum_probs=318.3 Q ss_pred CchhhhhccccccCCccch-hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSP-VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------~~~~~~ 72 (386) ||+|.+..+|....+.... .+..........++..++...||++++|++||++||++||++|++++ .|++++ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:79 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCccccccchHHH Confidence 7777666555433332221 22233334445566778999999999999999999999999999875 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++...++.+.....+.+....+ T Consensus 106 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:79 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999988877666665555555666789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|||+|+. +.++++|+||+..+..+++...+++++..++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 186 ~~~dvih~k~~----~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:79 186 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred ccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHh Confidence 99999999863 46789999999999999999999999999999999999999999876668899999999999999 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) +| .|+|+++|+++|++|++++++++|+||+|.+++++++||++|||||.+||....+++.+++ ..++.+||+|++.+| T Consensus 262 ~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~-~~~~~~tl~P~~~~i 340 (441) T protein:vir:79 262 SGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCV 340 (441) T ss_pred cCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHH Confidence 88 6899999999999999999999999999999999999999999999999876554444544 445567999999999 Q ss_pred HHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc---ccc------ Q lcl|NC_011801. 312 ESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG---TNL------ 377 (386) Q Consensus 312 e~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~---~~~------ 377 (386) |++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ |+++.+.. .+. T Consensus 341 e~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi-~ggd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:79 341 CAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI-PGGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCcceEeecccccccccc Confidence 999999984 578999999999999999999999999999999999999998775 44443221 010 Q ss_pred --cccCC---------CCCC Q lcl|NC_011801. 378 --LDNTK---------NIND 386 (386) Q Consensus 378 --~~~~~---------~~~~ 386 (386) ++.++ ++++ T Consensus 420 ~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 420 DEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccccCCCCC Confidence 11111 1111 No 6 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=1.3e-88 Score=502.45 Aligned_cols=380 Identities=18% Similarity=0.232 Sum_probs=318.8 Q ss_pred CchhhhhccccccCCccch-hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc-------hhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSP-VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA-------QPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~-------~~~~~ 72 (386) ||+|.+..+|....+.... .+..........++..++...||++++|++||++||++||++|+++++ |++++ T Consensus 26 ~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:98 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHhhccCceEEecCCcccccchHHH Confidence 7777666555433332222 222233344455667799999999999999999999999999998864 68999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++...++.+.....+.+....+ T Consensus 106 lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:98 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEEEEeccCcceeeEEE Confidence 99999999999999999999999999999999999999999999999999999999888877766666555556667789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|||+++. +.++++|+||+..+..++....++++++.++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 186 ~~~dviHir~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~ 261 (441) T protein:vir:98 186 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSF 261 (441) T ss_pred ccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 99999999863 46788999999999999999999999999999999999999999876667899999999999999 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) +| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||....+++.+++...| .+||+|++.+| T Consensus 262 ~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y-~~tl~P~~~~i 340 (441) T protein:vir:98 262 SGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-LSTLKPYITCV 340 (441) T ss_pred cCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-HHHHHHHHHHH Confidence 88 78999999999999999999999999999999999999999999999998765555555555444 56999999999 Q ss_pred HHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc-----------c Q lcl|NC_011801. 312 ESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG-----------T 375 (386) Q Consensus 312 e~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~-----------~ 375 (386) |++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|++ +++.+.. . T Consensus 341 e~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~-gGd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:98 341 CAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIP-GGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCcceEeecccccccccc Confidence 999999995 4689999999999999999999999999999999999999987654 4443210 0 Q ss_pred cccccCC---------CCCC Q lcl|NC_011801. 376 NLLDNTK---------NIND 386 (386) Q Consensus 376 ~~~~~~~---------~~~~ 386 (386) +.++.++ ++.+ T Consensus 420 ~~~q~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:98 420 DEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccccCCCCC Confidence 1111222 1111 No 7 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=5.1e-88 Score=499.16 Aligned_cols=375 Identities=17% Similarity=0.209 Sum_probs=315.3 Q ss_pred CchhhhhccccccCCc-----------cchhh--------------hhhcccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG-----------SSPVW--------------ILNQGQPVSIKPKAITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~-----------~~~~~--------------~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia 55 (386) ||+|+++++++..... ..+.. ........+.++..++.+.|+++++|++||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 9999988765432110 00100 0111223345677889999999999999999999 Q ss_pred HhhccCceeec----------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceE Q lcl|NC_011801. 56 SDIAGCRFVTN----------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVT 125 (386) Q Consensus 56 ~~ia~~p~~~~----------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~ 125 (386) +++|++|++++ +|+++++|+.+||++||+++||+.++.+++++||||+++.|+. |.+++|||++|..|+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~ 159 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAK 159 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeE Confidence 99999999864 4789999999999999999999999999999999999999984 899999999999999 Q ss_pred EeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_011801. 126 VALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKP 205 (386) Q Consensus 126 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~ 205 (386) +..+.++.. .|.+... .+..+.+++++|+|+|+. +.++++|+||+..+..++.+..+++++..++|+||++| T Consensus 160 ~~~~~~~~~-~y~~~~~---~g~~~~~~~~dViHir~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p 231 (431) T protein:vir:10 160 GRLTSTWQI-VYDYTTP---TGDKIELPAREVFHLRDL----SIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMA 231 (431) T ss_pred EEEcCCCeE-EEEEEeC---CceEEEEchhhEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 988776554 4444432 345677999999999854 25678999999999999999999999999999999999 Q ss_pred ceEEeeCCCCCCHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_011801. 206 SIFIKVPNATLGKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLS 284 (386) Q Consensus 206 ~~~l~~~~~~~~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~ 284 (386) +++|++++ .+++++.+++++.|++.++| +|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+|+ T Consensus 232 ~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg 310 (431) T protein:vir:10 232 GGAIEVPK-ELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLM 310 (431) T ss_pred cEEEecCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhC Confidence 99999976 78999999999999999988 68999999999999999999999999999999999999999999999999 Q ss_pred CCcC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCC----Cc Q lcl|NC_011801. 285 GKQD--AQSNITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAG----VL 351 (386) Q Consensus 285 ~~~~--~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g----~~ 351 (386) ..+. +++.+++..+|+++||.|++++||++|+++|+ .+++||++.+++.|.+++++.++++++.| || T Consensus 311 ~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~l 390 (431) T protein:vir:10 311 MDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWM 390 (431) T ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCcc Confidence 7644 55678899999999999999999999999985 45899999999999999999999998655 59 Q ss_pred CHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 352 APIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 352 t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) |+||+|+++|++|+ +++++|+.-.++..+..... T Consensus 391 T~NE~R~~~gl~p~-~~~~gD~~~~p~n~~~~~~~ 424 (431) T protein:vir:10 391 KQNEVREMLDLPRA-DDPVADQLRNPMTQKQKGSG 424 (431) T ss_pred CHHHHHHHhCCCCC-CCccccceecccccccCCCC Confidence 99999999998775 45555554444432222211 No 8 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=5.7e-88 Score=498.88 Aligned_cols=375 Identities=16% Similarity=0.241 Sum_probs=312.9 Q ss_pred Cchhhhhc---cccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------- Q lcl|NC_011801. 1 MAFLSNLF---KRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------- 66 (386) Q Consensus 1 Mg~~~~l~---~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------- 66 (386) =|||.++. .+.....+...............++..|+.+.|+++++|++||++||++||++|++++ T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeecCCceeee Confidence 57777653 2222222221111122223334567789999999999999999999999999999764 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCc Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDS 144 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 144 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++|||++|+.|++..+.+ ...|.+..+ T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~--~~~y~~~~~-- 169 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRD-- 169 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC--eEEEEEEeC-- Confidence 47899999999999999999999999999999999999999999999999999999999987654 445555432 Q ss_pred ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHH Q lcl|NC_011801. 145 KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENT 224 (386) Q Consensus 145 ~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~ 224 (386) +..+.++++||+|+|+.. .++++|+||+..+..+++.+.+++++..++|+||++|++++++++..+++++++++ T Consensus 170 --g~~~~~~~~eIih~r~~~----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~ 243 (424) T protein:vir:18 170 --SEYADFSQKEIFHLKGFG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) T ss_pred --CeEEEeccccEEEecCcC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH Confidence 455679999999998642 46789999999999999999999999999999999999999998877899999999 Q ss_pred HHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHHHHHHHHH Q lcl|NC_011801. 225 RQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNITMIRAFY 300 (386) Q Consensus 225 k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~~~~~~~~ 300 (386) ++.|++.+++.|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..+.+ ++.+++..+|+ T Consensus 244 ~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~ 323 (424) T protein:vir:18 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) T ss_pred HHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH Confidence 99999999888999999999999999999999999999999999999999999999999876543 46688999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhh-------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc Q lcl|NC_011801. 301 QSSLSIYIKPIESELSQKLGT-------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE 373 (386) Q Consensus 301 ~~~l~P~~~~ie~~l~~~l~~-------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~ 373 (386) ++||.|+++.||++|+++|++ +++||++++++.|.+++++++.+++++|+||+||+|+++|++|+ |++|... T Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi-~gGD~~~ 402 (424) T protein:vir:18 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL-PGGDVAM 402 (424) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeee Confidence 999999999999999999953 48899999999999999999999999999999999999998876 3333211 Q ss_pred -ccc--cccc---CCCCCC Q lcl|NC_011801. 374 -GTN--LLDN---TKNIND 386 (386) Q Consensus 374 -~~~--~~~~---~~~~~~ 386 (386) ..+ ++.. ++...| T Consensus 403 ~~~n~~~l~~~~~~~~p~~ 421 (424) T protein:vir:18 403 RQSQYVPITDLGTNKEPRN 421 (424) T ss_pred eccCccchHhhhccCCCcc Confidence 111 1111 111111 No 9 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=9.5e-88 Score=497.68 Aligned_cols=378 Identities=20% Similarity=0.255 Sum_probs=316.3 Q ss_pred CchhhhhccccccCCccchhhh----------hhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWI----------LNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---- 66 (386) ||||+++|++++.......... .............++.+.|+++++|++||++||++||++|++++ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 9999999987654432211100 00011111234468999999999999999999999999999874 Q ss_pred ---chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eEEE Q lcl|NC_011801. 67 ---AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LTYT 138 (386) Q Consensus 67 ---~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~~~ 138 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..++++.. .+|. T Consensus 81 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~ 160 (422) T protein:vir:13 81 EYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVWYV 160 (422) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceEEEE Confidence 47899999999999999999999999999999999999999999999999999999999999887643 4454 Q ss_pred EeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCH Q lcl|NC_011801. 139 VHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGK 218 (386) Q Consensus 139 ~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~ 218 (386) +... .+....+++++|+|+++. .+.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+++ T Consensus 161 ~~~~---~g~~~~~~~~eiih~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~ 233 (422) T protein:vir:13 161 VTDK---NGKEHKLLPDEMLHFIGD---ITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG-DLDE 233 (422) T ss_pred EEeC---CCeEEEEcccceEEEcCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCH Confidence 4433 345677999999999854 35678999999999999999999999999999999999999999975 6899 Q ss_pred HHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHH Q lcl|NC_011801. 219 EAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITM 295 (386) Q Consensus 219 ~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~ 295 (386) ++.+++++.|++.+.| .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+|+..+. +++.+++ T Consensus 234 e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~ 313 (422) T protein:vir:13 234 KAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQ 313 (422) T ss_pred HHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH Confidence 9999999999999988 678999999999999999999999999999999999999999999999987654 4567889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC Q lcl|NC_011801. 296 IRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP 367 (386) Q Consensus 296 ~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p 367 (386) ..+|+++||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ | T Consensus 314 ~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~ 392 (422) T protein:vir:13 314 QKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPV-E 392 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-C Confidence 9999999999999999999999984 467899999999999999999999999999999999999998776 3 Q ss_pred CCCCCc-ccc--cc----cc-CCCCCC Q lcl|NC_011801. 368 ELDLDE-GTN--LL----DN-TKNIND 386 (386) Q Consensus 368 ~~~~~~-~~~--~~----~~-~~~~~~ 386 (386) ++|... ..+ ++ +. .+++.+ T Consensus 393 ggD~~~~~~n~~~l~~~~~~~~~~g~~ 419 (422) T protein:vir:13 393 GGDRLLVNGNMIPIEMAGEQYKKGGEK 419 (422) T ss_pred CcCeeeeccCccchhhcccccccCCCc Confidence 333111 011 11 01 111111 No 10 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=1.3e-87 Score=496.92 Aligned_cols=372 Identities=17% Similarity=0.229 Sum_probs=315.7 Q ss_pred CchhhhhccccccCCccchhhhhhc-ccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-----------ch Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQ-GQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-----------AQ 68 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-----------~~ 68 (386) +.||++||+++....++.+...... ......++..|+.+.||++++|++||++||++||++|++++ +| T Consensus 16 ~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~ 95 (424) T protein:vir:45 16 RVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDH 95 (424) T ss_pred hHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccc Confidence 8899999988765555444332221 12233456789999999999999999999999999999874 47 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) +++++|+.+||++||+++||+.++.+++++||||+++.|+..|.+++|||++|+.|++..+.+ ...|.+... ++ T Consensus 96 ~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~--~~~y~~~~~----~~ 169 (424) T protein:vir:45 96 PAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGG--RYTYGLYNE----YG 169 (424) T ss_pred hHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCC--eEEEEEEec----Cc Confidence 899999999999999999999999999999999999999999999999999999999986654 344444322 33 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSF 228 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~ 228 (386) ...+++++|+|+|+. +.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+++++.+++++.| T Consensus 170 ~~~~~~~eVih~r~~----~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~ 244 (424) T protein:vir:45 170 AFAISPDDMIHIRAL----GNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQW 244 (424) T ss_pred eEEECcccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHH Confidence 467999999999853 2468899999999999999999999999999999999999999976 68999999999999 Q ss_pred HHHhcc--cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHH Q lcl|NC_011801. 229 EEQTTG--ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSL 304 (386) Q Consensus 229 ~~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l 304 (386) ++.+.| .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+. +++.+++.++|+++|| T Consensus 245 ~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL 324 (424) T protein:vir:45 245 QKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTM 324 (424) T ss_pred HHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHH Confidence 998877 578999999999999999999999999999999999999999999999987654 4566899999999999 Q ss_pred HHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc Q lcl|NC_011801. 305 SIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN 376 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~ 376 (386) .|++++||++||++|+ .+++||++.+++.|.+++++.+++++++|+||+||+|+++|++|++ ++ |+.-. T Consensus 325 ~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~-gg--D~~~~ 401 (424) T protein:vir:45 325 MPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVE-GL--DEMLV 401 (424) T ss_pred HHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-Cc--ceeee Confidence 9999999999999985 4589999999999999999999999999999999999999987763 32 22111 Q ss_pred cc------------ccCCCCCC Q lcl|NC_011801. 377 LL------------DNTKNIND 386 (386) Q Consensus 377 ~~------------~~~~~~~~ 386 (386) ++ ...++.+| T Consensus 402 ~~n~~~~~~~~~~~~~~~~~~~ 423 (424) T protein:vir:45 402 SVNAANPAGDFKPPKNDEGKTN 423 (424) T ss_pred cccccccccccCCCCCCCCCCC Confidence 11 11111111 No 11 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=2.5e-87 Score=495.36 Aligned_cols=375 Identities=16% Similarity=0.248 Sum_probs=313.0 Q ss_pred Cchhhhh---ccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------- Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------- 66 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------- 66 (386) =|||+++ |++.....+...............++..|+.+.|+++++|++||++||++||++|++++ T Consensus 14 ~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCceeee Confidence 5677664 33332222222222222223334566779999999999999999999999999999874 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCc Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDS 144 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 144 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++|||++|+.|++..+.+ ...|.+..+ T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~--~~~y~~~~~-- 169 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRD-- 169 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC--eEEEEEEeC-- Confidence 37899999999999999999999999999999999999999999999999999999999987654 445555432 Q ss_pred ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHH Q lcl|NC_011801. 145 KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENT 224 (386) Q Consensus 145 ~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~ 224 (386) +....++++||+|+|+. +.++++|+||+..+..++..+.+++++..++|+||++|++++++++..+++++.+++ T Consensus 170 --g~~~~~~~~eVihir~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~ 243 (424) T protein:vir:18 170 --SEYADFSQKEIFHLKGF----GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) T ss_pred --CeEEEeccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHH Confidence 45567999999999854 246789999999999999999999999999999999999999998877899999999 Q ss_pred HHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHHHHHHHHH Q lcl|NC_011801. 225 RQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNITMIRAFY 300 (386) Q Consensus 225 k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~~~~~~~~ 300 (386) ++.|++.+++.++|++++|++|++|+++++++.|+||+|++++++++||++|||||.+||..+.+ ++.+++..+|+ T Consensus 244 ~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~ 323 (424) T protein:vir:18 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) T ss_pred HHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH Confidence 99999989888999999999999999999999999999999999999999999999999876543 45688999999 Q ss_pred HHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc Q lcl|NC_011801. 301 QSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE 373 (386) Q Consensus 301 ~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~ 373 (386) ++||.|++++||++|+++|+ .+++||++++++.|.+++++.+++++++|+||+||+|+++|++|+ |++|... T Consensus 324 ~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi-~ggD~~~ 402 (424) T protein:vir:18 324 QYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPL-PGGDVAM 402 (424) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeee Confidence 99999999999999999995 358999999999999999999999999999999999999998776 3433221 Q ss_pred -ccc--cccc----CCCCCC Q lcl|NC_011801. 374 -GTN--LLDN----TKNIND 386 (386) Q Consensus 374 -~~~--~~~~----~~~~~~ 386 (386) ..+ ++.. .+..++ T Consensus 403 ~~~n~~~l~~~~~~~~~~~n 422 (424) T protein:vir:18 403 RQAQYVPITDLGTNKEPRNN 422 (424) T ss_pred eccCccchhhhhccCCcccc Confidence 111 1110 011111 No 12 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=3e-87 Score=494.92 Aligned_cols=372 Identities=18% Similarity=0.255 Sum_probs=312.1 Q ss_pred Cchhhhhccc---cccCCc-cchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---------- Q lcl|NC_011801. 1 MAFLSNLFKR---QKMLSG-SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---------- 66 (386) Q Consensus 1 Mg~~~~l~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---------- 66 (386) ||||+|+++. ...... +.+.+. .+..+..++.+.|+++++|++||++||++||++|++++ T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~------~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~ 74 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLL------QWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIVKS 74 (411) T ss_pred CchHHHHHhhccCcccccccchHHHH------HHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCceeee Confidence 9999987542 222211 222221 22345667889999999999999999999999999774 Q ss_pred -chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eEEEEe Q lcl|NC_011801. 67 -AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LTYTVH 140 (386) Q Consensus 67 -~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~~~~~ 140 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++++ .|++.+||||+|+.|++..++.+.. .+|.+. T Consensus 75 ~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~ 153 (411) T protein:vir:81 75 DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYN 153 (411) T ss_pred cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEEE Confidence 478999999999999999999999999999999999999998 6899999999999999998876532 233333 Q ss_pred ccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHH Q lcl|NC_011801. 141 FDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEA 220 (386) Q Consensus 141 ~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~ 220 (386) ....+..+.+++++|+|+|+. .+.++++|+||+..+..++....+++++..++|+||+.|+++|++++ .+++++ T Consensus 154 --~~~~g~~~~~~~~eiih~k~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~ 227 (411) T protein:vir:81 154 --DPYDGKMYVFRNDEILHFKTS---VTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTG-DLNQEA 227 (411) T ss_pred --ecCCceEEEEccccEEEEcCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCHHH Confidence 233456678999999999864 34678999999999999999999999999999999999999999875 689999 Q ss_pred HHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHH Q lcl|NC_011801. 221 KENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIR 297 (386) Q Consensus 221 ~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~ 297 (386) .++++++|++.+.| +|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+||..+. +++.+++.. T Consensus 228 ~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~ 307 (411) T protein:vir:81 228 RDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNL 307 (411) T ss_pred HHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHH Confidence 99999999999888 688999999999999999999999999999999999999999999999987654 456688899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL 369 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~ 369 (386) +|+++||.|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++ T Consensus 308 ~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~-~gg 386 (411) T protein:vir:81 308 AFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPAD-DYG 386 (411) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCC Confidence 99999999999999999999885 457999999999999999999999999999999999999998775 443 Q ss_pred CCCc-cc--ccc----ccC-CCCCC Q lcl|NC_011801. 370 DLDE-GT--NLL----DNT-KNIND 386 (386) Q Consensus 370 ~~~~-~~--~~~----~~~-~~~~~ 386 (386) |... .. .++ .+. +++++ T Consensus 387 D~~~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 387 NNLMANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred CeeeeccCccchhhhhhhhccCCCC Confidence 3221 11 112 112 22333 No 13 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=9.1e-87 Score=492.29 Aligned_cols=368 Identities=17% Similarity=0.167 Sum_probs=315.5 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------------ch Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------------AQ 68 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------------~~ 68 (386) |+|++.+++++..................+.++..++.+.|+++++|++||++||++||++|++++ +| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~ 80 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIAFDH 80 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccc Confidence 999997776655443333322233334556677889999999999999999999999999999873 47 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) ++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|+.|++..+.++.. +|.+.. . T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~-~y~~~~------~ 153 (419) T protein:vir:57 81 PLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMP-YYDIPS------I 153 (419) T ss_pred hHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceE-EEEEcC------C Confidence 799999999999999999999999999999999999999999999999999999999998877654 344321 1 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC---CCCCHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN---ATLGKEAKENTR 225 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~---~~~~~~~~~~~k 225 (386) ...++.++|+|+++. +.++++|+||+..+..++....++++++.++|+||++|+++|+++. ..+++++.++++ T Consensus 154 ~~~~~~~~vih~r~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~ 229 (419) T protein:vir:57 154 GEILPMRMVHHIKSF----SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAIL 229 (419) T ss_pred ceEEchhhEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHH Confidence 245899999999854 3567899999999999999999999999999999999999999863 457899999999 Q ss_pred HHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHH Q lcl|NC_011801. 226 QSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQS 302 (386) Q Consensus 226 ~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~ 302 (386) +.|.+.++| .|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+|+..+. +++.+++.++|+++ T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 309 (419) T protein:vir:57 230 AKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIY 309 (419) T ss_pred HHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHH Confidence 999999988 688999999999999999999999999999999999999999999999987654 45668999999999 Q ss_pred HHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc Q lcl|NC_011801. 303 SLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT 375 (386) Q Consensus 303 ~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~ 375 (386) ||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++| + T Consensus 310 ~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~ggD--~-- 384 (419) T protein:vir:57 310 TMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPI-PGGD--K-- 384 (419) T ss_pred HHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcC--e-- Confidence 999999999999999985 368999999999999999999999999999999999999998775 3332 2 Q ss_pred cccccCCCCCC Q lcl|NC_011801. 376 NLLDNTKNIND 386 (386) Q Consensus 376 ~~~~~~~~~~~ 386 (386) +..+.|-.+ T Consensus 385 --~~~~~n~~~ 393 (419) T protein:vir:57 385 --YLTPLNMVD 393 (419) T ss_pred --eeecccccc Confidence 112222111 No 14 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=1.9e-86 Score=490.54 Aligned_cols=377 Identities=20% Similarity=0.248 Sum_probs=312.7 Q ss_pred Cchhhhhcc---ccccCCc----cchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------- Q lcl|NC_011801. 1 MAFLSNLFK---RQKMLSG----SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------- 66 (386) Q Consensus 1 Mg~~~~l~~---~~~~~~~----~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------- 66 (386) ||||.++|+ |+..... ..+...... ....++..++.+.|+++++|++||++||++||++|++++ T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~--g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~ 78 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWL--GISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI 78 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHh--cCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce Confidence 999999885 3221111 111122222 223356678999999999999999999999999999864 Q ss_pred ----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeE---EEE Q lcl|NC_011801. 67 ----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLT---YTV 139 (386) Q Consensus 67 ----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~---~~~ 139 (386) +|+++++|+.+||++||+++||+.++.+++++||||+++.|+..|++++|||++|+.|++..++.+.... ..+ T Consensus 79 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~ 158 (429) T protein:vir:10 79 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 158 (429) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 4679999999999999999999999999999999999999999999999999999999999886543321 111 Q ss_pred eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHH Q lcl|NC_011801. 140 HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKE 219 (386) Q Consensus 140 ~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~ 219 (386) .+. ..+....+++++|||+++. .+.++++|+||+..+..++....+++++..++|+||++|++++++++ .++++ T Consensus 159 ~~~--~~g~~~~~~~~evih~~~~---~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e 232 (429) T protein:vir:10 159 VVN--TGGQQRVLKPEEILHFKNG---ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNED 232 (429) T ss_pred EEc--cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHH Confidence 222 2345678999999999864 35678999999999999999999999999999999999999999876 68999 Q ss_pred HHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHHHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNITMI 296 (386) Q Consensus 220 ~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~~~~ 296 (386) +.+++++.|++.++| .|+|+++++++|++|+++++++.|+|++|.+++++++||++|||||.+||..+ ++++.+++. T Consensus 233 ~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~ 312 (429) T protein:vir:10 233 AKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQ 312 (429) T ss_pred HHHHHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 999999999999888 68899999999999999999999999999999999999999999999998654 445678999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCC Q lcl|NC_011801. 297 RAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPE 368 (386) Q Consensus 297 ~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~ 368 (386) .+|++.||+|++++||++||++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ |+ T Consensus 313 ~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~g 391 (429) T protein:vir:10 313 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE-AG 391 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CC Confidence 999999999999999999999984 467999999999999999999999999999999999999998775 34 Q ss_pred CCCCc-ccc--cc-----ccCCCCCC Q lcl|NC_011801. 369 LDLDE-GTN--LL-----DNTKNIND 386 (386) Q Consensus 369 ~~~~~-~~~--~~-----~~~~~~~~ 386 (386) +|... ..+ ++ ...+++++ T Consensus 392 gD~~~~~~n~~~~d~~~~~~~k~g~~ 417 (429) T protein:vir:10 392 GDRLLVNGNMLPIDMAGQAYLKGGDT 417 (429) T ss_pred cCeeeecccccchhhccccccCCCCC Confidence 33111 000 00 01122222 No 15 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=1.2e-86 Score=491.70 Aligned_cols=367 Identities=17% Similarity=0.220 Sum_probs=312.6 Q ss_pred CchhhhhccccccCCccchhhhhh---cccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILN---QGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------- 66 (386) |.|++.+++++.+.+ ..+.|... .....+.++..|+.+.|+++++||+||++||++||++|++++ T Consensus 1 m~~~~~~~~~~~~~s-~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~ 79 (421) T protein:vir:10 1 MFIPQMFEGKKRSVS-GGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQRA 79 (421) T ss_pred CCCcchhcccccccC-cchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeec Confidence 997776555544433 33333322 223455567889999999999999999999999999999764 Q ss_pred -chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcc Q lcl|NC_011801. 67 -AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSK 145 (386) Q Consensus 67 -~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~ 145 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++.+||||+|+.|++..+.++... |.+.. . T Consensus 80 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~-y~~~~--~- 155 (421) T protein:vir:10 80 TDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPY-YEIPE--I- 155 (421) T ss_pred ccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEE-EEEcC--C- Confidence 378999999999999999999999999999999999999999999999999999999999988776543 33321 1 Q ss_pred cceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCC---CCCHHHHH Q lcl|NC_011801. 146 RSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNA---TLGKEAKE 222 (386) Q Consensus 146 ~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~---~~~~~~~~ 222 (386) + ..+++++|+|+++. +.++++|+||+..+..++....++++++.++|+||++|+++|+++.. ..++++.+ T Consensus 156 -g--~~~~~~eiih~~~~----~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~ 228 (421) T protein:vir:10 156 -G--ETLPMRMMHHVKVF----SLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKID 228 (421) T ss_pred -C--cEEchhhEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHH Confidence 1 25899999999864 35789999999999999999999999999999999999999998752 35899999 Q ss_pred HHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHH Q lcl|NC_011801. 223 NTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAF 299 (386) Q Consensus 223 ~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~ 299 (386) ++++.|++.++| .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+|+..+. +++.+++.++| T Consensus 229 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f 308 (421) T protein:vir:10 229 QLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQF 308 (421) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHH Confidence 999999999988 688999999999999999999999999999999999999999999999987654 45668899999 Q ss_pred HHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD 372 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~ 372 (386) +++||+|++.+||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++| T Consensus 309 ~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~ggD-- 385 (421) T protein:vir:10 309 VMYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPI-AGGD-- 385 (421) T ss_pred HHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcc-- Confidence 999999999999999999985 368999999999999999999999999999999999999998776 3333 Q ss_pred ccccccccCCCCCC Q lcl|NC_011801. 373 EGTNLLDNTKNIND 386 (386) Q Consensus 373 ~~~~~~~~~~~~~~ 386 (386) + +..+.|-.+ T Consensus 386 ~----~~~~~n~~~ 395 (421) T protein:vir:10 386 K----YLTPLNMVD 395 (421) T ss_pred e----eeecccccc Confidence 2 112222111 No 16 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=2.1e-86 Score=490.27 Aligned_cols=377 Identities=20% Similarity=0.267 Sum_probs=311.7 Q ss_pred Cchhhhh-----ccccccCCcc---chhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------ Q lcl|NC_011801. 1 MAFLSNL-----FKRQKMLSGS---SPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------ 66 (386) Q Consensus 1 Mg~~~~l-----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------ 66 (386) ||||+|+ |++++..... .+............++..++.+.|+++++|++||++||++||++|++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999987 2222221111 1111111111233356678999999999999999999999999999763 Q ss_pred -----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eE Q lcl|NC_011801. 67 -----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LT 136 (386) Q Consensus 67 -----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~ 136 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..++.+.. .+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 47899999999999999999999999999999999999999999999999999999999988765432 22 Q ss_pred EEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC Q lcl|NC_011801. 137 YTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL 216 (386) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~ 216 (386) |.+. ..+....+++++|||+++. .+.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l 232 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNG---ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DL 232 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CC Confidence 3222 2345677999999999854 35678999999999999999999999999999999999999999875 68 Q ss_pred CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHH Q lcl|NC_011801. 217 GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNI 293 (386) Q Consensus 217 ~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~ 293 (386) ++++.++++++|++.++| .|+|+++++++|++|+++++++.|+||++.+++++++||++|||||.+||..+ ++++.+ T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e 312 (432) T protein:vir:10 233 NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIE 312 (432) T ss_pred CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 999999999999999988 68899999999999999999999999999999999999999999999998654 455678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 294 TMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 294 ~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) ++.++|+++||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ T Consensus 313 ~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 313 QQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999984 368999999999999999999999999999999999999998876 Q ss_pred CCCCCCCc------------------cccccccCCCCCC Q lcl|NC_011801. 366 FPELDLDE------------------GTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~~~------------------~~~~~~~~~~~~~ 386 (386) |++|... +.+.-+.++++++ T Consensus 393 -~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 393 -AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNE 430 (432) T ss_pred -CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCC Confidence 4443211 1111111111111 No 17 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=2.1e-86 Score=490.27 Aligned_cols=377 Identities=20% Similarity=0.267 Sum_probs=311.7 Q ss_pred Cchhhhh-----ccccccCCcc---chhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------ Q lcl|NC_011801. 1 MAFLSNL-----FKRQKMLSGS---SPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------ 66 (386) Q Consensus 1 Mg~~~~l-----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------ 66 (386) ||||+|+ |++++..... .+............++..++.+.|+++++|++||++||++||++|++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999987 2222221111 1111111111233356678999999999999999999999999999763 Q ss_pred -----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eE Q lcl|NC_011801. 67 -----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LT 136 (386) Q Consensus 67 -----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~ 136 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..++.+.. .+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 47899999999999999999999999999999999999999999999999999999999988765432 22 Q ss_pred EEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC Q lcl|NC_011801. 137 YTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL 216 (386) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~ 216 (386) |.+. ..+....+++++|||+++. .+.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l 232 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNG---ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DL 232 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CC Confidence 3222 2345677999999999854 35678999999999999999999999999999999999999999875 68 Q ss_pred CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHH Q lcl|NC_011801. 217 GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNI 293 (386) Q Consensus 217 ~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~ 293 (386) ++++.++++++|++.++| .|+|+++++++|++|+++++++.|+||++.+++++++||++|||||.+||..+ ++++.+ T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e 312 (432) T protein:vir:10 233 NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIE 312 (432) T ss_pred CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 999999999999999988 68899999999999999999999999999999999999999999999998654 455678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 294 TMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 294 ~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) ++.++|+++||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ T Consensus 313 ~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 313 QQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999984 368999999999999999999999999999999999999998876 Q ss_pred CCCCCCCc------------------cccccccCCCCCC Q lcl|NC_011801. 366 FPELDLDE------------------GTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~~~------------------~~~~~~~~~~~~~ 386 (386) |++|... +.+.-+.++++++ T Consensus 393 -~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 393 -AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNE 430 (432) T ss_pred -CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCC Confidence 4443211 1111111111111 No 18 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=2.1e-86 Score=490.27 Aligned_cols=377 Identities=20% Similarity=0.267 Sum_probs=311.7 Q ss_pred Cchhhhh-----ccccccCCcc---chhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------ Q lcl|NC_011801. 1 MAFLSNL-----FKRQKMLSGS---SPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------ 66 (386) Q Consensus 1 Mg~~~~l-----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------ 66 (386) ||||+|+ |++++..... .+............++..++.+.|+++++|++||++||++||++|++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 9999987 2222221111 1111111111233356678999999999999999999999999999763 Q ss_pred -----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eE Q lcl|NC_011801. 67 -----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LT 136 (386) Q Consensus 67 -----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~ 136 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..++.+.. .+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 47899999999999999999999999999999999999999999999999999999999988765432 22 Q ss_pred EEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC Q lcl|NC_011801. 137 YTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL 216 (386) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~ 216 (386) |.+. ..+....+++++|||+++. .+.++++|+||+..+..++....++++++.++|+||++|+++|++++ .+ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l 232 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNG---ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DL 232 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CC Confidence 3222 2345677999999999854 35678999999999999999999999999999999999999999875 68 Q ss_pred CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHH Q lcl|NC_011801. 217 GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNI 293 (386) Q Consensus 217 ~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~ 293 (386) ++++.++++++|++.++| .|+|+++++++|++|+++++++.|+||++.+++++++||++|||||.+||..+ ++++.+ T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e 312 (432) T protein:vir:10 233 NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIE 312 (432) T ss_pred CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 999999999999999988 68899999999999999999999999999999999999999999999998654 455678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 294 TMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 294 ~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) ++.++|+++||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+ T Consensus 313 ~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 313 QQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999984 368999999999999999999999999999999999999998876 Q ss_pred CCCCCCCc------------------cccccccCCCCCC Q lcl|NC_011801. 366 FPELDLDE------------------GTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~~~------------------~~~~~~~~~~~~~ 386 (386) |++|... +.+.-+.++++++ T Consensus 393 -~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 393 -AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNE 430 (432) T ss_pred -CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCC Confidence 4443211 1111111111111 No 19 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2.9e-86 Score=489.56 Aligned_cols=380 Identities=19% Similarity=0.223 Sum_probs=309.1 Q ss_pred CchhhhhccccccCCc---c----chh-hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc----- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG---S----SPV-WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA----- 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~---~----~~~-~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~----- 67 (386) ||||+++|+++..... . .+. .........+.++..|+.+.||++++|++||++||++||++|+++++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 9999999987544211 1 111 11122233456778999999999999999999999999999998753 Q ss_pred -----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc---eeEEEE Q lcl|NC_011801. 68 -----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK---DLTYTV 139 (386) Q Consensus 68 -----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~---~~~~~~ 139 (386) ++.+..|..+||++||+++||+.++.+++++||||+++.++ .|.+.+||||+|+.|++..+.... ..+|.+ T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEE Confidence 44555555779999999999999999999999999998654 789999999999999987654332 333444 Q ss_pred eccCcc-cceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCH Q lcl|NC_011801. 140 HFDDSK-RSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGK 218 (386) Q Consensus 140 ~~~~~~-~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~ 218 (386) .+...+ ......+++++|||+++.. +.++++|+||+.++..++....++++++.++|+||++|+++|++++ .+++ T Consensus 160 ~~~~~g~~~~~~~~~~~eiih~r~~~---~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~ 235 (457) T protein:vir:62 160 DIDADGNEVLLGWFTPRDVLHIPGMM---LPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG-TMSE 235 (457) T ss_pred EEccCCceeEEEeeCccceEEecCCC---CCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC-CCCH Confidence 443222 2233568999999999643 3455899999999999999999999999999999999999999975 7899 Q ss_pred HHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHH Q lcl|NC_011801. 219 EAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNI 293 (386) Q Consensus 219 ~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~ 293 (386) ++.+++++.|++.++| .|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..+.+ ++.+ T Consensus 236 e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:62 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 9999999999999988 6889999999999999999999999999999999999999999999999876543 4568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC Q lcl|NC_011801. 294 TMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF 366 (386) Q Consensus 294 ~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~ 366 (386) ++.++|+++||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|++ T Consensus 316 q~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 395 (457) T protein:vir:62 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 888999999999999999999999985 3578999999999999999999999999999999999999988763 Q ss_pred CCCCCCcccccc-----------ccCCCCCC Q lcl|NC_011801. 367 PELDLDEGTNLL-----------DNTKNIND 386 (386) Q Consensus 367 p~~~~~~~~~~~-----------~~~~~~~~ 386 (386) ++.+++.-.++ .+.+...+ T Consensus 396 -~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:62 396 -DGLGEKYRVPLNLGEIGEEPEPEPAPAPPA 425 (457) T ss_pred -CCCcceeeeccccccccccccccccCCCcc Confidence 43323211111 00000000 No 20 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=3.1e-86 Score=489.37 Aligned_cols=373 Identities=16% Similarity=0.240 Sum_probs=308.3 Q ss_pred Cchhhhh---ccccccCCcc-----ch--hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---- Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSGS-----SP--VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---- 66 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~~-----~~--~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---- 66 (386) ||||+|+ |++....... .+ ......+...+.++..|+.+.|+++++|++||++||++||++|++++ T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecC Confidence 9999984 4443221110 11 11223344556678889999999999999999999999999999873 Q ss_pred -------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEE Q lcl|NC_011801. 67 -------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTV 139 (386) Q Consensus 67 -------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~ 139 (386) +|+++++|+.+||++||+++||+.++.+++++||||+++.++ +|++++||||+|+.|++..+.++.. .|.+ T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-~y~~ 164 (432) T protein:vir:81 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGNT-AYRY 164 (432) T ss_pred CcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCcE-EEEE Confidence 488999999999999999999999999999999999999986 5999999999999999999877654 4444 Q ss_pred eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHH Q lcl|NC_011801. 140 HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKE 219 (386) Q Consensus 140 ~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~ 219 (386) ... .+..+.+++++|+|+|+. +.++++|+||+..+..++....+++++..++|+||++|++++++++ .++++ T Consensus 165 ~~~---~g~~~~~~~~~iih~r~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e 236 (432) T protein:vir:81 165 RRT---DGQMIDIPKQQIWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDD 236 (432) T ss_pred Eec---CceEEEEccccEEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHH Confidence 432 245577999999999854 4578899999999999999999999999999999999999999875 68999 Q ss_pred HHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-----ccHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-----QSNIT 294 (386) Q Consensus 220 ~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-----~~~~~ 294 (386) +++++++.|+. ..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+ ++.++ T Consensus 237 ~~~~~~~~~~~---~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq 313 (432) T protein:vir:81 237 QYDSFAKKVSG---SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHhh---hhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHH Confidence 99998887753 46788999999999999999999999999999999999999999999999876532 45688 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC Q lcl|NC_011801. 295 MIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP 367 (386) Q Consensus 295 ~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p 367 (386) +..+|+++||.|+++.||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|+++ T Consensus 314 ~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g 393 (432) T protein:vir:81 314 QQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 89999999999999999999999995 36899999999999999999999999999999999999999888743 Q ss_pred CCCCCc-ccc--ccc-------c--CCCC------CC Q lcl|NC_011801. 368 ELDLDE-GTN--LLD-------N--TKNI------ND 386 (386) Q Consensus 368 ~~~~~~-~~~--~~~-------~--~~~~------~~ 386 (386) +++... ..+ ++. + .++. ++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~ 430 (432) T protein:vir:81 394 NAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred CcceEeecCcccchhhhccCCCCCCCCCCCCcccccc Confidence 222110 000 110 0 0000 00 No 21 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=1e-85 Score=486.51 Aligned_cols=378 Identities=27% Similarity=0.430 Sum_probs=329.8 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccCcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPLGN 80 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~ 80 (386) ||||++.+++++..+...+.+..... .+..+..|+.+.||++++|++||++||++||++|+++..+ ..++|+.+||+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~~~~-~~~~l~~~PN~ 77 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLT--GGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTSESD-RSQSIISNPSV 77 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhc--CCcCCceechHHhhccHHHHHHHHHHHHHHhhCccccccc-HHHHHHhcCCC Confidence 99999987776666666655543333 2345677999999999999999999999999999987655 45567788999 Q ss_pred cCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeee Q lcl|NC_011801. 81 LMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHF 160 (386) Q Consensus 81 ~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~ 160 (386) +||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..+.++..+.|.+.......+..+.+++++|+|+ T Consensus 78 ~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~ 157 (397) T protein:vir:38 78 TANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVPAADVIHI 157 (397) T ss_pred CCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccccccceeEecCccEEEe Confidence 99999999999999999999999999999999999999999999999999998889988887777777788999999999 Q ss_pred ccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_011801. 161 RCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRA 240 (386) Q Consensus 161 ~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~ 240 (386) ++.. +.+.++|+||+.++..++....++.+++.++|+||+.|+++++++. .+++++.+++++.|+..+++.|+|++ T Consensus 158 ~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~-~~~~e~~~~~~~~~~~~~~~~n~~~~ 233 (397) T protein:vir:38 158 RLLS---KNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQK-GGLLDAETRIARSKEISKQIHNSDGP 233 (397) T ss_pred cCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcccccCCc Confidence 9653 4556799999999999999999999999999999999999999876 57899999999999998888999999 Q ss_pred eecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011801. 241 VVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKLG 320 (386) Q Consensus 241 ~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~ 320 (386) +++++|++|++++.++.|+||+|.+++.+++||++|||||.+|+..+.++.+.++...||++||+|++..||++|+++|+ T Consensus 234 ~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l~ 313 (397) T protein:vir:38 234 VVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQISGQYAKSLNRYVQAIVGELNDKLH 313 (397) T ss_pred eecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999999999999999999999999999999999999999999877666666777889999999999999999999999 Q ss_pred hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccc------cccCCCCCC Q lcl|NC_011801. 321 TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNL------LDNTKNIND 386 (386) Q Consensus 321 ~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~------~~~~~~~~~ 386 (386) ..++|++..+++.|.+++++.+++++++|+||+||+|+++|++|++++ +....... ....+++.| T Consensus 314 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~-d~~~~~~~~~~~~~~~~~~~g~~ 384 (397) T protein:vir:38 314 ANISANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAK-DLPDPEKEPQQAIQLIQQEGGEN 384 (397) T ss_pred ChhcccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-ccccccccccccccccccccCCC Confidence 999999999999999999999999999999999999999998886543 32222211 111111111 No 22 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=5.3e-86 Score=488.09 Aligned_cols=378 Identities=19% Similarity=0.226 Sum_probs=310.2 Q ss_pred hhhhhccccccCC----ccchhhhhh------cccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc----- Q lcl|NC_011801. 3 FLSNLFKRQKMLS----GSSPVWILN------QGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA----- 67 (386) Q Consensus 3 ~~~~l~~~~~~~~----~~~~~~~~~------~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~----- 67 (386) +|+.+++.++... .+...+... .....+.++..|+.+.||++++|++||++||++||++|+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 6664443222111 111111111 1223445677899999999999999999999999999998763 Q ss_pred ------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEec Q lcl|NC_011801. 68 ------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHF 141 (386) Q Consensus 68 ------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 141 (386) ++.+++|+.+||++||+++||+.++.+++++||||++++++..|++.+||||+|++|++..+.++.. .|.+.. T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~-~y~~~~ 159 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEV-FYRITP 159 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcE-EEEEEe Confidence 3455677789999999999999999999999999999999999999999999999999998877654 444443 Q ss_pred c-CcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHH Q lcl|NC_011801. 142 D-DSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEA 220 (386) Q Consensus 142 ~-~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~ 220 (386) . ..+.+..+.+++++|+|+++. .+.++++|+||+..+..++....+++++..++|+||++|+++|++++ .+++++ T Consensus 160 ~~~~~~~~~~~~~~~eViH~k~~---~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~ 235 (454) T protein:vir:93 160 DRNCGITEAVTVPAREVIHDRFN---CFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG-SITEEN 235 (454) T ss_pred ccccccceeEEecCcceEEeccC---CCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHH Confidence 3 233345678999999999864 34678899999999999999999999999999999999999999975 689999 Q ss_pred HHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHH Q lcl|NC_011801. 221 KENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRA 298 (386) Q Consensus 221 ~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~ 298 (386) .+++++.|++.++|.|+|+++|+++|++|++++++++|+||+|++++++++||++|||||.+||..+. +++.+++.++ T Consensus 236 ~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~ 315 (454) T protein:vir:93 236 AKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQ 315 (454) T ss_pred HHHHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999987654 4556888899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhh----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc- Q lcl|NC_011801. 299 FYQSSLSIYIKPIESELSQKLG----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE- 373 (386) Q Consensus 299 ~~~~~l~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~- 373 (386) |+++||.|++..||++|+++|+ .+++||++.+++.|.+++++.+.+++++|+||+||+|+++|++|++ ++|... T Consensus 316 f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~-ggD~~~~ 394 (454) T protein:vir:93 316 YYSQCLQTLIESIELLLDEALETGENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLA-GGDALYL 394 (454) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCeeee Confidence 9999999999999999999994 5789999999999999999999999999999999999999988763 332210 Q ss_pred ccc--cc------ccCC----------------CCCC Q lcl|NC_011801. 374 GTN--LL------DNTK----------------NIND 386 (386) Q Consensus 374 ~~~--~~------~~~~----------------~~~~ 386 (386) ..+ ++ +... ..+| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 431 (454) T protein:vir:93 395 QQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASD 431 (454) T ss_pred ccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCC Confidence 000 00 0000 0000 No 23 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=1.4e-85 Score=485.71 Aligned_cols=368 Identities=20% Similarity=0.280 Sum_probs=316.4 Q ss_pred chhhhhccccccCCccchhhh--hhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-----------ch Q lcl|NC_011801. 2 AFLSNLFKRQKMLSGSSPVWI--LNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-----------AQ 68 (386) Q Consensus 2 g~~~~l~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-----------~~ 68 (386) =||+++|+|++......+... .......++++..|+.+.|+++++|++||++||+++|++|++++ +| T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTRVVDE 80 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcceeeccc Confidence 255678887766655544322 23334556788899999999999999999999999999999765 47 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) +++++|+.+||++||+++||+.++.+++++||||+++.++ .|++++||||+|++|++..+.++.. .|.+.... +. T Consensus 81 ~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~-~y~~~~~~---g~ 155 (413) T protein:vir:48 81 RLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQP-VYQVTFPD---GS 155 (413) T ss_pred HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceE-EEEEEecC---ce Confidence 8999999999999999999999999999999999999987 6899999999999999998877654 44444432 34 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSF 228 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~ 228 (386) ...+++++|+|+++. +.++++|+||+..+..+++...++++++.++|+||++|+++|++++ .+++++.+++++.| T Consensus 156 ~~~~~~~evih~~~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~~~~e~~~~~~~~~ 230 (413) T protein:vir:48 156 VDVLTQDEIWHVRTL----TLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQ-KLTPDAYERLKKDF 230 (413) T ss_pred EEEEccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHH Confidence 567999999999864 2467899999999999999999999999999999999999999976 67999999999999 Q ss_pred HHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHHH Q lcl|NC_011801. 229 EEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSLS 305 (386) Q Consensus 229 ~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l~ 305 (386) ++.++| .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+|+..+. +++.+++..+|+++||+ T Consensus 231 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~ 310 (413) T protein:vir:48 231 EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLV 310 (413) T ss_pred HHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHH Confidence 999988 789999999999999999999999999999999999999999999999997644 45668899999999999 Q ss_pred HHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccccc Q lcl|NC_011801. 306 IYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLL 378 (386) Q Consensus 306 P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~ 378 (386) |++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|. + T Consensus 311 P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~-~ggD~------~ 383 (413) T protein:vir:48 311 PYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPR-PGGDV------Y 383 (413) T ss_pred HHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcce------e Confidence 999999999999985 368999999999999999999999999999999999999998776 33332 1 Q ss_pred ccCCCCCC Q lcl|NC_011801. 379 DNTKNIND 386 (386) Q Consensus 379 ~~~~~~~~ 386 (386) .+++|... T Consensus 384 ~~~~n~~~ 391 (413) T protein:vir:48 384 LTPMNMTT 391 (413) T ss_pred eccccccc Confidence 12222111 No 24 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1e-85 Score=486.54 Aligned_cols=375 Identities=17% Similarity=0.234 Sum_probs=309.4 Q ss_pred CchhhhhccccccCCccchh-hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---------chhH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---------AQPI 70 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---------~~~~ 70 (386) ||||+|+|+++......... ............+..++.+.|+++++|++||++||+++|++|++++ .|++ T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l 80 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIPVSPA 80 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccchH Confidence 99999999886433222211 1112222334456788999999999999999999999999999875 4789 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEe-ecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCccccee Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIID-RDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGD 149 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~-~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (386) +++|+.+||++||+++||+.++.+++++||+|+++. ++..|++++||||+|+.|++....+.....+.+.+... . T Consensus 81 ~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~----g 156 (409) T protein:vir:84 81 PKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRID----G 156 (409) T ss_pred HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecCC----c Confidence 999999999999999999999999999999999885 78899999999999999999876655544444333221 1 Q ss_pred EEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHH Q lcl|NC_011801. 150 FLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFE 229 (386) Q Consensus 150 ~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~ 229 (386) ..+++++|||+++.. +.+.++|+||+..+..++....++++++.++|+||++|+++|+.++ .+++++.+++++.|. T Consensus 157 ~~~~~~dvih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~ 232 (409) T protein:vir:84 157 KVVPNHRIMHIKRYP---VAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDA-DLTPDQVKQTQKQWI 232 (409) T ss_pred eEEchhhEEEecCCC---CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHH Confidence 358999999999643 3455799999999999999999999999999999999999999875 689999999999998 Q ss_pred HHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHHHHHHHHHHHHHH Q lcl|NC_011801. 230 EQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNITMIRAFYQSSLS 305 (386) Q Consensus 230 ~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~~~~~~~~~~~l~ 305 (386) +.+ .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+||..+.+ ++.+++..+|+++||. T Consensus 233 ~~~--~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~ 310 (409) T protein:vir:84 233 QSH--HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLL 310 (409) T ss_pred HHh--ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHH Confidence 765 4578899999999999999999999999999999999999999999999875533 4568888999999999 Q ss_pred HHHHHHHHHHHHhh--hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc---ccccccc Q lcl|NC_011801. 306 IYIKPIESELSQKL--GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE---GTNLLDN 380 (386) Q Consensus 306 P~~~~ie~~l~~~l--~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~---~~~~~~~ 380 (386) |+++.||++|+++| +.+++||++.+++.|.+++++++.+++++||||+||+|+++|++|+ |++|... .-.++.. T Consensus 311 P~~~~ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~-~ggD~~~~~~n~~~~~~ 389 (409) T protein:vir:84 311 PWLRCIEQALDTFLPRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPI-PEGDIHLQPMNFVPLGY 389 (409) T ss_pred HHHHHHHHHHHHhccCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcceeeeccccccccc Confidence 99999999999998 6788999999999999999999999999999999999999998776 4433211 0000000 Q ss_pred -------------CCCCCC Q lcl|NC_011801. 381 -------------TKNIND 386 (386) Q Consensus 381 -------------~~~~~~ 386 (386) +....| T Consensus 390 ~~~~~~~~~~~~~~~~~gn 408 (409) T protein:vir:84 390 VPPEEPAQEPQPNSATEGN 408 (409) T ss_pred CCccccCcCCCCCCccCCC Confidence 000000 No 25 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.3e-85 Score=485.93 Aligned_cols=373 Identities=16% Similarity=0.236 Sum_probs=306.6 Q ss_pred Cchhhhh---ccccccCCc-------cchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---- Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSG-------SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---- 66 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---- 66 (386) ||||+|+ |.++..... ..+......+...+.++..|+.+.|+++++|++||++||++||++|++++ T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 9999885 222211100 01112223345566778899999999999999999999999999999874 Q ss_pred -------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEE Q lcl|NC_011801. 67 -------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTV 139 (386) Q Consensus 67 -------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~ 139 (386) +||++++|+.+||++||+++||+.++.+++++||||++++++ +|++.+||||+|+.|++..+.++.. .|.+ T Consensus 87 ~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g~~-~y~~ 164 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-AYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCCcE-EEEE Confidence 378999999999999999999999999999999999999997 5999999999999999998877654 4444 Q ss_pred eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHH Q lcl|NC_011801. 140 HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKE 219 (386) Q Consensus 140 ~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~ 219 (386) ... .+..+.+++++|+|+|+. +.++++|+||+..+..++....+++++..++|+||++|++++++++ .++++ T Consensus 165 ~~~---~g~~~~~~~~~iih~r~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~-~l~~e 236 (432) T protein:vir:97 165 RRT---DGQMIDIPRQQIWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDD 236 (432) T ss_pred Eec---CceEEEEccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC-CCCHH Confidence 432 245577999999999853 4578999999999999999999999999999999999999999875 68999 Q ss_pred HHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-----ccHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-----QSNIT 294 (386) Q Consensus 220 ~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-----~~~~~ 294 (386) +++++++.|.. ..|+|++++|++|++|+++++++.|+||+|++++++++||++|||||.+||..+.+ ++.++ T Consensus 237 ~~~~~~~~~~~---~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~ 313 (432) T protein:vir:97 237 QYDSFSKKVSG---SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHhh---hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHH Confidence 99888877753 35788999999999999999999999999999999999999999999999875432 45688 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC Q lcl|NC_011801. 295 MIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP 367 (386) Q Consensus 295 ~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p 367 (386) +..+|+++||.|+++.||++|+++|+ .+++||++.+++.|.+++++++.+++++||||+||+|+++|++|+++ T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g 393 (432) T protein:vir:97 314 QQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 89999999999999999999999985 35899999999999999999999999999999999999999877642 Q ss_pred CCCCCc-ccc--cc---------ccCCCCCC Q lcl|NC_011801. 368 ELDLDE-GTN--LL---------DNTKNIND 386 (386) Q Consensus 368 ~~~~~~-~~~--~~---------~~~~~~~~ 386 (386) +++.-. ..+ ++ ++..+..| T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~ 424 (432) T protein:vir:97 394 NAAVLTVQSAMVPLDSIGLQASPEPASGLGN 424 (432) T ss_pred CcceEeecccccchhhhcccCCCCCCCCCCC Confidence 222100 000 11 00000000 No 26 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=1.9e-85 Score=485.03 Aligned_cols=376 Identities=18% Similarity=0.233 Sum_probs=310.0 Q ss_pred Cch-hhhhccccc---------cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---- Q lcl|NC_011801. 1 MAF-LSNLFKRQK---------MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---- 66 (386) Q Consensus 1 Mg~-~~~l~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---- 66 (386) |.- +.|++.+.+ ..+...+.+....+.....++..|+.+.|+++++|++||++||++||++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~~ 80 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTKP 80 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEcC Confidence 662 122222211 111112222233455666778899999999999999999999999999999763 Q ss_pred --------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEE Q lcl|NC_011801. 67 --------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYT 138 (386) Q Consensus 67 --------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 138 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++|+ .|++++||||+|+.|++..+.++.. .|. T Consensus 81 ~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g~~-~y~ 158 (437) T protein:vir:10 81 DGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSGAL-QYT 158 (437) T ss_pred CCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCCeE-EEE Confidence 478999999999999999999999999999999999999998 5999999999999999998876654 344 Q ss_pred EeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCH Q lcl|NC_011801. 139 VHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGK 218 (386) Q Consensus 139 ~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~ 218 (386) +... .+....+++++|+|+|+. +.++++|+||+..+..++....+++++..++|+||++|+++|++++ .+++ T Consensus 159 ~~~~---~g~~~~~~~~dIih~r~~----~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~ 230 (437) T protein:vir:10 159 YRNV---DGTVSTLAEDDVFHVRGF----SLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQ-ILQK 230 (437) T ss_pred EEec---CceEEEEccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCH Confidence 4332 244567999999999854 2567899999999999999999999999999999999999999875 6899 Q ss_pred HHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHH Q lcl|NC_011801. 219 EAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNI 293 (386) Q Consensus 219 ~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~ 293 (386) ++.+++++.|++.++| .|+|+++|+++|++|+++++++.|+||+|++++++++||++|||||.+||..+.+ ++.+ T Consensus 231 e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e 310 (437) T protein:vir:10 231 EKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIE 310 (437) T ss_pred HHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHH Confidence 9999999999999887 6889999999999999999999999999999999999999999999999876543 4568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC Q lcl|NC_011801. 294 TMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF 366 (386) Q Consensus 294 ~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~ 366 (386) ++..+|+++||+|++.+||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|++ T Consensus 311 ~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 390 (437) T protein:vir:10 311 QQTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMG 390 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 899999999999999999999999985 3588999999999999999999999999999999999999998875 Q ss_pred CCCCCCc-cccccc--------cCCCCCC Q lcl|NC_011801. 367 PELDLDE-GTNLLD--------NTKNIND 386 (386) Q Consensus 367 p~~~~~~-~~~~~~--------~~~~~~~ 386 (386) ++++.-. ..+.+. ++++..+ T Consensus 391 gg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (437) T protein:vir:10 391 GNAAVLTVQSALLPIDKLGEHTTATAAQD 419 (437) T ss_pred CCcceEeecCcccchhhccCcCCCcchhc Confidence 3322100 111110 0100000 No 27 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=1.9e-85 Score=485.09 Aligned_cols=375 Identities=20% Similarity=0.251 Sum_probs=310.1 Q ss_pred CchhhhhccccccCCcc-------chh-hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc----- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGS-------SPV-WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA----- 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~-------~~~-~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~----- 67 (386) ||||+++|++....... .+. ...........++..|+.+.||++++|++||++||++||++|+++++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999875443110 111 12222334566788999999999999999999999999999998764 Q ss_pred ------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc---eeEEE Q lcl|NC_011801. 68 ------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK---DLTYT 138 (386) Q Consensus 68 ------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~---~~~~~ 138 (386) |++..+|+..|| .||+++||+.++.+++++||||+++.++ .|.+++||||+|+.|++..+.... ..++. T Consensus 81 ~~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 158 (457) T protein:vir:13 81 RKEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEA 158 (457) T ss_pred ccccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEE Confidence 467788877666 7999999999999999999999999765 699999999999999997665443 33344 Q ss_pred EeccCccc-ceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCC Q lcl|NC_011801. 139 VHFDDSKR-SGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLG 217 (386) Q Consensus 139 ~~~~~~~~-~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~ 217 (386) |.+...+. .....+++++|||+++.. +.++++|+||+..+..++....++++++.++|+||++|+++|++++ .++ T Consensus 159 y~~~~~~~~~~~~~~~~~diih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls 234 (457) T protein:vir:13 159 YDIDADGNEVLLGWFTPRDVLHIPGMM---LPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG-TMS 234 (457) T ss_pred EEEecCCceeeEEeeCccceEEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC-CCC Confidence 44433222 233468999999999653 3455899999999999999999999999999999999999999975 789 Q ss_pred HHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccH Q lcl|NC_011801. 218 KEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSN 292 (386) Q Consensus 218 ~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~ 292 (386) +++.+++++.|++.++| .|+|++++|++|++|+++++++.|+||+|++++++++||++|||||.+||..+.+ ++. T Consensus 235 ~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~ 314 (457) T protein:vir:13 235 EEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGL 314 (457) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchH Confidence 99999999999999988 6889999999999999999999999999999999999999999999999876543 456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 293 ITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 293 ~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) +++..+|+++||.|++++||++|+++|+ .+++||++.+++.|.+++++++.+++++|+||+||+|+++|++|+ T Consensus 315 eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi 394 (457) T protein:vir:13 315 AEQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPL 394 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 8888999999999999999999999995 358999999999999999999999999999999999999998886 Q ss_pred CCCCCCCccccccccCCCCCC Q lcl|NC_011801. 366 FPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~~~~~~~~~~~~~~~~ 386 (386) + ++.+++.- .+.|... T Consensus 395 ~-~g~~d~~~----~~~n~~~ 410 (457) T protein:vir:13 395 P-DGLGEKYR----VPLNLGE 410 (457) T ss_pred C-CCccccee----ecccccc Confidence 4 33222211 1111110 No 28 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=1.6e-85 Score=485.40 Aligned_cols=373 Identities=17% Similarity=0.230 Sum_probs=307.1 Q ss_pred Cchhhhh---ccccccCCc-------cchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---- Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSG-------SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---- 66 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---- 66 (386) ||+|+|+ |.+...... ..+......+...+.++..|+.+.|+++++||+||++||++||++|++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 9999875 322211100 01111222344556678889999999999999999999999999999864 Q ss_pred -------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEE Q lcl|NC_011801. 67 -------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTV 139 (386) Q Consensus 67 -------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~ 139 (386) +||++++|+.+||++||+++||+.++.+++++||||++++++ +|++.+||||+|+.|++..+.++.. .|.+ T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-~y~~ 164 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-AYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCcE-EEEE Confidence 478999999999999999999999999999999999999997 6999999999999999999877654 4444 Q ss_pred eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHH Q lcl|NC_011801. 140 HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKE 219 (386) Q Consensus 140 ~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~ 219 (386) ... .+..+.+++++|+|+++. +.++++|+||+..+..++....+++++..++|+||++|++++++++ .++++ T Consensus 165 ~~~---~g~~~~~~~~~iih~~~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e 236 (432) T protein:vir:10 165 RRT---DGQMIDIPKQQIWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDD 236 (432) T ss_pred Eec---CceEEEEcCccEEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHH Confidence 332 245678999999999854 4578999999999999999999999999999999999999999875 68999 Q ss_pred HHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-----ccHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-----QSNIT 294 (386) Q Consensus 220 ~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-----~~~~~ 294 (386) +++++++.|.. ..|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..+.+ ++.++ T Consensus 237 ~~~~~~~~~~~---~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~ 313 (432) T protein:vir:10 237 QYDSFAKKVSG---SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHhh---hhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH Confidence 99998888753 35788999999999999999999999999999999999999999999999876532 45678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC Q lcl|NC_011801. 295 MIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP 367 (386) Q Consensus 295 ~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p 367 (386) +..+|+++||.|++++||++|+++|+ .+++||++++++.|.+++++++++++++||||+||+|+++|++|++. T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g 393 (432) T protein:vir:10 314 QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 89999999999999999999999985 35899999999999999999999999999999999999999888742 Q ss_pred CCCCC---ccccccc---------------cCCCCCC Q lcl|NC_011801. 368 ELDLD---EGTNLLD---------------NTKNIND 386 (386) Q Consensus 368 ~~~~~---~~~~~~~---------------~~~~~~~ 386 (386) +++.- ..-.++. ...++++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 394 NAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred CcceEeecCcccchhhhcccCCCCCCCCCCCcccccc Confidence 22210 0000110 0000111 No 29 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=3.9e-85 Score=483.36 Aligned_cols=374 Identities=20% Similarity=0.245 Sum_probs=312.2 Q ss_pred CchhhhhccccccCCccc-hhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------chh Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSS-PVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------AQP 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------~~~ 69 (386) |- |+++|+++....... +.+.... ....++.+++.+.|+++++|++||++||++||++|++++ +|+ T Consensus 1 m~-f~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~ 77 (409) T protein:vir:10 1 ML-FRKGFKNQSQEISIDDKKILEWL--GINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVPDHY 77 (409) T ss_pred Cc-ccccccCcCCCCCCChHHHHHHh--cCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeeccCch Confidence 87 556776665432222 2222221 234456788999999999999999999999999999774 378 Q ss_pred HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-----eEEEEeccCc Q lcl|NC_011801. 70 ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-----LTYTVHFDDS 144 (386) Q Consensus 70 ~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~~~~~~~~~ 144 (386) ++++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..+.++.. ..|.+... T Consensus 78 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~-- 155 (409) T protein:vir:10 78 LEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDD-- 155 (409) T ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEeC-- Confidence 99999999999999999999999999999999999999999999999999999999998765432 33444322 Q ss_pred ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHH Q lcl|NC_011801. 145 KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENT 224 (386) Q Consensus 145 ~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~ 224 (386) .+....+++++|||+|+.. .++++|+||+..+..++....++++++.++|+||++|+++|++++ .+++++.+++ T Consensus 156 -~g~~~~~~~~evih~r~~~----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~ 229 (409) T protein:vir:10 156 -LGQRHKFMSDEILHFKGLT----ADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAG-DLNPEAEEVF 229 (409) T ss_pred -CceeEEeccccEEEecCcC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC-CCCHHHHHHH Confidence 3456789999999998542 467899999999999999999999999999999999999999875 6899999999 Q ss_pred HHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHHHHHHHHHH Q lcl|NC_011801. 225 RQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNITMIRAFYQ 301 (386) Q Consensus 225 k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~~~~~~~~~ 301 (386) ++.|++.++| .|+|+++++++|++|++++.++.|+||+|++++++++||++|||||.+|+..+ ++++.+++.++|++ T Consensus 230 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~ 309 (409) T protein:vir:10 230 KENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYI 309 (409) T ss_pred HHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHH Confidence 9999999988 67899999999999999999999999999999999999999999999998754 45567899999999 Q ss_pred HHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc Q lcl|NC_011801. 302 SSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE 373 (386) Q Consensus 302 ~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~ 373 (386) +||+|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|... T Consensus 310 ~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~-~ggD~~~ 388 (409) T protein:vir:10 310 DTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPL-EGGDVLL 388 (409) T ss_pred HHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeee Confidence 9999999999999999885 468999999999999999999999999999999999999998776 3332211 Q ss_pred -ccc--cccc----CCCCCC Q lcl|NC_011801. 374 -GTN--LLDN----TKNIND 386 (386) Q Consensus 374 -~~~--~~~~----~~~~~~ 386 (386) ..+ ++.. ...+.+ T Consensus 389 ~~~n~~~~~~~~~~~~kgGe 408 (409) T protein:vir:10 389 INGNMIPVKMAGEQYSKGGE 408 (409) T ss_pred eccCccchhhccccccccCC Confidence 111 1110 011111 No 30 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=3.9e-85 Score=483.37 Aligned_cols=370 Identities=19% Similarity=0.182 Sum_probs=312.1 Q ss_pred chhhhhccccc--cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-----------ch Q lcl|NC_011801. 2 AFLSNLFKRQK--MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-----------AQ 68 (386) Q Consensus 2 g~~~~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-----------~~ 68 (386) =||+|.+.+.. ........+....+...+.++..||.+.||++++|++||++||++||++|++++ +| T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDRKPATDH 80 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcccccccc Confidence 35555543332 223333445556666667788999999999999999999999999999999875 47 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) +++++|+.+||++||+++||+.++.+++++||||+++.|+..|.+++||||+|+.|++..+.++.. .|.+... T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~-~y~~~~~------ 153 (419) T protein:vir:14 81 PLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKP-VYRVRGS------ 153 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceE-EEEEccC------ Confidence 899999999999999999999999999999999999999999999999999999999998877654 3433321 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCC---CCCHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNA---TLGKEAKENTR 225 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~---~~~~~~~~~~k 225 (386) ..++.++|+|+++. +.++++|+||+..+..++....+++++..++|+||++|+++|++++. ..++++.++++ T Consensus 154 -~~~~~~~i~h~~~~----~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~ 228 (419) T protein:vir:14 154 -DPMPQRLVHHVRWM----SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRIT 228 (419) T ss_pred -cccchhheeEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHH Confidence 23788999999864 35789999999999999999999999999999999999999998753 34789999999 Q ss_pred HHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHH Q lcl|NC_011801. 226 QSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQS 302 (386) Q Consensus 226 ~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~ 302 (386) +.|++.++| .|+|+++++++|++|+++++++.|+||+|++++++++||++|||||.+|+..+. +++.|++.++|+++ T Consensus 229 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~ 308 (419) T protein:vir:14 229 DGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIY 308 (419) T ss_pred HHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHH Confidence 999999988 688999999999999999999999999999999999999999999999987543 45668999999999 Q ss_pred HHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc Q lcl|NC_011801. 303 SLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT 375 (386) Q Consensus 303 ~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~ 375 (386) ||.|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|.. - T Consensus 309 ~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~gGD~~--~ 385 (419) T protein:vir:14 309 TLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV-KGGDIY--L 385 (419) T ss_pred HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCee--e Confidence 999999999999999985 458999999999999999999999999999999999999998776 343321 1 Q ss_pred cccc----------------cCCCCCC Q lcl|NC_011801. 376 NLLD----------------NTKNIND 386 (386) Q Consensus 376 ~~~~----------------~~~~~~~ 386 (386) .++. ++.+..+ T Consensus 386 ~~~n~~~~~~~~~~~~~~~~~~~~~~~ 412 (419) T protein:vir:14 386 SPMNMVDASKPQQLPVGKSEPTKAAID 412 (419) T ss_pred eccccccccccccccCCCCCCcccccc Confidence 1110 0000000 No 31 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=1.4e-84 Score=480.25 Aligned_cols=372 Identities=20% Similarity=0.300 Sum_probs=317.4 Q ss_pred chhhhhccccccCCcc----chhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------- Q lcl|NC_011801. 2 AFLSNLFKRQKMLSGS----SPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------- 66 (386) Q Consensus 2 g~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------- 66 (386) =||+|+|+++...... .+.....++.....++..|+...++++++|++||++||++||++|++++ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERKP 80 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccc Confidence 3788899776544322 2233344555566778889999999999999999999999999999764 Q ss_pred chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCccc Q lcl|NC_011801. 67 AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKR 146 (386) Q Consensus 67 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 146 (386) .|+++++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+||||+|++|++..+.++..++|.+... T Consensus 81 ~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~---- 156 (416) T protein:vir:12 81 EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLN---- 156 (416) T ss_pred ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEecC---- Confidence 3789999999999999999999999999999999999999999999999999999999999988888888777543 Q ss_pred ceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHH Q lcl|NC_011801. 147 SGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQ 226 (386) Q Consensus 147 ~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~ 226 (386) +..+.+++++|+|+++. +.++++|+||+.++..++....+++++..++|+||+.|+++|+++. .+++++.+++++ T Consensus 157 g~~~~~~~~eiih~~~~----~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~ 231 (416) T protein:vir:12 157 GKAIELYDYEVLHFKGL----STDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA-FLDEKPKENVRK 231 (416) T ss_pred CeEEEecCccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC-CCCHHHHHHHHH Confidence 45578999999999854 3467899999999999999999999999999999999999999975 689999999999 Q ss_pred HHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHHHHHHHHHHHHH Q lcl|NC_011801. 227 SFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNITMIRAFYQSSL 304 (386) Q Consensus 227 ~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~~~~~~~~~~~l 304 (386) .|+... ++++++++++|++|+++++++.|+||+|.+++++++||++|||||.+|+..+ ++++.+++.++|+++|| T Consensus 232 ~~~~~~---~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l 308 (416) T protein:vir:12 232 EWKRVN---KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTL 308 (416) T ss_pred HHHHHh---cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHH Confidence 998754 4577999999999999999999999999999999999999999999998654 45566889999999999 Q ss_pred HHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-cc Q lcl|NC_011801. 305 SIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GT 375 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~ 375 (386) .|++++||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|... .. T Consensus 309 ~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi-~ggd~~~~~~ 387 (416) T protein:vir:12 309 QPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPI-ENGDKYISSL 387 (416) T ss_pred HHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcceeeecc Confidence 9999999999999984 468999999999999999999999999999999999999998886 3333211 00 Q ss_pred c--------ccccC------CCCCC Q lcl|NC_011801. 376 N--------LLDNT------KNIND 386 (386) Q Consensus 376 ~--------~~~~~------~~~~~ 386 (386) + .++.+ +++.+ T Consensus 388 n~~~~~~~~~~~~~~~~~~~~gge~ 412 (416) T protein:vir:12 388 NYVFLDFLEEYQRLKAGGAMKGGDN 412 (416) T ss_pred ccccccccchhhccccccccCCCCC Confidence 0 11111 11111 No 32 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=2.2e-84 Score=479.20 Aligned_cols=368 Identities=20% Similarity=0.295 Sum_probs=306.7 Q ss_pred Cch-hhhhcccccc-------------CCcc-chhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceee Q lcl|NC_011801. 1 MAF-LSNLFKRQKM-------------LSGS-SPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVT 65 (386) Q Consensus 1 Mg~-~~~l~~~~~~-------------~~~~-~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~ 65 (386) |.= |.++..+... ...+ ..++... ......++..|+.+.||++++|++||++||++||++|+++ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~~ 79 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQF-LGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLGV 79 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHHHHH-hcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceEE Confidence 321 1112111111 1111 2222222 3335557788999999999999999999999999999987 Q ss_pred c------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc Q lcl|NC_011801. 66 N------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK 133 (386) Q Consensus 66 ~------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 133 (386) + +|+++++|+.+||++||+++||+.++.+++++||||+++.++ .|++++||||+|+.|++..+.++. T Consensus 80 ~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~g~ 158 (434) T protein:vir:43 80 YERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDENGR 158 (434) T ss_pred EEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCCCe Confidence 4 468999999999999999999999999999999999998876 799999999999999999888766 Q ss_pred eeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC Q lcl|NC_011801. 134 DLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN 213 (386) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~ 213 (386) ..++ +... .+..+.+++++|+|+++. +.++++|+||+..+..++....+++++..++|+||++|++++++++ T Consensus 159 ~~y~-~~~~---~g~~~~~~~~eVih~~~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~ 230 (434) T protein:vir:43 159 LKYF-YTTK---KGARREIERTNMLHIPAF----TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR 230 (434) T ss_pred EEEE-EEec---CceEEEEccccEEEecCc----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC Confidence 5443 3322 345678999999999853 3578899999999999999999999999999999999999999975 Q ss_pred CCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC----c Q lcl|NC_011801. 214 ATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD----A 289 (386) Q Consensus 214 ~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~----~ 289 (386) .+++++.+++++.|++..++.|+|+++++++|++|+++++++.|+||+|++++++++||++|||||.+||..+. + T Consensus 231 -~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 309 (434) T protein:vir:43 231 -ILQPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWG 309 (434) T ss_pred -CCCHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCcccc Confidence 68999999999999987777899999999999999999999999999999999999999999999999987653 3 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhcc Q lcl|NC_011801. 290 QSNITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKN 362 (386) Q Consensus 290 ~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~ 362 (386) ++.+++..+|+++||.|++.+||++|+++|+ .+++||++.+++.|.+++++++.+++++||||+||+|+++|+ T Consensus 310 s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl 389 (434) T protein:vir:43 310 TGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENL 389 (434) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 5568889999999999999999999999985 468999999999999999999999999999999999999998 Q ss_pred CCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 363 RGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 363 ~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) +|+ |++|. +..++|... T Consensus 390 ~p~-~ggD~------~~~~~n~~~ 406 (434) T protein:vir:43 390 PEL-PGGDI------LTVQSNLVP 406 (434) T ss_pred CCC-CCCCe------EeeccCccc Confidence 775 33321 222222111 No 33 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=2.3e-84 Score=479.15 Aligned_cols=367 Identities=20% Similarity=0.197 Sum_probs=311.7 Q ss_pred Cchhhhhccccc-cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-----------ch Q lcl|NC_011801. 1 MAFLSNLFKRQK-MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-----------AQ 68 (386) Q Consensus 1 Mg~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-----------~~ 68 (386) |-|-++..+... ..+.....+...++...+.++..||.+.||++++|++||++||++||++|++++ +| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDRKPATDH 80 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCccccccc Confidence 755543322222 223334444555666667788999999999999999999999999999999874 47 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) +++++|+.+||++||+++||+.++.+++++||||++++|+..|++.+||||+|+.|++..+.++... |.+. + T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~-y~~~-------~ 152 (419) T protein:vir:80 81 PLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPM-YRVA-------G 152 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEE-EEEc-------C Confidence 8999999999999999999999999999999999999999999999999999999999988776543 3322 1 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC---CCCCHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN---ATLGKEAKENTR 225 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~---~~~~~~~~~~~k 225 (386) ...++.++|+|+++. +.++++|+||+..+..++....+++++..++|+||++|+++|++++ ...++++.++++ T Consensus 153 ~~~~~~~~i~h~~~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~ 228 (419) T protein:vir:80 153 ADPLPQRLVHHVRWM----SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRIT 228 (419) T ss_pred ccccchhheEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHH Confidence 124889999999863 4578999999999999999999999999999999999999999864 345899999999 Q ss_pred HHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHH Q lcl|NC_011801. 226 QSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQS 302 (386) Q Consensus 226 ~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~ 302 (386) +.|++.++| .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||.+||..+. +++.+++.++|+++ T Consensus 229 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~ 308 (419) T protein:vir:80 229 DGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIY 308 (419) T ss_pred HHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHH Confidence 999999988 588999999999999999999999999999999999999999999999987544 45668999999999 Q ss_pred HHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc Q lcl|NC_011801. 303 SLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT 375 (386) Q Consensus 303 ~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~ 375 (386) ||.|++++||++|+++|+ .+++||++.+++.|.+++++.+++++++|+||+||+|+++|++|+ |++| +. T Consensus 309 ~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~-~gGD--~~- 384 (419) T protein:vir:80 309 TLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV-KGGD--IY- 384 (419) T ss_pred HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcc--ee- Confidence 999999999999999985 358999999999999999999999999999999999999998776 3332 21 Q ss_pred cccccCCCCCC Q lcl|NC_011801. 376 NLLDNTKNIND 386 (386) Q Consensus 376 ~~~~~~~~~~~ 386 (386) ..+.|-.+ T Consensus 385 ---~~~~n~~~ 392 (419) T protein:vir:80 385 ---LSPMNMVD 392 (419) T ss_pred ---eecccccc Confidence 11111111 No 34 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=2.4e-84 Score=479.01 Aligned_cols=360 Identities=17% Similarity=0.183 Sum_probs=302.3 Q ss_pred Cchhhhhccccc---------------------------cCCcc--chhhh--hhc---ccccccCcccccHHHHhccHH Q lcl|NC_011801. 1 MAFLSNLFKRQK---------------------------MLSGS--SPVWI--LNQ---GQPVSIKPKAITSAIALKNSD 46 (386) Q Consensus 1 Mg~~~~l~~~~~---------------------------~~~~~--~~~~~--~~~---~~~~~~~~~~i~~~~a~~~~~ 46 (386) ||||+++|+-.. ..... .+... ... ..+...++..++.+.++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 999999998411 00000 00000 000 112233566789999999999 Q ss_pred HHHHHHHHHHhhccCceeecc-----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEE-eecCCCceEEEEEEc Q lcl|NC_011801. 47 VYAVISRVSSDIAGCRFVTNA-----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAII-DRDTNGYPVRIEPVP 120 (386) Q Consensus 47 v~~~v~~ia~~ia~~p~~~~~-----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~-~~~~~g~~~~l~~l~ 120 (386) |++||++||++||++|+++++ +.+..+|+.+||++||+++|++.++.++++ ||+|+++ .++.+|.+++||||+ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~ 159 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVP 159 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEEC Confidence 999999999999999998875 356778999999999999999999999987 9999875 589999999999999 Q ss_pred CcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 121 NEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLR 200 (386) Q Consensus 121 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 200 (386) |+.|++..+.++. ..|.+.. ...+++|+|+|+.. +.++++|+||+..+..++....++++++.++|+ T Consensus 160 p~~v~v~~~~~g~-~~y~~~~---------~~~~~eiiHir~~~---~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~ 226 (409) T protein:vir:83 160 PWLVNVELKKGAR-REYRIGG---------LNVTDEILHIRYQG---NTADAHGHGPLESAAPRQVVIGLLQKYVQNLAE 226 (409) T ss_pred CcceEEEEcCCce-EEEEEcc---------ccCccceEEeCCCC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999998887754 3343321 13358999998653 456789999999999999999999999999999 Q ss_pred ccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceee-eccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_011801. 201 HAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVE-TTNISPNVTEFLQNVSFSQDQIAKAFGIP 279 (386) Q Consensus 201 ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~-~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp 279 (386) ||++|++++++++ .+++++.+++++.|++.+.+ |+|+++++.+|+++. +++++++|+||+|++++++++||++|||| T Consensus 227 nga~p~gil~~~~-~ls~e~~~~~~~~~~~~~~~-nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVP 304 (409) T protein:vir:83 227 TGGVPLYWLGVER-RLSETEAVDLMDRWIESRSK-YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVP 304 (409) T ss_pred cCCCcceEeecCC-CCCHHHHHHHHHHHHHhhCC-ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCC Confidence 9999999999875 78999999999999987765 788899999999874 68999999999999999999999999999 Q ss_pred HHHhcCCcC-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhh---hhhhhhcchhhhccCHHHHHHHHHHHHhCCCc Q lcl|NC_011801. 280 ADYLSGKQD-----AQSNITMIRAFYQSSLSIYIKPIESELSQKL---GTDVKLDIASAIDSDNSELINNVQKLASAGVL 351 (386) Q Consensus 280 ~~~l~~~~~-----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l---~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~ 351 (386) |.+||..+. ++|.|++..+|+++||.|++++||++|+++| +.+++||++.+++.|.+++++++++++++||| T Consensus 305 p~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~l 384 (409) T protein:vir:83 305 PFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSPQHLELNRDDYTRPSLVERATAYKIMIEAGVM 384 (409) T ss_pred HHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEeehhhhhccCHHHHHHHHHHHHhCCCc Confidence 999986432 4667999999999999999999999999998 46789999999999999999999999999999 Q ss_pred CHHHHHHHhccCCcCCCCCCCcccc Q lcl|NC_011801. 352 APIQAQKLLKNRGVFPELDLDEGTN 376 (386) Q Consensus 352 t~nE~R~~lg~~p~~p~~~~~~~~~ 376 (386) |+||+|+++|++|...+++..++|. T Consensus 385 T~NE~R~~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 385 EPNEARAMERLHSEAAAVRLSGGGV 409 (409) T ss_pred CHHHHHHHhCCCCCCCCcccCCCCC Confidence 9999999999887655555555554 No 35 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=1.7e-83 Score=474.37 Aligned_cols=378 Identities=17% Similarity=0.260 Sum_probs=307.5 Q ss_pred chhhhhccccccCCccc--hhhhhhcccccccCcc------cccHHHHhccHHHHHHHHHHHHhhccCceeecc------ Q lcl|NC_011801. 2 AFLSNLFKRQKMLSGSS--PVWILNQGQPVSIKPK------AITSAIALKNSDVYAVISRVSSDIAGCRFVTNA------ 67 (386) Q Consensus 2 g~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~------ 67 (386) -+|.+ +..-+.+... ..+........+..+. .+....|+++++|++||++||++||++|+++++ T Consensus 1 ~~~~~--~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~ 78 (518) T protein:vir:78 1 MLLAN--GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) T ss_pred CcccC--ceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCcc Confidence 12222 1111112111 1111222222222332 234456889999999999999999999998853 Q ss_pred ----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccC Q lcl|NC_011801. 68 ----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDD 143 (386) Q Consensus 68 ----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 143 (386) ++.+.+|+.+||++||+++||+.++.+++++||||+++.|+..|.+++||||+|++|++..+.+.....|.+.... T Consensus 79 ~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~~ 158 (518) T protein:vir:78 79 TEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) T ss_pred ccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecC Confidence 3456667789999999999999999999999999999999999999999999999999999988888888887776 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN 223 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~ 223 (386) ...+..+.+++++|||+++.. +.+..+|+||+..+..++....++++++.++|+||++|+++|++++ .+++++.++ T Consensus 159 ~~~~~~~~~~~~eIiHir~~~---~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~-~ls~e~~~~ 234 (518) T protein:vir:78 159 GVGTQLVSFADDEVVPIRFFN---PDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSPEAQQR 234 (518) T ss_pred CccceeEEecCCcEEEecCCC---CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHH Confidence 666667789999999999653 2233589999999999999999999999999999999999999875 689999999 Q ss_pred HHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHH Q lcl|NC_011801. 224 TRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFY 300 (386) Q Consensus 224 ~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~ 300 (386) +++.|++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+. +++.+++...|+ T Consensus 235 ~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~ 314 (518) T protein:vir:78 235 LREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) T ss_pred HHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHH Confidence 99999999988 689999999999999999999999999999999999999999999999987654 456688999999 Q ss_pred HHHHHHHHHHHHHHHHHhhh------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc Q lcl|NC_011801. 301 QSSLSIYIKPIESELSQKLG------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG 374 (386) Q Consensus 301 ~~~l~P~~~~ie~~l~~~l~------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~ 374 (386) ++||+|++.+||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|++ ++++++. T Consensus 315 ~~tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie-~~~gD~~ 393 (518) T protein:vir:78 315 RDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSD-DPKADEL 393 (518) T ss_pred HHHHHHHHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCcee Confidence 99999999999999999985 3679999999999999999999999999999999999999988764 2222221 Q ss_pred -----ccccccCCCCCC Q lcl|NC_011801. 375 -----TNLLDNTKNIND 386 (386) Q Consensus 375 -----~~~~~~~~~~~~ 386 (386) -.++..+.++.. T Consensus 394 ~v~~n~~pl~~~~~~~~ 410 (518) T protein:vir:78 394 YANSALQPLGATPDGAV 410 (518) T ss_pred eecccceeccccccccc Confidence 111111111100 No 36 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=3.2e-83 Score=472.82 Aligned_cols=378 Identities=25% Similarity=0.393 Sum_probs=327.8 Q ss_pred CchhhhhccccccCCccchhhhh---hcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWIL---NQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAP 77 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~---~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~ 77 (386) ||||+++++.+.....+...+.. .........+..++.+.|+++|+|++||++||+++|++|++++++....++ .+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~-~~ 79 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQLQGII-DN 79 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccchhHHHh-hc Confidence 99999877655443332221111 111233456677999999999999999999999999999999988776555 77 Q ss_pred CcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccce Q lcl|NC_011801. 78 LGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEV 157 (386) Q Consensus 78 PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 157 (386) ||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++..++|.+.......+..+.+++++| T Consensus 80 pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ev 159 (386) T protein:vir:48 80 PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQGDV 159 (386) T ss_pred CCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecCccccceeEecCccE Confidence 99999999999999999999999999999999999999999999999999999998889988887777777788999999 Q ss_pred eeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_011801. 158 IHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENA 237 (386) Q Consensus 158 ih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 237 (386) +|+++. .+.++++|+||+..+..++....++++++.++|+||++|+++|+.++ .+++++.+++++.|.+.. .++ T Consensus 160 ih~~~~---~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-~~~~e~~~~~~~~~~~~~--~n~ 233 (386) T protein:vir:48 160 LHFKLL---SVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKLSRSRQAMK--QMQ 233 (386) T ss_pred EEecCC---CCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHhh--cCC Confidence 999864 34456899999999999999999999999999999999999999876 578999999999887643 567 Q ss_pred CcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 238 GRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 238 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) |+++++++|++|++++++++|+||+|++++++++||++|||||.+||..+++++++++.++|++.||.|+++.||++|++ T Consensus 234 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~ 313 (386) T protein:vir:48 234 GGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQ 313 (386) T ss_pred CCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999999999999999999999999999999999888888899999999999999999999999999 Q ss_pred hhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCC-CCCC Q lcl|NC_011801. 318 KLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTK-NIND 386 (386) Q Consensus 318 ~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~-~~~~ 386 (386) +|+.++++|+...++.|...++..+++++++|++|+||+|+++|+.|++| ++.+++.....++. ++.+ T Consensus 314 ~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~-~~~~~~~~~~~~~~~gGd~ 382 (386) T protein:vir:48 314 KLSCDVDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILP-KELPEGENPNKTTLKGGEI 382 (386) T ss_pred hhcchhhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCC-ccchhhcCCCCCccCCCCC Confidence 99999999999999999999999999999999999999999999988876 45555444333232 2222 No 37 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=5.1e-83 Score=471.74 Aligned_cols=378 Identities=17% Similarity=0.260 Sum_probs=306.6 Q ss_pred chhhhhccccccCCcc---chhhhhhcccccccCcc------cccHHHHhccHHHHHHHHHHHHhhccCceeecc----- Q lcl|NC_011801. 2 AFLSNLFKRQKMLSGS---SPVWILNQGQPVSIKPK------AITSAIALKNSDVYAVISRVSSDIAGCRFVTNA----- 67 (386) Q Consensus 2 g~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~----- 67 (386) -+|. +......++ ...+......+.+..+. .+....|+++++|++||++||++||++|+++++ T Consensus 1 ~~~~---~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~ 77 (518) T protein:vir:10 1 MLLA---NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) T ss_pred Cccc---CceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCC Confidence 1111 122211111 11111121122222222 234456889999999999999999999998753 Q ss_pred -----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEecc Q lcl|NC_011801. 68 -----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFD 142 (386) Q Consensus 68 -----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 142 (386) ++.+.+|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.+...+.|.+... T Consensus 78 ~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~~ 157 (518) T protein:vir:10 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) T ss_pred ceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEec Confidence 344566778999999999999999999999999999999999999999999999999999998888888888777 Q ss_pred CcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH Q lcl|NC_011801. 143 DSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE 222 (386) Q Consensus 143 ~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~ 222 (386) ....+..+.+++++|||+|+.. +.+..+|+||+..+..++....++++++.++|+||++|+++|+.++ .+++++.+ T Consensus 158 ~~~~~~~~~~~~~eViHir~~s---~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~ 233 (518) T protein:vir:10 158 AGVGTQLVSFADDEVVPIRFFN---PDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQ 233 (518) T ss_pred CCccceEEEecCCcEEEecCCC---CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHH Confidence 6656666789999999998653 2333589999999999999999999999999999999999999875 68999999 Q ss_pred HHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHH Q lcl|NC_011801. 223 NTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAF 299 (386) Q Consensus 223 ~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~ 299 (386) ++++.|++.++| .|+|+++||++|++|++++++++|+||+|.+++++++||++|||||.+||..+. +++.+++.+.| T Consensus 234 ~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f 313 (518) T protein:vir:10 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) T ss_pred HHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHH Confidence 999999999988 689999999999999999999999999999999999999999999999987654 45668899999 Q ss_pred HHHHHHHHHHHHHHHHHHhhh------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC-CCCCCC Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLG------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF-PELDLD 372 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~-p~~~~~ 372 (386) +++||.|++.+||++|+++|+ .+++||++.+++.|.+++++++++++++||||+||+|+++|++|++ |++|.. T Consensus 314 ~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~ 393 (518) T protein:vir:10 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCee Confidence 999999999999999999985 4689999999999999999999999999999999999999988764 233211 Q ss_pred c---cccccccCCCCCC Q lcl|NC_011801. 373 E---GTNLLDNTKNIND 386 (386) Q Consensus 373 ~---~~~~~~~~~~~~~ 386 (386) . .-.++..+.++.. T Consensus 394 ~~~~n~~pl~~~~~~~~ 410 (518) T protein:vir:10 394 YANSALQPLGATPDGAV 410 (518) T ss_pred eecccceeccccccccc Confidence 0 0011111111110 No 38 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=7.1e-83 Score=470.94 Aligned_cols=369 Identities=25% Similarity=0.339 Sum_probs=297.7 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec---------chhHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN---------AQPIT 71 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~---------~~~~~ 71 (386) |+||+++.. .............+......+. ++...||++++||+||++||++||++|++++ .|+++ T Consensus 1 m~~~~~~~~---~~~~~~~~~~~~~~~~~~~~g~-~~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~ 76 (417) T protein:vir:38 1 MKLFRGLAT---EVDPHWADHLLDSGVIPSFRGG-YLGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIE 76 (417) T ss_pred Ccccccccc---CCCccchhhhcccccccccCCc-eechhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHH Confidence 999965321 1111111111122222233333 3445799999999999999999999999875 46889 Q ss_pred HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeE Q lcl|NC_011801. 72 DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDF 150 (386) Q Consensus 72 ~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (386) ++|+.+||++||+++||+.++.+++++||||++++|+. .|.+..|||++|+.|++..+..+.. .|.+...+ .+... T Consensus 77 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~-~y~~~~~~--~~~~~ 153 (417) T protein:vir:38 77 YLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNI-IYRFTPYN--SSMQK 153 (417) T ss_pred HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeE-EEEEEEcC--CcEEE Confidence 99999999999999999999999999999999999986 4778999999999999987665543 45444332 34456 Q ss_pred EEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHH Q lcl|NC_011801. 151 LYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEE 230 (386) Q Consensus 151 ~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~ 230 (386) .+++++|+|+|+. +.++++|+||+.++..++.+..++++++.++|+||++|+++++.++ .+++++.++++++|++ T Consensus 154 ~~~~~dviH~r~~----~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~-~l~~e~~~~~~~~~~~ 228 (417) T protein:vir:38 154 VCGFEDVIHWKFF----SYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKES-RLSAEARQKIREDFER 228 (417) T ss_pred EecCcceEEecCC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC-CCCHHHHHHHHHHHHH Confidence 7999999999853 3577899999999999999999999999999999999999998865 6899999999999999 Q ss_pred HhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 231 QTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKP 310 (386) Q Consensus 231 ~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ 310 (386) .++|.|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||....+++.+++..+|+++||+|++++ T Consensus 229 ~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~~~~~~~~tl~P~~~~ 308 (417) T protein:vir:38 229 AQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQLADDYIRNDLPFYFEP 308 (417) T ss_pred HhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999977778888999999999999999999 Q ss_pred HHHHHHHhhhh-------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc---------- Q lcl|NC_011801. 311 IESELSQKLGT-------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE---------- 373 (386) Q Consensus 311 ie~~l~~~l~~-------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~---------- 373 (386) ||++|+++|+. +++||.+.+.+.+ .+.+++++++|+||+||+|+++|++|+ |++++|. T Consensus 309 ie~~l~~~Ll~~~~~~~~~~~fd~~~l~~~~----~~~~~~~~~~G~~T~NE~R~~~gl~pi-~~g~~d~~~~~~n~~~~ 383 (417) T protein:vir:38 309 ITSEFELKLLDDAQRHQYCIGFDTKSVNGLP----IADVNTAVNGGLWTGNEGRAELGKKPL-KDPNMDRIQSTLNTVFL 383 (417) T ss_pred HHHHHHhhhcChhhcccceEEechhhhhHHH----HHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCCCeeeeccccccc Confidence 99999999964 4677776654333 334678899999999999999998876 4444432 Q ss_pred ----------------cccccccCCCCCC Q lcl|NC_011801. 374 ----------------GTNLLDNTKNIND 386 (386) Q Consensus 374 ----------------~~~~~~~~~~~~~ 386 (386) |++..+...++++ T Consensus 384 d~~~~~~~~~~~~~kgg~~~~~~~~~~~~ 412 (417) T protein:vir:38 384 DQKEAYQAEHAAELKGGDTNAKGNQNGSG 412 (417) T ss_pred ccccccccccccccCCCCCCCCCCCcCCC Confidence 2222222222222 No 39 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=9.2e-83 Score=470.32 Aligned_cols=374 Identities=29% Similarity=0.464 Sum_probs=310.4 Q ss_pred Cchhhhhc-cccccCCccch-hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccC Q lcl|NC_011801. 1 MAFLSNLF-KRQKMLSGSSP-VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPL 78 (386) Q Consensus 1 Mg~~~~l~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~P 78 (386) ||||+++. .+.+......+ ................++.+.|+++++|++||++||+++|++|+++++|+...+|+ +| T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~~~~~ll~-~P 79 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTENTATLNRLE-SP 79 (385) T ss_pred CccccchhcccccccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeeccchhhhhh-cC Confidence 99998653 22222211111 11111222233455678999999999999999999999999999999999999885 69 Q ss_pred cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEccccee Q lcl|NC_011801. 79 GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVI 158 (386) Q Consensus 79 N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vi 158 (386) |++||+++||+.++.+++++||||++++++ ..+++|+++.+|++..+..+. .|.+... ..+....+++++|| T Consensus 80 N~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~--~~~~~~~--~~~~~~~~~~~eii 151 (385) T protein:vir:10 80 SSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGI--VYTVLES--NDRPQMVLRQDQML 151 (385) T ss_pred CCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCce--EEEEEEc--CCceEEEEccccEE Confidence 999999999999999999999999999875 456788888887777665543 3444333 23456789999999 Q ss_pred eeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccC Q lcl|NC_011801. 159 HFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAG 238 (386) Q Consensus 159 h~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g 238 (386) |+|+.. .+..++++|+||+..+..++....++++++.++|+||++|++++++++...++++.+++++.|++.+++.|+| T Consensus 152 hik~~~-~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~ 230 (385) T protein:vir:10 152 HFRLMP-DPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSG 230 (385) T ss_pred EeccCC-CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccC Confidence 999754 3456788999999999999999999999999999999999999999877778999999999999999999999 Q ss_pred cceecCCCceeeeccCChhhHHHH-HHHHHHHHHHHHHhCCCHHHhcCCcC----cccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 239 RAVVLDQSADVETTNISPNVTEFL-QNVSFSQDQIAKAFGIPADYLSGKQD----AQSNITMIRAFYQSSLSIYIKPIES 313 (386) Q Consensus 239 ~~~vl~~g~~~~~~~~~~~d~~~~-e~~~~~~~~Ia~~~gvp~~~l~~~~~----~~~~~~~~~~~~~~~l~P~~~~ie~ 313 (386) +++++++|++|+++++++.|+|++ |.+++++++||++|||||.+|+.... +++. ++.+.++.+||.|+++.||+ T Consensus 231 ~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~-eq~~~~~~~~l~P~~~~ie~ 309 (385) T protein:vir:10 231 RLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNI-DQIKATYLANLNSYVNPIVD 309 (385) T ss_pred CccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccH-HHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999975 99999999999999999999986432 2333 45667777899999999999 Q ss_pred HHHHhhh-hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 314 ELSQKLG-TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 314 ~l~~~l~-~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) +|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|++| +++++...+....+++.+ T Consensus 310 ~l~~~l~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~-~~~~~~~~~~~~~~~g~~ 382 (385) T protein:vir:10 310 ELRLKMNAPDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLP-DNLPEFKPLTTQVKGGDE 382 (385) T ss_pred HHHHhhCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCC-CCCccccCcccccCCCCC Confidence 9999986 67999999999999999999999999999999999999999998765 456665556665565555 No 40 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=9.5e-83 Score=470.25 Aligned_cols=379 Identities=14% Similarity=0.182 Sum_probs=305.7 Q ss_pred CchhhhhccccccCCccchhhh-hhcccccccCccc-ccHHHHhccHHHHHHHHHHHHhhccCceeec------------ Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWI-LNQGQPVSIKPKA-ITSAIALKNSDVYAVISRVSSDIAGCRFVTN------------ 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------------ 66 (386) ||||++++.++.........+. .........+... .+...++++|+|++||++||++||++|++++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 9999999766654433333322 1122222233333 3455567899999999999999999999864 Q ss_pred chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC--CceEEEEEEcCcceEEeecCCC-ceeEEEEeccC Q lcl|NC_011801. 67 AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN--GYPVRIEPVPNEKVTVALDDYG-KDLTYTVHFDD 143 (386) Q Consensus 67 ~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~--g~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~~~ 143 (386) +|+++++|. +||++||+++||+.++.+++++||||+++.|+.. +....|+|+++..+++....++ ..+.|.+.... T Consensus 81 ~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~ 159 (423) T protein:vir:81 81 EGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIIIESG 159 (423) T ss_pred cchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEEEec Confidence 356787775 7999999999999999999999999999998753 5667899999999988766543 45566665554 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC----CCCCHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN----ATLGKE 219 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~----~~~~~~ 219 (386) ...+..+.+++++|||+|.. .+.+..+|+||+..+..++....++++++.++|+||++|+++|+++. ..++++ T Consensus 160 ~~~g~~~~~~~~evih~r~~---~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e 236 (423) T protein:vir:81 160 DNDGRSVKVPGERVIHRHGY---NPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAE 236 (423) T ss_pred CCCceEEEEcccceEEecCC---CCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHH Confidence 45566788999999999854 33455689999999999999999999999999999999999998753 247999 Q ss_pred HHHHHHHHHHHHhcc--cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc--ccHHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTG--ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA--QSNITM 295 (386) Q Consensus 220 ~~~~~k~~~~~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~--~~~~~~ 295 (386) +.+++++.|++.+.+ +|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..+.+ ++.+++ T Consensus 237 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~ 316 (423) T protein:vir:81 237 SRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREF 316 (423) T ss_pred HHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHH Confidence 999999999998743 6789999999999999999999999999999999999999999999999876544 466889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhh---------hhhhcchhhhccCHHHHHHHHHHHHh-CCCcCHHHHHHHhccCCc Q lcl|NC_011801. 296 IRAFYQSSLSIYIKPIESELSQKLGT---------DVKLDIASAIDSDNSELINNVQKLAS-AGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 296 ~~~~~~~~l~P~~~~ie~~l~~~l~~---------~~~fd~~~~l~~d~~~~~~~~~~~~~-~g~~t~nE~R~~lg~~p~ 365 (386) .++|+++||+|++.+||++|+++|+. +++||++.+++.|.+++++++++++. .||||+||+|+++|++|+ T Consensus 317 ~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~ 396 (423) T protein:vir:81 317 RKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSI 396 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCC Confidence 99999999999999999999999853 57899999999999999999999885 699999999999998775 Q ss_pred CCCCCCCccccccccCCCC-CC Q lcl|NC_011801. 366 FPELDLDEGTNLLDNTKNI-ND 386 (386) Q Consensus 366 ~p~~~~~~~~~~~~~~~~~-~~ 386 (386) + + +|+.-.++...... .| T Consensus 397 ~-g--GD~~~~p~n~~~~~~~~ 415 (423) T protein:vir:81 397 D-G--GDDLARPLNTEFGDSED 415 (423) T ss_pred C-C--cceeecccccccCccCC Confidence 3 3 23322222111111 11 No 41 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=2.3e-82 Score=468.14 Aligned_cols=374 Identities=17% Similarity=0.276 Sum_probs=305.8 Q ss_pred Cchh--hhhccccccCC-----ccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------c Q lcl|NC_011801. 1 MAFL--SNLFKRQKMLS-----GSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------A 67 (386) Q Consensus 1 Mg~~--~~l~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~ 67 (386) |+|| .+++++.+..- ............+...+...++...|+++|+|++||++||++||++|++++ + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~ 80 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVN 80 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccccc Confidence 9999 45655432110 000011112223333455668899999999999999999999999999874 5 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) |+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++..++|.+...+ + T Consensus 81 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g 157 (412) T protein:vir:26 81 TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---G 157 (412) T ss_pred chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---c Confidence 8899999999999999999999999999999999999999999999999999999999999998888888776543 4 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ..+.+++++|+|+++. ++.++++|+||+.++..++.+..+++++. ++.++..++++++. +..+++++.+++++. T Consensus 158 ~~~~~~~~evih~~~~---~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~ 231 (412) T protein:vir:26 158 NKLIVHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLED 231 (412) T ss_pred eEEEEccccEEEeCCC---CCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEec-CCCCCHHHHHHHHHH Confidence 5678999999999864 35678999999999999999999998884 44555555556555 457899999999999 Q ss_pred HHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc--CcccHHHHHHHHHHHHHH Q lcl|NC_011801. 228 FEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ--DAQSNITMIRAFYQSSLS 305 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~--~~~~~~~~~~~~~~~~l~ 305 (386) |++.+. ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+|+..+ ++++.+++.++|+++||+ T Consensus 232 ~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~ 309 (412) T protein:vir:26 232 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLL 309 (412) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 998764 5678999999999999999999999999999999999999999999998754 345778999999999999 Q ss_pred HHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-ccc Q lcl|NC_011801. 306 IYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTN 376 (386) Q Consensus 306 P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~ 376 (386) |++++||++|+++|+ .+++||++++++.|.+++++++++++++|+||+||+|+++|++|+ |++|... ..+ T Consensus 310 P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~-~ggD~~~~~~n 388 (412) T protein:vir:26 310 PIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV-EGGDKPLISGD 388 (412) T ss_pred HHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeeeeccc Confidence 999999999999984 458999999999999999999999999999999999999998776 3333211 111 Q ss_pred --ccc-------cCCCCCC Q lcl|NC_011801. 377 --LLD-------NTKNIND 386 (386) Q Consensus 377 --~~~-------~~~~~~~ 386 (386) ++. ..+++.+ T Consensus 389 ~~~~~~~~~~~~~~~gG~~ 407 (412) T protein:vir:26 389 LYPIDTPLELRKSLKGGDK 407 (412) T ss_pred ccccccchhhcccccCCCC Confidence 110 1111111 No 42 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=3.3e-82 Score=467.29 Aligned_cols=363 Identities=14% Similarity=0.162 Sum_probs=298.8 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc--------hhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA--------QPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~--------~~~~~ 72 (386) |.=+ |........+.......++.+.++++++||+||++||++||++|+++++ |++++ T Consensus 1 ~~~~--------------~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~ 66 (723) T protein:vir:94 1 MTTF--------------PSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQ 66 (723) T ss_pred Cccc--------------ccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHH Confidence 1111 1111111112222334467788999999999999999999999998763 68999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec---CCCceEEEEEEcCcceEEeecCCCc------eeEEEEeccC Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD---TNGYPVRIEPVPNEKVTVALDDYGK------DLTYTVHFDD 143 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~---~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~~~~~~~ 143 (386) +|+.+||++||+++||+.++.+|+++||+|++++++ ..|.+.+||++++..+.+....+.. ...|.+... T Consensus 67 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~- 145 (723) T protein:vir:94 67 LWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERT- 145 (723) T ss_pred HHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEec- Confidence 999999999999999999999999999999999864 4589999999999888876554433 223333332 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN 223 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~ 223 (386) .|..+.+++++|||+|+. ++.++++|+||+..+..++....++++++.++|+||++|+++|+.+ .+++++.++ T Consensus 146 --~G~~~~~~~~dIiHir~~---~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~--~l~~e~~~~ 218 (723) T protein:vir:94 146 --DGVRVPVLADEMLWLRFS---DPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG--DMDEQTFTK 218 (723) T ss_pred --CceeEEecccceEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC--CCCHHHHHH Confidence 345678999999999864 3568899999999999999999999999999999999999999975 479999999 Q ss_pred HHHHHHHHhcc-cccCcceecC----------CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH Q lcl|NC_011801. 224 TRQSFEEQTTG-ENAGRAVVLD----------QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN 292 (386) Q Consensus 224 ~k~~~~~~~~~-~~~g~~~vl~----------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~ 292 (386) +++.|++.++| .|+|++++|+ .|++|++++++++|+||+|++++++++||++|||||.+|+....++|. T Consensus 219 ~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~ 298 (723) T protein:vir:94 219 TVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQ 298 (723) T ss_pred HHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccH Confidence 99999999888 7899999985 589999999999999999999999999999999999999877777788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC Q lcl|NC_011801. 293 ITMIRAFYQSSLSIYIKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF 366 (386) Q Consensus 293 ~~~~~~~~~~~l~P~~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~ 366 (386) +++.+.|+++||+|+++.||++|+++|+. +++||...+++.|.+++++++.+++++||||+||+|+++|++|+ T Consensus 299 e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi- 377 (723) T protein:vir:94 299 AEAKAAVWTETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPL- 377 (723) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC- Confidence 99999999999999999999999999863 35667778899999999999999999999999999999998776 Q ss_pred CCCCCCccccccccCCCCCC Q lcl|NC_011801. 367 PELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 367 p~~~~~~~~~~~~~~~~~~~ 386 (386) |++++...-.++..+-...| T Consensus 378 ~gGd~~~~~~p~~~~~a~~~ 397 (723) T protein:vir:94 378 PGGIGQMTLTPYRAQFAPAP 397 (723) T ss_pred CCCcccceeccccccccCCC Confidence 55554433333322222222 No 43 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=5.4e-82 Score=466.13 Aligned_cols=367 Identities=25% Similarity=0.311 Sum_probs=300.0 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--------~~~~~~ 72 (386) ||||.... .......+.+....+.. ....++...||++++|++||++||++||++|++++ +|++++ T Consensus 1 m~~f~~~~---~~~~~~~~~~~~~~~~~---~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~~~ 74 (406) T protein:vir:97 1 MSFFQPLG---TSKVSYDDYISSVLAGD---VSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDEDINY 74 (406) T ss_pred CccccccC---CCCCCcchHHHHHhcCC---CCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccccchHHH Confidence 99997532 22222233333333222 22455666799999999999999999999999765 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFL 151 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (386) +|+.+||++||+++||+.++.+++++||||+++.|+. .|.+.+|||++|+.|++..++++. +.|.+.. ...+..+. T Consensus 75 lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~-~~y~~~~--~~~~~~~~ 151 (406) T protein:vir:97 75 LLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHE-IVYTFTD--MLTAKQVK 151 (406) T ss_pred HhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCce-EEEEEEe--cCCceEEE Confidence 9999999999999999999999999999999999984 789999999999999998776553 4454433 23455678 Q ss_pred EcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHH Q lcl|NC_011801. 152 YDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQ 231 (386) Q Consensus 152 ~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~ 231 (386) ++++||||+|+. +.++++|+||+.++..++..+.+++++..++|+||+.|++++.. +..+++++.+++++.|++. T Consensus 152 ~~~~evih~r~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~-~~~l~~e~~~~~~~~~~~~ 226 (406) T protein:vir:97 152 CFAHDVIHWKFF----SHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMK-GAQLSGDARQRARQEFEKM 226 (406) T ss_pred EccccEEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEec-CCCCCHHHHHHHHHHHHHH Confidence 999999999853 46788999999999999999999999999999999998766554 5678999999999999999 Q ss_pred hcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 232 TTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 232 ~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) ++|.|+|+++|+++|++|++++++++|+||+|.+++++++||++|||||.+||....+++.+++.++|+++||+|++++| T Consensus 227 ~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~~~i 306 (406) T protein:vir:97 227 REGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYFDAI 306 (406) T ss_pred hcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999877788889999999999999999999 Q ss_pred HHHHHHhhhh-------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc---c--ccc Q lcl|NC_011801. 312 ESELSQKLGT-------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT---N--LLD 379 (386) Q Consensus 312 e~~l~~~l~~-------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~---~--~~~ 379 (386) |++|+++|+. +++||++ .+.+++++.+.+++++|+||+||+|+++|++|++ ++++|..- + ++. T Consensus 307 e~~l~~kll~~~~~~~~~i~fd~~----~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~-~~~gD~~~~~~n~~~~~ 381 (406) T protein:vir:97 307 TSELGLKTLNDKDRRLYHIEFDTR----SVTGRNVDEIVKLVNNQILTPNQGLVELGKQKST-DPNMDRYQSSLNYVFLD 381 (406) T ss_pred HHHHhhhhcChhhccceeEEEecC----ccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCCeEeeccCccchh Confidence 9999999863 3677754 4566777888899999999999999999987763 33333211 1 110 Q ss_pred c----------C-CCCCC Q lcl|NC_011801. 380 N----------T-KNIND 386 (386) Q Consensus 380 ~----------~-~~~~~ 386 (386) . + +++.+ T Consensus 382 ~~~~~~~~~~~~~~gg~~ 399 (406) T protein:vir:97 382 KKEEYQDKVGIKGKGGEV 399 (406) T ss_pred cccccccccccccCCCCC Confidence 0 0 11111 No 44 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=2.2e-81 Score=462.73 Aligned_cols=378 Identities=26% Similarity=0.392 Sum_probs=326.6 Q ss_pred CchhhhhccccccCCccchhhhhhc---ccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQ---GQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAP 77 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~---~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~ 77 (386) ||||+++++++.......+.+.... ....+..+..|+.+.|+++|+|++||++||+++|++|++++++.... |+.+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~-l~~~ 79 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQLQG-IVDN 79 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchhhh-hhhc Confidence 9999998866555444433332222 22334456789999999999999999999999999999999887655 4577 Q ss_pred CcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccce Q lcl|NC_011801. 78 LGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEV 157 (386) Q Consensus 78 PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 157 (386) ||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.++..++|.+.......+..+.+++++| T Consensus 80 PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ev 159 (386) T protein:vir:49 80 PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPHIAPKQHVPQNDI 159 (386) T ss_pred cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCccccceeEEccccE Confidence 99999999999999999999999999999999999999999999999999999988889888877666667788999999 Q ss_pred eeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_011801. 158 IHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENA 237 (386) Q Consensus 158 ih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 237 (386) ||+++.. +.++++|+||+.++..++....++.+++.++|+||+.|+++|++++ ..++++.+++++.|+.. ..++ T Consensus 160 ih~~~~~---~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~~~~~~~~~~~~~~~~~--~~n~ 233 (386) T protein:vir:49 160 LHFRLLS---VDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKG-GGLLDFKTKVSRSRQAM--KQMQ 233 (386) T ss_pred EEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCC-CCChHHHHHHHHHHHHh--ccCC Confidence 9998643 3456899999999999999999999999999999999999999976 56788888888888753 3678 Q ss_pred CcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 238 GRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 238 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) |+++++++|++|++++.+++|+||+|++++++++||++|||||.+|+..+.++.+.++.++|+..+|.|+++.|+++|++ T Consensus 234 g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~~~ 313 (386) T protein:vir:49 234 GGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSRYLRPFVSEMSK 313 (386) T ss_pred CCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999876666666777899999999999999999999 Q ss_pred hhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccC-CCCCC Q lcl|NC_011801. 318 KLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNT-KNIND 386 (386) Q Consensus 318 ~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~-~~~~~ 386 (386) +|+.+++||++.+++.|.+++++.+++++++|++|+||+|++++..|+.| .+.+++.++..++ +++.+ T Consensus 314 ~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~-~~~~~~~~~~~~~~~gGd~ 382 (386) T protein:vir:49 314 KLSCEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILP-KELPDGKNPNRTSLKGGEI 382 (386) T ss_pred HhcchhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCC-CcCcchhccCCCCCCCCCC Confidence 99999999999999999999999999999999999999999998888766 4455555544333 33333 No 45 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=8.6e-82 Score=465.01 Aligned_cols=358 Identities=33% Similarity=0.562 Sum_probs=303.4 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccCcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPLGN 80 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~ 80 (386) ||||..+ +++...... ..+..........++..|+.+.|+++++|++||++||++||++|++ .++++++|+.+||+ T Consensus 1 M~~~~~f-~~r~~~~~~-~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~--~~~~~~~L~~~PN~ 76 (359) T protein:vir:10 1 MSILNPF-ERRSSITPN-NYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI--GNQVFTSVLNNPSH 76 (359) T ss_pred Ccccchh-hccccCCCC-cchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc--cchHHHHHhhcccc Confidence 9999754 444333222 2233333344566778899999999999999999999999999985 77888888899999 Q ss_pred cCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeee Q lcl|NC_011801. 81 LMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHF 160 (386) Q Consensus 81 ~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~ 160 (386) +||+++||+.++.+++++||||++++|+..|.+.+||||+|+.|++..+++. +.|.+... ..+....++++||+|+ T Consensus 77 ~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~--~~y~~~~~--~~~~~~~~~~~evih~ 152 (359) T protein:vir:10 77 LTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDT--LTYEVNQF--DDYPSAKYNASEMIHV 152 (359) T ss_pred cCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCe--EEEEEEec--CCceEEEEcccceEEe Confidence 9999999999999999999999999999999999999999999999877654 34444432 2345677999999999 Q ss_pred cccc-ccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCc Q lcl|NC_011801. 161 RCTV-SGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGR 239 (386) Q Consensus 161 ~~~~-~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~ 239 (386) ++.. ..++.++++|+||+.++..++....+++++..++|+||++|+++++++...+++++.++++++|++.+++.|+|+ T Consensus 153 ~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~ 232 (359) T protein:vir:10 153 KIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGR 232 (359) T ss_pred ccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 9754 234568899999999999999999999999999999999999999998767899999999999999998899999 Q ss_pred ceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011801. 240 AVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKL 319 (386) Q Consensus 240 ~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l 319 (386) ++|+++|++|++++++++|+||+|.+++++++||++|||||.+||..++.+.+.++.++++..+|.|.+..++++|+.+| T Consensus 233 ~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l 312 (359) T protein:vir:10 233 VMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKC 312 (359) T ss_pred ceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999876654444444555556666666666666777777 Q ss_pred hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC Q lcl|NC_011801. 320 GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF 366 (386) Q Consensus 320 ~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~ 366 (386) ...++++...+++.|...+...+.+++++|+||+||+|+++|++|+. T Consensus 313 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 313 DSSIGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred hhhhcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 77778888888888888777888899999999999999999999987 No 46 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=1.9e-81 Score=463.07 Aligned_cols=378 Identities=26% Similarity=0.401 Sum_probs=327.4 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccCcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPLGN 80 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~ 80 (386) ||||+++++++..................+.++..++.+.|+++++|++||++||++||++|++++++... .|+.+||+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~-~L~~~PN~ 79 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKLQ-GIVDNPSN 79 (382) T ss_pred CccccccccCCcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchhh-hhhhhcCC Confidence 99999998765443322211112222334456778999999999999999999999999999999987654 56688999 Q ss_pred cCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeee Q lcl|NC_011801. 81 LMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHF 160 (386) Q Consensus 81 ~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~ 160 (386) +||+++|++.++.+++++||||++++|+..|++++||||+|+.|++..+.++..++|.+...+...+..+.+++++|+|+ T Consensus 80 ~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~ 159 (382) T protein:vir:48 80 NANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQNDVLHF 159 (382) T ss_pred CCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecCccccceeEEcCccEEEe Confidence 99999999999999999999999999999999999999999999999999998889988887777677888999999999 Q ss_pred ccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_011801. 161 RCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRA 240 (386) Q Consensus 161 ~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~ 240 (386) ++.. +.+.++|+||+.++..++....+++++..++|+||+.|++++++++ .+++++.+++++.|.+. ..++|++ T Consensus 160 ~~~~---~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~--~~n~g~~ 233 (382) T protein:vir:48 160 RLLS---VDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKG-GGLLDFKTKLSRSRQAM--KQMQGGP 233 (382) T ss_pred cCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCChHHHHHHHHHHHhh--ccCCCCe Confidence 9643 3456899999999999999999999999999999999999999976 67888888888888764 3567899 Q ss_pred eecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011801. 241 VVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKLG 320 (386) Q Consensus 241 ~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~ 320 (386) +|+++|++|++++.++.|+||+|.+++++++||++|||||.+||..+++++++++.++|++.||+|+++.|+++|+++|+ T Consensus 234 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~ 313 (382) T protein:vir:48 234 LVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKLS 313 (382) T ss_pred eEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999999999999999999999999999999999999888888888999999999999999999999999999 Q ss_pred hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 321 TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 321 ~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) ..++++....++.+.......+.+++++|++|+||+|++++..++.| .+.+++.++..+.+++.+ T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~-~~~~~~~~~~~~~~GGd~ 378 (382) T protein:vir:48 314 CDVDADIFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILP-KELPNGENPNSTLKGGEE 378 (382) T ss_pred ChhhhhhhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCC-cchhhhhcCCCCCCCCCC Confidence 99888887777888777777888999999999999999998777665 445566666555555555 No 47 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=1.2e-81 Score=464.13 Aligned_cols=372 Identities=16% Similarity=0.255 Sum_probs=301.5 Q ss_pred Cchhhhhcccc---ccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHH Q lcl|NC_011801. 1 MAFLSNLFKRQ---KMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPIT 71 (386) Q Consensus 1 Mg~~~~l~~~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~ 71 (386) |++|+|+..+- ......... .....+...+...++.+.|+++++|++||++||++||++|++++ +|+++ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~--~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~~~~~l~ 81 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWIDQSASKL--YDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVS 81 (409) T ss_pred ccchhhhhhHHhhhhhccccccc--cccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccccchhHH Confidence 88888764321 011111111 11112222333567888999999999999999999999999874 58899 Q ss_pred HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEE Q lcl|NC_011801. 72 DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFL 151 (386) Q Consensus 72 ~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (386) ++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|+.|++..+.+...++|.+...+ +..+. T Consensus 82 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g~~~~ 158 (409) T protein:vir:96 82 DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKLI 158 (409) T ss_pred HHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEEE Confidence 999999999999999999999999999999999999999999999999999999999988888887766433 45678 Q ss_pred EcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHH Q lcl|NC_011801. 152 YDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQ 231 (386) Q Consensus 152 ~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~ 231 (386) +++++|+|+++. ++.++++|+||+..+..+++...+++++. ++.++..++++++ .+..+++++++++++.|++. T Consensus 159 ~~~~evih~r~~---~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~-~~~~l~~e~~~~~~~~~~~~ 232 (409) T protein:vir:96 159 VHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLK-YGSNVSTEKRQQVLEDFKQY 232 (409) T ss_pred EccccEEEeCCC---CCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEe-cCCCCCHHHHHHHHHHHHHH Confidence 999999999854 35678999999999999999999888774 3334444444444 45689999999999999987 Q ss_pred hcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 232 TTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSLSIYIK 309 (386) Q Consensus 232 ~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l~P~~~ 309 (386) +. ++|+++++++|++|+++++++.|+|++|.+++++++||++|||||.+||..++ +++.+++.+.|+++||.|+++ T Consensus 233 ~~--n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~ 310 (409) T protein:vir:96 233 YE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVK 310 (409) T ss_pred hh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 74 56789999999999999999999999999999999999999999999987554 556789999999999999999 Q ss_pred HHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-ccc--cc Q lcl|NC_011801. 310 PIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTN--LL 378 (386) Q Consensus 310 ~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~--~~ 378 (386) +||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|... ..+ ++ T Consensus 311 ~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi-~ggD~~~~~~n~~~~ 389 (409) T protein:vir:96 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV-EGGDKPLISGDLYPI 389 (409) T ss_pred HHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC-CCcceeeeccccccc Confidence 99999999985 468999999999999999999999999999999999999998776 4443221 011 11 Q ss_pred -------ccCCCCCC Q lcl|NC_011801. 379 -------DNTKNIND 386 (386) Q Consensus 379 -------~~~~~~~~ 386 (386) ...+++.+ T Consensus 390 ~~~~~~~~~~~gG~~ 404 (409) T protein:vir:96 390 DTPLELRKSLKGGDK 404 (409) T ss_pred ccchhhcccccCCCC Confidence 01122211 No 48 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=2e-81 Score=462.99 Aligned_cols=377 Identities=25% Similarity=0.376 Sum_probs=312.0 Q ss_pred CchhhhhccccccCCccchh-------hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-------WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDV 73 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~ 73 (386) ||||+.+++.........+. ...........++..|+.+.|+++++|++||++||++||++|++++++....+ T Consensus 3 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:74 3 LPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchhhhh Confidence 99998665443322111111 11122223334567899999999999999999999999999999998876665 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEc Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYD 153 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (386) + .+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..+.++..+.|.+.......+....++ T Consensus 83 ~-~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~ 161 (392) T protein:vir:74 83 I-DNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAP 161 (392) T ss_pred h-hhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCCccceeEEEc Confidence 5 679999999999999999999999999999999999999999999999999999999888999888777667778899 Q ss_pred ccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-CHHHHHHHHHHHHHHh Q lcl|NC_011801. 154 SSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-GKEAKENTRQSFEEQT 232 (386) Q Consensus 154 ~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-~~~~~~~~k~~~~~~~ 232 (386) +++|+|+++.. +.+.++|+||+.++..++....++++++.++|+||++|++++++++... ++++ ++.|.+.+ T Consensus 162 ~~evih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~----~~~~~~~~ 234 (392) T protein:vir:74 162 QSDLIHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSF 234 (392) T ss_pred CccEEEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHH Confidence 99999998643 3345899999999999999999999999999999999999999986433 3333 34444444 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) .+ .|+|+++|+++|++|++++++++|+||+|++++++++||++|||||.+||..+.+++++++.++|+++||.|+++.| T Consensus 235 ~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~~~i 314 (392) T protein:vir:74 235 MKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPA 314 (392) T ss_pred hccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 44 68899999999999999999999999999999999999999999999999887777888889999999999999999 Q ss_pred HHHHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 312 ESELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 312 e~~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) |++|+++|+.+++||+..+++.|.+++++.+++++++|++|+||+|+++...|..|.+. ....+.-....+..+ T Consensus 315 e~~l~~~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~-r~~enl~~~~~Gd~~ 388 (392) T protein:vir:74 315 ISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL-PAPENTNKKTTGQSN 388 (392) T ss_pred HHHHHHhccchhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcccc-chhcCCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999985555544221 111111111111111 No 49 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=4.8e-81 Score=460.92 Aligned_cols=377 Identities=25% Similarity=0.380 Sum_probs=314.0 Q ss_pred CchhhhhccccccCCccchh-------hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-------WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDV 73 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~ 73 (386) ||||+++++..+......+. ...........++..|+.+.|+++++|++||++||++||++|++++++....+ T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhhhH Confidence 99999887654432222111 11222334445677899999999999999999999999999999998876655 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEc Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYD 153 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (386) + .+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..+.++..+.|.+...+...+....++ T Consensus 83 ~-~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 161 (392) T protein:vir:10 83 I-DNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAP 161 (392) T ss_pred h-hcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEc Confidence 4 789999999999999999999999999999999999999999999999999999999999999888877777778899 Q ss_pred ccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-CHHHHHHHHHHHHHHh Q lcl|NC_011801. 154 SSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-GKEAKENTRQSFEEQT 232 (386) Q Consensus 154 ~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-~~~~~~~~k~~~~~~~ 232 (386) ++||||+++.. +.+.++|+||+.++..++....++++++.++|+||++|+++|++++... ++++ ++.|.+.+ T Consensus 162 ~~eiih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~ 234 (392) T protein:vir:10 162 QSDLIHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSF 234 (392) T ss_pred cccEEEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHH Confidence 99999998643 3455899999999999999999999999999999999999999986433 3333 34444444 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) .+ .++|+++++++|++|++++++++|+||++.+++++++||++|||||.+||..+.+++++++.++|+++||.|+++.| T Consensus 235 ~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~i 314 (392) T protein:vir:10 235 MKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPA 314 (392) T ss_pred hccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 44 67889999999999999999999999999999999999999999999999887777788889999999999999999 Q ss_pred HHHHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 312 ESELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 312 e~~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) |++|+++|+.+++||...+++.|.+++++.+++++++|++|+||+|+++...+..|.+ ..+..+.-....+..+ T Consensus 315 e~~l~~~L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e-~r~~e~l~~~~~Gd~~ 388 (392) T protein:vir:10 315 ISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKD-LPAPENTNKKTTGQSN 388 (392) T ss_pred HHHHHHhccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccc-cchhcCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999998544444422 2211111111111111 No 50 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=4.8e-81 Score=460.92 Aligned_cols=377 Identities=25% Similarity=0.380 Sum_probs=314.0 Q ss_pred CchhhhhccccccCCccchh-------hhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-------WILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDV 73 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~ 73 (386) ||||+++++..+......+. ...........++..|+.+.|+++++|++||++||++||++|++++++....+ T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhhhH Confidence 99999887654432222111 11222334445677899999999999999999999999999999998876655 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEc Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYD 153 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (386) + .+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..+.++..+.|.+...+...+....++ T Consensus 83 ~-~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 161 (392) T protein:vir:39 83 I-DNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAP 161 (392) T ss_pred h-hcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEc Confidence 4 789999999999999999999999999999999999999999999999999999999999999888877777778899 Q ss_pred ccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-CHHHHHHHHHHHHHHh Q lcl|NC_011801. 154 SSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-GKEAKENTRQSFEEQT 232 (386) Q Consensus 154 ~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-~~~~~~~~k~~~~~~~ 232 (386) ++||||+++.. +.+.++|+||+.++..++....++++++.++|+||++|+++|++++... ++++ ++.|.+.+ T Consensus 162 ~~eiih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~ 234 (392) T protein:vir:39 162 QSDLIHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSF 234 (392) T ss_pred cccEEEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHH Confidence 99999998643 3455899999999999999999999999999999999999999986433 3333 34444444 Q ss_pred cc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI 311 (386) Q Consensus 233 ~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i 311 (386) .+ .++|+++++++|++|++++++++|+||++.+++++++||++|||||.+||..+.+++++++.++|+++||.|+++.| T Consensus 235 ~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~i 314 (392) T protein:vir:39 235 MKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPA 314 (392) T ss_pred hccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 44 67889999999999999999999999999999999999999999999999887777788889999999999999999 Q ss_pred HHHHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 312 ESELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 312 e~~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) |++|+++|+.+++||...+++.|.+++++.+++++++|++|+||+|+++...+..|.+ ..+..+.-....+..+ T Consensus 315 e~~l~~~L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e-~r~~e~l~~~~~Gd~~ 388 (392) T protein:vir:39 315 ISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKD-LPAPENTNKKTTGQSN 388 (392) T ss_pred HHHHHHhccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccc-cchhcCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999998544444422 2211111111111111 No 51 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=4.3e-81 Score=461.17 Aligned_cols=374 Identities=29% Similarity=0.465 Sum_probs=312.2 Q ss_pred Cchhhhh-ccccccCCccchhh-hhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccC Q lcl|NC_011801. 1 MAFLSNL-FKRQKMLSGSSPVW-ILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPL 78 (386) Q Consensus 1 Mg~~~~l-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~P 78 (386) ||||++. |++.+......+.. ...........+..++.+.|+++++|++||++||+++|++|+++++|+...+|+ +| T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ll~-~P 79 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTENTATLNRLE-SP 79 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecccchhhhhh-CC Confidence 9999875 44433332222211 111222233456678999999999999999999999999999999999999885 69 Q ss_pred cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEccccee Q lcl|NC_011801. 79 GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVI 158 (386) Q Consensus 79 N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vi 158 (386) |++||+++||+.++.+++++||||++++++ ..+++|+++.+|++..+.++ ..|.+.... .+..+.+++++|+ T Consensus 80 N~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~--~~~~~~~~~--~~~~~~~~~~evi 151 (383) T protein:vir:10 80 SSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMG--IVYTVLESN--DRPKMVLRQDQML 151 (383) T ss_pred CCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCc--eEEEEEEcC--CceEEEEcccceE Confidence 999999999999999999999999999875 45677777777777666554 334443332 3456789999999 Q ss_pred eeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccC Q lcl|NC_011801. 159 HFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAG 238 (386) Q Consensus 159 h~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g 238 (386) |+|+.+ .+..++.+|+||+.++..++....++++++.++|+||++|++++++++...++++.+++++.|++.+++.|+| T Consensus 152 h~r~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~ 230 (383) T protein:vir:10 152 HFRLMP-DPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSG 230 (383) T ss_pred EeccCC-CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccC Confidence 998654 3456778999999999999999999999999999999999999999887778999999999999999999999 Q ss_pred cceecCCCceeeeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhcCCcC----cccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 239 RAVVLDQSADVETTNISPNVTEF-LQNVSFSQDQIAKAFGIPADYLSGKQD----AQSNITMIRAFYQSSLSIYIKPIES 313 (386) Q Consensus 239 ~~~vl~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gvp~~~l~~~~~----~~~~~~~~~~~~~~~l~P~~~~ie~ 313 (386) +++++++|++|++++.++.|+|+ .|++++++++||++|||||.+||.... +++. ++...++..||.|+++.||+ T Consensus 231 ~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~-eq~~~~~~~~l~P~~~~ie~ 309 (383) T protein:vir:10 231 RLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNI-DQIKATYLANLNSYVNPIVD 309 (383) T ss_pred CccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccH-HHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999997 489999999999999999999986432 2333 44455667899999999999 Q ss_pred HHHHhhh-hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 314 ELSQKLG-TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 314 ~l~~~l~-~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) +|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |+++.++...+..+.+++.| T Consensus 310 ~l~~~l~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~-~~~d~~~~~~~~~~~~gGd~ 382 (383) T protein:vir:10 310 ELRLKMNAPDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGF-LPDNLPEFKPLTNETKGGDD 382 (383) T ss_pred HHHHhhCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcc-cCCcccccCCCcccCCCCCC Confidence 9999985 689999999999999999999999999999999999999998775 56666777777777888888 No 52 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=9.9e-81 Score=459.20 Aligned_cols=373 Identities=16% Similarity=0.268 Sum_probs=299.2 Q ss_pred Cchhhhhcccc-cc-CCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQ-KM-LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~ 72 (386) =+++.|+.... +. ...+.. .......+...+...++.+.|+++++|++||++||++||++|++++ +|++++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~~~~~~~~ 82 (409) T protein:vir:93 4 ENIVTRIKKKLIDNWIDQSTS-KLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVSD 82 (409) T ss_pred cchhhhhhhhhhhhhhccccc-cccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccccccchHHH Confidence 23444431110 00 000000 0011112222344567888999999999999999999999999874 588999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||+++.|+..|++.+||||+|+.|++..+.++..++|.+...+ +..+.+ T Consensus 83 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~---g~~~~~ 159 (409) T protein:vir:93 83 LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKLIV 159 (409) T ss_pred HHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEEEE Confidence 99999999999999999999999999999999999999999999999999999999988888888776543 456789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|+|+++. ++.++++|+||+.++..++....+++++. ++.++..++++++. +..+++++.+++++.|++.+ T Consensus 160 ~~~eVih~r~~---~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~ 233 (409) T protein:vir:93 160 HNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYY 233 (409) T ss_pred ccccEEEeCCC---CCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEec-CCCCCHHHHHHHHHHHHHHh Confidence 99999999854 35678999999999999999999998874 44555555555554 56789999999999999876 Q ss_pred cccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 233 TGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSLSIYIKP 310 (386) Q Consensus 233 ~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l~P~~~~ 310 (386) . ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++ +++.+++.++|+++||.|++++ T Consensus 234 ~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ 311 (409) T protein:vir:93 234 E--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQ 311 (409) T ss_pred h--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHH Confidence 4 56789999999999999999999999999999999999999999999987554 4567899999999999999999 Q ss_pred HHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-ccc--cc- Q lcl|NC_011801. 311 IESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTN--LL- 378 (386) Q Consensus 311 ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~--~~- 378 (386) ||++|+++|+ .+|+||++++++.|.+++++++++++++|+||+||+|+++|++|+ |++|... ..+ ++ T Consensus 312 ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~-~ggD~~~~~~n~~~~~ 390 (409) T protein:vir:93 312 YEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV-EGGDKPLISGDLYPID 390 (409) T ss_pred HHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeeeecccccccc Confidence 9999999985 458999999999999999999999999999999999999998876 4433221 011 11 Q ss_pred -----cc-CCCC-CC Q lcl|NC_011801. 379 -----DN-TKNI-ND 386 (386) Q Consensus 379 -----~~-~~~~-~~ 386 (386) +. .+++ +| T Consensus 391 ~~~~~~~~~~gG~~n 405 (409) T protein:vir:93 391 TPLELRKSLKGGDKN 405 (409) T ss_pred cchhhcccccCCCCC Confidence 01 1111 11 No 53 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=4.8e-80 Score=455.44 Aligned_cols=371 Identities=16% Similarity=0.259 Sum_probs=299.8 Q ss_pred Cchhhhhcc----ccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhH Q lcl|NC_011801. 1 MAFLSNLFK----RQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPI 70 (386) Q Consensus 1 Mg~~~~l~~----~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~ 70 (386) =++++|+.. +....+.+ . ......+...+...++.+.|+++++|++||++||++||++|++++ +|++ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~ 80 (409) T protein:vir:94 4 ENIVTRIKKKLIDNWIDQSAS-K--LYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEV 80 (409) T ss_pred cccchhhhhHHhhhhhcCCcc-c--ccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccccchhH Confidence 133433321 11111111 0 111111222234457888999999999999999999999999874 5889 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeE Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDF 150 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (386) +++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|+.|++..+.++..++|.+...+ +..+ T Consensus 81 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g~~~ 157 (409) T protein:vir:94 81 SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKL 157 (409) T ss_pred HHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEE Confidence 9999999999999999999999999999999999999999999999999999999999988888888776543 4567 Q ss_pred EEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHH Q lcl|NC_011801. 151 LYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEE 230 (386) Q Consensus 151 ~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~ 230 (386) .+++++|+|+|+. ++.++++|+||+..+..+++...+++++. ++.++..++++++. +..+++++.+++++.|++ T Consensus 158 ~~~~~dvih~r~~---~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~ 231 (409) T protein:vir:94 158 IVHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQ 231 (409) T ss_pred EEccccEEEecCC---CCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEec-CCCCCHHHHHHHHHHHHH Confidence 8999999999854 35678999999999999999999998874 44455555555555 557899999999999998 Q ss_pred HhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--cccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 231 QTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--AQSNITMIRAFYQSSLSIYI 308 (386) Q Consensus 231 ~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l~P~~ 308 (386) .++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+|+..++ +++.+++.++|+++||.|++ T Consensus 232 ~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~ 309 (409) T protein:vir:94 232 YYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIV 309 (409) T ss_pred Hhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 774 56789999999999999999999999999999999999999999999987553 45678999999999999999 Q ss_pred HHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc-cc--c Q lcl|NC_011801. 309 KPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG-TN--L 377 (386) Q Consensus 309 ~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~-~~--~ 377 (386) ++||++|+++|+ .+++||++++++.|.+++++++++++++|+||+||+|+++|++|+ |++|...- .+ + T Consensus 310 ~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~-~ggD~~~~~~n~~~ 388 (409) T protein:vir:94 310 KQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV-EGGDKPLISGDLYP 388 (409) T ss_pred HHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeEeecccccc Confidence 999999999985 468999999999999999999999999999999999999998776 44433210 01 1 Q ss_pred c-------ccCCCCCC Q lcl|NC_011801. 378 L-------DNTKNIND 386 (386) Q Consensus 378 ~-------~~~~~~~~ 386 (386) + ...+++.+ T Consensus 389 ~~~~~~~~~~~kGG~~ 404 (409) T protein:vir:94 389 IDTPLELRKSLKGGDK 404 (409) T ss_pred cccchhhcccccCCCC Confidence 1 11222222 No 54 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=1.6e-79 Score=452.55 Aligned_cols=382 Identities=14% Similarity=0.114 Sum_probs=300.9 Q ss_pred CchhhhhccccccCCcc------------------------chh-hhhhccc---ccccCcccccHHHHhccHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGS------------------------SPV-WILNQGQ---PVSIKPKAITSAIALKNSDVYAVIS 52 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~------------------------~~~-~~~~~~~---~~~~~~~~i~~~~a~~~~~v~~~v~ 52 (386) ||||+|++++....+.. .|. .....+. +...++..|+.+.|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 99999998765432111 000 0111111 1222456689999999999999999 Q ss_pred HHHHhhccCceeecc----------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC--------CceE Q lcl|NC_011801. 53 RVSSDIAGCRFVTNA----------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN--------GYPV 114 (386) Q Consensus 53 ~ia~~ia~~p~~~~~----------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~--------g~~~ 114 (386) +||++||++|+++++ ++.+..|+.+||++||+++||+.++.+++++||||++++|+.. |.++ T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~ 160 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVV 160 (466) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCccee Confidence 999999999998763 2234445578999999999999999999999999999999764 5589 Q ss_pred EEEEEcCcceEEeecCCCce-eEEEEeccCc-ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 115 RIEPVPNEKVTVALDDYGKD-LTYTVHFDDS-KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSS 192 (386) Q Consensus 115 ~l~~l~~~~v~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~ 192 (386) +|+||+|+.|++..+.++.. ..|.++.... .......+++++|+|+|+. +++.++++|+||+..+.+++....+++ T Consensus 161 ~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~--~~~~d~~~G~s~i~~~~~~i~~~~a~~ 238 (466) T protein:vir:81 161 EERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPI--PDPLASYRGMSWLTPILREIRADQAMS 238 (466) T ss_pred EEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCC--CCcccccccccHHHHHHHHHHHHHHHH Confidence 99999999999998877654 4455544332 2234567999999999854 245688999999999999999999999 Q ss_pred HHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_011801. 193 KLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQ 271 (386) Q Consensus 193 ~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 271 (386) ++..++|+||++|+++++++. .+++++.+++++.|++.++| .|+|+++|+++|++|++++++++|+||+|++++++++ T Consensus 239 ~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~ 317 (466) T protein:vir:81 239 KHQAKFFDNGATVNLVIKHNP-MADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETR 317 (466) T ss_pred HHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 999999999999999999865 68999999999999999887 6889999999999999999999999999999999999 Q ss_pred HHHHhCCCHHHhcCCc-----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-------hhhhcchhhhccCHHHHH Q lcl|NC_011801. 272 IAKAFGIPADYLSGKQ-----DAQSNITMIRAFYQSSLSIYIKPIESELSQKLGT-------DVKLDIASAIDSDNSELI 339 (386) Q Consensus 272 Ia~~~gvp~~~l~~~~-----~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~-------~~~fd~~~~l~~d~~~~~ 339 (386) ||++|||||.+||..+ .+++.+++.++|+++||.|++++||++|+++|+. +++||.+++++.|.++++ T Consensus 318 Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~ 397 (466) T protein:vir:81 318 IAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAA 397 (466) T ss_pred HHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHH Confidence 9999999999998643 3456789999999999999999999999999853 678999999999999988 Q ss_pred HH-------HHHHHhCCCcCHHHHHHHhccCC--cCCCCCCCc--c---ccc------cccCCCCCC Q lcl|NC_011801. 340 NN-------VQKLASAGVLAPIQAQKLLKNRG--VFPELDLDE--G---TNL------LDNTKNIND 386 (386) Q Consensus 340 ~~-------~~~~~~~g~~t~nE~R~~lg~~p--~~p~~~~~~--~---~~~------~~~~~~~~~ 386 (386) ++ +..++++|+ |+||+|+.++... +++..+..- . .++ -...++.++ T Consensus 398 ~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ 463 (466) T protein:vir:81 398 DIQKVRAETINTLITAGY-EPESVVAAVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADD 463 (466) T ss_pred HHHHHHHHHHHHHHHcCC-ChhhccccccCCccccccCCCcchhhhcccccccccCCCCcccCCCCc Confidence 76 567888995 9999998764211 111111100 0 000 001111111 No 55 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=2.2e-79 Score=451.76 Aligned_cols=380 Identities=12% Similarity=0.143 Sum_probs=302.5 Q ss_pred CchhhhhccccccCC-ccchhhhhhcccc---cccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecch-------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLS-GSSPVWILNQGQP---VSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ-------- 68 (386) Q Consensus 1 Mg~~~~l~~~~~~~~-~~~~~~~~~~~~~---~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~-------- 68 (386) =.++.|++++..... ...+.|...++.. .+..+..++...|+++|+|++||++||+++|++|+++++. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~ 81 (460) T protein:vir:10 2 ANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQ 81 (460) T ss_pred chhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchh Confidence 356788876544333 2344454444432 2335566888899999999999999999999999987531 Q ss_pred ------------------------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC----CCceE Q lcl|NC_011801. 69 ------------------------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT----NGYPV 114 (386) Q Consensus 69 ------------------------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~----~g~~~ 114 (386) +...+|+.+||++||+++||+.++.+++++||||++++|+. .|.+. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~ 161 (460) T protein:vir:10 82 LNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPS 161 (460) T ss_pred hhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeE Confidence 23456788999999999999999999999999999999964 47899 Q ss_pred EEEEEcCcceEEeecCCCceeEEEEe---ccCcccceeEEEcccceeeeccccccCcc--cccccccHHHHHHHHHHHHH Q lcl|NC_011801. 115 RIEPVPNEKVTVALDDYGKDLTYTVH---FDDSKRSGDFLYDSSEVIHFRCTVSGESD--TQYMGIPPIDSLLNEIEVQD 189 (386) Q Consensus 115 ~l~~l~~~~v~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~vih~~~~~~~~~~--~~~~G~s~~~~~~~~i~~~~ 189 (386) +||||+|+.|++..+.++....|.+. +.....+....+++++|||+|+....... ++++|+||+..+..++.... T Consensus 162 ~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~ 241 (460) T protein:vir:10 162 QMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQN 241 (460) T ss_pred EEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHH Confidence 99999999999999988876655322 11123455678999999999987655443 45899999999999999999 Q ss_pred HHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHH Q lcl|NC_011801. 190 LSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFS 268 (386) Q Consensus 190 ~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~ 268 (386) ++++++.++|+||+.|+++++.+ ..+++++.+++++.|++.++| .|+|+++++++|++|+++++++.|+||+|.++++ T Consensus 242 ~~~~~~~~~f~ng~~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 320 (460) T protein:vir:10 242 STIDNNVKTMQNGGVFGFIHGGS-TGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYD 320 (460) T ss_pred HHHHHHHHHHhcCCCcceeeecC-CCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHH Confidence 99999999999999998887764 578999999999999999988 6889999999999999999999999999999999 Q ss_pred HHHHHHHhCCCHHHhcCCc----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhhhcchhh--hccC Q lcl|NC_011801. 269 QDQIAKAFGIPADYLSGKQ----DAQSNITMIRAFYQSSLSIYIKPIESELSQKLGT--------DVKLDIASA--IDSD 334 (386) Q Consensus 269 ~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~--------~~~fd~~~~--l~~d 334 (386) +++||++|||||.+||..+ ++++.+++.++|+++||.|++.+||++|+++|+. +++||++.+ ++.| T Consensus 321 ~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d 400 (460) T protein:vir:10 321 QKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTD 400 (460) T ss_pred HHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHH Confidence 9999999999999998754 3457789999999999999999999999999853 467887776 4555 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc-----cc-------ccCCCCCC Q lcl|NC_011801. 335 NSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN-----LL-------DNTKNIND 386 (386) Q Consensus 335 ~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~-----~~-------~~~~~~~~ 386 (386) .++++ .++++|+||+||+|+++|++|++ ++++|+.-. ++ ...+.|-+ T Consensus 401 ~~~~~----~~~~~g~~T~NE~R~~~g~~pi~-~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~ 459 (460) T protein:vir:10 401 MVAMA----SWLNTIPVTPNEIRIAMKYETLN-QDGMDIVFMPSNKVRIDDVSNNLIDSAFNQN 459 (460) T ss_pred HHHHH----HHHhCCCCCHHHHHHHhCCCCCC-CCCCCeeeecccccchhhcccccCCCcccCC Confidence 55444 46789999999999999988763 222222110 11 11111111 No 56 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=1e-78 Score=448.09 Aligned_cols=366 Identities=14% Similarity=0.191 Sum_probs=295.9 Q ss_pred CchhhhhccccccCCccchhhhhh--cccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------------ Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILN--QGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------------ 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------------ 66 (386) |||++++..+.+... ...... ...... .....+.+.++++++|++||++||++||++|++++ T Consensus 1 mg~~~~~~~~~~~~~---~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~ 76 (403) T protein:vir:10 1 MGFKSWITEKLNPGQ---RIIRDMEPVSHRTN-RKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGV 76 (403) T ss_pred Ccchhhhhhccchhh---hhhhcccccccccC-CcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeeccccccccccc Confidence 999998854332111 110000 011111 11123567788999999999999999999999764 Q ss_pred -chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcc Q lcl|NC_011801. 67 -AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSK 145 (386) Q Consensus 67 -~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~ 145 (386) .|+++++|+.+||++||+++||+.++.+++++||||+++.+ ..||+++++.|++..+.+.....|.+. T Consensus 77 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~~~~~~~~----- 145 (403) T protein:vir:10 77 KTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKFIKKFIFN----- 145 (403) T ss_pred ccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCceEEEEEec----- Confidence 36789999999999999999999999999999999998753 358999999999988876655444321 Q ss_pred cceeEEEcccceeeecccc-ccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHH Q lcl|NC_011801. 146 RSGDFLYDSSEVIHFRCTV-SGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENT 224 (386) Q Consensus 146 ~~~~~~~~~~~vih~~~~~-~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~ 224 (386) ....+++++|+|++... ...+.++++|+||+.++..++....+++++..++|+||++|+++++.++ .+++++.+++ T Consensus 146 --~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~ 222 (403) T protein:vir:10 146 --NQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE-ILNKKLRERK 222 (403) T ss_pred --CceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHH Confidence 12458889999998443 2345678999999999999999999999999999999999999999865 7899999999 Q ss_pred HHHHHHHhcc-cccCcceecCCCceeeeccC--ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHH Q lcl|NC_011801. 225 RQSFEEQTTG-ENAGRAVVLDQSADVETTNI--SPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQ 301 (386) Q Consensus 225 k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~--~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~ 301 (386) +++|++.++| .|+|+++++++|++|+++++ ++.|+||+|.+++++++||++|||||.+|+.. .+++.+++.++|++ T Consensus 223 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~sn~e~~~~~f~~ 301 (403) T protein:vir:10 223 QEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG-NNANIRPNIELFYY 301 (403) T ss_pred HHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-CCcCHHHHHHHHHH Confidence 9999999987 68999999999999999975 57899999999999999999999999999753 45678899999999 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhcchhh--hccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccc Q lcl|NC_011801. 302 SSLSIYIKPIESELSQKLGTDVKLDIASA--IDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLD 379 (386) Q Consensus 302 ~~l~P~~~~ie~~l~~~l~~~~~fd~~~~--l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~ 379 (386) +||.|++.+||++|+++|+.+++||++.+ ++.|.+++++++++++++|+||+||+|+++|++|++ ++++++.-.+.. T Consensus 302 ~tl~P~~~~ie~~l~~~L~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~-~~~~d~~~~p~n 380 (403) T protein:vir:10 302 MTIIPMLNKLTSSLTFFFGYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLD-DEQMNKIRIPAN 380 (403) T ss_pred HHHHHHHHHHHHHHHHhcCceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-cccccccccccc Confidence 99999999999999999999999998855 899999999999999999999999999999988753 333332211111 Q ss_pred c-----CCCCCC Q lcl|NC_011801. 380 N-----TKNIND 386 (386) Q Consensus 380 ~-----~~~~~~ 386 (386) . ...+.| T Consensus 381 ~~~~~~~~~~~e 392 (403) T protein:vir:10 381 VAGSATGVSGQE 392 (403) T ss_pred cccccccCCCCc Confidence 0 111111 No 57 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=2.2e-78 Score=446.32 Aligned_cols=369 Identities=17% Similarity=0.171 Sum_probs=302.6 Q ss_pred CchhhhhccccccCCc-cchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec----------chh Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG-SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN----------AQP 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~----------~~~ 69 (386) ||||+++++.+..... ..+............+...++...++++++|++||++||+++|++|++++ +|+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 80 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRIRNE 80 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeecch Confidence 9999987654433222 22222233333445556667888899999999999999999999999874 467 Q ss_pred HHHHHhccCcccCCHHHHHHHHHHHHHHhCCe--EEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 70 ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNA--FAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 70 ~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a--~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) +.++|+.+||++||+++||+.++.+++++|++ |+++.++..|.+++||||+|++|++..+.++.. +.++ T Consensus 81 ~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~----~~~~----- 151 (406) T protein:vir:95 81 LSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQ----VLYG----- 151 (406) T ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEE----EEec----- Confidence 99999999999999999999999999999764 667789999999999999999999998887533 2221 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ...+++++|+|+++. +++.++++|+||+..+..++....++++++.++|+||++|++++++++ .+++++.++++++ T Consensus 152 -~~~~~~~evih~~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~ 227 (406) T protein:vir:95 152 -GQTFNYDEVLHFIYN--PDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNA 227 (406) T ss_pred -cEEEchhHEEEeecc--CCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHH Confidence 135899999999964 345678899999999999999999999999999999999999999875 6899999999999 Q ss_pred HHHHhcc-cccCcceecC-CCceeeecc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHH Q lcl|NC_011801. 228 FEEQTTG-ENAGRAVVLD-QSADVETTN-ISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSL 304 (386) Q Consensus 228 ~~~~~~~-~~~g~~~vl~-~g~~~~~~~-~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l 304 (386) |.+.+.| .|+|++++++ +|.+++++. ++++|+||+|.+++++++||++|||||.+||.. ++.+++..+|+++|| T Consensus 228 ~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---~~~~~~~~~~~~~~l 304 (406) T protein:vir:95 228 VFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---EFNRDEYNNFINSTI 304 (406) T ss_pred HHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---CchHHHHHHHHHHHH Confidence 9999887 6888888875 456777764 689999999999999999999999999999743 456788899999999 Q ss_pred HHHHHHHHHHHHHhhh----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-ccc--c Q lcl|NC_011801. 305 SIYIKPIESELSQKLG----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTN--L 377 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~--~ 377 (386) +|++++||++|+++|+ .+++||++.+++.|.+++++.+.+++++||||+||+|+++|++|+ |++|... ..+ + T Consensus 305 ~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~-~~gd~~~~~~n~~~ 383 (406) T protein:vir:95 305 LPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPK-EGLSELVILENYIP 383 (406) T ss_pred HHHHHHHHHHHHHhcCCCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcceeeeccCccc Confidence 9999999999999985 468999999999999999999999999999999999999998775 3333221 001 0 Q ss_pred c------ccCCCCCC Q lcl|NC_011801. 378 L------DNTKNIND 386 (386) Q Consensus 378 ~------~~~~~~~~ 386 (386) + ...+++++ T Consensus 384 ~~~~~~~~~~k~g~~ 398 (406) T protein:vir:95 384 LDKIGDQSKLKGGDN 398 (406) T ss_pred hhhcccccccCCCCC Confidence 0 11122222 No 58 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=1.1e-77 Score=442.61 Aligned_cols=366 Identities=19% Similarity=0.228 Sum_probs=293.5 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHH-HhccHHHHHHHHHHHHhhccCceeec----------chh Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAI-ALKNSDVYAVISRVSSDIAGCRFVTN----------AQP 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-a~~~~~v~~~v~~ia~~ia~~p~~~~----------~~~ 69 (386) ||||+ +|+++.......+........... ...++... +..+|+|++||++||++||++|++++ +|+ T Consensus 1 Mg~~~-~f~~k~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~ 77 (403) T protein:vir:80 1 MGLFN-FFRRKTRSEPTNAISWFLTQEAYD--TLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNE 77 (403) T ss_pred Ccccc-cccccccccccchhhhhccccccc--ccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCCh Confidence 99996 677766554443332222222211 12222222 34689999999999999999999875 478 Q ss_pred HHHHHhccCcccCCHHHHHHHHHHHHHHh--CCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 70 ITDVLNAPLGNLMSGFSVWQAMIVQMMLT--GNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 70 ~~~~l~~~PN~~~s~~~f~~~~~~~~~l~--G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) ++++|+.+||++||+++||+.+++++++. ||||+++.++..|++.+||||+|+.|++..+.++..++|. T Consensus 78 ~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--------- 148 (403) T protein:vir:80 78 LSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--------- 148 (403) T ss_pred HHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--------- Confidence 99999999999999999999999999985 8899999999999999999999999999988887655442 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ...++++||+|++.. +.+.++++|+||+..+..++....++++++.++|+||++|++++++++ .+++++.++++++ T Consensus 149 -~~~~~~~eiih~~~~--~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~ 224 (403) T protein:vir:80 149 -GKAYNYDEVLHFIVN--PDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNA 224 (403) T ss_pred -ecccchhhEEEEecc--CCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCChHHHHHHHHH Confidence 134889999999853 356678899999999999999999999999999999999999999876 5788888999999 Q ss_pred HHHHhcc-cccCcceecCCC-ceeeecc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHH Q lcl|NC_011801. 228 FEEQTTG-ENAGRAVVLDQS-ADVETTN-ISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSL 304 (386) Q Consensus 228 ~~~~~~~-~~~g~~~vl~~g-~~~~~~~-~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l 304 (386) |.+.+.+ .++|++++++.+ .++.++. ++++|+|++|.+++++++||++|||||.+||.. +..+++..+|+++|| T Consensus 225 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~---~~~~~~~~~f~~~~l 301 (403) T protein:vir:80 225 VFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVG---KYDKDEYNNFINSTI 301 (403) T ss_pred HHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCC---CccHHHHHHHHHHHH Confidence 9877765 688888888655 4555554 588999999999999999999999999999743 344556778999999 Q ss_pred HHHHHHHHHHHHHhhh----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCC---Cccccc Q lcl|NC_011801. 305 SIYIKPIESELSQKLG----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDL---DEGTNL 377 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~---~~~~~~ 377 (386) +|++++||++|+++|+ .+++||++.+++.|.+++++++.+++++||||+||+|+++|++|+ |++|. ...-.+ T Consensus 302 ~P~~~~ie~~l~~kll~~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~-~ggd~~~~~~n~~p 380 (403) T protein:vir:80 302 LPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPK-EGLSELVILENYIP 380 (403) T ss_pred HHHHHHHHHHHHHhccCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCeEeecccccc Confidence 9999999999999986 458999999999999999999999999999999999999998775 33331 111111 Q ss_pred c------ccCCCCCC Q lcl|NC_011801. 378 L------DNTKNIND 386 (386) Q Consensus 378 ~------~~~~~~~~ 386 (386) + +..|++++ T Consensus 381 l~~~~~~~~~k~ge~ 395 (403) T protein:vir:80 381 LDKIGDQNKLKGGEK 395 (403) T ss_pred hhhccchhhccCCCC Confidence 1 11222222 No 59 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=2.1e-77 Score=440.98 Aligned_cols=366 Identities=22% Similarity=0.310 Sum_probs=305.5 Q ss_pred CchhhhhccccccCCccchhhhhhcc---cccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQG---QPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAP 77 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~ 77 (386) ||||+++++.+.........+..... ...+.++..++.+.|+++++|++||++||+++|++|++++++....+ +.+ T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~~~l-~~~ 79 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQLQGI-VDN 79 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchhhhh-hhc Confidence 99999876554433322222221111 12234567799999999999999999999999999999998776654 577 Q ss_pred CcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccce Q lcl|NC_011801. 78 LGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEV 157 (386) Q Consensus 78 PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 157 (386) ||++||+++|++.++.+++++||||++++++..|++++||||+|++|++..+.++..++|.+...+...+..+.+++++| T Consensus 80 PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eV 159 (384) T protein:vir:49 80 PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPRIPPKQHVPQGDI 159 (384) T ss_pred cCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCccccceeEecCccE Confidence 99999999999999999999999999999999999999999999999999988888889998888777777889999999 Q ss_pred eeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_011801. 158 IHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENA 237 (386) Q Consensus 158 ih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 237 (386) ||+++.. +.++++|+||+.++..++....++++++.++|+||++|++++++++... +++.+ ++..+...++.|+ T Consensus 160 ih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~-~~~~~--~~~~~~~~~~~n~ 233 (384) T protein:vir:49 160 LHFRLLS---VDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGL-LDFKT--KQSRSRQAMKQMQ 233 (384) T ss_pred EEecCCC---CCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-hHHHH--HHHHHHHhcccCC Confidence 9999653 3456899999999999999999999999999999999999999987544 44333 2333334455788 Q ss_pred CcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc----ccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 238 GRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA----QSNITMIRAFYQSSLSIYIKPIES 313 (386) Q Consensus 238 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~----~~~~~~~~~~~~~~l~P~~~~ie~ 313 (386) |+++++++|++|+++++++.|+|++|.+++++++||++|||||.+||..... ++.+++...|++.+|.|+++.|++ T Consensus 234 ~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~ 313 (384) T protein:vir:49 234 GGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSK 313 (384) T ss_pred ccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999875432 234667788999999999999999 Q ss_pred HHHHhhh-----------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccccc Q lcl|NC_011801. 314 ELSQKLG-----------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLL 378 (386) Q Consensus 314 ~l~~~l~-----------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~ 378 (386) +|+++++ .+++|+++.+++.|..++.++++++.+.|+++ ||+|+++++.|++ ++|.++ .+ T Consensus 314 ~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~-gGd~~~---~~ 384 (384) T protein:vir:49 314 KLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLK-GGETNE---QY 384 (384) T ss_pred HhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCC-CCCCCC---CC Confidence 9998873 46789999999999999999999999999987 9999999987754 333222 23 No 60 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=3.8e-77 Score=439.58 Aligned_cols=382 Identities=14% Similarity=0.143 Sum_probs=293.5 Q ss_pred Cchhh--hh---------ccccccC--------CccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_011801. 1 MAFLS--NL---------FKRQKML--------SGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGC 61 (386) Q Consensus 1 Mg~~~--~l---------~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~ 61 (386) .=+|+ +. |+++... .+...........+..+.+.+++.+.++++++|++||++||+++|++ T Consensus 62 ~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsL 141 (945) T protein:vir:10 62 IIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSK 141 (945) T ss_pred eeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccC Confidence 11221 11 1111110 00000001111122334455688889999999999999999999999 Q ss_pred ceeec-----------------chhHHHHHhccCcccCCHHH----HHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEc Q lcl|NC_011801. 62 RFVTN-----------------AQPITDVLNAPLGNLMSGFS----VWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVP 120 (386) Q Consensus 62 p~~~~-----------------~~~~~~~l~~~PN~~~s~~~----f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~ 120 (386) |++++ .|+++++|+ +||++||+++ |++.++.+++++||+|++++|+..|.+++|||++ T Consensus 142 PlklYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLd 220 (945) T protein:vir:10 142 ELEIYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVD 220 (945) T ss_pred ceEEEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEC Confidence 99874 256778885 7999999998 5667889999999999999999999999999999 Q ss_pred CcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 121 NEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLR 200 (386) Q Consensus 121 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 200 (386) |++|++..+.++....++....+ ......++++++||+.+....+.....+|+||+.++.+++....++++++.++|. T Consensus 221 Ps~Vti~~ddDG~~~y~Yv~~id--G~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~Fs 298 (945) T protein:vir:10 221 GTTIKPILSEDTGIVVGYVQEVD--GAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYR 298 (945) T ss_pred CcceEEEEcCCCcEEEEEEEecC--CceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998887766544333322 2233467888877654433333333447999999999999999999999999995 Q ss_pred -ccCCCceEEeeCC---------CCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHH Q lcl|NC_011801. 201 -HAIKPSIFIKVPN---------ATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQD 270 (386) Q Consensus 201 -ng~~~~~~l~~~~---------~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~ 270 (386) ||++|+|+|++++ +.+++++.+++++.|++.++|.++|+++++++|++|+++++++.|+||+|+++++++ T Consensus 299 kNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~e 378 (945) T protein:vir:10 299 KGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVAR 378 (945) T ss_pred hCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHH Confidence 7889999998752 457999999999999999999888889999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHhcCCc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHH Q lcl|NC_011801. 271 QIAKAFGIPADYLSGKQ--DAQSNITMIRAFYQSSLSIYIKPIESELSQKLG-----TDVKLDIASAIDSDNSELINNVQ 343 (386) Q Consensus 271 ~Ia~~~gvp~~~l~~~~--~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~ 343 (386) +||++|||||.+||..+ ++++.+++..+|+++||+|++.+||++||++|. .+++|+++.....|.++++++++ T Consensus 379 eIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~ 458 (945) T protein:vir:10 379 KICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQ 458 (945) T ss_pred HHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHH Confidence 99999999999998654 356778899999999999999999999999984 34567777777789999999999 Q ss_pred HHHhCCCcCHHHHHHHhccCCcCCCCCCCccc-c---ccccCCCC-------------CC Q lcl|NC_011801. 344 KLASAGVLAPIQAQKLLKNRGVFPELDLDEGT-N---LLDNTKNI-------------ND 386 (386) Q Consensus 344 ~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~-~---~~~~~~~~-------------~~ 386 (386) +++++|+||+||+|+++|++|+ |++|...-. + +.....+. .| T Consensus 459 kli~sGiLTiNEvRe~lGLpPI-eGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~d 517 (945) T protein:vir:10 459 GQLNTGFRSINEARMEKGLEPV-PWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMAD 517 (945) T ss_pred HHHhCCCcCHHHHHHHhCCCCC-CCcceeeeccccccccccccccccCCCCcccccCCCC Confidence 9999999999999999998776 344332100 0 00000000 00 No 61 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=5e-77 Score=438.90 Aligned_cols=363 Identities=17% Similarity=0.247 Sum_probs=289.6 Q ss_pred Cchhhhhcccccc--C---CccchhhhhhcccccccCcccccH-HHHhccHHHHHHHHHHHHhhccCceeec-------- Q lcl|NC_011801. 1 MAFLSNLFKRQKM--L---SGSSPVWILNQGQPVSIKPKAITS-AIALKNSDVYAVISRVSSDIAGCRFVTN-------- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~i~~-~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------- 66 (386) |+||++-...... . ....+..... ..........+. ..++++++|++||++||+++|++|++++ T Consensus 13 m~~F~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~ 90 (413) T protein:vir:96 13 LKFFNNKRSPTEESKAKDEIPKAPQVVMT--LPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDK 90 (413) T ss_pred CCccccCCCcchhhhhhcccccccccccc--chhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCcc Confidence 7777542110000 0 0000000000 001111111222 2367899999999999999999999874 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC-ceEEEEEEcCcceEEeecCCCceeEEEEeccC Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG-YPVRIEPVPNEKVTVALDDYGKDLTYTVHFDD 143 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g-~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 143 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++++..| .+++|||++|+.|++..+.+. +.|.+...+ T Consensus 91 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~--~~y~~~~~~ 168 (413) T protein:vir:96 91 RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDD--LDYSITFDN 168 (413) T ss_pred ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCe--EEEEEeecC Confidence 478999999999999999999999999999999999999999887 578999999999999887654 445554432 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN 223 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~ 223 (386) ..++++||+|+++. +++.++++|+||+.++..++....+++++..++|+||++|+++|++++ .+++++.++ T Consensus 169 ------~~~~~~evih~k~~--~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~ 239 (413) T protein:vir:96 169 ------KEYDPSTLLHFVLN--PSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS-DSDELSDEE 239 (413) T ss_pred ------cEEchhhEEEEecc--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHH Confidence 35789999999864 345578899999999999999999999999999999999999999875 689999999 Q ss_pred HHHHHHHHhcc-cccCcceecCCCc-eeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHH Q lcl|NC_011801. 224 TRQSFEEQTTG-ENAGRAVVLDQSA-DVETT-NISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFY 300 (386) Q Consensus 224 ~k~~~~~~~~~-~~~g~~~vl~~g~-~~~~~-~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~ 300 (386) ++++|++.++| .++|+++++++|. ++.++ .++++|+||+|.+++++++||++|||||.+||.. ++++++..+|+ T Consensus 240 ~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~---~~~~~~~~~~~ 316 (413) T protein:vir:96 240 GRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG---TYNKDEFNNFI 316 (413) T ss_pred HHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---cchHHHHHHHH Confidence 99999999888 6889999986665 55555 4689999999999999999999999999999743 35678889999 Q ss_pred HHHHHHHHHHHHHHHHHhh---hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccc Q lcl|NC_011801. 301 QSSLSIYIKPIESELSQKL---GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNL 377 (386) Q Consensus 301 ~~~l~P~~~~ie~~l~~~l---~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~ 377 (386) ++||+|+++.||++|+++| +.+++||++.+++.|.+++++++++++++|+||+||+|+++|++|+ |++|. T Consensus 317 ~~~l~P~~~~ie~~ln~~ll~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~-~~gd~------ 389 (413) T protein:vir:96 317 NTKIMSIAQVIQQTYNKLIVEEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPD-AEMDD------ 389 (413) T ss_pred HHHHHHHHHHHHHHHHHhhCCCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcce------ Confidence 9999999999999999999 5688999999999999999999999999999999999999998875 33322 Q ss_pred cccCCCC--------------CC Q lcl|NC_011801. 378 LDNTKNI--------------ND 386 (386) Q Consensus 378 ~~~~~~~--------------~~ 386 (386) +..++|. .| T Consensus 390 ~~~~~n~~~~~~~~~~~~~~~~d 412 (413) T protein:vir:96 390 LLVLENYLQQKDLVNQKKLIQDE 412 (413) T ss_pred eeecccccchhhcccccCCCCCC Confidence 2222221 11 No 62 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=2.9e-75 Score=429.24 Aligned_cols=360 Identities=13% Similarity=0.114 Sum_probs=283.2 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc--------hhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA--------QPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~--------~~~~~ 72 (386) ||||++++++...............+...+.++..++.+.|+++++|++||++||++||++|+++++ |+++. T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~ 80 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGNEIKDDIALQ 80 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCcccchhhHHH Confidence 9999998755433333333344455566677788899999999999999999999999999998864 44554 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) + +.+||++||+++||+.++.+++++||||+++.++..+. ++.+.+..++.+. +.+.. ....+ T Consensus 81 L-l~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~--------~~~~~~~~~~~~~---~~~~~------~~~~~ 142 (394) T protein:vir:62 81 I-LRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHL--------ASNVFTELDDNLV---EHFNI------GGHEI 142 (394) T ss_pred H-hccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeec--------cccceEEECCceE---EEEee------CCEEe Confidence 4 56899999999999999999999999999997654332 2345555555432 22222 23569 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCC-CCHHHHHHHHHHHHHH Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNAT-LGKEAKENTRQSFEEQ 231 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~-~~~~~~~~~k~~~~~~ 231 (386) ++++|+|+|+. +.++++|+||+..+..++....++++++.++|+||++|++++++++.. .++++.+++++.|++. T Consensus 143 ~~~eiih~r~~----~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~ 218 (394) T protein:vir:62 143 PPCMIRHVKNI----GADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQ 218 (394) T ss_pred chhheEEecCc----CCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHH Confidence 99999999854 246789999999999999999999999999999999999999997642 3566789999999999 Q ss_pred hcc-cccCcceecCCCc--eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 232 TTG-ENAGRAVVLDQSA--DVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYI 308 (386) Q Consensus 232 ~~~-~~~g~~~vl~~g~--~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~ 308 (386) ++| .++|+++|++.|. ++++++.++.|+|++|.+++++++||++|||||.+|+.. .+++.+++.++|+++||+|++ T Consensus 219 ~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~sn~e~~~~~~~~~~l~P~~ 297 (394) T protein:vir:62 219 LESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL-IKEDIEKAMMYIHNKAVRPIM 297 (394) T ss_pred hccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC-CCcCHHHHHHHHHHHHHHHHH Confidence 988 6889999997776 556788899999999999999999999999999999854 356788999999999999999 Q ss_pred HHHHHHHHHhhhh-------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc--cc--- Q lcl|NC_011801. 309 KPIESELSQKLGT-------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG--TN--- 376 (386) Q Consensus 309 ~~ie~~l~~~l~~-------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~--~~--- 376 (386) ++||++|+++|+. +++||...+++ .+++++++.+++++|+||+||+|+++|++|++ +++++.. .. T Consensus 298 ~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~--~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~-~~~gd~~~~~~n~~ 374 (394) T protein:vir:62 298 KNFEDHLSLLFYAQNSGKRIKFKINILDFVT--YSNKTNIGYNLVRTAITSPDNVADMLGFPKQN-TKESQAIYISNDVT 374 (394) T ss_pred HHHHHHHhhhhcCccccCceEEEechhhhcC--HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCCeeecccccc Confidence 9999999999853 35666666654 45788999999999999999999999988763 2222221 00 Q ss_pred cc-------ccCCCCCC Q lcl|NC_011801. 377 LL-------DNTKNIND 386 (386) Q Consensus 377 ~~-------~~~~~~~~ 386 (386) ++ .+.+++++ T Consensus 375 ~~~~~~~~~~~~kgge~ 391 (394) T protein:vir:62 375 EIGKKEATDGSLGGGEE 391 (394) T ss_pred cccccccccccCCCCCC Confidence 11 11122221 No 63 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=1.9e-73 Score=419.32 Aligned_cols=354 Identities=15% Similarity=0.116 Sum_probs=280.0 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~~l 74 (386) ||||+++|++++...... ....+..++...++++++|++||++||+++|++|++++ +|+++++| T Consensus 1 Mg~f~~lf~~~~~~~~~~----------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:10 1 MSILEKIFKTRKDITYML----------DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCccccccc----------cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999998865432111 12234667888899999999999999999999999765 57899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+.... .+... ..+....+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~ 141 (395) T protein:vir:10 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFK--DVTVK--DYTYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCccee--EEEEc--Cceeeeeecc Confidence 99999999999999999999999999998775543 256666666665444333222 22222 2234567999 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) ++|||+++.. +.+..+|.||+.++..+++... +.|++|+.++++|++++..+++++.+++++.|++.+++ T Consensus 142 ~evih~~~~~---~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:10 142 QEVIYLKYNN---NKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred ccEEEEccCC---CCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 9999999753 3456789999999988876544 34677888899999988788999999999999999988 Q ss_pred cccCc--ceecCCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 235 ENAGR--AVVLDQSADVETTNISPNVT-----EFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIY 307 (386) Q Consensus 235 ~~~g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~ 307 (386) .++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||.+|+ +++++.+++.++|+++||.|+ T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHHHHHHHHHHHHHHHH Confidence 66555 45579999999999988765 8999999999999999999999997 456778999999999999999 Q ss_pred HHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc----- Q lcl|NC_011801. 308 IKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN----- 376 (386) Q Consensus 308 ~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~----- 376 (386) +.+||++|+++|+. +++||++.+++.|.+++++++++++++||||+||+|+++|++|++ ++.+++.-- T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~-~g~~d~~~~~~n~~ 368 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD-NPELDEYLITKNYE 368 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCceeeeccccc Confidence 99999999999853 368999999999999999999999999999999999999987763 333332110 Q ss_pred ccc------------cCCCCCC Q lcl|NC_011801. 377 LLD------------NTKNIND 386 (386) Q Consensus 377 ~~~------------~~~~~~~ 386 (386) ++. ..+++.+ T Consensus 369 ~~~~~~~~~~~~~~~~~kgg~~ 390 (395) T protein:vir:10 369 KANSGENDEKEKDENTLKGGDE 390 (395) T ss_pred cccccccccCcccccccCCCCC Confidence 000 0011111 No 64 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=1.9e-73 Score=419.32 Aligned_cols=354 Identities=15% Similarity=0.116 Sum_probs=280.0 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~~l 74 (386) ||||+++|++++...... ....+..++...++++++|++||++||+++|++|++++ +|+++++| T Consensus 1 Mg~f~~lf~~~~~~~~~~----------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:10 1 MSILEKIFKTRKDITYML----------DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCccccccc----------cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999998865432111 12234667888899999999999999999999999765 57899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+.... .+... ..+....+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~ 141 (395) T protein:vir:10 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFK--DVTVK--DYTYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCccee--EEEEc--Cceeeeeecc Confidence 99999999999999999999999999998775543 256666666665444333222 22222 2234567999 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) ++|||+++.. +.+..+|.||+.++..+++... +.|++|+.++++|++++..+++++.+++++.|++.+++ T Consensus 142 ~evih~~~~~---~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:10 142 QEVIYLKYNN---NKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred ccEEEEccCC---CCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 9999999753 3456789999999988876544 34677888899999988788999999999999999988 Q ss_pred cccCc--ceecCCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 235 ENAGR--AVVLDQSADVETTNISPNVT-----EFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIY 307 (386) Q Consensus 235 ~~~g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~ 307 (386) .++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||.+|+ +++++.+++.++|+++||.|+ T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHHHHHHHHHHHHHHHH Confidence 66555 45579999999999988765 8999999999999999999999997 456778999999999999999 Q ss_pred HHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc----- Q lcl|NC_011801. 308 IKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN----- 376 (386) Q Consensus 308 ~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~----- 376 (386) +.+||++|+++|+. +++||++.+++.|.+++++++++++++||||+||+|+++|++|++ ++.+++.-- T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~-~g~~d~~~~~~n~~ 368 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD-NPELDEYLITKNYE 368 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCceeeeccccc Confidence 99999999999853 368999999999999999999999999999999999999987763 333332110 Q ss_pred ccc------------cCCCCCC Q lcl|NC_011801. 377 LLD------------NTKNIND 386 (386) Q Consensus 377 ~~~------------~~~~~~~ 386 (386) ++. ..+++.+ T Consensus 369 ~~~~~~~~~~~~~~~~~kgg~~ 390 (395) T protein:vir:10 369 KANSGENDEKEKDENTLKGGDE 390 (395) T ss_pred cccccccccCcccccccCCCCC Confidence 000 0011111 No 65 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=1.9e-73 Score=419.32 Aligned_cols=354 Identities=15% Similarity=0.116 Sum_probs=280.0 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~~l 74 (386) ||||+++|++++...... ....+..++...++++++|++||++||+++|++|++++ +|+++++| T Consensus 1 Mg~f~~lf~~~~~~~~~~----------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:95 1 MSILEKIFKTRKDITYML----------DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCccccccc----------cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999998865432111 12234667888899999999999999999999999765 57899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++....+.... .+... ..+....+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~ 141 (395) T protein:vir:95 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFK--DVTVK--DYTYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCccee--EEEEc--Cceeeeeecc Confidence 99999999999999999999999999998775543 256666666665444333222 22222 2234567999 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) ++|||+++.. +.+..+|.||+.++..+++... +.|++|+.++++|++++..+++++.+++++.|++.+++ T Consensus 142 ~evih~~~~~---~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:95 142 QEVIYLKYNN---NKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred ccEEEEccCC---CCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 9999999753 3456789999999988876544 34677888899999988788999999999999999988 Q ss_pred cccCc--ceecCCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 235 ENAGR--AVVLDQSADVETTNISPNVT-----EFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIY 307 (386) Q Consensus 235 ~~~g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~ 307 (386) .++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||.+|+ +++++.+++.++|+++||.|+ T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:95 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHHHHHHHHHHHHHHHH Confidence 66555 45579999999999988765 8999999999999999999999997 456778999999999999999 Q ss_pred HHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc----- Q lcl|NC_011801. 308 IKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN----- 376 (386) Q Consensus 308 ~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~----- 376 (386) +.+||++|+++|+. +++||++.+++.|.+++++++++++++||||+||+|+++|++|++ ++.+++.-- T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~-~g~~d~~~~~~n~~ 368 (395) T protein:vir:95 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD-NPELDEYLITKNYE 368 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCceeeeccccc Confidence 99999999999853 368999999999999999999999999999999999999987763 333332110 Q ss_pred ccc------------cCCCCCC Q lcl|NC_011801. 377 LLD------------NTKNIND 386 (386) Q Consensus 377 ~~~------------~~~~~~~ 386 (386) ++. ..+++.+ T Consensus 369 ~~~~~~~~~~~~~~~~~kgg~~ 390 (395) T protein:vir:95 369 KANSGENDEKEKDENTLKGGDE 390 (395) T ss_pred cccccccccCcccccccCCCCC Confidence 000 0011111 No 66 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=1.4e-73 Score=419.99 Aligned_cols=356 Identities=15% Similarity=0.164 Sum_probs=279.7 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~~l 74 (386) ||||+++|+|+......... -....++...|+++++|++||++||+++|++|++++ .|+++++| T Consensus 1 Mg~f~~~f~~~~~~~~~~~~----------~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l~~lL 70 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWMYDL----------EFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTLYYLL 70 (385) T ss_pred CchhhhhhccCcccccccch----------hhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchHHHHH Confidence 99999999886543222111 122356778899999999999999999999999875 47899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+.+.... ++...... .+....+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~--~~~~~~~~~ 142 (385) T protein:vir:95 71 NVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHR-----FTNVLVND--FEFKRVFTM 142 (385) T ss_pred hcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccccc-----ceeeeecc--cceeeeecc Confidence 99999999999999999999999999999887654 44555666666665543322 22222222 234467999 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTT 233 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~ 233 (386) ++|+|+++.. +.+..+|.||+..+..++....++. .+++.|+++++++. ..+++++.++++++|++.++ T Consensus 143 ~eiih~~~~~---~~~~~~G~s~~~~~~~~i~~~~~~~-------~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~ 212 (385) T protein:vir:95 143 DDVIYLKYNN---QKLDAFSLGLFEDYGEIFGRMIDLQ-------MLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFD 212 (385) T ss_pred ccEEEecCCC---CCcccccchHHHHHHHHHHHHHHHH-------HhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhh Confidence 9999999653 3455789999999999887655543 22344788888864 46799999999999999988 Q ss_pred cc--ccCcceecCCCceeeeccC------ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHH Q lcl|NC_011801. 234 GE--NAGRAVVLDQSADVETTNI------SPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLS 305 (386) Q Consensus 234 ~~--~~g~~~vl~~g~~~~~~~~------~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~ 305 (386) |. ++++++++++|++|+++++ ++.|+||+|.+++++++||++|||||.+|+ +++++.+++..+|+++||. T Consensus 213 g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~--~~~sn~e~~~~~~~~~~l~ 290 (385) T protein:vir:95 213 AFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL--GEMADLEKTIESYLQFCIN 290 (385) T ss_pred hhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCHHHHHHHHHHHHHH Confidence 73 4456889999999999875 667999999999999999999999999996 4577889999999999999 Q ss_pred HHHHHHHHHHHHhhhh-------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC-CCCCCCc---c Q lcl|NC_011801. 306 IYIKPIESELSQKLGT-------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF-PELDLDE---G 374 (386) Q Consensus 306 P~~~~ie~~l~~~l~~-------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~-p~~~~~~---~ 374 (386) |++.+||++|+++|++ +++||++.+++.|.+++++++++++++|+||+||+|+++|++|++ |++|... . T Consensus 291 P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~~n 370 (385) T protein:vir:95 291 PLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIITKN 370 (385) T ss_pred HHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccc Confidence 9999999999999853 579999999999999999999999999999999999999988764 3333221 1 Q ss_pred ccccccCCCCCC Q lcl|NC_011801. 375 TNLLDNTKNIND 386 (386) Q Consensus 375 ~~~~~~~~~~~~ 386 (386) -.++...+++++ T Consensus 371 ~~~~~~~kgge~ 382 (385) T protein:vir:95 371 LQSADAFKGGES 382 (385) T ss_pred ceecccccCCCC Confidence 122233333333 No 67 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=1.3e-72 Score=414.65 Aligned_cols=379 Identities=12% Similarity=0.068 Sum_probs=286.1 Q ss_pred Cchhhhhc--cccccCCc--------------------------------cchhh-hhhccc--ccccCcccc-----cH Q lcl|NC_011801. 1 MAFLSNLF--KRQKMLSG--------------------------------SSPVW-ILNQGQ--PVSIKPKAI-----TS 38 (386) Q Consensus 1 Mg~~~~l~--~~~~~~~~--------------------------------~~~~~-~~~~~~--~~~~~~~~i-----~~ 38 (386) ||+|+++. ++...... ..+.. ...... ....+..+. .. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~ 84 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 84 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHHHHH Confidence 99998764 11111000 00000 000000 001111111 11 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-----------ceeec--c-------------hhHHHHHhccCccc-----CCHHHH Q lcl|NC_011801. 39 AIALKNSDVYAVISRVSSDIAGC-----------RFVTN--A-------------QPITDVLNAPLGNL-----MSGFSV 87 (386) Q Consensus 39 ~~a~~~~~v~~~v~~ia~~ia~~-----------p~~~~--~-------------~~~~~~l~~~PN~~-----~s~~~f 87 (386) +.+..+|+|++||+.||++||++ ++.+. + +.+..+| .+||++ +|+.+| T Consensus 85 ~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l-~~pn~~~~p~~~s~~~f 163 (551) T protein:vir:80 85 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFI-EKTGVDNDINRDSFSSF 163 (551) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHH-HhcCCCCCCccchHHHH Confidence 23446799999999999999974 34332 1 1123344 568887 488999 Q ss_pred HHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce----eEEEEeccCcccceeEEEcccceeeeccc Q lcl|NC_011801. 88 WQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD----LTYTVHFDDSKRSGDFLYDSSEVIHFRCT 163 (386) Q Consensus 88 ~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~vih~~~~ 163 (386) ++.++.+++++||||++++|+..|++.+||||+|++|++..+.++.. ..|.+... .+....+++++|+|++++ T Consensus 164 ~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~---g~~~~~~~~~eiiH~~~n 240 (551) T protein:vir:80 164 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVID---QKIVATFNAREMAFAVRN 240 (551) T ss_pred HHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeC---CcEEEEEcccceEEeccc Confidence 99999999999999999999999999999999999999998887753 23333222 233457999999999987 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhcc-cccCcce Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTTG-ENAGRAV 241 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~~-~~~g~~~ 241 (386) +..++.++.+|+||+.++..++..+.++++++.++|+||++|+++|++++ ..+++++++++++.|++.++| .|+|+++ T Consensus 241 ~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~ 320 (551) T protein:vir:80 241 PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIP 320 (551) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccc Confidence 66667778899999999999999999999999999999999999999864 458999999999999999988 6889976 Q ss_pred ec-CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC------------cccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 242 VL-DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD------------AQSNITMIRAFYQSSLSIYI 308 (386) Q Consensus 242 vl-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~------------~~~~~~~~~~~~~~~l~P~~ 308 (386) ++ ++|++|+++++++.|+||+|++++++++||++|||||.+||..+. +++.+++..+|+++||.|++ T Consensus 321 vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~ 400 (551) T protein:vir:80 321 VVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLL 400 (551) T ss_pred cccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHH Confidence 65 689999999999999999999999999999999999999986432 35668888999999999999 Q ss_pred HHHHHHHHHhhhh----hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccccccc---- Q lcl|NC_011801. 309 KPIESELSQKLGT----DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDN---- 380 (386) Q Consensus 309 ~~ie~~l~~~l~~----~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~---- 380 (386) .+||++|+++|+. .++|+++.+...+..++++++ +++.+|+||+||+|+++|++|..|++|... .++.. T Consensus 401 ~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~--~~~~~~~~~ 477 (551) T protein:vir:80 401 GFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPL--NGVIVQRIG 477 (551) T ss_pred HHHHHHHHhhhccccCCceEEEeeccChhhHHHHHHHH-HHHhcCCcCHHHHHHHhCCCCCCCCCceee--ccccccccc Confidence 9999999999864 456777777778888887765 467789999999999999877555555332 11100 Q ss_pred ----CCCCCC Q lcl|NC_011801. 381 ----TKNIND 386 (386) Q Consensus 381 ----~~~~~~ 386 (386) ..++++ T Consensus 478 ~~~~~~~~~~ 487 (551) T protein:vir:80 478 QLMQQEQFEH 487 (551) T ss_pred ccccccCcch Confidence 011111 No 68 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=1.6e-72 Score=414.24 Aligned_cols=317 Identities=18% Similarity=0.300 Sum_probs=269.0 Q ss_pred hccCceeec------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCC Q lcl|NC_011801. 58 IAGCRFVTN------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDY 131 (386) Q Consensus 58 ia~~p~~~~------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~ 131 (386) ||++|++++ +|+++++|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|+.|++..+.+ T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 999999875 47899999999999999999999999999999999999999999999999999999999999998 Q ss_pred CceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEee Q lcl|NC_011801. 132 GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKV 211 (386) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~ 211 (386) +..++|.+...+ +..+.+++++|+|+|+.. +.++++|+||+..+..+++...+++++... .++..++++++. T Consensus 81 ~~~~~y~~~~~~---g~~~~~~~~eiih~r~~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~~ 152 (348) T protein:vir:93 81 SRELYYSIHAAT---GNKLIVHNMDMLHFKHIV---ASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLKY 152 (348) T ss_pred CcEEEEEEEcCC---CeEEEEccccEEEecCCC---CCCceeeccHHHHHHHHHHHHHHHHHHHHH--hcCCCceeEEec Confidence 888888776543 345679999999998643 467889999999999999999999888633 333334455544 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--c Q lcl|NC_011801. 212 PNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--A 289 (386) Q Consensus 212 ~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~ 289 (386) +..+++++.++++++|++.+. ++|+++++++|++|++++++++|+||+|++++++++||++|||||.+|+..++ + T Consensus 153 -~~~l~~e~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~ 229 (348) T protein:vir:93 153 -GSNVSTEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNF 229 (348) T ss_pred -CCCCCHHHHHHHHHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 567899999999999998774 56789999999999999999999999999999999999999999999987654 5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhc Q lcl|NC_011801. 290 QSNITMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLK 361 (386) Q Consensus 290 ~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg 361 (386) ++.+++.++|+++||+|+++.||++|+++|+ .+++||++.+++.|.+++++++++++++|+||+||+|+++| T Consensus 230 ~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g 309 (348) T protein:vir:93 230 AKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWED 309 (348) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhC Confidence 6778999999999999999999999999984 45889999999999999999999999999999999999999 Q ss_pred cCCcCCCCCCCc-ccccc--c------c-CCCCCC Q lcl|NC_011801. 362 NRGVFPELDLDE-GTNLL--D------N-TKNIND 386 (386) Q Consensus 362 ~~p~~p~~~~~~-~~~~~--~------~-~~~~~~ 386 (386) ++|++ ++|... ..+.+ . . .+++.+ T Consensus 310 ~~p~~-ggD~~~~~~n~~~~~~~~~~~~~~~gg~~ 343 (348) T protein:vir:93 310 LPPVE-GGDKPLISGDLYPIDTPLELRKSLKGGDK 343 (348) T ss_pred CCCCC-CcCeEeecccccccccchhhcccccCCCC Confidence 88863 333211 11111 0 0 111111 No 69 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=1.2e-72 Score=414.88 Aligned_cols=355 Identities=18% Similarity=0.149 Sum_probs=269.3 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------chhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------AQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------~~~~~~~l 74 (386) ||||+++|+|.+....... ......++.+.|+++++|++||++||+++|++|++++ +|+++++| T Consensus 1 Mg~f~~l~~~~~~~~~~~~----------~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll 70 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMWD----------LDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVRDKLYYKL 70 (376) T ss_pred CchhhhhhccCCccccccc----------hhhccccchhhhhhhHHHHHHHHHHHHhhcccceeeccccccccchHHHHH Confidence 9999999987654322211 1123457888999999999999999999999999886 58899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.+++++||||+++.++..|.+.+++|+.+..+..... +.+.... ......+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~--~~~~~~~~~ 141 (376) T protein:vir:78 71 NIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVF-------EGVTVKD--YRYNRNFSM 141 (376) T ss_pred hhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeee-------eeeeeec--ceeeeeecc Confidence 9999999999999999999999999999999999999999999998877643321 1122211 112346899 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) ++|+|+++...+ |.+....+...... ........++.+++.+.+++......+++++.+++++.|++.+++ T Consensus 142 ~evih~~~~~~~-------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g 212 (376) T protein:vir:78 142 DDVIFLEYGNER-------LSAFTDGMFEDYGE--LFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYAS 212 (376) T ss_pred ccEEEeccCCCC-------chhhhhHHHHHHHH--HHHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhcc Confidence 999999965421 12222222221111 122222333344443333333334578999999999999999987 Q ss_pred c--ccCcceecCCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 235 E--NAGRAVVLDQSADVETTNISPNVT-----EFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIY 307 (386) Q Consensus 235 ~--~~g~~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~ 307 (386) . ++++++++++|++|+++++++.|+ ||+|++++++++||++|||||.+|+ +++++.+++..+|+++||.|+ T Consensus 213 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~--~~~s~~e~~~~~f~~~~l~P~ 290 (376) T protein:vir:78 213 FNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH--GDMADLSNNMKAYMEYCIDPL 290 (376) T ss_pred ccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC--CCCCCHHHHHHHHHHHHHHHH Confidence 3 445578899999999999888664 9999999999999999999999997 356778999999999999999 Q ss_pred HHHHHHHHHHhhhh----hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc---cccccc Q lcl|NC_011801. 308 IKPIESELSQKLGT----DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG---TNLLDN 380 (386) Q Consensus 308 ~~~ie~~l~~~l~~----~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~---~~~~~~ 380 (386) +.+||++|+++|+. +++|+++.+++.|.+++++++++++++||||+||+|+++|++|+ |++++|+. .+.... T Consensus 291 ~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~-~~g~~d~~~~~~n~~~~ 369 (376) T protein:vir:78 291 TKKLEDELNAKLFTFSEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERV-DNPELDKYLITKNYQSA 369 (376) T ss_pred HHHHHHHHHhhhCCcccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCCceeeeccCceeh Confidence 99999999999963 46889999999999999999999999999999999999998775 34433321 111122 Q ss_pred CCCCCC Q lcl|NC_011801. 381 TKNIND 386 (386) Q Consensus 381 ~~~~~~ 386 (386) .++++| T Consensus 370 ~~~~e~ 375 (376) T protein:vir:78 370 DEGGED 375 (376) T ss_pred hccccC Confidence 233333 No 70 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=1e-71 Score=409.75 Aligned_cols=379 Identities=10% Similarity=0.083 Sum_probs=276.1 Q ss_pred Cchhh--------------hhccccccCCcc--chh---hhhhccccc---------------cc-CcccccHHHHhccH Q lcl|NC_011801. 1 MAFLS--------------NLFKRQKMLSGS--SPV---WILNQGQPV---------------SI-KPKAITSAIALKNS 45 (386) Q Consensus 1 Mg~~~--------------~l~~~~~~~~~~--~~~---~~~~~~~~~---------------~~-~~~~i~~~~a~~~~ 45 (386) |++=+ .+-+........ .+. ....-+... .. ....+.....++.. T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~ 106 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSN 106 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHH Confidence 44321 011000000000 000 000000000 00 01113344456677 Q ss_pred HHHHHHHHHHHhhccCceeecc---------------hhHHHHHhc---cCcccC-CHHHHHHHHHHHHHHhCCeEEEEe Q lcl|NC_011801. 46 DVYAVISRVSSDIAGCRFVTNA---------------QPITDVLNA---PLGNLM-SGFSVWQAMIVQMMLTGNAFAIID 106 (386) Q Consensus 46 ~v~~~v~~ia~~ia~~p~~~~~---------------~~~~~~l~~---~PN~~~-s~~~f~~~~~~~~~l~G~a~~~~~ 106 (386) +|++|+.+|++++|++||+++. |++..+|.. .|||.+ |+.+|++.++.+++++||+|++++ T Consensus 107 ~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~ 186 (574) T protein:vir:80 107 QVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKV 186 (574) T ss_pred HHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE Confidence 8999999999999999998752 456677754 356664 788999999999999999999999 Q ss_pred ecCCCceEEEEEEcCcceEEeecCCCce----eEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHH Q lcl|NC_011801. 107 RDTNGYPVRIEPVPNEKVTVALDDYGKD----LTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLL 182 (386) Q Consensus 107 ~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~ 182 (386) |+..|++++||||+|.+|++..+.++.. ..|++.. ..+....+++++|+|+++....+..++.+|+||+.++. T Consensus 187 r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~---~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~ 263 (574) T protein:vir:80 187 FDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVI---DNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIAL 263 (574) T ss_pred ECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEe---CCceEEEEccccEEEEeccCCCCcccccccccHHHHHH Confidence 9999999999999999999998876643 1222222 23445679999999999877777777889999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhcc-cccCcce-ecCCCceeeeccCChhhH Q lcl|NC_011801. 183 NEIEVQDLSSKLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTTG-ENAGRAV-VLDQSADVETTNISPNVT 259 (386) Q Consensus 183 ~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~~-~~~g~~~-vl~~g~~~~~~~~~~~d~ 259 (386) .++..+.++++++.++|+||++|+++|++++ ..+++++++++++.|++.++| .|+|+++ ++++|++|+++++++.|+ T Consensus 264 ~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~ 343 (574) T protein:vir:80 264 KQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDM 343 (574) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHH Confidence 9999999999999999999999999999865 458999999999999999988 6888864 557899999999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCcC------------cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh------ Q lcl|NC_011801. 260 EFLQNVSFSQDQIAKAFGIPADYLSGKQD------------AQSNITMIRAFYQSSLSIYIKPIESELSQKLGT------ 321 (386) Q Consensus 260 ~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~------------~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~------ 321 (386) ||+|++++++++||++|||||.+||..+. +++.+++...|+++||+|++.+||++|+++|+. T Consensus 344 qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~ 423 (574) T protein:vir:80 344 QFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFGEKY 423 (574) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCce Confidence 99999999999999999999999987543 356788999999999999999999999999864 Q ss_pred hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc-------cccccC------------- Q lcl|NC_011801. 322 DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT-------NLLDNT------------- 381 (386) Q Consensus 322 ~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~-------~~~~~~------------- 381 (386) +++|+..++.+.+ +..+ +.+++++||||+||+|+++|++|+ |++|...-. ...... T Consensus 424 ~~~f~~~d~~~~~--~~~~-~~~~~~~G~lT~NE~R~~lgl~Pi-~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 499 (574) T protein:vir:80 424 QFQFRGGDLSAQL--DKLK-IIEQEGKVFRTVNEIRHDKGLEPI-KGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNR 499 (574) T ss_pred EEEecccchhhHH--HHHH-HHHHHhCCccCHHHHHHHhCCCCC-CCCCEeeeccceeecccccccccCCccchhccccc Confidence 3445544433222 2222 245788999999999999998776 343322100 000000 Q ss_pred ----CCCCC Q lcl|NC_011801. 382 ----KNIND 386 (386) Q Consensus 382 ----~~~~~ 386 (386) .+..| T Consensus 500 ~~~~~~~~~ 508 (574) T protein:vir:80 500 LLELSGGDV 508 (574) T ss_pred cccccCCCC Confidence 00000 No 71 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=4e-71 Score=406.55 Aligned_cols=379 Identities=12% Similarity=0.065 Sum_probs=284.8 Q ss_pred Cchhhhhcccc--cc-CCc-------------------------------cchhhh-hhccc--ccccCcccc-----cH Q lcl|NC_011801. 1 MAFLSNLFKRQ--KM-LSG-------------------------------SSPVWI-LNQGQ--PVSIKPKAI-----TS 38 (386) Q Consensus 1 Mg~~~~l~~~~--~~-~~~-------------------------------~~~~~~-~~~~~--~~~~~~~~i-----~~ 38 (386) ||||+++.+.. +. .-. +.+.+. ..+.. ....+..+. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 99999874311 00 000 000000 00000 000111111 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-------------ceeecc-------------hhHHHHHhccCccc-----CCHHHH Q lcl|NC_011801. 39 AIALKNSDVYAVISRVSSDIAGC-------------RFVTNA-------------QPITDVLNAPLGNL-----MSGFSV 87 (386) Q Consensus 39 ~~a~~~~~v~~~v~~ia~~ia~~-------------p~~~~~-------------~~~~~~l~~~PN~~-----~s~~~f 87 (386) +.+..+|.|++||+.||+.||++ ++++++ +.+..+| .+||++ +|+.+| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l-~~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFI-EKTGVDNDINRDSFSSF 159 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHH-HhhCCCCCCccchHHHH Confidence 23456899999999999999964 222221 1233444 458776 488999 Q ss_pred HHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce----eEEEEeccCcccceeEEEcccceeeeccc Q lcl|NC_011801. 88 WQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD----LTYTVHFDDSKRSGDFLYDSSEVIHFRCT 163 (386) Q Consensus 88 ~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~vih~~~~ 163 (386) ++.++.+++++||+|++++|+..|++.+||||+|++|++..+.++.. ..|.+... .+....+++++|||++++ T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~eiih~r~n 236 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVID---QKIVATFNAREMAFAVRN 236 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcC---CcEEEEeccccEEEeccc Confidence 99999999999999999999999999999999999999998777643 23332222 233457999999999987 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhcc-cccCcce Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTTG-ENAGRAV 241 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~~-~~~g~~~ 241 (386) +..++..+.+|+||+.++..++....++++++.++|+||++|+++|++++ ..+++++++++++.|++.++| .|+|+++ T Consensus 237 ~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~ 316 (547) T protein:vir:63 237 PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIP 316 (547) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccc Confidence 66666778899999999999999999999999999999999999999874 458999999999999999988 6889876 Q ss_pred ec-CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc------------CcccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 242 VL-DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ------------DAQSNITMIRAFYQSSLSIYI 308 (386) Q Consensus 242 vl-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~------------~~~~~~~~~~~~~~~~l~P~~ 308 (386) ++ ++|++|+++++++.|+||+|++++++++||++|||||.+||... ++++.+++..+|+++||+|++ T Consensus 317 vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~ 396 (547) T protein:vir:63 317 VVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLL 396 (547) T ss_pred cccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHH Confidence 65 68899999999999999999999999999999999999998643 235678889999999999999 Q ss_pred HHHHHHHHHhhhh----hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccc----- Q lcl|NC_011801. 309 KPIESELSQKLGT----DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLD----- 379 (386) Q Consensus 309 ~~ie~~l~~~l~~----~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~----- 379 (386) ..||++||++|+. .++|+++.+...+..++++++ +++.+|+||+||+|+++|++|..|++|... +++. T Consensus 397 ~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~--~~~~~~~~~ 473 (547) T protein:vir:63 397 GFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPL--NGVIVQRIG 473 (547) T ss_pred HHHHHHHHhhcccccCCceEEEeeccccccHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCCCCCceee--ccccccccc Confidence 9999999999864 356777777777887777765 577889999999999999877555555332 1110 Q ss_pred ---cCCCCCC Q lcl|NC_011801. 380 ---NTKNIND 386 (386) Q Consensus 380 ---~~~~~~~ 386 (386) +..+.++ T Consensus 474 ~~~~~~~~~~ 483 (547) T protein:vir:63 474 QLMQQEQFEH 483 (547) T ss_pred ccccccCCcc Confidence 0011111 No 72 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=4e-71 Score=406.53 Aligned_cols=378 Identities=12% Similarity=0.079 Sum_probs=282.8 Q ss_pred Cchhhh-----------------hccccccCCc-cchhhh--hhcccccccCcccccHHHHhccHHHHHHHHHHHHhhcc Q lcl|NC_011801. 1 MAFLSN-----------------LFKRQKMLSG-SSPVWI--LNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAG 60 (386) Q Consensus 1 Mg~~~~-----------------l~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~ 60 (386) |.++.. ..+.+..... ...... .............| +.+..+.++++|+.++++.+++ T Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i--~t~~~~va~~~~i~~~s~~~~~ 111 (535) T protein:vir:10 34 KAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAII--RTRTNQVLTYSNPSRYNRNGVG 111 (535) T ss_pred hhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHH--HHHHHHHHHHHHHHHHhcccCc Confidence 333221 1000000000 000000 00000111111112 2344678899999999999999 Q ss_pred Cceeecc-------------hhHHHHHhccCcccCCHHH----HHHHHHHHHHHh-CCeEEEEeecCCCceEEEEEEcCc Q lcl|NC_011801. 61 CRFVTNA-------------QPITDVLNAPLGNLMSGFS----VWQAMIVQMMLT-GNAFAIIDRDTNGYPVRIEPVPNE 122 (386) Q Consensus 61 ~p~~~~~-------------~~~~~~l~~~PN~~~s~~~----f~~~~~~~~~l~-G~a~~~~~~~~~g~~~~l~~l~~~ 122 (386) +|+++++ |++.++|+.+||++|++++ |++.++.+++++ |++|++++++..|++.+||||+|+ T Consensus 112 ~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~ 191 (535) T protein:vir:10 112 FKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDAS 191 (535) T ss_pred ceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCc Confidence 9987652 5678899999999999876 556667776655 589999999999999999999999 Q ss_pred ceEEeecCCCc---eeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 123 KVTVALDDYGK---DLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTL 199 (386) Q Consensus 123 ~v~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 199 (386) +|++..+.++. ..+|.+. ..+....+++++|+|+++++.....++.+|+||+.++..++....++++++.++| T Consensus 192 ~V~v~~d~~~~~~~~~~~~~~----~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f 267 (535) T protein:vir:10 192 KVVISYSPRSKDQPRKFEQFV----SETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFF 267 (535) T ss_pred eeEEEEcCccccCceEEEEEe----cCceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99998875543 2333222 1234567999999999987766667788999999999999999999999999999 Q ss_pred hccCCCceEEeeCC---CCCCHHHHHHHHHHHHHHhcc-cccCcceecC-CCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_011801. 200 RHAIKPSIFIKVPN---ATLGKEAKENTRQSFEEQTTG-ENAGRAVVLD-QSADVETTNISPNVTEFLQNVSFSQDQIAK 274 (386) Q Consensus 200 ~ng~~~~~~l~~~~---~~~~~~~~~~~k~~~~~~~~~-~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~ 274 (386) +||++|+++|++++ ..+++++++++++.|++.++| .|+|+++++. +|++|++++++++|+||+|++++++++||+ T Consensus 268 ~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~ 347 (535) T protein:vir:10 268 SQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAA 347 (535) T ss_pred hccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHH Confidence 99999999999874 358999999999999999988 6888876665 799999999999999999999999999999 Q ss_pred HhCCCHHHhcCCcCcc--------------cHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----hhhhcchhhhccCHH Q lcl|NC_011801. 275 AFGIPADYLSGKQDAQ--------------SNITMIRAFYQSSLSIYIKPIESELSQKLGT----DVKLDIASAIDSDNS 336 (386) Q Consensus 275 ~~gvp~~~l~~~~~~~--------------~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~----~~~fd~~~~l~~d~~ 336 (386) +|||||.+||..++++ +.+++...|++.||.|++..||++||++|+. +++|+++.+++.|.+ T Consensus 348 afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~ 427 (535) T protein:vir:10 348 IFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKL 427 (535) T ss_pred HhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHH Confidence 9999999999765432 2366778899999999999999999999864 467888999999999 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc----ccccccC--------------CCCCC Q lcl|NC_011801. 337 ELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG----TNLLDNT--------------KNIND 386 (386) Q Consensus 337 ~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~----~~~~~~~--------------~~~~~ 386 (386) +++++++.+. +|+||+||+|+++|++|+ |++|.+.. .+.+..+ .+..+ T Consensus 428 ~r~~~~~~~~-~g~lT~NE~R~~~gl~pi-egGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 493 (535) T protein:vir:10 428 QEEQVWKLKL-ANGYFINEYRKDHGLKTV-DGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLG 493 (535) T ss_pred HHHHHHHHHH-cCCCCHHHHHHHhCCCCC-CCccccccccchhhcccccccccccCCCCCCCccccCC Confidence 9999887655 678999999999998776 45553211 0111000 00000 No 73 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=2.2e-70 Score=402.45 Aligned_cols=376 Identities=16% Similarity=0.135 Sum_probs=282.8 Q ss_pred hhhhhcccc-----ccC-CccchhhhhhcccccccCcccccHHH----HhccHHHHHHHHHHHHhhccCceeecchhHHH Q lcl|NC_011801. 3 FLSNLFKRQ-----KML-SGSSPVWILNQGQPVSIKPKAITSAI----ALKNSDVYAVISRVSSDIAGCRFVTNAQPITD 72 (386) Q Consensus 3 ~~~~l~~~~-----~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~----a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~ 72 (386) +|+..|.-+ ++. ................+...+++... +..+++|++||++||++||++|++++...... T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~~~ 80 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDEGV 80 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccchh Confidence 454332211 111 11111111111111222223444432 34579999999999999999999998877777 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeE-------EEEeccC-- Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLT-------YTVHFDD-- 143 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~-------~~~~~~~-- 143 (386) +++..||+.||+++|++.++.+++++||||++++|+..|++.+||||+|..|++..+.+..... |...+.. T Consensus 81 l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~ 160 (542) T protein:vir:41 81 VDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEG 160 (542) T ss_pred hhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccc Confidence 7777799999999999999999999999999999999999999999999999998876543321 1111111 Q ss_pred ----cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC------ Q lcl|NC_011801. 144 ----SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN------ 213 (386) Q Consensus 144 ----~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~------ 213 (386) ........+++++|||+|+. .+.++++|+||+..+..++....++++++.++|+||++|+++|++++ T Consensus 161 ~~~~~~g~~~~~~~~~eIiHir~~---~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~ 237 (542) T protein:vir:41 161 EINPETGEDQDSVGANELVFIHIP---SPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEL 237 (542) T ss_pred cccccccccccccCcccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCcccccc Confidence 01112345889999999854 35678899999999999999999999999999999999999998864 Q ss_pred ---CCCCHHHHHHHHHHHHHHhcc--cccCcceecC------CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 214 ---ATLGKEAKENTRQSFEEQTTG--ENAGRAVVLD------QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 214 ---~~~~~~~~~~~k~~~~~~~~~--~~~g~~~vl~------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) ..+++++.+++++.|++.+.| .|+|+++|++ +|++|++++++++|++|++.+++++++||++|||||.+ T Consensus 238 ~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~ 317 (542) T protein:vir:41 238 EEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYR 317 (542) T ss_pred ccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 346889999999999998876 4778899984 79999999999999999999999999999999999999 Q ss_pred hcCCcC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_011801. 283 LSGKQD----AQSNITMIRAFYQSSLSIYIKPIESELSQKLG------TDVKLDIASAIDSDNSELINNVQKLASAGVLA 352 (386) Q Consensus 283 l~~~~~----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t 352 (386) ||..+. +++.+++.+.|+++||+|+++.||++||++|+ .+++|+.+.+++.|..+ .+++++++|+|| T Consensus 318 lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~d~~~---~~~~~v~~GilT 394 (542) T protein:vir:41 318 LGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNPKTRFKFNDETLLESDSVR---NCALLVQSGVLT 394 (542) T ss_pred hCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEecchhhcchHHHH---HHHHHHhCCCCC Confidence 987643 35678899999999999999999999999885 45688888888877544 456799999999 Q ss_pred HHHHHHHhccCCcCCCCCCC-----ccccccc-cCCCCCC Q lcl|NC_011801. 353 PIQAQKLLKNRGVFPELDLD-----EGTNLLD-NTKNIND 386 (386) Q Consensus 353 ~nE~R~~lg~~p~~p~~~~~-----~~~~~~~-~~~~~~~ 386 (386) +||+|+.+ .|++|+++.. .....+. ..+|..+ T Consensus 395 ~NE~Re~L--~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~ 432 (542) T protein:vir:41 395 PAEARERL--FGLDGGPDIFMVPSKGAAKSVKRQERNYEK 432 (542) T ss_pred HHHHHHhh--CCCCCCCccccccccccccccccCCcCCCC Confidence 99999854 2333332211 0011111 1111221 No 74 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=1.9e-69 Score=397.32 Aligned_cols=345 Identities=17% Similarity=0.139 Sum_probs=276.5 Q ss_pred cHHHHhccHHHHHHHHHHHHhhccCceeecch---------------hHHHHHhccCcccC--------CHHHHHHHHHH Q lcl|NC_011801. 37 TSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ---------------PITDVLNAPLGNLM--------SGFSVWQAMIV 93 (386) Q Consensus 37 ~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~---------------~~~~~l~~~PN~~~--------s~~~f~~~~~~ 93 (386) -.+-+..+++|++||++||++||++|++++.+ ....++..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 22334457999999999999999999987532 12335667788765 66789999999 Q ss_pred HHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-------eEEE------------------EeccCcccce Q lcl|NC_011801. 94 QMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-------LTYT------------------VHFDDSKRSG 148 (386) Q Consensus 94 ~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-------~~~~------------------~~~~~~~~~~ 148 (386) +++++||||++++|+..|.+++||||+|++|++..+..... .++. ........+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 99999999999999999999999999999999987654321 1111 1111223455 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSF 228 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~ 228 (386) .+.+++++|||+|.. ++.++++|+||+.++..++....++++++.++|+||+.|+++|++++..+++++.+++++.| T Consensus 161 ~~~~~~~diih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~ 237 (467) T protein:vir:31 161 SVSNPANELIFKRNH---SPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLI 237 (467) T ss_pred eeEeccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHH Confidence 678999999999854 45678899999999999999999999999999999999999999888789999999999999 Q ss_pred HHHhcc------------cccCcceecCCCceeeeccC--------ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC Q lcl|NC_011801. 229 EEQTTG------------ENAGRAVVLDQSADVETTNI--------SPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD 288 (386) Q Consensus 229 ~~~~~~------------~~~g~~~vl~~g~~~~~~~~--------~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~ 288 (386) ++.+++ .+++++++++.|+++.++++ +++|+||++++++++++||++|||||.+||..+. T Consensus 238 ~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~ 317 (467) T protein:vir:31 238 EDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVES 317 (467) T ss_pred HhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCC Confidence 876642 57788999988776655532 5789999999999999999999999999987543 Q ss_pred c---ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHH Q lcl|NC_011801. 289 A---QSNITMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQ 357 (386) Q Consensus 289 ~---~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R 357 (386) + ++.+++.++|+++||+|+++.||++||++|+ .+++|+++.+++.|.+++++++++++++|++|+||+| T Consensus 318 ~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R 397 (467) T protein:vir:31 318 GAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELR 397 (467) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 3 4567888999999999999999999999984 3578999999999999999999999999999999999 Q ss_pred HHhccCCcCCCCCCCcccccccc--CC------CCCC Q lcl|NC_011801. 358 KLLKNRGVFPELDLDEGTNLLDN--TK------NIND 386 (386) Q Consensus 358 ~~lg~~p~~p~~~~~~~~~~~~~--~~------~~~~ 386 (386) +++|++|++ ++..... ..+.. +. ...+ T Consensus 398 ~~~Gl~pi~-d~~~~~~-~~~~~~~~~~~~~~~~~~~ 432 (467) T protein:vir:31 398 DEFGFEPFP-EEHVYGG-ETLVAEVTGGSGPGGGIGD 432 (467) T ss_pred HHhCCCCCC-cccccCC-cccccccccccCCCCcccC Confidence 999998863 3222111 11100 00 0000 No 75 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=9.4e-69 Score=393.52 Aligned_cols=382 Identities=11% Similarity=0.043 Sum_probs=275.2 Q ss_pred CchhhhhccccccCCc-----cchhhhhh---cccccccCcccc--c--------HHHHhccHHHHHHHHHHHHhhccC- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG-----SSPVWILN---QGQPVSIKPKAI--T--------SAIALKNSDVYAVISRVSSDIAGC- 61 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~-----~~~~~~~~---~~~~~~~~~~~i--~--------~~~a~~~~~v~~~v~~ia~~ia~~- 61 (386) =++|.++..+.+.... ........ ..........++ . ...+-.++.|++||++||++||++ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~ 111 (576) T protein:vir:96 32 QANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYC 111 (576) T ss_pred hHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhh Confidence 2333333221111100 00001110 001001111111 1 112335788999999999999863 Q ss_pred ------------ceeecc-------hh------HHHHH---hccCccc-CCHHHHHHHHHHHHHHhCCeEEEEeec--CC Q lcl|NC_011801. 62 ------------RFVTNA-------QP------ITDVL---NAPLGNL-MSGFSVWQAMIVQMMLTGNAFAIIDRD--TN 110 (386) Q Consensus 62 ------------p~~~~~-------~~------~~~~l---~~~PN~~-~s~~~f~~~~~~~~~l~G~a~~~~~~~--~~ 110 (386) +++.++ ++ +...| ...|||. +|+.+|++.++.+++++||||++++++ .. T Consensus 112 ~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~ 191 (576) T protein:vir:96 112 QPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNA 191 (576) T ss_pred hhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCC Confidence 232221 11 12222 2335554 589999999999999999999998854 45 Q ss_pred CceEEEEEEcCcceEEeecCCCceeEEEEec-cCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHH Q lcl|NC_011801. 111 GYPVRIEPVPNEKVTVALDDYGKDLTYTVHF-DDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQD 189 (386) Q Consensus 111 g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~ 189 (386) |++++||||+|++|++..+.++....+...+ .....+....+++++|+|+++....+...+.+|+||+.++..++.... T Consensus 192 g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~ 271 (576) T protein:vir:96 192 TTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYN 271 (576) T ss_pred CceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHH Confidence 7899999999999999999888765443221 112334556789999998876655555667899999999999999999 Q ss_pred HHHHHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhcc-cccCc-ceecCCCceeeeccCChhhHHHHHHHH Q lcl|NC_011801. 190 LSSKLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTTG-ENAGR-AVVLDQSADVETTNISPNVTEFLQNVS 266 (386) Q Consensus 190 ~~~~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~~-~~~g~-~~vl~~g~~~~~~~~~~~d~~~~e~~~ 266 (386) ++++++.++|+||++|+++|++++ ..+++++++++++.|++.++| .|+|+ ++|+++|++|+++++++.|+||+|+++ T Consensus 272 ~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~ 351 (576) T protein:vir:96 272 NTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLT 351 (576) T ss_pred HHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHH Confidence 999999999999999999999875 468999999999999999988 67888 588999999999999999999999999 Q ss_pred HHHHHHHHHhCCCHHHhcCCc-------------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh--hcchhhh Q lcl|NC_011801. 267 FSQDQIAKAFGIPADYLSGKQ-------------DAQSNITMIRAFYQSSLSIYIKPIESELSQKLGTDVK--LDIASAI 331 (386) Q Consensus 267 ~~~~~Ia~~~gvp~~~l~~~~-------------~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~--fd~~~~l 331 (386) +++++||++|||||.+||..+ ++++.+++.++|+++||+|++..||++|+++|+..++ +.+ .++ T Consensus 352 ~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~-~f~ 430 (576) T protein:vir:96 352 YLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSDKYVF-QFV 430 (576) T ss_pred HhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccCceEE-Eec Confidence 999999999999999998754 2356789999999999999999999999999975432 222 257 Q ss_pred ccCHHHHHHHHHHH--HhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCC-------CC Q lcl|NC_011801. 332 DSDNSELINNVQKL--ASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNI-------ND 386 (386) Q Consensus 332 ~~d~~~~~~~~~~~--~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~-------~~ 386 (386) +.|.+++++.++.+ +.+||||+||+|+++|++|+ |++|.. -.++..+..+ .+ T Consensus 431 r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~pi-egGD~~--~~~~~~~~~~~~~~~~~~e 491 (576) T protein:vir:96 431 GGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPI-EGGDVL--LDGSFIQSMSLNTQKEQYE 491 (576) T ss_pred cCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCC-CCccee--ccccccccccccccCCCCC Confidence 88999888877654 56799999999999998776 343321 1111111110 00 No 76 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=3.9e-69 Score=395.62 Aligned_cols=369 Identities=15% Similarity=0.103 Sum_probs=270.6 Q ss_pred Cchhh--hhccccccCCccchhhhhhcccccccCccccc----HHHHhccHHHHHHHHHHHHhhccCceeecc--hhHHH Q lcl|NC_011801. 1 MAFLS--NLFKRQKMLSGSSPVWILNQGQPVSIKPKAIT----SAIALKNSDVYAVISRVSSDIAGCRFVTNA--QPITD 72 (386) Q Consensus 1 Mg~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~----~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~--~~~~~ 72 (386) |++-+ |....++. ..+.......+ ..+...+++ .+.+..+++|++||++||++||++|++++. +.+.. T Consensus 6 ~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~~~~~ 81 (540) T protein:vir:41 6 LSIKSLEKYRAIKGD-TDSQALKEDRF---EEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDGGVEE 81 (540) T ss_pred cChhhccchhhhhcc-ccccccccCCC---CccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCccchhh Confidence 55543 11101111 11111111111 122222333 234456889999999999999999998764 33444 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-------eEEEEeccC-- Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-------LTYTVHFDD-- 143 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-------~~~~~~~~~-- 143 (386) + .||+.||+++||+.++.+++++||||++++|+..|.+.+||||+|.+|++..+..... ..|...+.. T Consensus 82 ~---lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~ 158 (540) T protein:vir:41 82 L---LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEG 158 (540) T ss_pred h---ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccc Confidence 3 3999999999999999999999999999999999999999999999999987654322 111111111 Q ss_pred ----cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCH- Q lcl|NC_011801. 144 ----SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGK- 218 (386) Q Consensus 144 ----~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~- 218 (386) ........+++++|||+|.. ++.++++|+||+.++..++....++++++.++|+||++|+++|++++...++ T Consensus 159 ~~~~~~g~~~~~~~~~eViHir~~---~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~ 235 (540) T protein:vir:41 159 EVNPDNGEDQDGVGANEIIFIHLP---SPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEM 235 (540) T ss_pred eeeccccccceeecccceEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchh Confidence 11223457899999999854 4567899999999999999999999999999999999999999988643322 Q ss_pred --------HHHHHHHHHHHHHhcc--cccCcceecC------CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 219 --------EAKENTRQSFEEQTTG--ENAGRAVVLD------QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 219 --------~~~~~~k~~~~~~~~~--~~~g~~~vl~------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) +..+++++.|++.+.+ .|+|+++|++ +|++|++++++++|+||+|++++++++||++|||||.+ T Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~ 315 (540) T protein:vir:41 236 ELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYR 315 (540) T ss_pred ccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 2235677777776665 5788999984 79999999999999999999999999999999999999 Q ss_pred hcCCcC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_011801. 283 LSGKQD----AQSNITMIRAFYQSSLSIYIKPIESELSQKLG------TDVKLDIASAIDSDNSELINNVQKLASAGVLA 352 (386) Q Consensus 283 l~~~~~----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t 352 (386) ||..+. +++.+++.+.|+++||.|++++||++||++|+ .+++||.+.+++.|.+++ +++++++|+|| T Consensus 316 lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~D~~~~---~~~lv~~G~lT 392 (540) T protein:vir:41 316 LGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDPGARFVFNEEILMESEFVHN---YALLVQCGVLT 392 (540) T ss_pred cCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecchhhcchHHHHH---HHHHHhCCCCC Confidence 986542 35678899999999999999999999999884 357899999999876555 55689999999 Q ss_pred HHHHHHHhccCCcCCCCCCCcccccccc-------CCCC-----CC Q lcl|NC_011801. 353 PIQAQKLLKNRGVFPELDLDEGTNLLDN-------TKNI-----ND 386 (386) Q Consensus 353 ~nE~R~~lg~~p~~p~~~~~~~~~~~~~-------~~~~-----~~ 386 (386) +||+|+.+ .|++|+++.. -.+... +.++ .+ T Consensus 393 ~NE~Re~L--~g~e~gdd~~--l~p~n~~~~~~~~~~~~~~~~~~~ 434 (540) T protein:vir:41 393 PSEVREKL--FGLDGGPDMF--MVPSSIGKSAMKRQKRNYEKNQIN 434 (540) T ss_pred HHHHHHHh--CcCcCCCccc--ccccccccccccccccccCCCCcc Confidence 99999864 2333332211 111100 0000 00 No 77 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=3e-69 Score=396.23 Aligned_cols=360 Identities=13% Similarity=0.097 Sum_probs=257.8 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecc------hhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNA------QPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~------~~~~~~l 74 (386) |||++++++.............. ..+.....++.+.|+++++|++||++||+++|++|+++++ |+++++| T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~~~~~~~lL 76 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDT----VWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEVRKKNWYMF 76 (395) T ss_pred CchHHHHHhhhcccccccccccc----hhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccccchHHHHH Confidence 99999987665433222221111 1122344577788999999999999999999999998863 6899999 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcc Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDS 154 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (386) +.+||++||+++||+.++.+++++||||+++.++.. ++.+ + +.+.........++.+.... .+....+++ T Consensus 77 ~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~------~~~~-~-~~~~~~~~~~~~~~~v~~~~--~~~~~~~~~ 146 (395) T protein:vir:40 77 NVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI------YVAD-S-FTKNDKSLYENTYTEVTLKD--LTLKKEFKE 146 (395) T ss_pred HhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce------eecC-C-ccccccccccceeeeeeecC--ceeeeeecc Confidence 999999999999999999999999999999987642 2222 2 22211111111122222221 223456899 Q ss_pred cceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 155 SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 155 ~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) ++|+|+++..... ...+.+.+..+...+... .....+.|+.++.++++.+ ..+++++.+++++.|++.+.+ T Consensus 147 ~evih~r~~~~~~---~~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~ 217 (395) T protein:vir:40 147 SEVLHLTLNNESI---KSIIDGFYLLYGDLLTAA-----VNKYKKLNSRKIIVKLKAM-FGQTPEAEEKLRLMLSERMKK 217 (395) T ss_pred ccEEEeecCCCCc---cccchhHHHHHHHHHHHH-----HHHHHhcCCCCceEEEecc-cCCCHHHHHHHHHHHHHHHHH Confidence 9999999653211 112222333333222211 1222334555554444443 468999999999999998876 Q ss_pred --cccCcceecCCCceeeeccCChhhHHHHHHHHHHH---HHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 235 --ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQ---DQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIK 309 (386) Q Consensus 235 --~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~---~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~ 309 (386) .++++++++++|++|+++++++.|+|++|.+++.. ++||++|||||.+|+ +++++.+++..+|+++||.|+++ T Consensus 218 ~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~--~~~sn~e~~~~~f~~~~L~P~~~ 295 (395) T protein:vir:40 218 FLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAK--GDTVGLSEQVNSFLMFSINPIAE 295 (395) T ss_pred hhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCHHHHHHHHHHHHHHHHHH Confidence 56788999999999999999999999999998874 799999999999997 44677899999999999999999 Q ss_pred HHHHHHHHhhh--------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc----- Q lcl|NC_011801. 310 PIESELSQKLG--------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN----- 376 (386) Q Consensus 310 ~ie~~l~~~l~--------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~----- 376 (386) +||++|+++|+ .+++||++.+++.|.+++++++.+++++||||+||+|+++|++|++ +++++..-- T Consensus 296 ~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~-~~~gD~~~~~~n~~ 374 (395) T protein:vir:40 296 MFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVM-SPETQERFVTKNYA 374 (395) T ss_pred HHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCCCceeeeccccc Confidence 99999999984 4689999999999999999999999999999999999999988763 333332111 Q ss_pred cc----ccCCCCCC Q lcl|NC_011801. 377 LL----DNTKNIND 386 (386) Q Consensus 377 ~~----~~~~~~~~ 386 (386) ++ ..++++.+ T Consensus 375 ~~~~~~~~~kgge~ 388 (395) T protein:vir:40 375 PLGENEEDLKGGDI 388 (395) T ss_pred cccccccccCCCCC Confidence 11 11111111 No 78 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=9.7e-68 Score=387.98 Aligned_cols=378 Identities=12% Similarity=0.081 Sum_probs=276.0 Q ss_pred CchhhhhccccccCCccchhhhh-hccc----ccccCccccc----HHHHhccHHHHHHHHHHHHhhcc----------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWIL-NQGQ----PVSIKPKAIT----SAIALKNSDVYAVISRVSSDIAG----------- 60 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~i~----~~~a~~~~~v~~~v~~ia~~ia~----------- 60 (386) +.++.+-..++..+ ...|+... .... .........+ .+.+-.++.|.+||+.+++.+|. T Consensus 43 ~~~~~~~~~~~~~a-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~ 121 (563) T protein:vir:99 43 YQDLTKSLYGQQQA-YAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKG 121 (563) T ss_pred HHHHHhhhccCCCc-chhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 45554433233322 22333211 1111 1111111112 22223467788999999988884 Q ss_pred --Cceeecc-------------hhHHHHHh---ccCccc-CCHHHHHHHHHHHHHHhCCeEEEEe--ecCCCceEEEEEE Q lcl|NC_011801. 61 --CRFVTNA-------------QPITDVLN---APLGNL-MSGFSVWQAMIVQMMLTGNAFAIID--RDTNGYPVRIEPV 119 (386) Q Consensus 61 --~p~~~~~-------------~~~~~~l~---~~PN~~-~s~~~f~~~~~~~~~l~G~a~~~~~--~~~~g~~~~l~~l 119 (386) +|+++++ |++...|. ..|||. +|+.+|++.++.+++++||+|++++ |+..|++.+|||| T Consensus 122 ~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl 201 (563) T protein:vir:99 122 LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAV 201 (563) T ss_pred ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEe Confidence 4555442 23333332 123333 5889999999999999999999875 7788999999999 Q ss_pred cCcceEEeecCCCceeE----EEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 120 PNEKVTVALDDYGKDLT----YTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 120 ~~~~v~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) +|++|++..+.++.... |.+... .+....+++++++|+++....+...+.+|+||+.++..++....++++++ T Consensus 202 ~p~~V~v~~~~~g~~~~~~~~y~~~~~---g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~ 278 (563) T protein:vir:99 202 DPSTIFYATDKKGKIIKGGKRFVQVVD---KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFN 278 (563) T ss_pred CCceeEEEECCCCceeccceeEEEEeC---CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHH Confidence 99999999888876543 222222 23345688999887766555555668899999999999999999999999 Q ss_pred HHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcc-cccCcc-eecCCCceeeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTG-ENAGRA-VVLDQSADVETTNISPNVTEFLQNVSFSQDQI 272 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~-~~~g~~-~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~I 272 (386) .++|+||++|+++|++++. .+++++++++++.|++.++| .|+|++ +|+++|++|+++++++.|+||+|++++++++| T Consensus 279 ~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~I 358 (563) T protein:vir:99 279 DRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINII 358 (563) T ss_pred HHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHH Confidence 9999999999999998753 58999999999999999988 688885 78999999999999999999999999999999 Q ss_pred HHHhCCCHHHhcCCcC-------------cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh--hcchhhhccCHHH Q lcl|NC_011801. 273 AKAFGIPADYLSGKQD-------------AQSNITMIRAFYQSSLSIYIKPIESELSQKLGTDVK--LDIASAIDSDNSE 337 (386) Q Consensus 273 a~~~gvp~~~l~~~~~-------------~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~--fd~~~~l~~d~~~ 337 (386) |++|||||.+||..+. +++.+++.+.|+++||.|++..||++|+++|+..++ +.+ .+++.|.++ T Consensus 359 a~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~-~f~r~D~~~ 437 (563) T protein:vir:99 359 SALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTF-QFVGGDTKS 437 (563) T ss_pred HHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEE-EeccCCHHH Confidence 9999999999987542 245678889999999999999999999999975432 222 257889999 Q ss_pred HHHHHH--HHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccc--------cCCCCCC Q lcl|NC_011801. 338 LINNVQ--KLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLD--------NTKNIND 386 (386) Q Consensus 338 ~~~~~~--~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~--------~~~~~~~ 386 (386) +++.+. +++++||||+||+|+++|++|+ |++|... .++. +..+.++ T Consensus 438 ~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi-~gGD~~~--~~~~~~~~~~~~~~~~~~~ 493 (563) T protein:vir:99 438 ATDKLNILKLETQIFKTVNEAREEQGKKPI-EGGDIIL--DASFLQGTAQLQQDKQYND 493 (563) T ss_pred HHHHHHHHHHhcCCccCHHHHHHHhCCCCC-CCcceee--cccccccccccccccCCCc Confidence 988765 4688999999999999998876 3443221 1110 0000000 No 79 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=9.7e-68 Score=387.98 Aligned_cols=378 Identities=12% Similarity=0.081 Sum_probs=276.0 Q ss_pred CchhhhhccccccCCccchhhhh-hccc----ccccCccccc----HHHHhccHHHHHHHHHHHHhhcc----------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWIL-NQGQ----PVSIKPKAIT----SAIALKNSDVYAVISRVSSDIAG----------- 60 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~i~----~~~a~~~~~v~~~v~~ia~~ia~----------- 60 (386) +.++.+-..++..+ ...|+... .... .........+ .+.+-.++.|.+||+.+++.+|. T Consensus 43 ~~~~~~~~~~~~~a-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~ 121 (563) T protein:vir:95 43 YQDLTKSLYGQQQA-YAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKG 121 (563) T ss_pred HHHHHhhhccCCCc-chhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 45554433233322 22333211 1111 1111111112 22223467788999999988884 Q ss_pred --Cceeecc-------------hhHHHHHh---ccCccc-CCHHHHHHHHHHHHHHhCCeEEEEe--ecCCCceEEEEEE Q lcl|NC_011801. 61 --CRFVTNA-------------QPITDVLN---APLGNL-MSGFSVWQAMIVQMMLTGNAFAIID--RDTNGYPVRIEPV 119 (386) Q Consensus 61 --~p~~~~~-------------~~~~~~l~---~~PN~~-~s~~~f~~~~~~~~~l~G~a~~~~~--~~~~g~~~~l~~l 119 (386) +|+++++ |++...|. ..|||. +|+.+|++.++.+++++||+|++++ |+..|++.+|||| T Consensus 122 ~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl 201 (563) T protein:vir:95 122 LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAV 201 (563) T ss_pred ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEe Confidence 4555442 23333332 123333 5889999999999999999999875 7788999999999 Q ss_pred cCcceEEeecCCCceeE----EEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 120 PNEKVTVALDDYGKDLT----YTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 120 ~~~~v~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) +|++|++..+.++.... |.+... .+....+++++++|+++....+...+.+|+||+.++..++....++++++ T Consensus 202 ~p~~V~v~~~~~g~~~~~~~~y~~~~~---g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~ 278 (563) T protein:vir:95 202 DPSTIFYATDKKGKIIKGGKRFVQVVD---KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFN 278 (563) T ss_pred CCceeEEEECCCCceeccceeEEEEeC---CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHH Confidence 99999999888876543 222222 23345688999887766555555668899999999999999999999999 Q ss_pred HHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcc-cccCcc-eecCCCceeeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTG-ENAGRA-VVLDQSADVETTNISPNVTEFLQNVSFSQDQI 272 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~-~~~g~~-~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~I 272 (386) .++|+||++|+++|++++. .+++++++++++.|++.++| .|+|++ +|+++|++|+++++++.|+||+|++++++++| T Consensus 279 ~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~I 358 (563) T protein:vir:95 279 DRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINII 358 (563) T ss_pred HHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHH Confidence 9999999999999998753 58999999999999999988 688885 78999999999999999999999999999999 Q ss_pred HHHhCCCHHHhcCCcC-------------cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh--hcchhhhccCHHH Q lcl|NC_011801. 273 AKAFGIPADYLSGKQD-------------AQSNITMIRAFYQSSLSIYIKPIESELSQKLGTDVK--LDIASAIDSDNSE 337 (386) Q Consensus 273 a~~~gvp~~~l~~~~~-------------~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~--fd~~~~l~~d~~~ 337 (386) |++|||||.+||..+. +++.+++.+.|+++||.|++..||++|+++|+..++ +.+ .+++.|.++ T Consensus 359 a~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~-~f~r~D~~~ 437 (563) T protein:vir:95 359 SALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTF-QFVGGDTKS 437 (563) T ss_pred HHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEE-EeccCCHHH Confidence 9999999999987542 245678889999999999999999999999975432 222 257889999 Q ss_pred HHHHHH--HHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccc--------cCCCCCC Q lcl|NC_011801. 338 LINNVQ--KLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLD--------NTKNIND 386 (386) Q Consensus 338 ~~~~~~--~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~--------~~~~~~~ 386 (386) +++.+. +++++||||+||+|+++|++|+ |++|... .++. +..+.++ T Consensus 438 ~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi-~gGD~~~--~~~~~~~~~~~~~~~~~~~ 493 (563) T protein:vir:95 438 ATDKLNILKLETQIFKTVNEAREEQGKKPI-EGGDIIL--DASFLQGTAQLQQDKQYND 493 (563) T ss_pred HHHHHHHHHHhcCCccCHHHHHHHhCCCCC-CCcceee--cccccccccccccccCCCc Confidence 988765 4688999999999999998876 3443221 1110 0000000 No 80 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=1.4e-68 Score=392.56 Aligned_cols=326 Identities=13% Similarity=0.149 Sum_probs=247.2 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------------- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------------- 66 (386) ||||.++............ .....+ .+...+++.++|++||++||++||++|++++ T Consensus 1 Mg~f~~~~~f~~~~~~~~~------~~~~~~----~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDT------QRVTAW----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CccchhhhhhhccccCCCc------ceeeec----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccc Confidence 9999987422111111110 111111 1233466888999999999999999999653 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCCceeEEEEeccC Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDD 143 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 143 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++++. .|++..++|. T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~------------------------ 126 (378) T protein:vir:93 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA------------------------ 126 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec------------------------ Confidence 3789999999999999999999999999999999999987764 3555554431 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN 223 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~ 223 (386) .....+++++|+|+|. +.++..|.|++..+...+. +++++| .++|++++++ .+++++.++ T Consensus 127 ---~~~~~~~~~diih~r~-----~~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~-~l~~~~~~~ 186 (378) T protein:vir:93 127 ---DDKKEYKTEELVRLTS-----PFYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINA-FLDIDNTQE 186 (378) T ss_pred ---CCeeEeccceeEEecC-----ccccchhhHHHHHHHHHHH----------HHHhcC-cccceeeeCC-cCCHHHHHH Confidence 1124588999999983 3456679999887776543 345555 5889999875 567776666 Q ss_pred HHHHHHH----HhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHH Q lcl|NC_011801. 224 TRQSFEE----QTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAF 299 (386) Q Consensus 224 ~k~~~~~----~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~ 299 (386) ++++|++ ..+++++|+++++++|++|++++++++|+|+ +.+++++++||++|||||.+|+ +++++++..+| T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~----g~~~e~~~~~f 261 (378) T protein:vir:93 187 YREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----GTATQEQQIYF 261 (378) T ss_pred HHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----CCcHHHHHHHH Confidence 6655554 4566788899999999999999999999996 6778999999999999999995 45678899999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhh--------------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLGT--------------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~~--------------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) +++||.|++.+||++|+++|+. .++||++.+++.|.+++++++.+++++|+||+||+|+++|++|+ T Consensus 262 ~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 341 (378) T protein:vir:93 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI 341 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 9999999999999999999863 26799999999999999999999999999999999999998876 Q ss_pred CCCCCCCc-ccc--cc------ccCCCC-----CC Q lcl|NC_011801. 366 FPELDLDE-GTN--LL------DNTKNI-----ND 386 (386) Q Consensus 366 ~p~~~~~~-~~~--~~------~~~~~~-----~~ 386 (386) |++|... ..+ ++ ++.+++ ++ T Consensus 342 -~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~ 375 (378) T protein:vir:93 342 -EGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDET 375 (378) T ss_pred -CCCCeeeeccccccccchhhhcCccCCCCCCCCC Confidence 3433211 001 00 111111 11 No 81 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=1.5e-68 Score=392.43 Aligned_cols=324 Identities=14% Similarity=0.160 Sum_probs=247.7 Q ss_pred Cchhhhhc--cccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------------ Q lcl|NC_011801. 1 MAFLSNLF--KRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------------ 66 (386) Q Consensus 1 Mg~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------------ 66 (386) ||||+++. .+..... .. .....+ .+...+++.++|++||++||++||++|++++ T Consensus 1 Mg~f~~~~~~~~~~~~~--~~------~~~~~~----~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~ 68 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN--DT------QRVTAW----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTL 68 (378) T ss_pred CCccccchhcccccccC--Cc------ceeeee----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccc Confidence 99999884 3222111 10 011111 2233466788999999999999999998753 Q ss_pred ----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec-CCCceEEEEEEcCcceEEeecCCCceeEEEEec Q lcl|NC_011801. 67 ----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD-TNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHF 141 (386) Q Consensus 67 ----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~-~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 141 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++++ ..|++..++|.. T Consensus 69 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~--------------------- 127 (378) T protein:vir:94 69 ISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD--------------------- 127 (378) T ss_pred cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC--------------------- Confidence 488999999999999999999999999999999999987654 456666655411 Q ss_pred cCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHH Q lcl|NC_011801. 142 DDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAK 221 (386) Q Consensus 142 ~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~ 221 (386) ....+++++|||++. +.++..|+||+..+.+++.. .+++| .++|++++++ .+++++. T Consensus 128 ------~~~~~~~~diiH~~~-----~~~~~~g~s~l~~~~~~i~~----------~~~~~-~~~gil~~~~-~l~~~~~ 184 (378) T protein:vir:94 128 ------DKKEYKPEELVRLTS-----PFYINEDTSILDNALASIQT----------KLEQG-KLRGLLKINA-FLDIDNT 184 (378) T ss_pred ------CeeEeeeeeeEEecC-----cCCccchhHHHHHHHHHHHH----------HHhcc-cccceeeeCC-cCCHHHH Confidence 123478899999984 24566899999988876642 34444 5789999875 5677766 Q ss_pred HHHHHHHHH----HhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHH Q lcl|NC_011801. 222 ENTRQSFEE----QTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIR 297 (386) Q Consensus 222 ~~~k~~~~~----~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~ 297 (386) ++++++|.+ ..+++++|+++++++|++|+++++++.++|+ +.+++++++||++|||||.+|+ +++++++.. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~----~~~se~~~~ 259 (378) T protein:vir:94 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----GTASQEQQI 259 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----CChHHHHHH Confidence 666655554 4566788899999999999999999999996 6778999999999999999995 456788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh--------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccC Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGTD--------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNR 363 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~--------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~ 363 (386) +|+++||.|++.+||++|+++|++. ++||++.+++.|.+++++++++++++||||+||+|+++|++ T Consensus 260 ~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~ 339 (378) T protein:vir:94 260 YFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQ 339 (378) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 9999999999999999999999642 67999999999999999999999999999999999999987 Q ss_pred CcCCCCCCCc-ccc--cc------ccCCCCC---C Q lcl|NC_011801. 364 GVFPELDLDE-GTN--LL------DNTKNIN---D 386 (386) Q Consensus 364 p~~p~~~~~~-~~~--~~------~~~~~~~---~ 386 (386) |++ ++|... ..+ ++ ++.+++. | T Consensus 340 p~~-gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 373 (378) T protein:vir:94 340 PIE-GGDVYIANLNAVAVKNLSDLQGSRKDVTSTD 373 (378) T ss_pred CCC-CCCeeeecccccccccchhhcCCcCCCCCCC Confidence 763 433211 000 00 1111110 0 No 82 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=3.7e-68 Score=390.24 Aligned_cols=326 Identities=13% Similarity=0.153 Sum_probs=245.9 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------------- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------------- 66 (386) ||||.++........... ......+. +...+++.++|++||++||++||++|++++ T Consensus 1 Mg~f~~~~~~~~~~~~~~------~~~~~~~~----~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNND------TQRVTAWQ----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CccchhhhhhhcccccCC------cceeeecc----cchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccc Confidence 999998743221111111 01111121 233467888999999999999999999653 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC-CceEEEEEEcCcceEEeecCCCceeEEEEeccC Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN-GYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDD 143 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 143 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++++.. |++..++|. . T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~---~-------------------- 127 (378) T protein:vir:16 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA---D-------------------- 127 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec---C-------------------- Confidence 48899999999999999999999999999999999999988653 555444432 1 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN 223 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~ 223 (386) ....+++++|||+|. +.++..|.|++..+...+. ..+++| .++++++.++ .+++++.++ T Consensus 128 ----~~~~~~~~diih~r~-----~~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~-~l~~~~~~~ 186 (378) T protein:vir:16 128 ----DKKEYKPEELVRLTS-----PFYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINA-FLDIDNTQE 186 (378) T ss_pred ----CeeEecccceEEecC-----ccCccchhHHHHHHHHHHH----------HHHhcC-ccceeeEeCC-cCCHHHHHH Confidence 123478899999984 3456678999888776553 334444 5789999875 567766555 Q ss_pred HHHHHHH----HhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHH Q lcl|NC_011801. 224 TRQSFEE----QTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAF 299 (386) Q Consensus 224 ~k~~~~~----~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~ 299 (386) .+++|++ ..+++++|+++++++|++|++++++++|+|+ +.+++++++||++|||||.+|+ +++++++..+| T Consensus 187 ~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~----g~~~e~~~~~f 261 (378) T protein:vir:16 187 YREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----GTASQEQQIYF 261 (378) T ss_pred HHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----CCchHHHHHHH Confidence 5555554 4566788899999999999999999999997 5568999999999999999995 45678899999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhh--------------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLGT--------------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~~--------------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) +++||.|++++||++|+++|+. .++||++.+++.|.+++++++.+++++|+||+||+|+++|++|+ T Consensus 262 ~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~ 341 (378) T protein:vir:16 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI 341 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 9999999999999999999863 26799999999999999999999999999999999999998876 Q ss_pred CCCCCCCc-ccc--cc------cc-----CCCCCC Q lcl|NC_011801. 366 FPELDLDE-GTN--LL------DN-----TKNIND 386 (386) Q Consensus 366 ~p~~~~~~-~~~--~~------~~-----~~~~~~ 386 (386) |++|... ..+ ++ ++ ++.+++ T Consensus 342 -~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~ 375 (378) T protein:vir:16 342 -EGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDET 375 (378) T ss_pred -CCCCeEeeccccccccchhhhcCccCCCCCCCCC Confidence 3333211 000 00 00 001111 No 83 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=4.3e-68 Score=389.90 Aligned_cols=361 Identities=11% Similarity=0.070 Sum_probs=264.1 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--------~~~~~~ 72 (386) ||||++++++++....... .......++.+.++++++|++||++||++||++|++++ +|++++ T Consensus 1 MGlf~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~ 71 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSDDD---------SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLY 71 (395) T ss_pred CcchhhhcCCCcccccccc---------cchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHH Confidence 9999998766533221110 01112245777889999999999999999999999875 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||++++++..+. +++..+...... ....+.+.... ......+ T Consensus 72 lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~------~~~~~~~~~~~~--~~~~~~~~~~~--~~~~~~~ 141 (395) T protein:vir:98 72 WINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY------VADSFTQDKKIS--GSQFKVSRVQG--QTYEKTF 141 (395) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee------cCCccccccccc--CcccceeeecC--ceeeeEe Confidence 9999999999999999999999999999999999875432 222222221111 11122222221 1234678 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHH--HHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHH Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQD--LSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEE 230 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~--~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~ 230 (386) ++++|+|+|+... + ...++.+++......+.... .......+++.++..+.+.+..+....++++.++.++.+++ T Consensus 142 ~~~evih~k~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (395) T protein:vir:98 142 TFDQVIYLKNDNS-D--LMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKR 218 (395) T ss_pred cCccEEEecCCCC-C--ccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHH Confidence 9999999996542 1 22234444454444444333 33445567888888888888777666678888888888988 Q ss_pred Hhccc--ccCcceecCCCceeeeccC------ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHH Q lcl|NC_011801. 231 QTTGE--NAGRAVVLDQSADVETTNI------SPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQS 302 (386) Q Consensus 231 ~~~~~--~~g~~~vl~~g~~~~~~~~------~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~ 302 (386) .+++. +++.++++++|++|+++++ ++.++|+.+.+++++++||++|||||.+|+ +++++.+++..+|+++ T Consensus 219 ~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~--~~~sn~e~~~~~f~~~ 296 (395) T protein:vir:98 219 TVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--GDIADNQKNYELLLEG 296 (395) T ss_pred HHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCcccHHHHHHHHHHH Confidence 87763 4455788999999999985 467889999999999999999999999997 4577889999999999 Q ss_pred HHHHHHHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc-- Q lcl|NC_011801. 303 SLSIYIKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG-- 374 (386) Q Consensus 303 ~l~P~~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~-- 374 (386) ||.|++.+||++|+++|++ +++||++.+++.|.+++++++++++++||||+||+|+++|++|++ ++.+|+. T Consensus 297 tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~-~~~gD~~~~ 375 (395) T protein:vir:98 297 PIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELP-DGLGKVLYM 375 (395) T ss_pred HHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-CCCCceeee Confidence 9999999999999999864 346899999999999999999999999999999999999987753 3323321 Q ss_pred -ccccccC-CC-CCC Q lcl|NC_011801. 375 -TNLLDNT-KN-IND 386 (386) Q Consensus 375 -~~~~~~~-~~-~~~ 386 (386) .+..... ++ -+| T Consensus 376 ~~n~~~~~~~gge~~ 390 (395) T protein:vir:98 376 TKNYESVLERGGEVD 390 (395) T ss_pred cccceecccccCCCC Confidence 1111110 11 111 No 84 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=3.8e-68 Score=390.22 Aligned_cols=353 Identities=11% Similarity=0.095 Sum_probs=253.9 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--------~~~~~~ 72 (386) ||||+++++++........ .+. ....++.+.|+++++|++||++||+++|++|++++ +|++.+ T Consensus 1 Mgl~d~~~~~~~~~~~~~~-----~~~----~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~ 71 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDD-----SGS----TTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLY 71 (395) T ss_pred CcchhhhcCCCCccccccc-----ccc----chhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHH Confidence 9999998766543221111 000 11246778899999999999999999999999875 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||++||+++||+.++.+++++||||+++.++..+.+...++.. ..-.. ...+.+.... ......+ T Consensus 72 lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~-------~~~~~-~~~~~v~~~~--~~~~~~~ 141 (395) T protein:vir:96 72 WINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQD-------KKLSG-NKFKVSRVQG--QTYEKIF 141 (395) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccc-------ccccc-ceeeeeeecc--ceeeeEe Confidence 999999999999999999999999999999999987654332222211 11001 1112222221 1234568 Q ss_pred cccceeeeccccccCcccccccccHHHH------HHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHH Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDS------LLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQ 226 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~------~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~ 226 (386) ++++|+|+|+..... ..++.+++.. +...+.....+.++..+++++|+.+.+++...+ ++..+..++ T Consensus 142 ~~~dvih~k~~~~~~---~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 214 (395) T protein:vir:96 142 TFDQVIYLKNDNSDL---MLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDG----GRQPKSDKD 214 (395) T ss_pred ccCceEEecccCCcc---ccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCc----hhhHHHHHH Confidence 999999999654321 1222222222 222222233455788889999999999887654 333344555 Q ss_pred HHHHHhccc--ccCcceecCCCceeeeccCChhhHHHHHHHHHH------HHHHHHHhCCCHHHhcCCcCcccHHHHHHH Q lcl|NC_011801. 227 SFEEQTTGE--NAGRAVVLDQSADVETTNISPNVTEFLQNVSFS------QDQIAKAFGIPADYLSGKQDAQSNITMIRA 298 (386) Q Consensus 227 ~~~~~~~~~--~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~------~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~ 298 (386) .|++.+++. +++.++++++|++|+++++++.|+|+++.+++. +++||++|||||.+|+ +++++.+++.++ T Consensus 215 ~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~--~~~sn~e~~~~~ 292 (395) T protein:vir:96 215 FFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--GDIADNQKNYEL 292 (395) T ss_pred HHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCccHHHHHHH Confidence 555554442 344578899999999999999999988887765 5899999999999997 457788999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhh------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC Q lcl|NC_011801. 299 FYQSSLSIYIKPIESELSQKLGT------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD 372 (386) Q Consensus 299 ~~~~~l~P~~~~ie~~l~~~l~~------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~ 372 (386) |+++||.|++.+||++|+++|++ +++|+++.+++.|.+++++++++++++||||+||+|+++|++|+ |++.++ T Consensus 293 f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi-~~~~gD 371 (395) T protein:vir:96 293 LLEGPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPEL-PDGLGK 371 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCCc Confidence 99999999999999999999964 24589999999999999999999999999999999999998775 333333 Q ss_pred ccccccccCCC--------CCC Q lcl|NC_011801. 373 EGTNLLDNTKN--------IND 386 (386) Q Consensus 373 ~~~~~~~~~~~--------~~~ 386 (386) + +..++| +++ T Consensus 372 ~----~~~~~N~~~~~~~gge~ 389 (395) T protein:vir:96 372 V----LYMTKNYESVLERGGEV 389 (395) T ss_pred e----eeecccceechhccCCC Confidence 2 222221 111 No 85 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=1.1e-65 Score=376.69 Aligned_cols=324 Identities=13% Similarity=0.146 Sum_probs=239.4 Q ss_pred Cchhhhhcccc--ccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec------------ Q lcl|NC_011801. 1 MAFLSNLFKRQ--KMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN------------ 66 (386) Q Consensus 1 Mg~~~~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~------------ 66 (386) ||||.|++... .......+ ....++...++++++|++||++||++||++|++++ T Consensus 1 M~~f~k~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~ 68 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDTQR------------VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTL 68 (378) T ss_pred CchhhhhhhhhhcccccCCcc------------eeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccc Confidence 99999875322 22221111 11223444567889999999999999999999653 Q ss_pred ----chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEe-ecCCCceEEEEEEcCcceEEeecCCCceeEEEEec Q lcl|NC_011801. 67 ----AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIID-RDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHF 141 (386) Q Consensus 67 ----~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~-~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 141 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++ . T Consensus 69 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~----------------------~ 126 (378) T protein:vir:85 69 ISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLF----------------------A 126 (378) T ss_pred cccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEe----------------------c Confidence 4789999999999999999999999999999999999864 444554433322 1 Q ss_pred cCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHH Q lcl|NC_011801. 142 DDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAK 221 (386) Q Consensus 142 ~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~ 221 (386) . ....+.+++|+|++... ..+ .+.+.+..+... ....+++| .++++++.++ .+++++. T Consensus 127 ~-----~~~~~~~~dvih~~~~~---~~~--~~~~~~~~a~~~----------~~~~~~~~-~~~g~l~~~~-~l~~~~~ 184 (378) T protein:vir:85 127 N-----DKKEYKPEELVRLVSPF---YIN--EDTSILDNALAS----------IQTKLEQG-KLRGLLKINA-FLDIDNT 184 (378) T ss_pred C-----CCEEEcccceEEEecCc---Ccc--chhhHHHHHHHH----------HHHHHhcC-CcceEEEeCC-cCCHHHH Confidence 1 12347789999997321 122 233444333322 23344554 6789999875 5788777 Q ss_pred HHHHHHHHHH----hcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHH Q lcl|NC_011801. 222 ENTRQSFEEQ----TTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIR 297 (386) Q Consensus 222 ~~~k~~~~~~----~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~ 297 (386) ++++++|++. .++.++|+++++++|++|+++++++.++|+ +.+++++++||++|||||.+|+ +++++++.. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~----~s~~e~~~~ 259 (378) T protein:vir:85 185 QEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILL----GTATQEQQI 259 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----CCchHHHHH Confidence 7776666543 456788899999999999999999999996 6789999999999999999995 456788889 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhh--------------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccC Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGT--------------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNR 363 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~--------------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~ 363 (386) +|+++||.|++.+||++|+++|+. .++||++.+++.|.+++++++++++++|+||+||+|+++|++ T Consensus 260 ~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~ 339 (378) T protein:vir:85 260 YFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQ 339 (378) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 999999999999999999999863 256899999999999999999999999999999999999987 Q ss_pred CcCCCCCCCc-ccc--cc------cc---CCCCCC Q lcl|NC_011801. 364 GVFPELDLDE-GTN--LL------DN---TKNIND 386 (386) Q Consensus 364 p~~p~~~~~~-~~~--~~------~~---~~~~~~ 386 (386) |++ ++|... ..+ ++ +. +.+..| T Consensus 340 p~~-gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~ 373 (378) T protein:vir:85 340 PIE-GGDIYIANLNAVAVKNLSDLQGSRKDVASTD 373 (378) T ss_pred CCC-CCCeEeecccccccccchhhcCccCCCCCCC Confidence 763 433211 000 00 00 111111 No 86 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=1.1e-65 Score=376.76 Aligned_cols=320 Identities=13% Similarity=0.143 Sum_probs=240.3 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------------- 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------------- 66 (386) ||||.|++..+........ ..+...+....+++.++|++||++||++||++|++++ T Consensus 1 M~if~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDT----------QRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CchhHHhHhhhhcccccCc----------ceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccc Confidence 9999988643211111110 1111233444567888999999999999999999652 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEe-ecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccC Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIID-RDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDD 143 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~-~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 143 (386) +|+++++|+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. . T Consensus 71 ~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~----------------------~- 127 (378) T protein:vir:94 71 MAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFA----------------------N- 127 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEe----------------------c- Confidence 4789999999999999999999999999999999999854 4555655443321 1 Q ss_pred cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH- Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE- 222 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~- 222 (386) ....+++++|+|++.... .+ .+.+++..+...+ ...+++| .++|+++.+. .+++++.+ T Consensus 128 ----~~~~~~~~dvih~~~~~~---~~--~~~~~~~~~~~~~----------~~~~~~~-~~~g~l~~~~-~l~~~~~~~ 186 (378) T protein:vir:94 128 ----DKKEYKPEELVRLTSPFY---IN--EDTSILDNALASI----------QTKLEQG-KLRGLLKINA-FLDIDNTQE 186 (378) T ss_pred ----CcEEechhceeeecCcCC---cc--cchhHHHHHHHHH----------HHHHhhC-CcccceeeCC-cCCHHHHHH Confidence 123588999999984321 12 2455655554433 2334444 5689999875 56766554 Q ss_pred ---HHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHH Q lcl|NC_011801. 223 ---NTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAF 299 (386) Q Consensus 223 ---~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~ 299 (386) ++++.|++.+++.++|+++++++|++|+++++++.++|+ +.+++++++||++|||||.+|+ ++.++++..+| T Consensus 187 ~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~----g~~~e~~~~~f 261 (378) T protein:vir:94 187 YREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----GTATQEQQIYF 261 (378) T ss_pred HHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhc----CCchHHHHHHH Confidence 455555555677888899999999999999999999996 7789999999999999999995 45567888999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhh--------------hhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLGT--------------DVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~~--------------~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) +++||.|++.+||++|+++|++ .++||++.+++.|.+++++++.+++++||||+||+|+++|++|+ T Consensus 262 ~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~ 341 (378) T protein:vir:94 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI 341 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 9999999999999999999853 25689999999999999999999999999999999999998886 Q ss_pred CCCCCCCccccccccCCCCC--------------------C Q lcl|NC_011801. 366 FPELDLDEGTNLLDNTKNIN--------------------D 386 (386) Q Consensus 366 ~p~~~~~~~~~~~~~~~~~~--------------------~ 386 (386) |++|. +..++|.. | T Consensus 342 -~ggd~------~~~~~n~~~~~~~~~~~~~~~~~~~~~e~ 375 (378) T protein:vir:94 342 -EGGDV------YIANLNAVAVKNLSDLQGNRKDVTSTDET 375 (378) T ss_pred -CCCCe------eeecccccchhcchhcccccCCCCCCCCC Confidence 33331 11111111 1 No 87 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=5.2e-64 Score=367.51 Aligned_cols=376 Identities=9% Similarity=0.053 Sum_probs=267.9 Q ss_pred Cchhhhhcccccc--------CCccch-------hhhhhcccccccCcccccH----HHHhccHHHHHHHHHHHHhhccC Q lcl|NC_011801. 1 MAFLSNLFKRQKM--------LSGSSP-------VWILNQGQPVSIKPKAITS----AIALKNSDVYAVISRVSSDIAGC 61 (386) Q Consensus 1 Mg~~~~l~~~~~~--------~~~~~~-------~~~~~~~~~~~~~~~~i~~----~~a~~~~~v~~~v~~ia~~ia~~ 61 (386) |++=+.+...+.. ..+... ......+....+....++. +.+..+|.|++||++||++||++ T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l 113 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKA 113 (648) T ss_pred cccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhC Confidence 6664433221100 000000 0011111122222223332 22346899999999999999999 Q ss_pred ceeecchh-------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc---------------eEEEEEE Q lcl|NC_011801. 62 RFVTNAQP-------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY---------------PVRIEPV 119 (386) Q Consensus 62 p~~~~~~~-------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~---------------~~~l~~l 119 (386) ||++.... ...++..+||+.+++++|++.++.+++++||||++++|+.+|. +..+||| T Consensus 114 ~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl 193 (648) T protein:vir:79 114 DWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPL 193 (648) T ss_pred cceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEee Confidence 99865321 2334557899999999999999999999999999999998884 4789999 Q ss_pred cCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 120 PNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTL 199 (386) Q Consensus 120 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 199 (386) +|++|++..+.++....|.+... +.+..+.+++++|+|+++.. +.++++|+||+.++..++....+++++..++| T Consensus 194 ~p~~v~v~~d~~g~~~~Y~y~~~--g~~~~~~~~~~dIIHik~~~---~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF 268 (648) T protein:vir:79 194 NLASMKVKRDKFGMIKGWQQEQE--GQDKPQKFKPEDIVHIYYKR---EKGRAFGTPWLLPALDDIRALRQVEENVLRLV 268 (648) T ss_pred cCceeEEEEcCCCceeeeEEEec--CCceeEEecCccEEEEccCC---CCCCceeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999888877776543 33455779999999999643 46778999999999999999999999999999 Q ss_pred hccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeecc----CChhhHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 200 RHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTN----ISPNVTEFLQNVSFSQDQIAKA 275 (386) Q Consensus 200 ~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~----~~~~d~~~~e~~~~~~~~Ia~~ 275 (386) +||++|+++++++......+..+++++.|.+.+.+. .+.+++.+++.++ .+++|+||++++++++++||++ T Consensus 269 ~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~-----~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~a 343 (648) T protein:vir:79 269 YRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENM-----DVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTV 343 (648) T ss_pred hccCCccEEEEeCCCccchHHHHHHHHHHHHhcccc-----cccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHH Confidence 999999999987644445566667777777665443 2333333433333 2568999999999999999999 Q ss_pred hCCCHHHhcCCcCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhh----h------------hhhhhcchhhhccCHHHH Q lcl|NC_011801. 276 FGIPADYLSGKQDAQSN-ITMIRAFYQSSLSIYIKPIESELSQKL----G------------TDVKLDIASAIDSDNSEL 338 (386) Q Consensus 276 ~gvp~~~l~~~~~~~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l----~------------~~~~fd~~~~l~~d~~~~ 338 (386) |||||.+||..++++.+ .++...++..++.|++..++..++..+ . .+++|+++++++.|.+++ T Consensus 344 FgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~ 423 (648) T protein:vir:79 344 LGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKL 423 (648) T ss_pred hCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHH Confidence 99999999976655433 345556677788888777765554322 1 246899999999999999 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-cccccc-------------cCCCCC-C Q lcl|NC_011801. 339 INNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTNLLD-------------NTKNIN-D 386 (386) Q Consensus 339 ~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~~~~-------------~~~~~~-~ 386 (386) ++.+.+++++||||+||+|+++|++|++.+.+... ..+++. ++..++ + T Consensus 424 a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (648) T protein:vir:79 424 ENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSAS 486 (648) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCC Confidence 99999999999999999999999888754433221 111110 000000 0 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=4.3e-64 Score=367.96 Aligned_cols=380 Identities=15% Similarity=0.062 Sum_probs=279.2 Q ss_pred Cchhhhhccc-c-cc-----------CCccchhhhhhcccccccCcccccHHH----HhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLSNLFKR-Q-KM-----------LSGSSPVWILNQGQPVSIKPKAITSAI----ALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~~l~~~-~-~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~----a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.=-.+-.+. + ++ ...+.......+......-..+++... +..++++++||+.++++||+++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~ 80 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGF 80 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCc Confidence 3221110000 0 00 000011111111111222223334333 33478999999999999999998 Q ss_pred eecch------------------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEE Q lcl|NC_011801. 64 VTNAQ------------------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPV 119 (386) Q Consensus 64 ~~~~~------------------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l 119 (386) .+..+ +.+..+...+|+.+|..++++.++.|++.+|++|+.++++..|.+..|+.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~l 160 (651) T protein:vir:99 81 DLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYV 160 (651) T ss_pred eeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhc Confidence 76421 112233445799999999999999999999999999999999999999999 Q ss_pred cCcceEEeecCCCce--------------------------------eEEEEe--------------------------- Q lcl|NC_011801. 120 PNEKVTVALDDYGKD--------------------------------LTYTVH--------------------------- 140 (386) Q Consensus 120 ~~~~v~~~~~~~~~~--------------------------------~~~~~~--------------------------- 140 (386) ++..+++..+..... .++.+. T Consensus 161 p~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~ 240 (651) T protein:vir:99 161 PARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEES 240 (651) T ss_pred ChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcce Confidence 999887654321100 000000 Q ss_pred -------------ccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_011801. 141 -------------FDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSI 207 (386) Q Consensus 141 -------------~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 207 (386) +.....+....+++++|||+|+. ++.++++|+||+..+..++..+.++++++.++|+||++|++ T Consensus 241 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~---~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~g 317 (651) T protein:vir:99 241 EREPIFVDRETGDVTTGDANGLENRPANELIFIPNP---SILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRM 317 (651) T ss_pred eeeeecccceeeeEEEcCCCceeEecccceEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 00011223456889999999854 34688999999999999999999999999999999999999 Q ss_pred EEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-----------CceeeeccCCh-hhHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 208 FIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-----------SADVETTNISP-NVTEFLQNVSFSQDQIAKA 275 (386) Q Consensus 208 ~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-----------g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~ 275 (386) +|++++..+++++.+++++.|++.++ |+|++++|+. |++|+++++++ +|+||+|++++++++||++ T Consensus 318 il~~~~~~ls~e~~~~lr~~~~~~~~--nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~a 395 (651) T protein:vir:99 318 VIKVTGGELSEESKRDLRQMLNGLRE--ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKV 395 (651) T ss_pred EEEecCCCCCHHHHHHHHHHHHHHhc--cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHH Confidence 99998878999999999999998664 5778888865 99999999876 5999999999999999999 Q ss_pred hCCCHHHhcCCcC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----------hhhhcchhhhccCHHHHHHHHH Q lcl|NC_011801. 276 FGIPADYLSGKQD--AQSNITMIRAFYQSSLSIYIKPIESELSQKLGT----------DVKLDIASAIDSDNSELINNVQ 343 (386) Q Consensus 276 ~gvp~~~l~~~~~--~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~----------~~~fd~~~~l~~d~~~~~~~~~ 343 (386) |||||.+||..+. +++.+++.+.|+++||+|++.+||++||++|+. +++|+...+++.|.+++++.+. T Consensus 396 fgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~ 475 (651) T protein:vir:99 396 LEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVR 475 (651) T ss_pred hCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHH Confidence 9999999987654 556789999999999999999999999999853 4578888999999999999999 Q ss_pred HHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCC------C Q lcl|NC_011801. 344 KLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIN------D 386 (386) Q Consensus 344 ~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~------~ 386 (386) +++++||||+||+|+++|++|++ ++.++..-.+++.+..+. + T Consensus 476 ~~i~~G~~T~NE~R~~lglppi~-~~~gd~~l~~~~~~~~g~~~~gge~ 523 (651) T protein:vir:99 476 AMRLAGVGLVDEAREELGLDPLG-EPYGEMTLSEFEAEVAGDVAGGGET 523 (651) T ss_pred HHHhCCCcCHHHHHHHhCCCCCC-CccccccccccccccccccccCCCC Confidence 99999999999999999987764 222222111222111111 0 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=2.7e-58 Score=336.23 Aligned_cols=262 Identities=18% Similarity=0.361 Sum_probs=234.3 Q ss_pred hccCceeec------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCC Q lcl|NC_011801. 58 IAGCRFVTN------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDY 131 (386) Q Consensus 58 ia~~p~~~~------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~ 131 (386) ||++|++++ +|+++++|+.+||++||+++||+.++.+++++||||++++++..|++++||||+|+.|++..+.+ T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 999999884 58899999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEee Q lcl|NC_011801. 132 GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKV 211 (386) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~ 211 (386) +...+|.+... .+..+.+++++|+|+++. ++.++++|+||+.++..++....++++++...+.+ .|+++++. T Consensus 81 ~~~~~y~~~~~---~g~~~~~~~~evih~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~~ 152 (278) T protein:vir:78 81 SRELYYSIHAA---TGNKLIVHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLKY 152 (278) T ss_pred CceEEEEEEcC---CceEEEEccccEEEECCC---CCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEEe Confidence 98888877654 345678999999999864 34677899999999999999999999887665555 46788876 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC--c Q lcl|NC_011801. 212 PNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD--A 289 (386) Q Consensus 212 ~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~--~ 289 (386) + ..+++|+.+++++.|++.+. ++|+++++++|++|+++++++.|+|+.|.+++++++||++|||||.++|..++ + T Consensus 153 ~-~~l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~ 229 (278) T protein:vir:78 153 G-SNVGKEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNF 229 (278) T ss_pred C-CCCCHHHHHHHHHHHHHHhc--cCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 5 47899999999999998764 56889999999999999999999999999999999999999999999987654 5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhh Q lcl|NC_011801. 290 QSNITMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASA 330 (386) Q Consensus 290 ~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~ 330 (386) ++.+++.++|+++||+|+++.|+++||++|+ .+++||++.+ T Consensus 230 sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 230 AKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 5678899999999999999999999999985 4678888887 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=5.5e-56 Score=323.54 Aligned_cols=333 Identities=12% Similarity=0.127 Sum_probs=250.2 Q ss_pred Cchhhhh-cccccc---------------CCccchhhhhhcccccccC-cccccH-------HHHhccHHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNL-FKRQKM---------------LSGSSPVWILNQGQPVSIK-PKAITS-------AIALKNSDVYAVISRVSS 56 (386) Q Consensus 1 Mg~~~~l-~~~~~~---------------~~~~~~~~~~~~~~~~~~~-~~~i~~-------~~a~~~~~v~~~v~~ia~ 56 (386) |+=-.+- +++... +....+.....++.+.++. +..++. ..+.+.|.-+.|+..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~ 80 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFR 80 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHh Confidence 5533211 111100 0111222223334333222 111111 112233444455544443 Q ss_pred hhccC-ceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee Q lcl|NC_011801. 57 DIAGC-RFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL 135 (386) Q Consensus 57 ~ia~~-p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 135 (386) .-+.- .....+|+++.++ .+||++||+++|++ ++.+++++||||++++++..|++++|+|++|..|++..+.+. . T Consensus 81 ~~~~h~~~~~~~~n~l~l~-~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~--~ 156 (368) T protein:vir:79 81 AAAHHSSAVYVKRNILVST-FIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDLNT--Y 156 (368) T ss_pred hccccchhhhhhcchhhhh-cCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccCCE--E Confidence 32211 1123457777665 57999999999975 788999999999999999999999999999999998766542 2 Q ss_pred EEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCC Q lcl|NC_011801. 136 TYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNAT 215 (386) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~ 215 (386) ++. . ..+..+.+++++|+|+|.. ++.++++|+||+.++..++....++..+..++|+||++|+++|++++.. T Consensus 157 ~~~--~---~~~~~~~~~~~dIihir~~---~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~ 228 (368) T protein:vir:79 157 FFV--Q---NWQQPYTFAAGSVFHLQEP---DINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAA 228 (368) T ss_pred EEE--e---cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Confidence 221 1 2345678999999999843 4568899999999999999999999999999999999999999998888 Q ss_pred CCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC-- Q lcl|NC_011801. 216 LGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD-- 288 (386) Q Consensus 216 ~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~-- 288 (386) +++++.+++++.|++..+..|+|+++++ ++|++|++++.+++|+||+|.+++++++||++|||||.++|..++ T Consensus 229 l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t 308 (368) T protein:vir:79 229 QKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNT 308 (368) T ss_pred CCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCC Confidence 9999999999999987766899999998 678999999999999999999999999999999999999987543 Q ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcchhhhccCHHHHHHHHHHHH Q lcl|NC_011801. 289 --AQSNITMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDIASAIDSDNSELINNVQKLA 346 (386) Q Consensus 289 --~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~~~~l~~d~~~~~~~~~~~~ 346 (386) +++.+++.+.|+++||.|+++.|| +++.+|+. .++|+...+++.|.++++....+-- T Consensus 309 ~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 309 GGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGDEVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCcceeeechhHhhcccccccCCcccccC Confidence 457789999999999999999998 68888874 5689999999999988886322211 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=1.1e-54 Score=316.45 Aligned_cols=308 Identities=14% Similarity=0.145 Sum_probs=235.5 Q ss_pred CchhhhhccccccCCccch-----------hhhhhcccccc-cCc-----------------ccccHHH----HhccHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSP-----------VWILNQGQPVS-IKP-----------------KAITSAI----ALKNSDV 47 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~-----------~~~~~~~~~~~-~~~-----------------~~i~~~~----a~~~~~v 47 (386) |+=-+ ++........+ .....++.+.. +++ .+|+... .-.++.+ T Consensus 26 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h 102 (376) T protein:vir:10 26 MSKRR---SRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHH 102 (376) T ss_pred chhcc---CCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHh Confidence 43221 11111100000 00111111111 010 1122211 1123334 Q ss_pred HHHHHHHHHhhccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEe Q lcl|NC_011801. 48 YAVISRVSSDIAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVA 127 (386) Q Consensus 48 ~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~ 127 (386) .+||...++.+++ ..+||++||+.+|++ ++.+++++||||++++|+..|++++|+||+|.+|++. T Consensus 103 ~s~l~~k~n~l~~--------------~~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~ 167 (376) T protein:vir:10 103 SSALFFKANVLAS--------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRK 167 (376) T ss_pred hhhHHHHhHHHHh--------------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEE Confidence 4444444443322 246999999999985 5679999999999999999999999999999999998 Q ss_pred ecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_011801. 128 LDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSI 207 (386) Q Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 207 (386) .+.+... +.. ..+..+.+++++|+|++.. ++.++++|+||+.++..++....+++.|..++|+||++|++ T Consensus 168 ~d~~~~~----~~~---~~~~~~~~~~~eViHir~~---~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pgg 237 (376) T protein:vir:10 168 ADFNGFV----YVN---GWQERHEFEPDSVFQLVRP---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGF 237 (376) T ss_pred eeCCeEE----EEE---cCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 8766422 111 2345678999999999853 56688999999999999999999999999999999999999 Q ss_pred EEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 208 FIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 208 ~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) +|+.++..+++++.+++++.|++..+..|+++++++ ++|++|++++.+++|+||+|.+++++++||++|||||.+ T Consensus 238 Il~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~l 317 (376) T protein:vir:10 238 ILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQL 317 (376) T ss_pred EEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 999988889999999999999986666888888888 568999999999999999999999999999999999999 Q ss_pred hcCCcC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcchhhhccCHHH Q lcl|NC_011801. 283 LSGKQD----AQSNITMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDIASAIDSDNSE 337 (386) Q Consensus 283 l~~~~~----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~~~~l~~d~~~ 337 (386) +|..++ +++.+++.+.|+++||.|+++.|| +++.+|+. .++|+...+++.|.++ T Consensus 318 lGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 318 LGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEEVVRFDDYEIPPAPVAA 376 (376) T ss_pred hcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhccccccccChhHhhcccccC Confidence 987543 467789999999999999999998 58888875 5899999999999988 No 92 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=3.7e-54 Score=313.54 Aligned_cols=308 Identities=13% Similarity=0.148 Sum_probs=234.9 Q ss_pred hhhhccccccCCccch-----------hhhhhcccccc-cCc-------------c----cccHHH----HhccHHHHHH Q lcl|NC_011801. 4 LSNLFKRQKMLSGSSP-----------VWILNQGQPVS-IKP-------------K----AITSAI----ALKNSDVYAV 50 (386) Q Consensus 4 ~~~l~~~~~~~~~~~~-----------~~~~~~~~~~~-~~~-------------~----~i~~~~----a~~~~~v~~~ 50 (386) +++-.+++.......+ .....++.+.. +++ . +|+... .-.++.+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhh Confidence 3322211111100000 01111111111 111 0 122211 1112333344 Q ss_pred HHHHHHhhccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecC Q lcl|NC_011801. 51 ISRVSSDIAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDD 130 (386) Q Consensus 51 v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~ 130 (386) |...++.+++ .-+||+.||+++|+ .++.+++++||||++++|+..|++++|+||+|.+|++..+. T Consensus 81 l~~k~n~l~~--------------~~~Pnp~~t~~~f~-~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~ 145 (351) T protein:vir:79 81 LFFKANVLAS--------------TFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADF 145 (351) T ss_pred hhhhhhHHhh--------------cccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecC Confidence 4333333221 24699999999996 57789999999999999999999999999999999998777 Q ss_pred CCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEe Q lcl|NC_011801. 131 YGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIK 210 (386) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~ 210 (386) ++. .+.. ..+..+.+++++|||+|.. ++.++++|+||+.++..++....+++.+..++|+||++|+++|+ T Consensus 146 ~~~----~~~~---~~g~~~~~~~~eIihir~~---~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~ 215 (351) T protein:vir:79 146 SGF----VYVN---GWQERHEFEPDSVFQLVRP---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILY 215 (351) T ss_pred CeE----EEEe---cCceEEEEcCccEEEeCCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 642 2221 2345678999999999853 56688999999999999999999999999999999999999999 Q ss_pred eCCCCCCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_011801. 211 VPNATLGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSG 285 (386) Q Consensus 211 ~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~ 285 (386) .++..+++++.++++++|++..+..|+++++++ ++|++|++++.+++|+||++.+++++++||++|||||.++|. T Consensus 216 ~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi 295 (351) T protein:vir:79 216 MTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGI 295 (351) T ss_pred ecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcc Confidence 988889999999999999987666888988887 578999999999999999999999999999999999999987 Q ss_pred CcC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccCHHH Q lcl|NC_011801. 286 KQD----AQSNITMIRAFYQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSDNSE 337 (386) Q Consensus 286 ~~~----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d~~~ 337 (386) .+. +++.+++.+.|+++||.|+++.||+ +|.+|+.+ ++|+...+++.|.++ T Consensus 296 ~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 296 VPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDEVVTFDDYEIPPAPVAA 351 (351) T ss_pred cCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcceeeeChhhhccccccC Confidence 433 4577999999999999999999985 88888865 699999999999988 No 93 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=4e-54 Score=313.35 Aligned_cols=324 Identities=14% Similarity=0.162 Sum_probs=242.2 Q ss_pred Cchhhhhc-cccccCCccchhhhhhcccccccCc-ccccHH--H------HhccHHHHHHHHHHHHhhc--cCceeecch Q lcl|NC_011801. 1 MAFLSNLF-KRQKMLSGSSPVWILNQGQPVSIKP-KAITSA--I------ALKNSDVYAVISRVSSDIA--GCRFVTNAQ 68 (386) Q Consensus 1 Mg~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~--~------a~~~~~v~~~v~~ia~~ia--~~p~~~~~~ 68 (386) |+=..+-. .+..++..........++.+-+... ..++.. . +..-|.-+..+-.+.+..+ +-+++++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~~k~n 80 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAIITKAN 80 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhhhhhh Confidence 44332110 0011111111112222222211111 111100 0 0011111222222222222 335567778 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) .+..++. +||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|..|++..+.++.. |.+.. ..+. T Consensus 81 ~l~~l~~-~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~~~--~~~~~---~~g~ 153 (346) T protein:vir:10 81 ILLSTCE-VDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQFY--YVPQR---FDHQ 153 (346) T ss_pred hHHHHHh-CCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCeEE--EEEEc---cCCe Confidence 8888774 6999999999986 56899999999999999999999999999999999987766532 22222 2245 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSF 228 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~ 228 (386) .+.+++++|+|+|.. ++.++++|+||+..+..++....+++.+..++|+||++|+++|+.++..+++++.+++++.| T Consensus 154 ~~~~~~~dIih~r~~---~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~ 230 (346) T protein:vir:10 154 EHEFAKGSIYHLLEP---DINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQL 230 (346) T ss_pred EEEEecccEEEecCC---CCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 678999999999854 45678999999999999999999999999999999999999999888889999999999999 Q ss_pred HHHhcccccCcceecC-----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----CcccHHHHHHHH Q lcl|NC_011801. 229 EEQTTGENAGRAVVLD-----QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----DAQSNITMIRAF 299 (386) Q Consensus 229 ~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~~~ 299 (386) ++.+++.|+|+++++. .|+++++++.+++|+||++.+++++++||++|||||.++|..+ ++++.+++.+.| T Consensus 231 ~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f 310 (346) T protein:vir:10 231 KQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVF 310 (346) T ss_pred HHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHH Confidence 9988778999999885 4789999999999999999999999999999999999998643 245778999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccCH Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSDN 335 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d~ 335 (386) ++++|.|+++.||+ ++.+|+.. ++|+...+++.|. T Consensus 311 ~~~~l~P~~~~iee-~n~~L~~e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 311 FITEIEPLQERLKE-FNQWLGQEVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHH-HHhhcccceeeechhhhcccCC Confidence 99999999999985 77777754 7999999999988 No 94 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=1.5e-53 Score=310.22 Aligned_cols=308 Identities=13% Similarity=0.128 Sum_probs=234.5 Q ss_pred hhhhccccccCCccc-----------hhhhhhcccccc-cC-------------cc----cccHHH----HhccHHHHHH Q lcl|NC_011801. 4 LSNLFKRQKMLSGSS-----------PVWILNQGQPVS-IK-------------PK----AITSAI----ALKNSDVYAV 50 (386) Q Consensus 4 ~~~l~~~~~~~~~~~-----------~~~~~~~~~~~~-~~-------------~~----~i~~~~----a~~~~~v~~~ 50 (386) +++-.+++....... ......++.+.. ++ +. +|+... .-.++.+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhhh Confidence 332211111110000 001111111110 01 01 122211 1112333344 Q ss_pred HHHHHHhhccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecC Q lcl|NC_011801. 51 ISRVSSDIAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDD 130 (386) Q Consensus 51 v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~ 130 (386) |...++.+++ ..+||++||+++|+ .++.+++++||||++++|+..|++++|+||+|.+|++..+. T Consensus 81 l~~k~n~l~~--------------~~~Pn~~~t~~~f~-~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~ 145 (351) T protein:vir:78 81 LFFKANVLAS--------------TFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADF 145 (351) T ss_pred hhhhhhHHhh--------------cccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeC Confidence 4443333322 24699999999997 56679999999999999999999999999999999998877 Q ss_pred CCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEe Q lcl|NC_011801. 131 YGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIK 210 (386) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~ 210 (386) ++.. +.. ..+..+.+++++|+|++.. ++.++++|+|++..+..++....++..+..++|+||++|+++|+ T Consensus 146 ~~~~----~~~---~~~~~~~~~~~eVihir~~---~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~ 215 (351) T protein:vir:78 146 SGFV----YVN---GWQERHEFAPDSVFQLVRP---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILY 215 (351) T ss_pred CeEE----EEe---cCCeEEEEccccEEEEcCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 6422 222 2345678999999999853 56788999999999999999999999999999999999999999 Q ss_pred eCCCCCCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_011801. 211 VPNATLGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSG 285 (386) Q Consensus 211 ~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~ 285 (386) .++..+++++.++++++|++..+..|+|+++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|. T Consensus 216 ~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi 295 (351) T protein:vir:78 216 MTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGI 295 (351) T ss_pred ecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcc Confidence 988889999999999999987666899998887 578999999999999999999999999999999999999987 Q ss_pred CcC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcchhhhccCHHH Q lcl|NC_011801. 286 KQD----AQSNITMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDIASAIDSDNSE 337 (386) Q Consensus 286 ~~~----~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~~~~l~~d~~~ 337 (386) .+. +++.+++.+.|+++||.|+++.||+ ++.+|+. .++|+...+++.|.++ T Consensus 296 ~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 296 VPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDEVVRFDDYEIPPAPVAA 351 (351) T ss_pred cCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccceecChhhhccccccC Confidence 432 4677899999999999999999995 6777764 5899999999999988 No 95 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=1.5e-53 Score=310.15 Aligned_cols=317 Identities=13% Similarity=0.185 Sum_probs=234.0 Q ss_pred CchhhhhccccccC---CccchhhhhhcccccccC-cccc-------cHHHHhccHHHHHHHHHHHHhhc--cCceeecc Q lcl|NC_011801. 1 MAFLSNLFKRQKML---SGSSPVWILNQGQPVSIK-PKAI-------TSAIALKNSDVYAVISRVSSDIA--GCRFVTNA 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~-~~~i-------~~~~a~~~~~v~~~v~~ia~~ia--~~p~~~~~ 67 (386) |+ .| .+++... ..........++.+.... +..+ ....+.+-|.-+..+-.+.+.-+ +-+++.+. T Consensus 1 m~--~~-~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~ 77 (340) T protein:vir:98 1 MS--KR-KPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKR 77 (340) T ss_pred CC--CC-CCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhh Confidence 55 21 1122111 111112222233322211 1100 00111112222222222222111 11233333 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) +-+... -+||++||+.+|+ .++.+++++||||++++|+..|++++|+|+++..|++..+.+ .+|.+. ..+ T Consensus 78 n~l~~~--~~Pn~~lt~~~f~-~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~~---~~~~~~----~~~ 147 (340) T protein:vir:98 78 NVLAST--YIPHPLLSRQDFS-RFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDDS---VFWFVE----NFT 147 (340) T ss_pred hHHhhc--cCCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccCc---EEEEEe----cCC Confidence 322222 3699999999996 566899999999999999999999999999999999876554 233332 234 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ..+.+++++|+|++. +++.++++|+|++..+.+++....++..+..++|+||++|+++|.+++..+++++.++++++ T Consensus 148 ~~~~~~~~eViHir~---~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~ 224 (340) T protein:vir:98 148 QPHEFAPDTVFHLLE---PDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDA 224 (340) T ss_pred eEEEEccccEEEEcC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHH Confidence 567799999999985 34668899999999999999999999999999999999999999998888999999999999 Q ss_pred HHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----CcccHHHHHHH Q lcl|NC_011801. 228 FEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----DAQSNITMIRA 298 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~~ 298 (386) |++..+..|+++++++ ++|++|++++.+++|+||++.+++++++||++|||||.++|..+ ++++.+++.+. T Consensus 225 ~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~ 304 (340) T protein:vir:98 225 MRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKV 304 (340) T ss_pred HHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHH Confidence 9987656788888888 57899999999999999999999999999999999999998743 24677999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccC Q lcl|NC_011801. 299 FYQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSD 334 (386) Q Consensus 299 ~~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d 334 (386) |+++||.|+++.||+ +|.+|+.. ++|+...+++.| T Consensus 305 f~~~~l~Pl~~~iee-~n~~L~~e~~rF~~~~l~~~d 340 (340) T protein:vir:98 305 FVRNELSPLQDRFRE-VNDWLGMEVIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHHHH-HHhcccccccccCccccccCC Confidence 999999999999995 88888765 689988999999 No 96 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=3.1e-53 Score=308.45 Aligned_cols=310 Identities=10% Similarity=0.060 Sum_probs=232.9 Q ss_pred Cchhhhhcc-cccc-CCccchhhhhhcc--cccccCcccccHHH----HhccHHHHHHHHHHHHhhccCceeecchhHHH Q lcl|NC_011801. 1 MAFLSNLFK-RQKM-LSGSSPVWILNQG--QPVSIKPKAITSAI----ALKNSDVYAVISRVSSDIAGCRFVTNAQPITD 72 (386) Q Consensus 1 Mg~~~~l~~-~~~~-~~~~~~~~~~~~~--~~~~~~~~~i~~~~----a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~ 72 (386) +.-|+ |+ .+.. .....-....... ....+...+++... ...++.+.+||....+.+++ T Consensus 17 ~~~~~--~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h~~~i~~k~N~l~~------------ 82 (348) T protein:vir:26 17 KSVYS--FDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYHGSLLKARANYVAG------------ 82 (348) T ss_pred ceEEE--ecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhhhhhHhhhhhHHhh------------ Confidence 22221 22 1110 0000000000000 01112222343322 12345566676666655443 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) .-.||++||+.+|++. +.+++++||||++++|+..|++++|+||+|..|++..+.. +|.+.. .+..+.| T Consensus 83 --~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d~~----~~~~~~----~g~~~~f 151 (348) T protein:vir:26 83 --RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKNGD----FVQLLR----NNEQKVF 151 (348) T ss_pred --cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeecCc----EEEEEe----cCeEEEE Confidence 2369999999999764 6799999999999999999999999999999999876643 222222 2456789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|+|++. +++.++++|+||+.++.+++....++..+..++|+||++|+++|..++..+++++.+++++.|++.. T Consensus 152 ~~~dIiHir~---~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~ 228 (348) T protein:vir:26 152 KAKDVIFIPQ---YDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSK 228 (348) T ss_pred cCccEEEEcC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhc Confidence 9999999985 3556889999999999999999999999999999999999999998888899999999999999987 Q ss_pred cccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCC----cCcccHHHHHHHHHHHH Q lcl|NC_011801. 233 TGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGK----QDAQSNITMIRAFYQSS 303 (386) Q Consensus 233 ~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~----~~~~~~~~~~~~~~~~~ 303 (386) ++.|+++++++ ++|+++++++.+++|+||++.+++++++||++|||||.++|.. .++++.+++.+.|+++| T Consensus 229 G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~ 308 (348) T protein:vir:26 229 GIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYE 308 (348) T ss_pred CcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHH Confidence 77888999988 7899999999999999999999999999999999999999863 23467799999999999 Q ss_pred HHHHHHHHHHHHHHhhhh----hhhhcchhhhc-cCHHHH Q lcl|NC_011801. 304 LSIYIKPIESELSQKLGT----DVKLDIASAID-SDNSEL 338 (386) Q Consensus 304 l~P~~~~ie~~l~~~l~~----~~~fd~~~~l~-~d~~~~ 338 (386) |.|+++.||++||++|+. +++||++.... .|..+. T Consensus 309 l~P~~~~ie~~ln~~l~~~~~~~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 309 VIPVCKRFMDAVNNDPEIPDNLKLKFNLNPGVESANGSAV 348 (348) T ss_pred HHHHHHHHHHHHhhhhCCCCccEEEEecCcccccchhhcC Confidence 999999999999999852 45677664322 222221 No 97 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=1.4e-52 Score=304.90 Aligned_cols=324 Identities=12% Similarity=0.095 Sum_probs=231.9 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCccccc------HHHHhccHHHHHHHHHHHH--hhccCceeecchhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAIT------SAIALKNSDVYAVISRVSS--DIAGCRFVTNAQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~------~~~a~~~~~v~~~v~~ia~--~ia~~p~~~~~~~~~~ 72 (386) |.=....-.....+..........++.+.....-.+. ...+..-|.-+..+-.+.+ .-.+-.++++.+-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSS 80 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeechHHHh Confidence 3322211001111111111122333333222111000 0001111221222222211 1112233343333332 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) + .+||+.||+++|++ ++.+++++||||++++|+..|++++|+||+|..|++..+.+.....+.+... ..+..+.+ T Consensus 81 -~-~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~~~--~~g~~~~~ 155 (345) T protein:vir:37 81 -L-YEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSYLMKKSLYD--TAQEIYRY 155 (345) T ss_pred -h-ccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeEEEEEeEec--CCceEEEE Confidence 2 36999999999975 5679999999999999999999999999999999998877665433332222 23456789 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) ++++|+|++.. ++.++++|+||+..+..++....++..++.++|+||++|+++|..++..+++++.++++++|++.. T Consensus 156 ~~~dVihir~~---~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~ 232 (345) T protein:vir:37 156 DAKDIIFIKLY---DPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESK 232 (345) T ss_pred ccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhc Confidence 99999999853 456789999999999999999999999999999999999999999888899999999999999877 Q ss_pred cccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----CcccHHHHHHHHHHHH Q lcl|NC_011801. 233 TGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----DAQSNITMIRAFYQSS 303 (386) Q Consensus 233 ~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~~~~~~~ 303 (386) +..|.++++++ ++|+++++++.+++|+||.|.+++++++||++|||||.++|..+ .+++.+++.+.|+++| T Consensus 233 g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~ 312 (345) T protein:vir:37 233 GVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDE 312 (345) T ss_pred CcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHH Confidence 66888888877 57999999999999999999999999999999999999998643 3456789999999999 Q ss_pred HHHHHHHHHHHHHHhhh----hhhhhcchhhhc Q lcl|NC_011801. 304 LSIYIKPIESELSQKLG----TDVKLDIASAID 332 (386) Q Consensus 304 l~P~~~~ie~~l~~~l~----~~~~fd~~~~l~ 332 (386) |.|+++.||+++|+.+. ..++|+-.++.+ T Consensus 313 l~P~~~~ie~~ln~~~~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 313 VMPLQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHhhhhccCCCcceEEecchhhcC Confidence 99999999999998641 345566555544 No 98 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=5.4e-53 Score=307.13 Aligned_cols=307 Identities=10% Similarity=0.084 Sum_probs=230.2 Q ss_pred hhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhcc---Cceeec------c-h-hHHH Q lcl|NC_011801. 4 LSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAG---CRFVTN------A-Q-PITD 72 (386) Q Consensus 4 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~---~p~~~~------~-~-~~~~ 72 (386) +++-.+++.......+.....++.+.+. +....+..|+...-+..+. -|+... + + .... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~----------~~~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~~~~~~h~~ 70 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAI----------DPTAWMTDYTGVFYNPYGEYYQPPIDRKGLAKVARANAHHGA 70 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccc----------cCcchhHhhhhhhhccCcceecCCCCHHHHHHHhhcchhhhh Confidence 3322222222222222222333322222 1222233344433332221 222110 0 1 1234 Q ss_pred HHhccCcccCCH----HHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccce Q lcl|NC_011801. 73 VLNAPLGNLMSG----FSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSG 148 (386) Q Consensus 73 ~l~~~PN~~~s~----~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 148 (386) +|..+||+.++. .++++.++.+++++||||++++|+..|++++|+||+|.+|++..+.. .+|... .+. T Consensus 71 ~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~~---~~~~~~-----~~~ 142 (337) T protein:vir:78 71 ILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDGC---FVYLQQ-----GKP 142 (337) T ss_pred HHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCCe---EEEEEc-----CCc Confidence 677789987765 46899999999999999999999999999999999999999876543 222221 235 Q ss_pred eEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 149 DFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSF 228 (386) Q Consensus 149 ~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~ 228 (386) .+.+++++|+|+|.. ++.++++|+||+..+.+++.+..+++.+..++|+||++|+++|..++..+++++.+++++.| T Consensus 143 ~~~~~~~eIiHik~~---~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~ 219 (337) T protein:vir:78 143 NLIYRPDDVIWLAQY---DPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMI 219 (337) T ss_pred eEEECCccEEEECCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Confidence 578999999999853 45688999999999999999999999999999999999999999888789999999999999 Q ss_pred HHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc-----CcccHHHHHHH Q lcl|NC_011801. 229 EEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ-----DAQSNITMIRA 298 (386) Q Consensus 229 ~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~-----~~~~~~~~~~~ 298 (386) ++..++.|.++++++ ++|++|++++.+++|+||+|.+++++++||++|||||.++|... ++++.+++... T Consensus 220 ~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~ 299 (337) T protein:vir:78 220 ANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDAT 299 (337) T ss_pred HHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHH Confidence 987666788888887 57899999999999999999999999999999999999998633 23467889999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhh-----hhhhcchhhh Q lcl|NC_011801. 299 FYQSSLSIYIKPIESELSQKLGT-----DVKLDIASAI 331 (386) Q Consensus 299 ~~~~~l~P~~~~ie~~l~~~l~~-----~~~fd~~~~l 331 (386) |+++||.|+++.||++++++|.. .|+++...++ T Consensus 300 f~~~~L~P~~~~ie~~~n~~ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 300 YARNEVLPLCELVQDAINSAGLPRALWVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHhhhcCChhhceeccccccccC Confidence 99999999999999999988753 3445555556 No 99 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=9.8e-53 Score=305.70 Aligned_cols=309 Identities=12% Similarity=0.106 Sum_probs=230.2 Q ss_pred Cchhhhhccccc---cCCccchhhhhhcccccc----------------cCcccccHHH----HhccHHHHHHHHHHHHh Q lcl|NC_011801. 1 MAFLSNLFKRQK---MLSGSSPVWILNQGQPVS----------------IKPKAITSAI----ALKNSDVYAVISRVSSD 57 (386) Q Consensus 1 Mg~~~~l~~~~~---~~~~~~~~~~~~~~~~~~----------------~~~~~i~~~~----a~~~~~v~~~v~~ia~~ 57 (386) |.=- .++.. .+..........++.+.. +...+++... ...++.+.+|+...++. T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n~ 77 (345) T protein:vir:37 1 MKTN---VKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANM 77 (345) T ss_pred CCcc---ccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhhhH Confidence 2221 11111 000000001111111111 1111222211 11233333444333333 Q ss_pred hccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEE Q lcl|NC_011801. 58 IAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTY 137 (386) Q Consensus 58 ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 137 (386) ++ ...+||+.||+.+|+ .++.+++++||||++++|+..|++++|+|++|.+|++..+.+...... T Consensus 78 l~--------------~~~~Pn~~~t~~~f~-~~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~ 142 (345) T protein:vir:37 78 VS--------------ATYEGGKALSKMEMR-ALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGGYSYLMK 142 (345) T ss_pred Hh--------------hccCCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCCeeEEEe Confidence 22 234699999999997 556799999999999999999999999999999999887766544332 Q ss_pred EEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCC Q lcl|NC_011801. 138 TVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLG 217 (386) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~ 217 (386) .+.+. ..+....+++++|+|++. +++.++++|+||+..+.+++....+++.+..++|+||++|++++..++..++ T Consensus 143 ~~~~~--~~g~~~~~~~~eViHir~---~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~ 217 (345) T protein:vir:37 143 KSLYD--TAQEIYRYDAKDIIFIKL---YDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLT 217 (345) T ss_pred eeeec--cCceEEEEccccEEEEcC---CCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCC Confidence 22222 235667899999999984 3456789999999999999999999999999999999999999998888899 Q ss_pred HHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----C Q lcl|NC_011801. 218 KEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----D 288 (386) Q Consensus 218 ~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~ 288 (386) +++.++++++|++.+++.|.++++++ ++|+++++++.+++|+||++.+++++++||++|||||.++|..+ + T Consensus 218 ~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~ 297 (345) T protein:vir:37 218 EEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGG 297 (345) T ss_pred HHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCC Confidence 99999999999998877777666665 56799999999999999999999999999999999999998643 2 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhh----hhhhhhcchhhhc Q lcl|NC_011801. 289 AQSNITMIRAFYQSSLSIYIKPIESELSQKL----GTDVKLDIASAID 332 (386) Q Consensus 289 ~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l----~~~~~fd~~~~l~ 332 (386) +++.+++.+.|+++||.|+++.||+++|+.+ ...++||..++++ T Consensus 298 ~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 298 LGDPLKYREVYHYDEVMPLQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceEEECchhhcC Confidence 4577899999999999999999999999864 2467888888888 No 100 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=7.4e-52 Score=300.91 Aligned_cols=317 Identities=14% Similarity=0.179 Sum_probs=227.0 Q ss_pred hhhhcccccc------CCccchhhhhhcccccccC-ccccc-------HHHHhccHHHHHHHHHH--HHhhccCceeecc Q lcl|NC_011801. 4 LSNLFKRQKM------LSGSSPVWILNQGQPVSIK-PKAIT-------SAIALKNSDVYAVISRV--SSDIAGCRFVTNA 67 (386) Q Consensus 4 ~~~l~~~~~~------~~~~~~~~~~~~~~~~~~~-~~~i~-------~~~a~~~~~v~~~v~~i--a~~ia~~p~~~~~ 67 (386) +++-.++... ...........++.+.... +..+. ...+..-|.-+..+-.+ |+.-.+-+++.+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k~ 80 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhhhh Confidence 2221111100 0001111122222221110 10000 00011111111111111 1111223444444 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) +-+... .+||+.||+.+| +.++.+++++||||++++|+..|++++|+||+|.+|++..+.+. +|.+. ..+ T Consensus 81 n~l~~~--~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~~---~~~v~----~~~ 150 (344) T protein:vir:60 81 NILAST--FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV---YWWVP----SFN 150 (344) T ss_pred hHHHhh--ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCCe---EEEEc----cCC Confidence 433332 369999999999 57889999999999999999999999999999999999877653 22222 234 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ..+.+++++|+|++. +++.++++|+||+..+..++....+++.+..++|+||++|+++|+.++..+++++.++++++ T Consensus 151 ~~~~~~~~eIiHir~---~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~ 227 (344) T protein:vir:60 151 EPTAFAPGSVFHLLE---PDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLREN 227 (344) T ss_pred eEEEEcCccEEEEcC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHH Confidence 567899999999984 34568899999999999999999999999999999999999999998888999999999999 Q ss_pred HHHHhcccccCcceec------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----CcccHHHHHH Q lcl|NC_011801. 228 FEEQTTGENAGRAVVL------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----DAQSNITMIR 297 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~ 297 (386) |++.++ .++++.+++ ++|++|++++.+++|+||+|.+++++++||++|||||.++|..+ ++++.+++.+ T Consensus 228 ~~~~~g-~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~ 306 (344) T protein:vir:60 228 MVKSKG-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred HHHhcC-CCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHH Confidence 998764 456677776 47899999999999999999999999999999999999998643 2466789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccCH Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSDN 335 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d~ 335 (386) .|+++||.|+++.|| +++.+||.. ++|+.-.+...|- T Consensus 307 ~f~~~~L~Pl~~~~e-~ln~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 307 VFVRNELIPLQDRIR-EINGWLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHH-HHHHhcCCcccccCccccCCCCC Confidence 999999999999998 599999854 5665554444444 No 101 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=1.9e-52 Score=304.11 Aligned_cols=242 Identities=16% Similarity=0.240 Sum_probs=205.0 Q ss_pred CchhhhhccccccCCccch-hhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec-------chhHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSP-VWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN-------AQPITD 72 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~-------~~~~~~ 72 (386) ||||.+.++|....+.... ..........+..+..|+.+.|+++|+|++||++||++||++|++++ +|++++ T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 80 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (251) T ss_pred CCccccccccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccccccchHHH Confidence 9999887766543332221 12233344555667779999999999999999999999999999886 478999 Q ss_pred HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEE Q lcl|NC_011801. 73 VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLY 152 (386) Q Consensus 73 ~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (386) +|+.+||+.||+++||+.++.+++++||||++++|+..|++++|+||+|++|++..+.++...++.+.......+....+ T Consensus 81 ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~~~~ 160 (251) T protein:vir:46 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 160 (251) T ss_pred HHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEeccCCcceeEEE Confidence 99999999999999999999999999999999999999999999999999999999888777665555554455667889 Q ss_pred cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011801. 153 DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQT 232 (386) Q Consensus 153 ~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~ 232 (386) +++||+|+|+. +.++++|+||+.++..++....+++++..++|+||++|+++|++++...++++.+++++.|++.+ T Consensus 161 ~~~diiH~r~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~ 236 (251) T protein:vir:46 161 KFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVL 236 (251) T ss_pred CCccEEEecCc----CCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHh Confidence 99999999864 35789999999999999999999999999999999999999999876667888999999999999 Q ss_pred cc-cccCcceecCCC Q lcl|NC_011801. 233 TG-ENAGRAVVLDQS 246 (386) Q Consensus 233 ~~-~~~g~~~vl~~g 246 (386) +| .|+|++++..+- T Consensus 237 ~g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 237 VELNKLGKLSYSMNQ 251 (251) T ss_pred cCcccccccccccCC Confidence 98 688876663332 No 102 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=1.2e-51 Score=299.79 Aligned_cols=312 Identities=11% Similarity=0.129 Sum_probs=225.8 Q ss_pred CchhhhhccccccCCc-----------cchhhhhhcccccccCcccccHHHHhccHHHHHHHHH---------HHHhh-- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG-----------SSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISR---------VSSDI-- 58 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~---------ia~~i-- 58 (386) |+=-++..++..++.. ........++.+-.. .+....+....++.|-+. ||+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v----~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~ 76 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPV----LDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGS 76 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceee----cCcchhhHHHHHhhcCccccCCCCHHHHHHHHhh Confidence 4433221111110000 011111222222111 111111111112222111 11111 Q ss_pred ---ccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee Q lcl|NC_011801. 59 ---AGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL 135 (386) Q Consensus 59 ---a~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 135 (386) .+-+++.+.+-+.. ..+||++||+++|++ ++.+++++||||++++|+..|++++|+||+|.+|++..+.+. T Consensus 77 ~~~h~~~l~~k~n~l~~--~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~~--- 150 (350) T protein:vir:11 77 SVYLQSGLKFKRNMLAK--TFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLET--- 150 (350) T ss_pred hhhhccchhhhhhhhhh--cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCCe--- Confidence 11122222222222 236999999999975 678999999999999999999999999999999998876653 Q ss_pred EEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCC Q lcl|NC_011801. 136 TYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNAT 215 (386) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~ 215 (386) +|.+. ..+..+.+++++|+|++.. ++.++++|+||+.++..++....++..+..++|+||++|+++|+.++.. T Consensus 151 ~~~~~----~~~~~~~~~~~eVihir~~---~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ 223 (350) T protein:vir:11 151 FYQVR----SWKDEHEFEKGSVIQLREA---DINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAA 223 (350) T ss_pred EEEEe----eCCeEEEECcccEEEeCCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCC Confidence 23222 2345678999999999853 4567899999999999999999999999999999999999999998888 Q ss_pred CCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC-- Q lcl|NC_011801. 216 LGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD-- 288 (386) Q Consensus 216 ~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~-- 288 (386) +++++.+++++.|++..++.|+|+++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+. T Consensus 224 ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t 303 (350) T protein:vir:11 224 QNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNA 303 (350) T ss_pred CCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC Confidence 9999999999999998777899998887 468999999999999999999999999999999999999986432 Q ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hhhcchhh Q lcl|NC_011801. 289 --AQSNITMIRAFYQSSLSIYIKPIESELSQKLGTD----VKLDIASA 330 (386) Q Consensus 289 --~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~----~~fd~~~~ 330 (386) +++.+++.+.|+++||.|+++.||+ ++.+|+.. .+|++.++ T Consensus 304 ~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~~F~~~~~~~l 350 (350) T protein:vir:11 304 GGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVVRFAQFDAPGL 350 (350) T ss_pred CCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccccccCcccccCC Confidence 4667899999999999999999985 88888753 34566666 No 103 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=2.5e-51 Score=298.03 Aligned_cols=317 Identities=14% Similarity=0.184 Sum_probs=227.6 Q ss_pred hhhhcccccc------CCccchhhhhhccccccc-CcccccHH-------HHhccHHHHHHHHHH--HHhhccCceeecc Q lcl|NC_011801. 4 LSNLFKRQKM------LSGSSPVWILNQGQPVSI-KPKAITSA-------IALKNSDVYAVISRV--SSDIAGCRFVTNA 67 (386) Q Consensus 4 ~~~l~~~~~~------~~~~~~~~~~~~~~~~~~-~~~~i~~~-------~a~~~~~v~~~v~~i--a~~ia~~p~~~~~ 67 (386) ++|-.++... ...........++.+... ++..+... .+..=|.-+..+-.+ |+.-.+-+++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k~ 80 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccceehh Confidence 3321111100 000011112222322221 11111100 000001111221111 1112233445544 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) +-+... -+||+.||+.+| +.++.+++++||||++++|+..|++++|+|+++.+|++..+.+. +|.+. ..+ T Consensus 81 n~l~~~--~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~~---~~~~~----~~~ 150 (344) T protein:vir:20 81 NILAST--FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV---YWWVP----SFN 150 (344) T ss_pred hhHHHh--ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCCE---EEEEc----cCC Confidence 444332 369999999999 67889999999999999999999999999999999999776653 22221 234 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ..+.+++++|+|++.. ++.++++|+||+..+..++.+..+++.+..++|+||++|+++|.+++..+++++.++++++ T Consensus 151 ~~~~~~~~eIiHir~~---~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik~~ 227 (344) T protein:vir:20 151 EPTAFAPGSVFHLLEP---DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLREN 227 (344) T ss_pred eEEEEcCccEEEeCCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHH Confidence 5678999999999843 4567899999999999999999999999999999999999999988888999999999999 Q ss_pred HHHHhcccccCcceec------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc----CcccHHHHHH Q lcl|NC_011801. 228 FEEQTTGENAGRAVVL------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ----DAQSNITMIR 297 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~----~~~~~~~~~~ 297 (386) |++.++ .++++.+++ ++|++|++++.++.|+||+|.+++++++||++|||||.++|..+ ++++.+++.+ T Consensus 228 ~~~~~g-~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~ 306 (344) T protein:vir:20 228 MVKSKG-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred HHHhcC-CCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH Confidence 998764 456777776 46999999999999999999999999999999999999998643 2456799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccCH Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSDN 335 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d~ 335 (386) .|+++||.|+++.|| +++.+||.. ++|+.-.+...|. T Consensus 307 ~f~~~~l~P~~~~~e-~in~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 307 VFVRNELIPLQDRIR-EINGWLGQEVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHH-HHHHhcCCcccccCccccccCCC Confidence 999999999999998 588888754 4676555544444 No 104 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=4.8e-51 Score=296.43 Aligned_cols=317 Identities=14% Similarity=0.194 Sum_probs=225.5 Q ss_pred hhhhccccccC------Cccchhhhhhccccccc-CcccccHH-HHhcc------HHHHHHHHHHHH--hhccCceeecc Q lcl|NC_011801. 4 LSNLFKRQKML------SGSSPVWILNQGQPVSI-KPKAITSA-IALKN------SDVYAVISRVSS--DIAGCRFVTNA 67 (386) Q Consensus 4 ~~~l~~~~~~~------~~~~~~~~~~~~~~~~~-~~~~i~~~-~a~~~------~~v~~~v~~ia~--~ia~~p~~~~~ 67 (386) +++-.++.... ..........++.+... ++..+... ....+ |.-+..+-.+.+ .--+-+++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k~ 80 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceehh Confidence 33221111000 00011112222222221 11111100 00011 111222222211 11123445544 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcccc Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRS 147 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 147 (386) +-+.. .-+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++..|++..+.+. +|.+. ..+ T Consensus 81 n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~---~~~~~----~~g 150 (344) T protein:vir:56 81 NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV---YWWVP----SFN 150 (344) T ss_pred hhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCCE---EEEEe----cCC Confidence 44433 2369999999999 67889999999999999999999999999999999999876653 22222 235 Q ss_pred eeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHH Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQS 227 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~ 227 (386) ..+.+++++|+|++.. ++.++++|+||+.++..++....+++.+..++|+||++|+++|+.++..+++++.++++++ T Consensus 151 ~~~~~~~~dIiHir~~---~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~ 227 (344) T protein:vir:56 151 EPTAFAPGSVFHLLEP---DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLREN 227 (344) T ss_pred eEEEEcCccEEEECCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHH Confidence 6678999999999853 4568899999999999999999999999999999999999999998888999999999999 Q ss_pred HHHHhcccccCcceec------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC----cccHHHHHH Q lcl|NC_011801. 228 FEEQTTGENAGRAVVL------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD----AQSNITMIR 297 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~----~~~~~~~~~ 297 (386) |++.. |.++++++++ ++|++|++++.+++|+||+|.+++++++||++|||||.++|..+. +++.+++.+ T Consensus 228 ~~~~~-g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~ 306 (344) T protein:vir:56 228 MVKSK-GRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred HHHhc-CCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH Confidence 99876 4567888887 479999999999999999999999999999999999999986432 456789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh-hhhcchhhhccCH Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGTD-VKLDIASAIDSDN 335 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~-~~fd~~~~l~~d~ 335 (386) .|+++||.|+++.||+ ++.+|+.. ++|+--.+...|- T Consensus 307 ~f~~~tL~Pl~~~ie~-~n~~l~~~~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 307 VFVRNELIPLQDRIRE-INGWIGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHH-HHhhhccccccCCCccccccCC Confidence 9999999999999985 78888753 3443222222222 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=4.5e-42 Score=247.27 Aligned_cols=205 Identities=11% Similarity=0.127 Sum_probs=166.3 Q ss_pred eEEeecCCCceeEEEEeccCc-ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 124 VTVALDDYGKDLTYTVHFDDS-KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHA 202 (386) Q Consensus 124 v~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 202 (386) |++..+. .++|.+..... ..+....++++||+|+|.. .+.++++|+||+.++..++....++++|+.++|+|| T Consensus 1 ~r~~~dg---~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~---~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng 74 (219) T protein:vir:98 1 MRVCKDG---NYKYLMKKSLYDTKSEIYEYNKNDVIFIKLY---DPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNG 74 (219) T ss_pred CceeecC---eEEEEEecceecCCceeEEeccccEEEecCC---CCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4443332 23444333221 2345678999999999854 456889999999999999999999999999999999 Q ss_pred CCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhC Q lcl|NC_011801. 203 IKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFG 277 (386) Q Consensus 203 ~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~g 277 (386) ++|+++|++++..+++++.+++++.|++..++.|+++++++ ++|++|++++++++|+||+|++++++++||++|| T Consensus 75 ~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fg 154 (219) T protein:vir:98 75 AHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHR 154 (219) T ss_pred CCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhC Confidence 99999999888789999999999999987766777666665 5689999999999999999999999999999999 Q ss_pred CCHHHhcCCc----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh----hhhhhcchhhhccC Q lcl|NC_011801. 278 IPADYLSGKQ----DAQSNITMIRAFYQSSLSIYIKPIESELSQKLG----TDVKLDIASAIDSD 334 (386) Q Consensus 278 vp~~~l~~~~----~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d 334 (386) |||.+||..+ ++++.+++...|+++||.|+++.||++||++++ .+++|+.+.....+ T Consensus 155 VPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 155 FPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred CCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCccEEeecCcccccCC Confidence 9999998643 356789999999999999999999999998753 23445443333333 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.95 E-value=3.7e-29 Score=176.45 Aligned_cols=376 Identities=13% Similarity=0.131 Sum_probs=233.9 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecch----hHHHHHhc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ----PITDVLNA 76 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~----~~~~~l~~ 76 (386) |.+.|.+..-........+......+.+.......+ ...+.+++.++++|+.+|++..+.++++... ...+.+.. T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l-~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~~~ 79 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQL-EALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLFTK 79 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHH-HHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHHHH Confidence 888877632111111111111111111111111111 2235678999999999999999988887432 12222322 Q ss_pred cCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC---------CCceEEEEEEcCcceEEeecC--------CCceeEEEE Q lcl|NC_011801. 77 PLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT---------NGYPVRIEPVPNEKVTVALDD--------YGKDLTYTV 139 (386) Q Consensus 77 ~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~---------~g~~~~l~~l~~~~v~~~~~~--------~~~~~~~~~ 139 (386) .-+..- ..+-+..++.+.-++|.|+++++.+. .|.++.+.++++..+++.... .+.+..|.+ T Consensus 80 ~~~~l~-~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v 158 (437) T protein:vir:52 80 FERSLK-LRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSI 158 (437) T ss_pred HHHhhc-HHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEEE Confidence 222222 23444555666668999999998865 367888999999888742211 234445554 Q ss_pred eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC--CCCC Q lcl|NC_011801. 140 HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN--ATLG 217 (386) Q Consensus 140 ~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~--~~~~ 217 (386) . +.+..+.+.++.|+||.....+.+.+...|.|+++.+...+.....+.......+.+...+ ++++++ ..++ T Consensus 159 ~----~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~ 232 (437) T protein:vir:52 159 L----GGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIA 232 (437) T ss_pred e----cCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhc Confidence 3 2234467899999999765555566778899999999999999999999999988776554 344432 2233 Q ss_pred HHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc--ccHHHH Q lcl|NC_011801. 218 KEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA--QSNITM 295 (386) Q Consensus 218 ~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~--~~~~~~ 295 (386) ....+.+.++++......+.+++++++.+.+|+.++.+..++ .+..+....+||++.+||..+|.....+ +..++. T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl--~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D 310 (437) T protein:vir:52 233 AGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGL--KDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDED 310 (437) T ss_pred CCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCH--HHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHH Confidence 333344555555444455567799999999999998877765 5778899999999999999888543322 233555 Q ss_pred HHHHHH-------HHHHHHHHHHHHHHHHhhh----hhhhhcchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHH Q lcl|NC_011801. 296 IRAFYQ-------SSLSIYIKPIESELSQKLG----TDVKLDIASAIDSDNSELI-------NNVQKLASAGVLAPIQAQ 357 (386) Q Consensus 296 ~~~~~~-------~~l~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d~~~~~-------~~~~~~~~~g~~t~nE~R 357 (386) .+.||. .-+.|.++.+-+.+-+..+ ..+.|.+.++...+.++++ +++.+++++|+++++|+| T Consensus 311 ~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r 390 (437) T protein:vir:52 311 IQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIA 390 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHH Confidence 566665 4577777777776655443 3567778888888866655 457788999999999999 Q ss_pred HHhccCCc---CCCCCCCcc--ccccccCCCCCC Q lcl|NC_011801. 358 KLLKNRGV---FPELDLDEG--TNLLDNTKNIND 386 (386) Q Consensus 358 ~~lg~~p~---~p~~~~~~~--~~~~~~~~~~~~ 386 (386) +.|...+. .+.++..+. .++...+....+ T Consensus 391 ~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~ 424 (437) T protein:vir:52 391 NELRESGLFANISAEHIEELKNADEFAGNFEEPE 424 (437) T ss_pred HHHHhcCCCCCCCccccccccCCCCCCCccCCCC Confidence 99853332 222222211 111101000001 No 107 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.88 E-value=2e-23 Score=144.98 Aligned_cols=374 Identities=11% Similarity=0.090 Sum_probs=216.4 Q ss_pred CchhhhhccccccCCccchhhhhhccccccc------Cccccc----HHHHhccHHHHHHHHHHHHhhccCceeecchhH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSI------KPKAIT----SAIALKNSDVYAVISRVSSDIAGCRFVTNAQPI 70 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~------~~~~i~----~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~ 70 (386) ||+|-+- +++.....+ .....+...... .....+ ...+.+++.++.+|+.+|++..+..+++...+- T Consensus 1 ~~~~m~~--~~~~~~~~D-~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~ 77 (435) T protein:vir:79 1 MGVFMSD--KVKAITKED-GYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVKN 77 (435) T ss_pred CCccccc--ccccchhhc-chhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCCh Confidence 9999542 222211111 111212221111 112222 123457888999999999999888888765433 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-c---------CCCceEEEEEEcCcceEEee-------cCCCc Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-D---------TNGYPVRIEPVPNEKVTVAL-------DDYGK 133 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~---------~~g~~~~l~~l~~~~v~~~~-------~~~~~ 133 (386) ...+...-....-+ +-+..++.+..++|.|++++.. + ..|....|.++++..+++.. ...+. T Consensus 78 ~~~~~~~~~~l~~~-~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~ 156 (435) T protein:vir:79 78 EKSFKSRWDELRLN-AKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGE 156 (435) T ss_pred HHHHHHHHHHhhHH-HHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCc Confidence 33333222333223 3444555556688988887753 2 33455678888887776432 12233 Q ss_pred eeEEEEeccCcccceeEEEcccceeeecccccc---CcccccccccHH-HHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_011801. 134 DLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSG---ESDTQYMGIPPI-DSLLNEIEVQDLSSKLAISTLRHAIKPSIFI 209 (386) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~---~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l 209 (386) +..|.+... .....+.+.++.++||.....+ .+....+|.|++ +.+...+.....+.......+...... ++ T Consensus 157 P~~y~v~~~--~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~--v~ 232 (435) T protein:vir:79 157 PKLYKISPG--GDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQA--VW 232 (435) T ss_pred ceEEEEecC--CCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc--cc Confidence 445554322 2223466889999998643222 123557899998 578899999999988888877655443 23 Q ss_pred eeCC--CCC-CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_011801. 210 KVPN--ATL-GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSG 285 (386) Q Consensus 210 ~~~~--~~~-~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~ 285 (386) ++++ ..+ +.+....+.+++...... ++.+.+++.+++.+|+.++.+..++ .+..+....+||++.+||..+|.. T Consensus 233 ~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~IP~t~L~G 310 (435) T protein:vir:79 233 KARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGV--PEFLQEKIDRIVALTGIHEIIIKN 310 (435) T ss_pred cchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCH--HHHHHHHHHHHHhhhCCCeeeecc Confidence 3332 111 222333444445443333 2334466666667899988877654 677899999999999999977643 Q ss_pred -CcCc--ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhhhhhhhcchhhhccCHHHHH-------HHHHHHHhC Q lcl|NC_011801. 286 -KQDA--QSNITMIRAFYQS-------SLSIYIKPIESELSQKLGTDVKLDIASAIDSDNSELI-------NNVQKLASA 348 (386) Q Consensus 286 -~~~~--~~~~~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~fd~~~~l~~d~~~~~-------~~~~~~~~~ 348 (386) +..+ +..++..+.||.. .+.|.++.+-+.+-.. ..+.|.+.++...|.++++ +++++++++ T Consensus 311 ~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~ 388 (435) T protein:vir:79 311 KNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISE--TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAE 388 (435) T ss_pred CCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc Confidence 3332 2224455555543 3555555444333221 4667778888888876655 456778899 Q ss_pred CCcCHHHHHHHhc----cCCcCCCC----CCCccccccccCCCCCC Q lcl|NC_011801. 349 GVLAPIQAQKLLK----NRGVFPEL----DLDEGTNLLDNTKNIND 386 (386) Q Consensus 349 g~~t~nE~R~~lg----~~p~~p~~----~~~~~~~~~~~~~~~~~ 386 (386) |+++++|+|+.|- ..+..++. +-.+.+++=..+.++.| T Consensus 389 g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~ 434 (435) T protein:vir:79 389 QAINLKETRDTLRSICPDLKIMDNDNIELPEPEDLDPEPGQEGGLN 434 (435) T ss_pred CCCCHHHHHHHHHHhccccCCCCcccccCCccccCCCCCCCCCCCC Confidence 9999999998771 22222211 11122333445556666 No 108 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.87 E-value=1.7e-22 Score=139.89 Aligned_cols=372 Identities=9% Similarity=0.028 Sum_probs=210.2 Q ss_pred Cchhhhh----------------cccccc-----CCccchhhhhhcccccc----------cCccccc----HHHHhccH Q lcl|NC_011801. 1 MAFLSNL----------------FKRQKM-----LSGSSPVWILNQGQPVS----------IKPKAIT----SAIALKNS 45 (386) Q Consensus 1 Mg~~~~l----------------~~~~~~-----~~~~~~~~~~~~~~~~~----------~~~~~i~----~~~a~~~~ 45 (386) |-+.... ..+... +..+.+.....++.... .....+. ...+.+++ T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~ 114 (537) T protein:vir:10 35 KPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQMCALIATHW 114 (537) T ss_pred hHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccHHHHHHHHhCc Confidence 1111100 000000 00000000001110000 0001111 12345789 Q ss_pred HHHHHHHHHHHhhccCceeecc-------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec-CC------- Q lcl|NC_011801. 46 DVYAVISRVSSDIAGCRFVTNA-------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD-TN------- 110 (386) Q Consensus 46 ~v~~~v~~ia~~ia~~p~~~~~-------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~-~~------- 110 (386) .++.+|+.+|+++.+-++++.- ....+.|....+....+..|.+.+. +..++|.+++++.-. .+ T Consensus 115 l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~-~~rlyG~~~i~i~v~~~D~~~~~~P 193 (537) T protein:vir:10 115 LVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVR-KGRIFGIRIALFKVDSPDPYYYEKP 193 (537) T ss_pred hhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHH-hcccccceEEEEeecCcCCcccccc Confidence 9999999999999888776532 1233445444455554555555444 444689988877432 22 Q ss_pred --------CceEEEEEEcCcceEEee----cCC------CceeEEEEeccCcccceeEEEcccceeeeccccccC---cc Q lcl|NC_011801. 111 --------GYPVRIEPVPNEKVTVAL----DDY------GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGE---SD 169 (386) Q Consensus 111 --------g~~~~l~~l~~~~v~~~~----~~~------~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~---~~ 169 (386) |....|.+++|..+.+.. ..+ +.+..|.+. ...+.++.|+||.....++ +. T Consensus 194 l~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~--------g~~iH~SRli~f~g~~~p~~~~~~ 265 (537) T protein:vir:10 194 FNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN--------GKKYHRSHLAIYINDEVVDFLKPS 265 (537) T ss_pred cccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec--------CeEecceeEEEecCCCCchhhhcc Confidence 234567788887776521 111 222333321 1357889999986443222 23 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-Cce Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-SAD 248 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-g~~ 248 (386) .++.|.|.++.+...+.....+.......+.+.......+.......++++ +.++++.........++++++. +.+ T Consensus 266 ~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~g~~~id~e~e~ 342 (537) T protein:vir:10 266 YIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQ---FDETMSWWTATRDNYQVRVVDKDNED 342 (537) T ss_pred cCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHH---HHHHHHHHHhhcCCcceeEecCCCce Confidence 446899999999999999999988888888776654332222222234443 3344443333433344677765 589 Q ss_pred eeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCcCccc--HHHHHHHHHH------HHHHHHHHHHHHHHHHhh Q lcl|NC_011801. 249 VETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYL-SGKQDAQS--NITMIRAFYQ------SSLSIYIKPIESELSQKL 319 (386) Q Consensus 249 ~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l-~~~~~~~~--~~~~~~~~~~------~~l~P~~~~ie~~l~~~l 319 (386) |+.++.+..++ .+........||.+.|||...| |.+..+.+ .+...+.||. ..|.|.++.+.+.+-+.. T Consensus 343 ~e~~~~~lsgl--~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~ 420 (537) T protein:vir:10 343 VVQIDTTLNDL--DKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSH 420 (537) T ss_pred eEEEeccCCCH--HHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99888776664 5678889999999999999966 43323322 3444455543 247888888887776654 Q ss_pred h---hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-----cccccc------ Q lcl|NC_011801. 320 G---TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVFPELDLD-----EGTNLL------ 378 (386) Q Consensus 320 ~---~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-----~~~~~~------ 378 (386) + ..+.|.+.++...|.++++++ +++++.+|++++||+|+.|+..|-.+..+.. +..+.+ T Consensus 421 ~~~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~ 500 (537) T protein:vir:10 421 LRKRIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEG 500 (537) T ss_pred CCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccC Confidence 3 346677888989998887764 8889999999999999999865422111110 000000 Q ss_pred -----cc-CCCCCC Q lcl|NC_011801. 379 -----DN-TKNIND 386 (386) Q Consensus 379 -----~~-~~~~~~ 386 (386) .. +.+..+ T Consensus 501 ~~~~~~~~~~~~~~ 514 (537) T protein:vir:10 501 KPVRIIEDQPAPSE 514 (537) T ss_pred CcCCCCCCCCCccc Confidence 00 000000 No 109 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.86 E-value=1.9e-22 Score=139.68 Aligned_cols=374 Identities=11% Similarity=0.066 Sum_probs=215.0 Q ss_pred Cchhhhhcccc-ccCC----ccchhhhhhccc---cc-----c-cCccccc----HHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_011801. 1 MAFLSNLFKRQ-KMLS----GSSPVWILNQGQ---PV-----S-IKPKAIT----SAIALKNSDVYAVISRVSSDIAGCR 62 (386) Q Consensus 1 Mg~~~~l~~~~-~~~~----~~~~~~~~~~~~---~~-----~-~~~~~i~----~~~a~~~~~v~~~v~~ia~~ia~~p 62 (386) ++.-...--.. ...+ ..........+. .. . .....+. ...+.+++.++.+|+.+|+++.+-. T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~ 112 (532) T protein:vir:94 33 LGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETPADECVRAW 112 (532) T ss_pred hhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccchHHHhhCC Confidence 11111000000 0000 000000000000 00 0 0011111 2234578899999999999998877 Q ss_pred eeecc-------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-------------------CCceEEE Q lcl|NC_011801. 63 FVTNA-------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-------------------NGYPVRI 116 (386) Q Consensus 63 ~~~~~-------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-------------------~g~~~~l 116 (386) +++.. ....+.+...-..+.-+ +-+..++.+..++|.+++++.... .|..+.| T Consensus 113 ~~i~~~~~~~~~~~~~~~i~~~~~~l~v~-~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l 191 (532) T protein:vir:94 113 GKITCSSKDELAADKATRITQKLEQYNVR-TLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGF 191 (532) T ss_pred ceEeeCCccccchHHHHHHHHHHHhhhHH-HHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEE Confidence 76632 23333443333333323 344455555568999988775432 2234678 Q ss_pred EEEcCcceEEeecC--------CCceeEEEEeccCcccceeEEEcccceeeeccccccC---cccccccccHHHHHHHHH Q lcl|NC_011801. 117 EPVPNEKVTVALDD--------YGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGE---SDTQYMGIPPIDSLLNEI 185 (386) Q Consensus 117 ~~l~~~~v~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~---~~~~~~G~s~~~~~~~~i 185 (386) .+++|..|.+.... .+.+..|.+. ....+.++.++||.....++ +..++.|.|.++.+...+ T Consensus 192 ~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~-------~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l 264 (532) T protein:vir:94 192 ATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT-------SGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYV 264 (532) T ss_pred EeechheecccccccccccccccCCceeEEEc-------cCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHH Confidence 88888887764221 1122233221 11358899999986443322 224457999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEeeC-CCCCCHHHHHHHHHHHHHHhcccccCcceecCC-CceeeeccCChhhHHHHH Q lcl|NC_011801. 186 EVQDLSSKLAISTLRHAIKPSIFIKVP-NATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-SADVETTNISPNVTEFLQ 263 (386) Q Consensus 186 ~~~~~~~~~~~~~~~ng~~~~~~l~~~-~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-g~~~~~~~~~~~d~~~~e 263 (386) .....+......+........ +++. ...++.+..+.+.++++....+....++++++. ..+|+.++.+..++ .+ T Consensus 265 ~~~~~t~~~~~~l~~~~~~~v--~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~~lsgl--~~ 340 (532) T protein:vir:94 265 DNWLRTRQSVSDTVKQFSMTN--LATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTNTPLSGL--DS 340 (532) T ss_pred HHHHHHHHHHHHHHHhcCCce--eeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEecccCCH--HH Confidence 999999888888776655432 2332 234566677788888876655543445777764 57899888777664 66 Q ss_pred HHHHHHHHHHHHhCCCHHHh-cCCcCccc--HHHHHHHHHH-------HHHHHHHHHHHHHHHHhhh----hhhhhcchh Q lcl|NC_011801. 264 NVSFSQDQIAKAFGIPADYL-SGKQDAQS--NITMIRAFYQ-------SSLSIYIKPIESELSQKLG----TDVKLDIAS 329 (386) Q Consensus 264 ~~~~~~~~Ia~~~gvp~~~l-~~~~~~~~--~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~----~~~~fd~~~ 329 (386) ..+....+||++.+||..+| |.+..+-+ .+...+.||. .-+.|+++.+-+.|-+..+ ..+.|.+.+ T Consensus 341 ~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~p 420 (532) T protein:vir:94 341 LQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSP 420 (532) T ss_pred HHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCC Confidence 78888999999999999976 43322222 2334444544 4478888888888876543 356777888 Q ss_pred hhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----cc-cccc----------ccCC----- Q lcl|NC_011801. 330 AIDSDNSELIN-------NVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----EG-TNLL----------DNTK----- 382 (386) Q Consensus 330 ~l~~d~~~~~~-------~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~~-~~~~----------~~~~----- 382 (386) +...+.+++++ ++++++.+|++++||+|+.++..|..+..+.. +. ..+. .+.. T Consensus 421 L~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (532) T protein:vir:94 421 LMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQT 500 (532) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCC Confidence 88888877654 46788999999999999999876542111110 00 0000 0000 Q ss_pred ------CCCC Q lcl|NC_011801. 383 ------NIND 386 (386) Q Consensus 383 ------~~~~ 386 (386) ...| T Consensus 501 ~~~~~~~~~d 510 (532) T protein:vir:94 501 PNPQPDSEDD 510 (532) T ss_pred CCCCCCCCCC Confidence 0001 No 110 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.84 E-value=2e-21 Score=134.07 Aligned_cols=375 Identities=15% Similarity=0.145 Sum_probs=210.9 Q ss_pred Cchhhhhc-cccccCCccchhhhhhcccccc---------cCcccccH----HHHhccHHHHHHHHHHHHhhccCceeec Q lcl|NC_011801. 1 MAFLSNLF-KRQKMLSGSSPVWILNQGQPVS---------IKPKAITS----AIALKNSDVYAVISRVSSDIAGCRFVTN 66 (386) Q Consensus 1 Mg~~~~l~-~~~~~~~~~~~~~~~~~~~~~~---------~~~~~i~~----~~a~~~~~v~~~v~~ia~~ia~~p~~~~ 66 (386) |+=.+.-. ++...............+.... .....++. ..+..++.++.+|+.+|+.+-+-++++. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~ 80 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLK 80 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeee Confidence 43322110 0000000000000111111000 01111222 2345678889999999999887776553 Q ss_pred --chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-cCC------------CceEEEEEEc---CcceEE-- Q lcl|NC_011801. 67 --AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-DTN------------GYPVRIEPVP---NEKVTV-- 126 (386) Q Consensus 67 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~~~------------g~~~~l~~l~---~~~v~~-- 126 (386) +.+..+.+...-+....+ +-+..++.+..++|.|++++.. +.. +.+..+..|. +..+.. T Consensus 81 ~~~~~~~~~~~~~~~~l~~~-~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~ 159 (461) T protein:vir:80 81 TDNKEMKKNIESKWRKLKTK-DRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLY 159 (461) T ss_pred cCCHHHHHHHHHHHHHhhHH-HHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEeccccccchhh Confidence 334444443333333333 3455556666689999988853 211 1112232222 222211 Q ss_pred -ee----cCCCceeEEEEecc---------CcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 127 -AL----DDYGKDLTYTVHFD---------DSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSS 192 (386) Q Consensus 127 -~~----~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~ 192 (386) .. ...+.+..|.+... +........+.++.++|+..... .+..+|.|.++.+...+.....+. T Consensus 160 ~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~---~~~~~G~S~le~~~~~l~~~~~~~ 236 (461) T protein:vir:80 160 LNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRF---EGETKGRSIFESLYDIITVMDTSL 236 (461) T ss_pred hcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCC---CccccCcchHHHHHHHHHHHHHHH Confidence 11 12334444444321 11223346789999999975432 245689999999999999999999 Q ss_pred HHHHHHHhccCCCceEEeeCC-CCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_011801. 193 KLAISTLRHAIKPSIFIKVPN-ATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQ 271 (386) Q Consensus 193 ~~~~~~~~ng~~~~~~l~~~~-~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 271 (386) .....++.+...+ ++++++ ..+..+....+.++++.... ..++++++.+.+++.++.+..++ .+..+..... T Consensus 237 ~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~~---~~g~~~~d~~e~~e~~~~~lsgl--~~~l~~~~~~ 309 (461) T protein:vir:80 237 WSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMFR---TEALAIIKGDEQLTKESTNVSGM--KDLLDYGWDY 309 (461) T ss_pred HHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhcC---CceEEEEcCCcceEEEecCcCCH--HHHHHHHHHH Confidence 9999888776654 344443 12333444455666664432 33488889989999988877765 5788999999 Q ss_pred HHHHhCCCHHHhcCCcCcccH--HHHHHHHHH-------HHHHHHHHHHHHHHHHhhh----------hhhhhcchhhhc Q lcl|NC_011801. 272 IAKAFGIPADYLSGKQDAQSN--ITMIRAFYQ-------SSLSIYIKPIESELSQKLG----------TDVKLDIASAID 332 (386) Q Consensus 272 Ia~~~gvp~~~l~~~~~~~~~--~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~----------~~~~fd~~~~l~ 332 (386) ||++-+||...|.....+.++ ++..+.||. .-+.|+++.+-+.+-+..+ ..++|.+.++.. T Consensus 310 iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~ 389 (461) T protein:vir:80 310 LAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWN 389 (461) T ss_pred HhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCC Confidence 999999999877443334333 444555544 3467777777776655432 246678888988 Q ss_pred cCHHHHHH-------HHHHHHhCCCcCHHHHHHHh-ccCCcCCCCCCCccc---ccc------ccCCCCCC Q lcl|NC_011801. 333 SDNSELIN-------NVQKLASAGVLAPIQAQKLL-KNRGVFPELDLDEGT---NLL------DNTKNIND 386 (386) Q Consensus 333 ~d~~~~~~-------~~~~~~~~g~~t~nE~R~~l-g~~p~~p~~~~~~~~---~~~------~~~~~~~~ 386 (386) .|.+++++ ++++++++|+++++|+|+.+ +..+..|....++-+ +.+ .......| T Consensus 390 ~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 460 (461) T protein:vir:80 390 LDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNAD 460 (461) T ss_pred CCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCC Confidence 88888765 47789999999999999977 333333322222110 111 01111111 No 111 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.84 E-value=3.8e-21 Score=132.51 Aligned_cols=372 Identities=14% Similarity=0.090 Sum_probs=210.4 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHHhccCcc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNAPLGN 80 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~~PN~ 80 (386) |--.|.+..- ...+..+. ...+.+....... -...+.+++.++++|+.+|+++.+..+++...+....+..+-.. T Consensus 1 ~~~~D~~~n~--~~gg~~~~--~~~~~~~~~~~~~-l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~~~~~~~ 75 (422) T protein:vir:10 1 MVKTDSYANI--FLGGSDGS--EIYGSLQNQAPTI-LASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDD 75 (422) T ss_pred CccchhhHHH--HcCCCCCc--cccCcccccCHHH-HHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHHHHHHHHHHH Confidence 4443332100 00000000 0001111111000 11235578899999999999998888877654433333322232 Q ss_pred cCCHHHHHHHHHHHHHHhCCeEEEEee-c---------CCCceEEEEEEcCcceEEee-------cCCCceeEEEEeccC Q lcl|NC_011801. 81 LMSGFSVWQAMIVQMMLTGNAFAIIDR-D---------TNGYPVRIEPVPNEKVTVAL-------DDYGKDLTYTVHFDD 143 (386) Q Consensus 81 ~~s~~~f~~~~~~~~~l~G~a~~~~~~-~---------~~g~~~~l~~l~~~~v~~~~-------~~~~~~~~~~~~~~~ 143 (386) .- ..+-+..++.+..++|.|++++.. + ..|....|.++++..+++.. ...+.+..|.+... T Consensus 76 l~-~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~- 153 (422) T protein:vir:10 76 LE-MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTN- 153 (422) T ss_pred hh-HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecC- Confidence 22 234455555666689999888854 2 34566788889888876532 12344455555432 Q ss_pred cccceeEEEcccceeeecccccc---CcccccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC--CCC- Q lcl|NC_011801. 144 SKRSGDFLYDSSEVIHFRCTVSG---ESDTQYMGIPPIDS-LLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN--ATL- 216 (386) Q Consensus 144 ~~~~~~~~~~~~~vih~~~~~~~---~~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~--~~~- 216 (386) ..+..+.+.++.++|+.....+ .+....+|.|++.. +...+.....+.......+.+.... ++++++ +.+ T Consensus 154 -~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~ 230 (422) T protein:vir:10 154 -ESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQA--VWKAKGLAELCD 230 (422) T ss_pred -CCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchhHHHhcC Confidence 2233467888999998543322 13445689999986 6788999999988888877665543 334432 111 Q ss_pred CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCC-cCc--ccH Q lcl|NC_011801. 217 GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGK-QDA--QSN 292 (386) Q Consensus 217 ~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~-~~~--~~~ 292 (386) +.+....+.++++..... ++.+.+++.+++.+|++++.+..++ .+..+....+||++.+||...|... ..+ +.. T Consensus 231 ~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatg 308 (422) T protein:vir:10 231 DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQ 308 (422) T ss_pred CccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCCh--HHHHHHHHHHHHhhhCCCeeeeccCCcccccccc Confidence 233344455555544433 3344466666678999998887764 6779999999999999999877433 222 223 Q ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHHhhhhhhhhcchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHH Q lcl|NC_011801. 293 ITMIRAFYQS-------SLSIYIKPIESELSQKLGTDVKLDIASAIDSDNSELI-------NNVQKLASAGVLAPIQAQK 358 (386) Q Consensus 293 ~~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~fd~~~~l~~d~~~~~-------~~~~~~~~~g~~t~nE~R~ 358 (386) ++..+.||.. -+.|.++.+-+.+-+ -..+.|.+.++...+.++++ +++++++++|+++++|+|+ T Consensus 309 d~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~--s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~ 386 (422) T protein:vir:10 309 NTALETFHKLVDRKRNAELLPILEFLIPFIVN--AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARD 386 (422) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH Confidence 4445555542 355555444433322 13455667788888876654 5577889999999999999 Q ss_pred Hhc----cCCcCCCC---CCCc-cccccccCCCCCC Q lcl|NC_011801. 359 LLK----NRGVFPEL---DLDE-GTNLLDNTKNIND 386 (386) Q Consensus 359 ~lg----~~p~~p~~---~~~~-~~~~~~~~~~~~~ 386 (386) .|- ..++.++. +.++ ....-.......| T Consensus 387 ~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 387 TLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 883 22221111 0000 1011111222222 No 112 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.83 E-value=7e-21 Score=131.08 Aligned_cols=370 Identities=11% Similarity=0.019 Sum_probs=207.0 Q ss_pred Cchh-------------------------------hhhccccccCCcc---chh----hhhhcccccccCcccccHHHHh Q lcl|NC_011801. 1 MAFL-------------------------------SNLFKRQKMLSGS---SPV----WILNQGQPVSIKPKAITSAIAL 42 (386) Q Consensus 1 Mg~~-------------------------------~~l~~~~~~~~~~---~~~----~~~~~~~~~~~~~~~i~~~~a~ 42 (386) |... +.+-.-....... .+. ............+.. ....+. T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyq-l~alY~ 144 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQ-ACALIA 144 (862) T ss_pred ccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHH-HHHHHH Confidence 1111 1110000000000 000 000000000000011 123456 Q ss_pred ccHHHHHHHHHHHHhhccCceeecc--------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-cCC--- Q lcl|NC_011801. 43 KNSDVYAVISRVSSDIAGCRFVTNA--------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-DTN--- 110 (386) Q Consensus 43 ~~~~v~~~v~~ia~~ia~~p~~~~~--------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~~~--- 110 (386) +++.++.+|+.+|+++.+..+++.- .+..+.+...-....-+. -+..++.+.-|+|.+++++.. ..+ T Consensus 145 ~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~-~l~eair~~RLyGga~ililv~~~D~~~ 223 (862) T protein:vir:99 145 QHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKE-NLIEFNRFKNVFGIRVAIFVVDSEDPDY 223 (862) T ss_pred hCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHH-HHHHHHHhcccccceEEEEEecCcCchh Confidence 7889999999999999988877652 122333332222222233 344455555578877776542 112 Q ss_pred ------------CceEEEEEEcCcceEEe----ecCC------CceeEEEEeccCcccceeEEEcccceeeeccccccC- Q lcl|NC_011801. 111 ------------GYPVRIEPVPNEKVTVA----LDDY------GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGE- 167 (386) Q Consensus 111 ------------g~~~~l~~l~~~~v~~~----~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~- 167 (386) |.++.|.+|+|..+.+. ...+ +.+..|.+. + ..+.++.++|+.....++ T Consensus 224 LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~------g--~~IH~SRliif~g~~vpd~ 295 (862) T protein:vir:99 224 YEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIIS------G--QKYHRSHLIIARGPQPADI 295 (862) T ss_pred hhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeec------C--eeeccceeEEecCCCchhh Confidence 34567888888776642 1222 222333221 1 247778888875322211 Q ss_pred --cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcccccCcceecC Q lcl|NC_011801. 168 --SDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTGENAGRAVVLD 244 (386) Q Consensus 168 --~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~ 244 (386) +...+.|+|.++.+...|.....+......++.+.... +++++.. .+.. .+.+.++++......+..++++++ T Consensus 296 lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~--v~ktd~l~~l~~--ed~l~~r~~~~~~~rdN~Gi~liD 371 (862) T protein:vir:99 296 LKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT--AIHTDTAKAIAN--EDKFIQRLMFWVRYRDNHAVKVLG 371 (862) T ss_pred hhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHhhhcc--HHHHHHHHHHHHhccCcceeEEec Confidence 23335799999999999999999999999988876653 3344321 1222 133455555444443344589999 Q ss_pred CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCcCcc--cHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|NC_011801. 245 QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYL-SGKQDAQ--SNITMIRAFYQ-------SSLSIYIKPIESE 314 (386) Q Consensus 245 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l-~~~~~~~--~~~~~~~~~~~-------~~l~P~~~~ie~~ 314 (386) .+.+|+.++.+..++ .+.......+||++.+||...| |.+..+. ..++..+.||. .-|.|.++.+... T Consensus 372 ~eEe~e~ls~slSGL--~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~l 449 (862) T protein:vir:99 372 TDETMEQFDTSLADF--DAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLI 449 (862) T ss_pred CCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999988877755 5678888889999999999965 4332333 33445555655 4588999999888 Q ss_pred HHHhhh--hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcC-----CCCCCC-----ccc Q lcl|NC_011801. 315 LSQKLG--TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVF-----PELDLD-----EGT 375 (386) Q Consensus 315 l~~~l~--~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~-----p~~~~~-----~~~ 375 (386) +...++ ..+.|.+.+|...+.++++++ +++++.+|+++++|+|+.|-..+.. +.++.. ..+ T Consensus 450 i~~~lg~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e 529 (862) T protein:vir:99 450 SRLSLGIQHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPE 529 (862) T ss_pred HHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcc Confidence 777665 356777888988888877654 6789999999999999987322211 111100 000 Q ss_pred cccccCCCCCC Q lcl|NC_011801. 376 NLLDNTKNIND 386 (386) Q Consensus 376 ~~~~~~~~~~~ 386 (386) +.-...+.+.+ T Consensus 530 ~~~~~e~~g~a 540 (862) T protein:vir:99 530 NLAAYQKAGAA 540 (862) T ss_pred cccccccCCcc Confidence 00000000000 No 113 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.79 E-value=1.3e-19 Score=124.14 Aligned_cols=367 Identities=14% Similarity=0.129 Sum_probs=210.4 Q ss_pred Cchhhh-----hccccccCCccchhhhhhcccccccCcccc-cHHHHhccHHHHHHHHHHHHhhccCceeecchhHHHHH Q lcl|NC_011801. 1 MAFLSN-----LFKRQKMLSGSSPVWILNQGQPVSIKPKAI-TSAIALKNSDVYAVISRVSSDIAGCRFVTNAQPITDVL 74 (386) Q Consensus 1 Mg~~~~-----l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i-~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l 74 (386) |..|.. +++...... . .+...+...+ -...+.+++.++.+|+.+|+++.+..+++...+-...+ T Consensus 1 ~~~~~~d~~~~~~~~~~~~~-~---------~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~~ 70 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGS-P---------KPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDEKEF 70 (427) T ss_pred CCccccchHHHHhhcCCCCc-c---------cCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccHHHHH Confidence 655532 111100000 0 0001111111 12345578889999999999999888877654322233 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEee----------cCCCceEEEEEEcCcceEEeec-------CCCceeEE Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR----------DTNGYPVRIEPVPNEKVTVALD-------DYGKDLTY 137 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~----------~~~g~~~~l~~l~~~~v~~~~~-------~~~~~~~~ 137 (386) ...-.... ..+-+..++.+..++|.+++++.- ...|.+..|.++++..+++... ..+.+..| T Consensus 71 ~~~~~~l~-~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y 149 (427) T protein:vir:10 71 KSLWDSYK-LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEIY 149 (427) T ss_pred HHHHHHhh-HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcceEE Confidence 22222222 234455566666689999988742 3456778899999888765321 22344555 Q ss_pred EEeccCcccceeEEEcccceeeecccccc---CcccccccccHHH-HHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC Q lcl|NC_011801. 138 TVHFDDSKRSGDFLYDSSEVIHFRCTVSG---ESDTQYMGIPPID-SLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN 213 (386) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~vih~~~~~~~---~~~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~ 213 (386) .+... .....+.+.++.++|+.....+ .+.+..+|.|++. .+...+.....+.......+...... ++++++ T Consensus 150 ~v~~~--~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~--v~k~~~ 225 (427) T protein:vir:10 150 KVSPG--DNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKG 225 (427) T ss_pred EEecC--CCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchh Confidence 54422 2223467889999998643322 2245578999986 57788888888888888877665543 334432 Q ss_pred C--CC-CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCC-cC Q lcl|NC_011801. 214 A--TL-GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGK-QD 288 (386) Q Consensus 214 ~--~~-~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~-~~ 288 (386) - .+ +.+....+++++...... ++.+.+++.+++.+|+.++.+..++ .+.......+||++.+||...|... .. T Consensus 226 l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~IP~t~L~G~sp~ 303 (427) T protein:vir:10 226 LAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVG 303 (427) T ss_pred HHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCeeeeccCCcc Confidence 1 11 122222334444433322 3344566666778999988877764 6678999999999999999877433 22 Q ss_pred cc--cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhhhhhhhcchhhhccCHHHHH-------HHHHHHHhCCCcC Q lcl|NC_011801. 289 AQ--SNITMIRAFYQ-------SSLSIYIKPIESELSQKLGTDVKLDIASAIDSDNSELI-------NNVQKLASAGVLA 352 (386) Q Consensus 289 ~~--~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~~~~~fd~~~~l~~d~~~~~-------~~~~~~~~~g~~t 352 (386) +- ..++..+.||. ..+.|.++.+-+.+-.. ..+.+.+.++...+.++++ +++++++++|+++ T Consensus 304 Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s--~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~ 381 (427) T protein:vir:10 304 GVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVDE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIID 381 (427) T ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 22 22445555554 34666655554443322 3456666788888776654 5677889999999 Q ss_pred HHHHHHHh----ccCCcCCCCCCC--cccc----ccccCCCCCC Q lcl|NC_011801. 353 PIQAQKLL----KNRGVFPELDLD--EGTN----LLDNTKNIND 386 (386) Q Consensus 353 ~nE~R~~l----g~~p~~p~~~~~--~~~~----~~~~~~~~~~ 386 (386) ++|+|+.| +..++.++.+.+ +-.+ +-.....++| T Consensus 382 ~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d 425 (427) T protein:vir:10 382 LEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED 425 (427) T ss_pred HHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCC Confidence 99999987 334433322211 1001 1111112222 No 114 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.79 E-value=1.7e-19 Score=123.49 Aligned_cols=371 Identities=11% Similarity=0.034 Sum_probs=205.5 Q ss_pred Cchhhh-----------hccccc------cCC--ccchhhhhhccccc----------ccCcccc----cHHHHhccHHH Q lcl|NC_011801. 1 MAFLSN-----------LFKRQK------MLS--GSSPVWILNQGQPV----------SIKPKAI----TSAIALKNSDV 47 (386) Q Consensus 1 Mg~~~~-----------l~~~~~------~~~--~~~~~~~~~~~~~~----------~~~~~~i----~~~~a~~~~~v 47 (386) -++... .|+.+. ++. ...+......+... ......+ ....+.+++.+ T Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~ 123 (765) T protein:vir:96 44 RGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLV 123 (765) T ss_pred hhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchh Confidence 111111 111000 000 00000000000000 0000011 12235578899 Q ss_pred HHHHHHHHHhhccCceeecch------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec-CC---------- Q lcl|NC_011801. 48 YAVISRVSSDIAGCRFVTNAQ------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD-TN---------- 110 (386) Q Consensus 48 ~~~v~~ia~~ia~~p~~~~~~------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~-~~---------- 110 (386) +.+|+.+|++..+-.+++.-. +..+.|...-.... ..+-+..++.+.-++|.+|+++.-+ .+ T Consensus 124 rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~-v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~ 202 (765) T protein:vir:96 124 DKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFR-VKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNP 202 (765) T ss_pred hhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhh-HHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccc Confidence 999999999998877766321 12223322222222 2444555666666889888776432 11 Q ss_pred -----CceEEEEEEcCcceEEee----cCC------CceeEEEEeccCcccceeEEEcccceeeeccccccC---ccccc Q lcl|NC_011801. 111 -----GYPVRIEPVPNEKVTVAL----DDY------GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGE---SDTQY 172 (386) Q Consensus 111 -----g~~~~l~~l~~~~v~~~~----~~~------~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~---~~~~~ 172 (386) |....|..++|..+.... ..+ +.+..|.+. + ..+.++.|+|+.....++ +...+ T Consensus 203 ~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~------g--~~IH~SRli~~~g~~lpd~lk~~~~~ 274 (765) T protein:vir:96 203 DGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIIS------G--KKYHRSHLVVVRGPQPPDILKPTYIF 274 (765) T ss_pred cccccceeeEEEEechhhcccccchhccccccccccCcceeeeec------C--ceeccceEEEecCCCchhhhccccCc Confidence 234566777766655421 111 122222221 1 246788898885332211 23346 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcccccCcceecCCCceeee Q lcl|NC_011801. 173 MGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVET 251 (386) Q Consensus 173 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~ 251 (386) +|.|.++.+...|.....+......++.+.... +++++.. .+.. .+.+.++++......+..++++++.+.+|+. T Consensus 275 ~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~--~~~l~~r~~~~~~~r~n~g~~~id~ee~~e~ 350 (765) T protein:vir:96 275 GGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIAN--EDAFNARLAFWIANRDNHGVKVIGIDETMEQ 350 (765) T ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhcc--HHHHHHHHHHHHHhcCCceeEEecCCcceeE Confidence 799999999999999999998999888776653 3333321 1111 2345555555544443456889999999999 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCC-cCcc--cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhh-- Q lcl|NC_011801. 252 TNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGK-QDAQ--SNITMIRAFYQ-------SSLSIYIKPIESELSQKL-- 319 (386) Q Consensus 252 ~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~-~~~~--~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l-- 319 (386) ++.+..++ .+.......+||++.+||...|-.. ..+. ..+...+.||. ..|.|.++.+-+.|-+.- T Consensus 351 ~s~~lsgl--~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i 428 (765) T protein:vir:96 351 FDTNLSDF--DSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSESI 428 (765) T ss_pred EecccCCH--HHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 98877764 6778888999999999999776443 2333 23445555655 346666666655554331 Q ss_pred hhhhhhcchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcc---cccccc-------CC Q lcl|NC_011801. 320 GTDVKLDIASAIDSDNSELIN-------NVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEG---TNLLDN-------TK 382 (386) Q Consensus 320 ~~~~~fd~~~~l~~d~~~~~~-------~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~---~~~~~~-------~~ 382 (386) -..+.|.+.++...+.+++++ ++++++++|++++||+|+.|...|.-+..+.+.. .++... .+ T Consensus 429 ~~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~ 508 (765) T protein:vir:96 429 DVQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEK 508 (765) T ss_pred CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccC Confidence 135677888898888887765 4778899999999999999875543221111100 001111 11 Q ss_pred CCCC Q lcl|NC_011801. 383 NIND 386 (386) Q Consensus 383 ~~~~ 386 (386) .+.| T Consensus 509 ~~~~ 512 (765) T protein:vir:96 509 AGAQ 512 (765) T ss_pred CCcc Confidence 1111 No 115 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.74 E-value=1.1e-16 Score=108.11 Aligned_cols=370 Identities=13% Similarity=0.074 Sum_probs=222.6 Q ss_pred hccccccCCccchhhhhhccc-------cc--------cc-Cccc--ccHHHHhccHHHHHHHHHHHHhhccCceeecch Q lcl|NC_011801. 7 LFKRQKMLSGSSPVWILNQGQ-------PV--------SI-KPKA--ITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ 68 (386) Q Consensus 7 l~~~~~~~~~~~~~~~~~~~~-------~~--------~~-~~~~--i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~ 68 (386) ...+. +++.........++ .. .. .+.. +..+...+.+.|.+|++.+...|.+++|.|... T Consensus 1 ~~~~~--~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~ 78 (469) T protein:vir:10 1 MTERV--KTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRAN 78 (469) T ss_pred CCCcc--cCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecC Confidence 11111 11111000000000 00 00 0111 223233368999999999999999999998642 Q ss_pred h--------HHHHHhccC------------cccCCHHHHHHHHHHHHHHhCCeEEEEeecCC-----Cc--eEEEEEEcC Q lcl|NC_011801. 69 P--------ITDVLNAPL------------GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN-----GY--PVRIEPVPN 121 (386) Q Consensus 69 ~--------~~~~l~~~P------------N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~-----g~--~~~l~~l~~ 121 (386) . +...|.... +-..++.+++..++.+.+.+|-++.++++... |. +..|.+.|+ T Consensus 79 ~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp~ 158 (469) T protein:vir:10 79 GASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRPQ 158 (469) T ss_pred CCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecCc Confidence 1 222222111 11236777888888888889999999987643 32 456667777 Q ss_pred cce-EEeecCCCceeEEEEeccCc--------ccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 122 EKV-TVALDDYGKDLTYTVHFDDS--------KRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSS 192 (386) Q Consensus 122 ~~v-~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~ 192 (386) .++ +...+.+.....+....... .......+|+...++.++.. ..+..+|.|.+..+......-.... T Consensus 159 ~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~---~~g~p~g~gLlr~~~~~~~fK~~~~ 235 (469) T protein:vir:10 159 WTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNK---RPGQWQGKSILRSAYKHWLLKDKLL 235 (469) T ss_pred ccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecC---CCCCcccchhHHHHHHHHHHHHHHH Confidence 655 33444444444333211110 11123456666655555432 2344689999999999999999999 Q ss_pred HHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_011801. 193 KLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQI 272 (386) Q Consensus 193 ~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~I 272 (386) ++...|.+.-|.|--+.+.+. ..++++++.+.+.+.+...|.++ .++++.|++++-+..+-....|.+..++.-++| T Consensus 236 ~~w~~f~EryG~P~~vgky~~-~a~~~ek~~l~~a~~~~~~g~~a--~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~I 312 (469) T protein:vir:10 236 RIEAATAERNGMGIPVGTASS-ATDEDEVRKMAALARSVRGGINA--GVGLAQGQILELLGVSGNLPDIRRAIEGHDRSI 312 (469) T ss_pred HHHHHHHHHcCCcceEEecCC-CCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEeecCCCchHHHHHHHHHHHHH Confidence 999999999888877766654 45788888888888877666655 467888888888877666667889999998998 Q ss_pred HHHhCCCHHHhcCCcCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-h-----------hhcchhhhccCHHHHH Q lcl|NC_011801. 273 AKAFGIPADYLSGKQDAQSN-ITMIRAFYQSSLSIYIKPIESELSQKLGTD-V-----------KLDIASAIDSDNSELI 339 (386) Q Consensus 273 a~~~gvp~~~l~~~~~~~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~-~-----------~fd~~~~l~~d~~~~~ 339 (386) +.+.-=.- +-.....++++ .+.......+.+.-.++.+++.||+.|... + +|.++. ...+.+..+ T Consensus 313 sk~iLG~t-lTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~-~e~~~~~~a 390 (469) T protein:vir:10 313 ALSGLAHF-LNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDP-IGSRQDLTA 390 (469) T ss_pred HHHHhccc-ccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecC-CCCcHHHHH Confidence 87763222 11111223444 334456667788889999999999876421 1 222222 235667788 Q ss_pred HHHHHHHhCCCc-----CHHHHHHHhccCCcCCCCCCCcccccc---ccCCCCCC Q lcl|NC_011801. 340 NNVQKLASAGVL-----APIQAQKLLKNRGVFPELDLDEGTNLL---DNTKNIND 386 (386) Q Consensus 340 ~~~~~~~~~g~~-----t~nE~R~~lg~~p~~p~~~~~~~~~~~---~~~~~~~~ 386 (386) +.+++++..|++ +.+.+|+.+|..+..++.+..+...+. .++..... T Consensus 391 ~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (469) T protein:vir:10 391 AAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPAR 445 (469) T ss_pred HHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccc Confidence 999999999984 567899999964433333322211111 11111111 No 116 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.70 E-value=7e-17 Score=109.14 Aligned_cols=380 Identities=9% Similarity=0.080 Sum_probs=212.0 Q ss_pred Cchhhhhcccccc---C---Cccch---hhhhh-----cccccccCcc-----------cccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKM---L---SGSSP---VWILN-----QGQPVSIKPK-----------AITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~---~---~~~~~---~~~~~-----~~~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia 55 (386) |+|+||...--+. . ..... ..... ...+...+.. .-+.+.+..++.+..+|+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 9999975321110 0 00000 00000 0001111111 011233456788999999777 Q ss_pred HhhccC-ceeecc----------hhHHH----HH---hcc--CcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC----- Q lcl|NC_011801. 56 SDIAGC-RFVTNA----------QPITD----VL---NAP--LGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN----- 110 (386) Q Consensus 56 ~~ia~~-p~~~~~----------~~~~~----~l---~~~--PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~----- 110 (386) ..+=+. -+.+.. ..+.+ ++ -.. .+-.++.+.+...++..++..|++|+.+.+... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 765432 222211 11221 22 122 234577888999999999999999999876543 Q ss_pred --CceEEEEEEcCcceE------------EeecCCCceeEEEEeccCcc---cceeEEEcccceeeeccccccCcccccc Q lcl|NC_011801. 111 --GYPVRIEPVPNEKVT------------VALDDYGKDLTYTVHFDDSK---RSGDFLYDSSEVIHFRCTVSGESDTQYM 173 (386) Q Consensus 111 --g~~~~l~~l~~~~v~------------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~vih~~~~~~~~~~~~~~ 173 (386) +.+..|..|+|+.+. |..|..+..+.|.+.....+ ......+|+++|+|+... ...+..+ T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~---~r~gQ~R 237 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFV---RRLHQMR 237 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecc---cCCcccc Confidence 235688999998875 55677788888876543322 223367999999999643 3456789 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-CHHHHHHHHHHHHHHhcccccCcce-ecCCCceeee Q lcl|NC_011801. 174 GIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-GKEAKENTRQSFEEQTTGENAGRAV-VLDQSADVET 251 (386) Q Consensus 174 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-~~~~~~~~k~~~~~~~~~~~~g~~~-vl~~g~~~~~ 251 (386) |+|.+..+...+..............+=.+...++|+.+.... ..+... ..-......-..|.++ .|..|.+++. T Consensus 238 Gis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~---~~~~~~~~~l~pG~i~~~L~pGe~i~~ 314 (502) T protein:vir:79 238 GTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNG---SKENERELTIQPGIIYDDLKPGEEIGM 314 (502) T ss_pred CCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCC---CCCccccccccCCccccccCCCceeee Confidence 9999999999998887777777666666666677777543211 100000 0000000111234444 5889999998 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--HHHH-----------HHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_011801. 252 TNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--ITMI-----------RAFYQSSLSIYIKP-IESELSQ 317 (386) Q Consensus 252 ~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~~~~-----------~~~~~~~l~P~~~~-ie~~l~~ 317 (386) .+.+.....|.+..+...+.||+.+|||.+.|...-+.+++ .... ..|...-++|+.+. ++.++-. T Consensus 315 ~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 394 (502) T protein:vir:79 315 VKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVAS 394 (502) T ss_pred eCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 88765666789999999999999999999999765433322 1111 12334445554443 2333332 Q ss_pred hhhh---hh--------hhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCC------------------c--C Q lcl|NC_011801. 318 KLGT---DV--------KLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRG------------------V--F 366 (386) Q Consensus 318 ~l~~---~~--------~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p------------------~--~ 366 (386) .... +. ++---.....|+..-+++..+++++|+.|.-|+-+..|..| + + T Consensus 395 G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~ 474 (502) T protein:vir:79 395 GVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFD 474 (502) T ss_pred CCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCC Confidence 2110 11 11111123358888888889999999999887766666431 1 1 Q ss_pred --CCCCCC------ccccccccCCCCCC Q lcl|NC_011801. 367 --PELDLD------EGTNLLDNTKNIND 386 (386) Q Consensus 367 --p~~~~~------~~~~~~~~~~~~~~ 386 (386) |..... ...++-.....+.| T Consensus 475 ~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 475 TDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 000000 00011111111112 No 117 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.68 E-value=2.8e-15 Score=100.37 Aligned_cols=361 Identities=9% Similarity=-0.015 Sum_probs=210.1 Q ss_pred ccccccCCc------cchhhhhhcccccccCcccc----------cHHHHhccHHHHHHHHHHHHhhccCceeecch--- Q lcl|NC_011801. 8 FKRQKMLSG------SSPVWILNQGQPVSIKPKAI----------TSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ--- 68 (386) Q Consensus 8 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~i----------~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~--- 68 (386) .+|+..... ....+........ .....+ .-+..++.+.|.+|++.+...|.+++|.+... T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~-~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~ 79 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQ-VPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDR 79 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCC-CCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCC Confidence 111110000 0000000000000 000111 11334678999999999999999999998632 Q ss_pred ----hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcCcceEEeecCCCceeEEEEec Q lcl|NC_011801. 69 ----PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPNEKVTVALDDYGKDLTYTVHF 141 (386) Q Consensus 69 ----~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 141 (386) .....+...-+ ...+.++++.+. +.+++|-++.++++...| .+..+.+.|+..+.+..+. .. .+.. . T Consensus 80 ~~~~~~ae~v~~~l~-~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~--~l-~~~~-~ 153 (488) T protein:vir:99 80 PIDQAAAEHLEQQLQ-RVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQDG--GL-RLLT-P 153 (488) T ss_pred hHHHHHHHHHHHHHh-CCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecCCC--ce-EEec-c Confidence 12233322111 135667777765 567899999999886543 3457888889887764332 22 1111 1 Q ss_pred cCcccceeEEEcccc--eeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHH Q lcl|NC_011801. 142 DDSKRSGDFLYDSSE--VIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKE 219 (386) Q Consensus 142 ~~~~~~~~~~~~~~~--vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~ 219 (386) ++...+ ..++... ++|.... ..+..+|.|.+..+......-....++...|.+.-|.|-.+.+.+....+++ T Consensus 154 ~~~~~g--~~lp~~~~~i~~~~~~----~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ 227 (488) T protein:vir:99 154 NNMFEG--EPCPAPYFWHFSTGAD----NDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPE 227 (488) T ss_pred CCCCCc--cccccCceEEEEeecC----CCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHH Confidence 112222 3344332 3443222 2344689999999999999999999999999999899877777664456788 Q ss_pred HHHHHHHHHHHHhcccccCcceecCCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHH-HHHH Q lcl|NC_011801. 220 AKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS-PNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNI-TMIR 297 (386) Q Consensus 220 ~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~-~~~~ 297 (386) +++++.+.+.+... + ..++++.|++++-+..+ .....|.+..++.-++|+.+.-= ..+-+..++++++. +... T Consensus 228 ek~~l~~av~~~~~--~--~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLG-qtlts~~~~Gs~a~~~vh~ 302 (488) T protein:vir:99 228 DKAKLLAALHAIQT--D--SAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLG-QVASTQGTPGRLGNDDLQA 302 (488) T ss_pred HHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhh-hhhcccccccchhhHHHHH Confidence 88888887776532 2 35777777777666542 22234788888888999887411 12212222234432 2334 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhh-hhhhc----------chhhhccCHHHHHHHHHHHHhC-CC-cCHHHHHHHhccCC Q lcl|NC_011801. 298 AFYQSSLSIYIKPIESELSQKLGT-DVKLD----------IASAIDSDNSELINNVQKLASA-GV-LAPIQAQKLLKNRG 364 (386) Q Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd----------~~~~l~~d~~~~~~~~~~~~~~-g~-~t~nE~R~~lg~~p 364 (386) ....+.++-.++.|++.||+.|.. .+.++ ++.....|.+++++++.++++. |+ ++..++|+.+|..+ T Consensus 303 ~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~ 382 (488) T protein:vir:99 303 DVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEV 382 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCC Confidence 566778888999999999987642 22222 2222356888999999999985 75 78889999999654 Q ss_pred cCCCCCCCccc---cccccCCCCCC Q lcl|NC_011801. 365 VFPELDLDEGT---NLLDNTKNIND 386 (386) Q Consensus 365 ~~p~~~~~~~~---~~~~~~~~~~~ 386 (386) ..++++..... ......++... T Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (488) T protein:vir:99 383 ESTQAEATAPTPSTEFAEGDQPSDP 407 (488) T ss_pred cccccccccCCCcccCCCCCCCCCc Confidence 33222211110 00011111000 No 118 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.68 E-value=4.1e-15 Score=99.45 Aligned_cols=369 Identities=12% Similarity=0.099 Sum_probs=213.6 Q ss_pred CchhhhhccccccC----Cccchhhhhhccccc-----ccCcc------------c------ccHHHHhccHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKML----SGSSPVWILNQGQPV-----SIKPK------------A------ITSAIALKNSDVYAVISR 53 (386) Q Consensus 1 Mg~~~~l~~~~~~~----~~~~~~~~~~~~~~~-----~~~~~------------~------i~~~~a~~~~~v~~~v~~ 53 (386) |+-+=-..+++-.. ....+.......... ..... . +..+...+.+.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~ 80 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 65431122332111 111111111000000 00000 0 111111258899999999 Q ss_pred HHHhhccCceeecch--------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcCc Q lcl|NC_011801. 54 VSSDIAGCRFVTNAQ--------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPNE 122 (386) Q Consensus 54 ia~~ia~~p~~~~~~--------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~~ 122 (386) +...|.+++|.|... .....+...-+....+.+++..+. +.+++|-+..++++...| .+..+.+.++. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 159 (526) T protein:vir:99 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQS 159 (526) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 999999999988632 122233222222223555665555 577899999999876543 35678999998 Q ss_pred ceEEeecCCCceeEEEEeccCcccceeEEEccc-ceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011801. 123 KVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSS-EVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRH 201 (386) Q Consensus 123 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 201 (386) .+.+..+.... +.+. ++...+ ..+++. -|+|.... ..+..+|.|.+..+......-....++...|.+. T Consensus 160 ~f~~~~~~~~~-l~~~---~~~~~g--~~l~~~k~i~~~~~~----~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~ 229 (526) T protein:vir:99 160 WFQLNPEDQNE-LRLR---DNSPAG--EALQPFGWIIHRPRA----RSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEI 229 (526) T ss_pred ceeeccCCCcE-EEec---CCCCCc--eeecCCCeEEEeecC----CcCCccccchHHHHHHHHHHHHhhHHHHHHHHHH Confidence 88875544332 2221 111222 335554 44554322 2344689999999999999999999999999999 Q ss_pred cCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_011801. 202 AIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS-PNVTEFLQNVSFSQDQIAKAFGIPA 280 (386) Q Consensus 202 g~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gvp~ 280 (386) -|.|-.+.+.+. ..++++++.+.+.+.+... + ..++++.|++++-+..+ .....|.+..++.-++|+.+. +-. T Consensus 230 yG~P~~igky~~-~a~~~ek~~L~~av~~i~~--d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGq 303 (526) T protein:vir:99 230 YGLPIRLGKYPP-GTADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGG 303 (526) T ss_pred cCCceEEEecCC-CCCHHHHHHHHHHHHHHhh--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhh Confidence 999877777764 3478888888888876532 2 25777777777766543 222347888889999998875 112 Q ss_pred HHhcC---CcCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhc--------------chhhhccCHHHHHHH Q lcl|NC_011801. 281 DYLSG---KQDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLD--------------IASAIDSDNSELINN 341 (386) Q Consensus 281 ~~l~~---~~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd--------------~~~~l~~d~~~~~~~ 341 (386) .+-.. ...+++.. +.......+.+.-.++.+++.||+.|.. .+.+| ++..-..|.+++++. T Consensus 304 tlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~ 383 (526) T protein:vir:99 304 TLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQS 383 (526) T ss_pred hhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHH Confidence 22111 11233332 2334556677788889999999986632 22222 222345678889999 Q ss_pred HHHHHhCCC-cCHHHHHHHhccCCcCCCCCCCccc-cccccCCC----CCC Q lcl|NC_011801. 342 VQKLASAGV-LAPIQAQKLLKNRGVFPELDLDEGT-NLLDNTKN----IND 386 (386) Q Consensus 342 ~~~~~~~g~-~t~nE~R~~lg~~p~~p~~~~~~~~-~~~~~~~~----~~~ 386 (386) +.+++..|+ ++..++|+.+|.....++++..+.. .+-.++.. ..+ T Consensus 384 ~~~L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~ 434 (526) T protein:vir:99 384 IPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAA 434 (526) T ss_pred HHHHHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCccccccccccccc Confidence 999999997 8999999999953222222111100 00000000 000 No 119 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.67 E-value=5.1e-15 Score=98.92 Aligned_cols=368 Identities=13% Similarity=0.088 Sum_probs=211.2 Q ss_pred CchhhhhccccccCCc----cchhhhhhcccccccCccc-----------------c------cHHHHhccHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG----SSPVWILNQGQPVSIKPKA-----------------I------TSAIALKNSDVYAVISR 53 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~-----------------i------~~~~a~~~~~v~~~v~~ 53 (386) |+-+=-.++|+-..+. ..+................ + ..+...+.+.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 6654222344322111 1111111100000000000 1 11111268899999999 Q ss_pred HHHhhccCceeecch----h----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcCc Q lcl|NC_011801. 54 VSSDIAGCRFVTNAQ----P----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPNE 122 (386) Q Consensus 54 ia~~ia~~p~~~~~~----~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~~ 122 (386) +...|.+++|.+... + ....+...-+..-.+.+++.. +.+.+++|-++.++++...| .+..+.+.++. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQGREWLPQAFDHRPQS 159 (528) T ss_pred HHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 999999999998631 1 222222211111123333433 34466799999999876543 35678889998 Q ss_pred ceEEeecCCCceeEEEEeccCcccceeEEEcccc-eeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011801. 123 KVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSE-VIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRH 201 (386) Q Consensus 123 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 201 (386) .+.+..+... .+. .. ++... ...+++.. ++|. +. ...+..+|.+.+..+......-....++...|.+. T Consensus 160 ~f~~~~~~~~-~l~--~~-~~~~~--g~~l~~~k~iv~~-~~---~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~ 229 (528) T protein:vir:10 160 WFQLNPDDQD-ELR--LR-DNSIA--GEVLQPFGWIMHK-PR---SRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEI 229 (528) T ss_pred ceeeccCCCc-EEe--cc-CCCCC--ceeecCCCeEEEe-ec---CCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHH Confidence 8777544332 111 11 11112 23455554 4444 32 12334579999999999999999999999999999 Q ss_pred cCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_011801. 202 AIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS-PNVTEFLQNVSFSQDQIAKAFGIPA 280 (386) Q Consensus 202 g~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gvp~ 280 (386) -|.|-.+.+.+. ..++++++.+.+.+.+... ++ .++++.|++++-+..+ ..-..|.+..++.-++|+.+.- -. T Consensus 230 yG~P~~igky~~-~a~~~ek~~L~~al~~i~~--~~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL-Gq 303 (528) T protein:vir:10 230 YGLPIRLGKYPP-GTPDEEKVTLLRAVTGLGH--AA--AGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAIL-GG 303 (528) T ss_pred cCCCeEEEecCC-CCCHHHHHHHHHHHHHHhh--Cc--EEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHh-hh Confidence 999877777764 4578888888888876532 22 5677777777766543 2223478888888888887762 12 Q ss_pred HHhcCC---cCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhc--------------chhhhccCHHHHHHH Q lcl|NC_011801. 281 DYLSGK---QDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLD--------------IASAIDSDNSELINN 341 (386) Q Consensus 281 ~~l~~~---~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd--------------~~~~l~~d~~~~~~~ 341 (386) .+-... ..++++- +-......+.+.-.++.+++.||+.|.. .+.++ ++..-..|.++++++ T Consensus 304 tlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~ 383 (528) T protein:vir:10 304 TLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATS 383 (528) T ss_pred hhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHH Confidence 221111 1233432 2335566777888999999999987632 22222 222335678889999 Q ss_pred HHHHHhCCC-cCHHHHHHHhccCCcCCCCCCCcccccccc----CC---CCCC Q lcl|NC_011801. 342 VQKLASAGV-LAPIQAQKLLKNRGVFPELDLDEGTNLLDN----TK---NIND 386 (386) Q Consensus 342 ~~~~~~~g~-~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~----~~---~~~~ 386 (386) +.+++..|+ ++..++|+.+|.....++++.. ..+.... +. ...+ T Consensus 384 ~~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~-~~~~~~~~~~~~~~~~~~~~ 435 (528) T protein:vir:10 384 LPPLVKLGVQVPVNWVQEQLGIPLPANGEAVL-GDQAGAGIAQLSRRPGPRIA 435 (528) T ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCCcccc-cCCCcccccccCcccccccc Confidence 999999998 8999999999953222222111 1110000 00 0000 No 120 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.64 E-value=1.3e-15 Score=102.17 Aligned_cols=379 Identities=10% Similarity=0.048 Sum_probs=213.2 Q ss_pred Cchhhhhcccccc------CCccchhhhhhcc--------cccccCcc-----------cccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKM------LSGSSPVWILNQG--------QPVSIKPK-----------AITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~------~~~~~~~~~~~~~--------~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia 55 (386) |++++|....... ............+ .+...+.. .-+.+.+..++.+..+|+.+. T Consensus 8 ~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 87 (505) T protein:vir:96 8 PSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLK 87 (505) T ss_pred cchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 8888876431100 0000000000000 01111111 112223456788899999776 Q ss_pred Hhhcc-Cceeecc----------hhHH-------HHHhccCc----ccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC-c Q lcl|NC_011801. 56 SDIAG-CRFVTNA----------QPIT-------DVLNAPLG----NLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG-Y 112 (386) Q Consensus 56 ~~ia~-~p~~~~~----------~~~~-------~~l~~~PN----~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g-~ 112 (386) ..+=+ ..++... ..+. +.+-..+| -.++.+++...++..++..|+||+.+.+...+ . T Consensus 88 ~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~ 167 (505) T protein:vir:96 88 NNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKW 167 (505) T ss_pred HHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCc Confidence 66543 3443321 1122 22333344 44668888999999999999999988765433 3 Q ss_pred eEEEEEEcCcceE----------------EeecCCCceeEEEEeccCcc---------cceeEEEcccceeeeccccccC Q lcl|NC_011801. 113 PVRIEPVPNEKVT----------------VALDDYGKDLTYTVHFDDSK---------RSGDFLYDSSEVIHFRCTVSGE 167 (386) Q Consensus 113 ~~~l~~l~~~~v~----------------~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~vih~~~~~~~~ 167 (386) +..|.+|+|+.+. |..|..+..+.|.+.....+ ......+|+++|+|+.. +. T Consensus 168 ~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~---~~ 244 (505) T protein:vir:96 168 GYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFV---PW 244 (505) T ss_pred ceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhc---cc Confidence 5678888888873 34566677777776533221 12234589999999963 33 Q ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCc Q lcl|NC_011801. 168 SDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSA 247 (386) Q Consensus 168 ~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~ 247 (386) ..+..+|+|.+..+...+..............+=.+...++|+.......+...+. .......-..|.+..|..|. T Consensus 245 r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~----~~~~~~~l~pG~i~~L~pGe 320 (505) T protein:vir:96 245 RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDD----QGEIVEEVEAGTYQLLPYGI 320 (505) T ss_pred CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccc----cCccccccCCceeeecCCCC Confidence 45678999999999999888877777766666655666677776433322211111 01111112245688899999 Q ss_pred eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-ccH--HHH-----------HHHHHHHHHHHHHHH-HH Q lcl|NC_011801. 248 DVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-QSN--ITM-----------IRAFYQSSLSIYIKP-IE 312 (386) Q Consensus 248 ~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-~~~--~~~-----------~~~~~~~~l~P~~~~-ie 312 (386) +++.++.+--..+|.+..+...+.||+.+|||.+.|...-+. +++ ... ...|....++|+.+. ++ T Consensus 321 ~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~ 400 (505) T protein:vir:96 321 RFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLIS 400 (505) T ss_pred eeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999988776567789999999999999999999998654322 222 111 122444556665543 34 Q ss_pred HHHHHhhhh--h------hhhcch--hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccC------------------C Q lcl|NC_011801. 313 SELSQKLGT--D------VKLDIA--SAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNR------------------G 364 (386) Q Consensus 313 ~~l~~~l~~--~------~~fd~~--~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~------------------p 364 (386) .++-..... . ....+. .....|+..-+++...++++|+.|.-|+-+..|.. + T Consensus 401 ~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~G 480 (505) T protein:vir:96 401 MSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKG 480 (505) T ss_pred HHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcC Confidence 443332211 1 111111 12234888888999999999999987765555532 2 Q ss_pred cCCCCCCC---ccccccccCCCCCC Q lcl|NC_011801. 365 VFPELDLD---EGTNLLDNTKNIND 386 (386) Q Consensus 365 ~~p~~~~~---~~~~~~~~~~~~~~ 386 (386) +.+..+.. .....-..+..++| T Consensus 481 l~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 481 VNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC Confidence 22111111 11111111111112 No 121 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.62 E-value=5.1e-14 Score=93.47 Aligned_cols=365 Identities=13% Similarity=0.094 Sum_probs=212.5 Q ss_pred Cchh-hhhccccccC----Cccchhhhhhccccc-----ccCc-------------c-----cccHHHHhccHHHHHHHH Q lcl|NC_011801. 1 MAFL-SNLFKRQKML----SGSSPVWILNQGQPV-----SIKP-------------K-----AITSAIALKNSDVYAVIS 52 (386) Q Consensus 1 Mg~~-~~l~~~~~~~----~~~~~~~~~~~~~~~-----~~~~-------------~-----~i~~~~a~~~~~v~~~v~ 52 (386) |+-+ +. .+++-.. ....+.......... .+.. + .+..+...+.+.|.+|++ T Consensus 1 ~~~~~d~-~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~ 79 (526) T protein:vir:79 1 MAQIVDV-YGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMS 79 (526) T ss_pred CCeeeCC-CCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 5543 32 2332111 111111111000000 0000 0 011111226889999999 Q ss_pred HHHHhhccCceeecch--------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcC Q lcl|NC_011801. 53 RVSSDIAGCRFVTNAQ--------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPN 121 (386) Q Consensus 53 ~ia~~ia~~p~~~~~~--------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~ 121 (386) .+...|.+++|.+... ..+..+...-+......+++..+.. .+.+|-+..++++...| .+..+.+.++ T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 158 (526) T protein:vir:79 80 KRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHHRPQ 158 (526) T ss_pred HHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeeeecc Confidence 9999999999988631 1222332222222235555555444 66799999999876643 3567888899 Q ss_pred cceEEeecCCCceeEEEEeccCcccceeEEEcccc-eeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 122 EKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSE-VIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLR 200 (386) Q Consensus 122 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 200 (386) ..+++..+.... +.+. ++...+ ..+++.. |+|. +. ...+..+|.+.+..+......-....++...|.+ T Consensus 159 ~~F~~~~~~~~~-l~~~---~~~~~g--~~l~~~k~iv~~-~~---~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E 228 (526) T protein:vir:79 159 SWFQLNPEDQNE-LRLR---DNSPAG--EALQPFGWIIHR-PR---ARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLE 228 (526) T ss_pred cceEeccCCCcE-EEec---CCCCCc--eeecCCceEEEe-ec---CCcCCccccchHHHHHHHHHHHHhhHHHHHHHHH Confidence 888765544322 2211 111222 3455554 4444 22 1234468999999999999999999999999999 Q ss_pred ccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_011801. 201 HAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS-PNVTEFLQNVSFSQDQIAKAFGIP 279 (386) Q Consensus 201 ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gvp 279 (386) .-|.|--+.+.+. ..++++++++.+.+.+... + ..++++.|++++-+..+ .....|.+..++.-++|+.+. +- T Consensus 229 ~yG~P~~igky~~-~a~~~ek~~L~~av~~i~~--d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG 302 (526) T protein:vir:79 229 IYGLPIRLGKYPP-GTADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LG 302 (526) T ss_pred HcCCceEEEecCC-CCCHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hh Confidence 9898877777754 4578888888888877632 2 36778888777766643 233347888899999998874 11 Q ss_pred HHHhcC---CcCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhc--------------chhhhccCHHHHHH Q lcl|NC_011801. 280 ADYLSG---KQDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLD--------------IASAIDSDNSELIN 340 (386) Q Consensus 280 ~~~l~~---~~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd--------------~~~~l~~d~~~~~~ 340 (386) ..+-.. ...+++.. +.......+.+.-.++++++.||+.|.. .+.+| ++..-..|.+++++ T Consensus 303 qtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~ 382 (526) T protein:vir:79 303 GTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQ 382 (526) T ss_pred hhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHH Confidence 222211 11233332 2334556677888999999999987632 22222 22234567888999 Q ss_pred HHHHHHhCCC-cCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 341 NVQKLASAGV-LAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 341 ~~~~~~~~g~-~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .+.+++..|+ ++..++|+.+|.....++++.. .+...+...+. T Consensus 383 ~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~l---~~~~~~~~~~~ 426 (526) T protein:vir:79 383 SIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVL---RPAAQPAILSR 426 (526) T ss_pred HHHHHHhCCCcCCHHHHHHHhCCCCCCCchhhc---cccCCcccccc Confidence 9999999998 7999999999952211111110 00101100000 No 122 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.60 E-value=1.5e-14 Score=96.33 Aligned_cols=381 Identities=13% Similarity=0.087 Sum_probs=208.0 Q ss_pred Cchhh--hhcccccc---------CCccchhhhhhcccccccCcc-----------cccHHHHhccHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAFLS--NLFKRQKM---------LSGSSPVWILNQGQPVSIKPK-----------AITSAIALKNSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~~~--~l~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia~~i 58 (386) |..=+ .+-.+... ...... ......+...+.. .-..+.+..++.+..+|+.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nv 78 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGG--QLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHI 78 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCC--cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHh Confidence 33211 00000000 000000 0000001111111 112333456788999999888877 Q ss_pred ccCceeecch--------------h----HHHHHh---ccCc------ccCCHHHHHHHHHHHHHHhCCeEEEEeecCC- Q lcl|NC_011801. 59 AGCRFVTNAQ--------------P----ITDVLN---APLG------NLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN- 110 (386) Q Consensus 59 a~~p~~~~~~--------------~----~~~~l~---~~PN------~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~- 110 (386) =+-.+++... . +..++. ..|+ -.++..++.+.++..++..|+||+.+.+... T Consensus 79 VG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~ 158 (530) T protein:vir:38 79 VGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDS 158 (530) T ss_pred hCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCC Confidence 6556544321 1 122232 2333 3568889999999999999999999886543 Q ss_pred C--ceEEEEEEcCcceE--------------EeecCCCceeEEEEeccCc-cc--c------eeEEEcccceeeeccccc Q lcl|NC_011801. 111 G--YPVRIEPVPNEKVT--------------VALDDYGKDLTYTVHFDDS-KR--S------GDFLYDSSEVIHFRCTVS 165 (386) Q Consensus 111 g--~~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~~~-~~--~------~~~~~~~~~vih~~~~~~ 165 (386) | .+..|..|+|+.+. |..|..+..+.|++..... +. . ....++.++|+|+... T Consensus 159 g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~-- 236 (530) T protein:vir:38 159 TRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEP-- 236 (530) T ss_pred CCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccc-- Confidence 3 24678888888774 4456777777777653321 00 0 1234667799999633 Q ss_pred cCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC----------CHHHHHHHHHHHHHHh--- Q lcl|NC_011801. 166 GESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL----------GKEAKENTRQSFEEQT--- 232 (386) Q Consensus 166 ~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~----------~~~~~~~~k~~~~~~~--- 232 (386) ...+..+|+|.+..+...+.....-........+=.+...++|+.+.... ..++...+........ T Consensus 237 -~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (530) T protein:vir:38 237 -MEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYY 315 (530) T ss_pred -cCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcc Confidence 34567899999999999988887777777666665666667776533211 1111111111111110 Q ss_pred c----ccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-ccH--HHH---------- Q lcl|NC_011801. 233 T----GENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-QSN--ITM---------- 295 (386) Q Consensus 233 ~----~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-~~~--~~~---------- 295 (386) . .-..|.+..|..|.+++.++.+-...+|.+..+...+.||+.+|||.+.|...-+. +++ ... T Consensus 316 ~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~ 395 (530) T protein:vir:38 316 SAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMG 395 (530) T ss_pred cccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHH Confidence 0 01245677889999999888775566788999999999999999999998654322 222 111 Q ss_pred -HHHHHHHHHHHHHHH-HHHHHHHhhh-----hhhhh--cchh----------hhccCHHHHHHHHHHHHhCCCcCHHHH Q lcl|NC_011801. 296 -IRAFYQSSLSIYIKP-IESELSQKLG-----TDVKL--DIAS----------AIDSDNSELINNVQKLASAGVLAPIQA 356 (386) Q Consensus 296 -~~~~~~~~l~P~~~~-ie~~l~~~l~-----~~~~f--d~~~----------~l~~d~~~~~~~~~~~~~~g~~t~nE~ 356 (386) ...|...-+.|+.+. +++++..... ..++| +... ....|+..-+++...++++|+.|.-|+ T Consensus 396 ~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~ 475 (530) T protein:vir:38 396 RRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKE 475 (530) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHH Confidence 122333344554433 3444443221 11111 1111 223488888899999999999998876 Q ss_pred HHHhccC------------------CcCCCCCC--CccccccccCCCCCC Q lcl|NC_011801. 357 QKLLKNR------------------GVFPELDL--DEGTNLLDNTKNIND 386 (386) Q Consensus 357 R~~lg~~------------------p~~p~~~~--~~~~~~~~~~~~~~~ 386 (386) -+..|.. ++.+..+. .-.......+.+..| T Consensus 476 ~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d 525 (530) T protein:vir:38 476 CAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQD 525 (530) T ss_pred HHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCC Confidence 6655533 22111110 000111111222222 No 123 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.58 E-value=9.6e-15 Score=97.43 Aligned_cols=380 Identities=11% Similarity=0.036 Sum_probs=208.1 Q ss_pred CchhhhhccccccCCc------c---chhhhh-----hcccccccCcc-----------cccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG------S---SPVWIL-----NQGQPVSIKPK-----------AITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~------~---~~~~~~-----~~~~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia 55 (386) |+||||..+-...... . ...... ....+...+.. .-..+.+..++.+..+|+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 9999986432211000 0 000000 00001111111 011222346788889999886 Q ss_pred Hhhcc---Cceeec----c----hhHHH-------HHhccC--cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---- Q lcl|NC_011801. 56 SDIAG---CRFVTN----A----QPITD-------VLNAPL--GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---- 111 (386) Q Consensus 56 ~~ia~---~p~~~~----~----~~~~~-------~l~~~P--N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---- 111 (386) +.+-+ +-+... + ..+.+ .+-..+ .-.++.+++...++..++..|++|+.+.+...+ T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 65543 222211 0 11111 122222 345778899999999999999999988764322 Q ss_pred ---ceEEEEEEcCcceE-------------EeecCCCceeEEEEeccCcc-------cceeEEEcccceeeeccccccCc Q lcl|NC_011801. 112 ---YPVRIEPVPNEKVT-------------VALDDYGKDLTYTVHFDDSK-------RSGDFLYDSSEVIHFRCTVSGES 168 (386) Q Consensus 112 ---~~~~l~~l~~~~v~-------------~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~vih~~~~~~~~~ 168 (386) .+..|..|+|+.+. |..|..+..+.|.+.....+ ......+|+++|+|+... .. T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~---~r 237 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYR---KR 237 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccc---cC Confidence 35678889998774 34566677777876544322 223467999999998632 34 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcce-ecCCCc Q lcl|NC_011801. 169 DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAV-VLDQSA 247 (386) Q Consensus 169 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~-vl~~g~ 247 (386) .+..+|+|.+..+...+..............+=.+...++|+.+.... ...+.....-.. ...-..|.++ .|..|. T Consensus 238 ~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~--~~~~~~~~~~~~-~~~~~pG~iv~~L~pGe 314 (548) T protein:vir:95 238 IGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDS--YTVEPGKDRKNR-TIPIAPGMVFDDLEPGE 314 (548) T ss_pred CccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcc--ccCCCCcccccc-cccccCCccccccCCCc Confidence 567899999999999998887777776666665566667777543211 111000000000 0011124333 588899 Q ss_pred eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--HHHH-----------HHHHHHHHHHHHHH-HHH Q lcl|NC_011801. 248 DVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--ITMI-----------RAFYQSSLSIYIKP-IES 313 (386) Q Consensus 248 ~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~~~~-----------~~~~~~~l~P~~~~-ie~ 313 (386) +++.++.+.....|.+..+...+.||+.+|||.+.|....+.+++ .... ..|...-+.|+.+. ++. T Consensus 315 ~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~ 394 (548) T protein:vir:95 315 DVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQM 394 (548) T ss_pred eeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999888765566789999999999999999999998765433322 1111 12334445554433 333 Q ss_pred HHHHhhh---hh------hhhcch--hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCC------------CC Q lcl|NC_011801. 314 ELSQKLG---TD------VKLDIA--SAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPE------------LD 370 (386) Q Consensus 314 ~l~~~l~---~~------~~fd~~--~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~------------~~ 370 (386) ++-.... .+ +..++. .....|+..-+++...++++|+.|.-|+-+..|..|-.-. .+ T Consensus 395 a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~G 474 (548) T protein:vir:95 395 YLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAG 474 (548) T ss_pred HHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcC Confidence 3332211 01 111111 1223588888899999999999998877666664321000 00 Q ss_pred CCccccc-cccCCCCCC Q lcl|NC_011801. 371 LDEGTNL-LDNTKNIND 386 (386) Q Consensus 371 ~~~~~~~-~~~~~~~~~ 386 (386) ..-.+.+ ..+.+...| T Consensus 475 L~~~~~~~~~~~~~~~~ 491 (548) T protein:vir:95 475 LVFSSDAYHQLVKSGMD 491 (548) T ss_pred CCCCCcccccccccccC Confidence 0000000 011111111 No 124 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.57 E-value=3.4e-13 Score=88.95 Aligned_cols=368 Identities=12% Similarity=0.075 Sum_probs=208.1 Q ss_pred CchhhhhccccccC----Cccchhhhhh----cccc-cccCc------------ccc------cHHHHhccHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKML----SGSSPVWILN----QGQP-VSIKP------------KAI------TSAIALKNSDVYAVISR 53 (386) Q Consensus 1 Mg~~~~l~~~~~~~----~~~~~~~~~~----~~~~-~~~~~------------~~i------~~~~a~~~~~v~~~v~~ 53 (386) |+=+=-..+++-.. ....+..... .+.+ ..... -.+ ..+..++.+.|.+|++. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~ 80 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSK 80 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 55432123332111 1111110000 0000 00000 001 11222468899999999 Q ss_pred HHHhhccCceeecc----hh----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC---CCceEEEEEEcCc Q lcl|NC_011801. 54 VSSDIAGCRFVTNA----QP----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT---NGYPVRIEPVPNE 122 (386) Q Consensus 54 ia~~ia~~p~~~~~----~~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~---~g~~~~l~~l~~~ 122 (386) +...|.+++|.+.. ++ ++..+...-+......+++..+. +.+++|-+..++++.. ...+..+.+.++. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (512) T protein:vir:19 81 RRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPA 159 (512) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeeccc Confidence 99999999998863 22 22222221121223555665554 5678999999998743 3446788899998 Q ss_pred ceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 123 KVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHA 202 (386) Q Consensus 123 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 202 (386) .+....+.... +.+. +....+ ..+++...+..++. ...+..+|.+.+..+......-....++...|.+.- T Consensus 160 ~f~~~~~~~~~-lr~~---~~~~~G--~~l~~~k~i~~~~~---~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~y 230 (512) T protein:vir:19 160 LFCANPDNLNE-LRLR---DASYHG--LELQPFGWFMHRAK---SRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIY 230 (512) T ss_pred cceeccCCCcE-EEec---CCCCCc--eeecCCceEEEecc---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 88765544322 2221 111222 33555443333322 123446899999999999999999999999999998 Q ss_pred CCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCCh-hhHHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_011801. 203 IKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISP-NVTEFLQNVSFSQDQIAKAFGIPAD 281 (386) Q Consensus 203 ~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~~gvp~~ 281 (386) |.|-.+-+.+. ..++++++.+.+.+.+... + ..++++.|++++-+..+. ....|.+..++.-++|+.+.-= .. T Consensus 231 G~P~~igky~~-~a~~~ek~~L~~al~~~~~--~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLG-qt 304 (512) T protein:vir:19 231 GLPMRVGKYPT-GSTNREKATLMQAVMDIGR--R--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILG-GT 304 (512) T ss_pred CCCeeEEecCC-CCCHHHHHHHHHHHHHHhh--C--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhh-hh Confidence 98877666654 4577888888888887532 2 367778887777665432 3334788899999999877310 11 Q ss_pred HhcCC-cCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcch--------------hhhccCHHHHHHHHHH Q lcl|NC_011801. 282 YLSGK-QDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDIA--------------SAIDSDNSELINNVQK 344 (386) Q Consensus 282 ~l~~~-~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~~--------------~~l~~d~~~~~~~~~~ 344 (386) +-... .+++++. +.......+.+.-.++.+++.||+.|.. .+.+++. ..-..|.+..++.+.+ T Consensus 305 lTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~ 384 (512) T protein:vir:19 305 LTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPK 384 (512) T ss_pred hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHH Confidence 11111 1222332 2335566778889999999999987642 2233321 1223577788888888 Q ss_pred HHhCCCcCHHHHHHHhccCCcC-CCCCCCccccccccCCCCCC Q lcl|NC_011801. 345 LASAGVLAPIQAQKLLKNRGVF-PELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 345 ~~~~g~~t~nE~R~~lg~~p~~-p~~~~~~~~~~~~~~~~~~~ 386 (386) +..+--++..++|+.+|. |.+ ++++.. ...+..+..+... T Consensus 385 l~~G~~i~~~~i~e~~Gi-p~~~~~e~~~-~~~~~~~~~~~~~ 425 (512) T protein:vir:19 385 LAAGMRIPVSWIQEKLHI-PQPVGDEAVF-TIQPVVPDNGSQK 425 (512) T ss_pred HhcCCCCCHHHHHHHhCC-CCCCCccccc-cCCCccccccccc Confidence 764333799999999995 322 111111 1111111110000 No 125 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.56 E-value=5.7e-13 Score=87.73 Aligned_cols=366 Identities=8% Similarity=0.032 Sum_probs=206.0 Q ss_pred Cchhhh---hccccccCCccchhhhhh-----ccccccc-----------Ccc-cccHHHHhccHHHHHHHHHHHHhhcc Q lcl|NC_011801. 1 MAFLSN---LFKRQKMLSGSSPVWILN-----QGQPVSI-----------KPK-AITSAIALKNSDVYAVISRVSSDIAG 60 (386) Q Consensus 1 Mg~~~~---l~~~~~~~~~~~~~~~~~-----~~~~~~~-----------~~~-~i~~~~a~~~~~v~~~v~~ia~~ia~ 60 (386) =+|++. +++............... ....... .+. .+. +..++.+.|.+|++.+...|.+ T Consensus 3 ~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y-~~m~~D~~i~s~l~~Rk~av~~ 81 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVY-RELRADAHVGGCVRRRKAAVKA 81 (491) T ss_pred CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHH-HHHhhChHHHHHHHHHHHHHhC Confidence 233331 111111000000000000 0000000 000 112 2345789999999999999999 Q ss_pred Cceeecch----hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcCcceEEeecCCCc Q lcl|NC_011801. 61 CRFVTNAQ----PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPNEKVTVALDDYGK 133 (386) Q Consensus 61 ~p~~~~~~----~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~~~v~~~~~~~~~ 133 (386) ++|.+... ...+.+...-+ .....+++..+ .+.+++|-+..++++...| .+..+.+.++.++.+..+. . T Consensus 82 ~~w~i~~~~~~~~~a~~i~e~l~-~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~--~ 157 (491) T protein:vir:79 82 LEWGLDRGKAKSRVAKSIADVFA-DLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPEN--Q 157 (491) T ss_pred CCcEEecCCCCHHHHHHHHHHHh-cCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeecccceeeccCC--c Confidence 99998642 12222222111 12355666555 4577899999999876543 3457899999888764432 2 Q ss_pred eeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC Q lcl|NC_011801. 134 DLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN 213 (386) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~ 213 (386) . .+... ++.. ....+++...++.++... .+..+|.|.+..+......-....++...|.+.-|.|-.+.+.+. T Consensus 158 l-~l~~~-~~~~--~g~~lp~~k~i~~~~~~~---~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~ 230 (491) T protein:vir:79 158 L-RFRSK-EHWV--QGEELPARKFLVPRQEAT---YLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR 230 (491) T ss_pred e-EEeec-CCCC--CceeecCCCeEEEEecCC---CCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC Confidence 2 22211 1111 224566766666554322 233689999999999999999999999999999998877777754 Q ss_pred CCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC---hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc Q lcl|NC_011801. 214 ATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS---PNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ 290 (386) Q Consensus 214 ~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~---~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~ 290 (386) ..++++++.+.+.+.+.. .++ .++++.|++++-+..+ .....|.+..++.-++|+.+.- -..+....+++ T Consensus 231 -~a~~~ek~~l~~al~~~~--~~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL--GqtlTt~~~gs 303 (491) T protein:vir:79 231 -SASDAETNLLLDRLEDMV--QDA--VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL--GQNQTTEATST 303 (491) T ss_pred -CCCHHHHHHHHHHHHHHh--cCe--EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh--hhhhccCcccc Confidence 457788888888777653 223 5677777777665432 2223377778888888877652 01111122333 Q ss_pred cHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh----------hcchhhhccCHHHHHHHHHHHHhCCC-cCHHHHHH Q lcl|NC_011801. 291 SNI-TMIRAFYQSSLSIYIKPIESELSQKLGTDVK----------LDIASAIDSDNSELINNVQKLASAGV-LAPIQAQK 358 (386) Q Consensus 291 ~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~----------fd~~~~l~~d~~~~~~~~~~~~~~g~-~t~nE~R~ 358 (386) ++. +.......+.+.-.++.+++.||+.+-..+. |.+... ..+.+.+++.+.+++..|+ ++.+++|+ T Consensus 304 ~a~~~vh~~v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~~~p~f~~~e~-ee~~~~~a~~~~~L~~~G~~i~~~~~~e 382 (491) T protein:vir:79 304 RASAQAGLEVTDDIRDGDKAIVVEAMNMLIRWICDLNFDGAARPVFDMWEQ-EQVDEIQAGRDEKLTRAGARFTPAYFKR 382 (491) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEeecCc-CchhHHHHHHHHHHHhCCCccCHHHHHH Confidence 332 2234455666777888888888863323233 333221 1123567899999999987 79999999 Q ss_pred HhccCCcCCCCCCCccc--------cccccCCCCCC Q lcl|NC_011801. 359 LLKNRGVFPELDLDEGT--------NLLDNTKNIND 386 (386) Q Consensus 359 ~lg~~p~~p~~~~~~~~--------~~~~~~~~~~~ 386 (386) .+|..+...+.+..... ..........+ T Consensus 383 ~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (491) T protein:vir:79 383 AYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQD 418 (491) T ss_pred HhCCCCCCCCccccCcCcccccccccccccCCCCCc Confidence 99953321111110000 00000111111 No 126 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.54 E-value=2.7e-15 Score=100.43 Aligned_cols=370 Identities=14% Similarity=0.077 Sum_probs=199.8 Q ss_pred Cchhhhhcccc-ccCCc-----cchhhhhhcccccccCcc-------cccHHHHhccHHHHHHHHHHHHhhccCcee-ec Q lcl|NC_011801. 1 MAFLSNLFKRQ-KMLSG-----SSPVWILNQGQPVSIKPK-------AITSAIALKNSDVYAVISRVSSDIAGCRFV-TN 66 (386) Q Consensus 1 Mg~~~~l~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~-------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~-~~ 66 (386) .++ .+.|.-. ...++ ..-...++......++.. +-.....-+.|.++.|+..|++.+..- |. +. T Consensus 67 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (698) T protein:vir:10 67 LRL-ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (698) T ss_pred ccc-cccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 222 2223211 00000 000000111111111110 111223346788899999999977533 31 10 Q ss_pred ----------------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC--------------- Q lcl|NC_011801. 67 ----------------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--------------- 109 (386) Q Consensus 67 ----------------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--------------- 109 (386) +.+-.++|...-..+.-+..+.+.+.+.. ++|-+.+++.-+. T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~ 223 (698) T protein:vir:10 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (698) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccceEEEEEeecCcccccccccccccc Confidence 01223344444344444445555555555 5666655443211 Q ss_pred --CCceEEEEEEcCcceEEeec--------CCCceeEEEEeccCcccceeEEEcccceeeeccccccCc---cccccccc Q lcl|NC_011801. 110 --NGYPVRIEPVPNEKVTVALD--------DYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGES---DTQYMGIP 176 (386) Q Consensus 110 --~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~---~~~~~G~s 176 (386) .|..+.|.+|+|..|.+... ..+.+.+|.+. +. .+..+.++.+.....++. ...+.|+| T Consensus 224 I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~------G~--~IH~SRL~~~vg~pvpd~LKp~y~f~G~S 295 (698) T protein:vir:10 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI------GS--EVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (698) T ss_pred ccCccceeeeeecccccccchhhhccchhhccCCCceEEEe------cc--eecceeEEEecCCCchhhhcchhccCCcc Confidence 24455688888888876432 12223333332 11 245555544432222221 22246999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHH--HHhcccccCcceecC-CCceeeecc Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFE--EQTTGENAGRAVVLD-QSADVETTN 253 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~--~~~~~~~~g~~~vl~-~g~~~~~~~ 253 (386) ..+.+...+............+.+.-........+ -+.++......+..+++ +.+. +|. ++.+++ ++.+|++.+ T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~dl-a~aL~~g~~~~l~~R~eli~~~R-sn~-G~~llDk~~Eefeq~s 372 (698) T protein:vir:10 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDL-AQALTPGANVDLSMRAELINRYR-DNR-NILFLDKATEEFFQFN 372 (698) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHH-HHhcCChhhHHHHHHHHHHHHhc-Ccc-ceEEEecCCcceEEEe Confidence 99999999998888777777776543332111000 01122222222333333 2232 333 477788 578999998 Q ss_pred CChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcc--cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhh--- Q lcl|NC_011801. 254 ISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQ--SNITMIRAFY-------QSSLSIYIKPIESELSQKLG--- 320 (386) Q Consensus 254 ~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~--~~~~~~~~~~-------~~~l~P~~~~ie~~l~~~l~--- 320 (386) .+...+ -+......+.||.+-+||...|- .+..+- ..+...+.|| +.-|.|.++.+-+.+-+..+ T Consensus 373 t~lSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i 450 (698) T protein:vir:10 373 TPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV 450 (698) T ss_pred cCcCCH--HHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 777666 45677788899999999987764 333322 2233344444 45688988888877766553 Q ss_pred -hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcCCCCC-CCccccccccCCCCCC Q lcl|NC_011801. 321 -TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVFPELD-LDEGTNLLDNTKNIND 386 (386) Q Consensus 321 -~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-~~~~~~~~~~~~~~~~ 386 (386) +.+.|.+.++.+++.++++++ ...++..|+++++|+|++|..+|--+..+ .|.-++|..+..+..+ T Consensus 451 dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~ 525 (698) T protein:vir:10 451 DPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDID 525 (698) T ss_pred CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcch Confidence 467788889999998887765 44577899999999999997654433322 3444455555555444 No 127 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.53 E-value=1.3e-12 Score=85.67 Aligned_cols=362 Identities=10% Similarity=0.043 Sum_probs=206.7 Q ss_pred Cchhhhh---ccccccCCccchhhhhh-----ccc--ccccC---------c-ccccHHHHhccHHHHHHHHHHHHhhcc Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSGSSPVWILN-----QGQ--PVSIK---------P-KAITSAIALKNSDVYAVISRVSSDIAG 60 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~~~~~~~~~-----~~~--~~~~~---------~-~~i~~~~a~~~~~v~~~v~~ia~~ia~ 60 (386) =+|++-. ..+.............. ... ..... + ..+. +..++.+.|.+|++.+...|.+ T Consensus 3 ~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y-~~m~~D~~i~s~l~~Rk~av~~ 81 (491) T protein:vir:10 3 KGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVY-RELRADAHVGGCVRRRKAAVKA 81 (491) T ss_pred CceeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHH-HHHhhChHHHHHHHHHHHHHhC Confidence 1333211 11000000000000000 000 00000 0 0112 2245789999999999999999 Q ss_pred Cceeecch----h----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC---ceEEEEEEcCcceEEeec Q lcl|NC_011801. 61 CRFVTNAQ----P----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG---YPVRIEPVPNEKVTVALD 129 (386) Q Consensus 61 ~p~~~~~~----~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g---~~~~l~~l~~~~v~~~~~ 129 (386) ++|.+... . +...| .++ ...+++..+. +.+++|-+..++++...| .+..+.+.|+..+.+..+ T Consensus 82 ~~w~i~~~~~~~~~~e~v~e~l-~~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~ 155 (491) T protein:vir:10 82 LEWGLDRGKAKSRVAKSIADVF-ADL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPE 155 (491) T ss_pred CCcEEecCCCCHHHHHHHHHHH-hcC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeeccC Confidence 99998631 1 22333 222 4667776665 678899999999886554 345788899988876443 Q ss_pred CCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_011801. 130 DYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFI 209 (386) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l 209 (386) . . +.+.. .++.. ....+++...++.++.. ..+..+|.|.+..+......-....++...|.+.-|.|-.+. T Consensus 156 ~--~-l~~~~-~~~~~--~g~~l~~~k~i~~~~~~---~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig 226 (491) T protein:vir:10 156 N--Q-LRFRS-KDHWM--QGEELPARKFLVPRQEA---TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVG 226 (491) T ss_pred C--c-eEEec-CCCCC--CcceecCCCEEEEEecC---CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE Confidence 2 2 22221 11111 22456676666555432 223468999999999999999999999999999999887777 Q ss_pred eeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC--hh-hHHHHHHHHHHHHHHHHHhCCCHHHhcCC Q lcl|NC_011801. 210 KVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS--PN-VTEFLQNVSFSQDQIAKAFGIPADYLSGK 286 (386) Q Consensus 210 ~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~--~~-d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~ 286 (386) +.+. ..++++++++.+.+.+... + ..++++.|++++-+..+ .. ...|.+..++.-++|+.+.-= ..+... T Consensus 227 ky~~-~a~~~ek~~l~~al~~~~~--~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlTt~ 299 (491) T protein:vir:10 227 KHPR-SASDGEKNLLLDCLEDMVQ--D--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQTTE 299 (491) T ss_pred ecCC-CCCHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcccC Confidence 7754 4578888888888877532 2 25777777777766432 22 223777888888888776321 112212 Q ss_pred cCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhc----------chhhhccCHHHHHHHHHHHHhCCC-cCHH Q lcl|NC_011801. 287 QDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGTDVKLD----------IASAIDSDNSELINNVQKLASAGV-LAPI 354 (386) Q Consensus 287 ~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~fd----------~~~~l~~d~~~~~~~~~~~~~~g~-~t~n 354 (386) .+++++. +.......+.+.-.++.+++.||+.+-..+.++ +... ..+.+++++.+.+++..|+ ++.. T Consensus 300 ~~gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~~~~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:10 300 ATSTRASAQAGLEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDGADRPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPA 378 (491) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEecCc-CchhHHHHHHHHHHHhCCCcCCHH Confidence 2333432 223445566677788888888886332223322 2221 2334678999999999997 7999 Q ss_pred HHHHHhccCCcCCCCCCCc-ccc-------ccccCCCCCC Q lcl|NC_011801. 355 QAQKLLKNRGVFPELDLDE-GTN-------LLDNTKNIND 386 (386) Q Consensus 355 E~R~~lg~~p~~p~~~~~~-~~~-------~~~~~~~~~~ 386 (386) ++|+.+|..+...+.+... ..+ .........+ T Consensus 379 ~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (491) T protein:vir:10 379 YFKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQD 418 (491) T ss_pred HHHHHhCCCCCCcCccccccCCCCCcccccccccCCCCCC Confidence 9999999533221111100 000 0000011111 No 128 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.52 E-value=1.3e-13 Score=91.18 Aligned_cols=381 Identities=12% Similarity=0.080 Sum_probs=204.3 Q ss_pred CchhhhhccccccCCcc-chhhhhhcc-------------cccccCcc-----------cccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGS-SPVWILNQG-------------QPVSIKPK-----------AITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~-~~~~~~~~~-------------~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia 55 (386) |.-..|..+........ ........+ .+...+.. .-..+.+..++.+..+|+.+. T Consensus 2 ~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 81 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQR 81 (553) T ss_pred cchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 11111111100000000 000000000 00011100 112223456788889999887 Q ss_pred HhhccCceeecch---------------hH----H---HHHhccC------cccCCHHHHHHHHHHHHHHhCCeEEEEee Q lcl|NC_011801. 56 SDIAGCRFVTNAQ---------------PI----T---DVLNAPL------GNLMSGFSVWQAMIVQMMLTGNAFAIIDR 107 (386) Q Consensus 56 ~~ia~~p~~~~~~---------------~~----~---~~l~~~P------N~~~s~~~f~~~~~~~~~l~G~a~~~~~~ 107 (386) ..+=+-.++.... .+ . +.+-..+ .-.++.+.+...++..++..|+||+.+.+ T Consensus 82 ~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~ 161 (553) T protein:vir:63 82 DSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEW 161 (553) T ss_pred HhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeee Confidence 7765555544311 11 1 2222323 34557888899999999999999998866 Q ss_pred cCC-C--ceEEEEEEcCcceE--------------EeecCCCceeEEEEeccCccc--------------ceeEEEcccc Q lcl|NC_011801. 108 DTN-G--YPVRIEPVPNEKVT--------------VALDDYGKDLTYTVHFDDSKR--------------SGDFLYDSSE 156 (386) Q Consensus 108 ~~~-g--~~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~ 156 (386) ... | .+..|..|+|+.+. |..|..+..+.|.+...-.+. .....+++++ T Consensus 162 ~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~ 241 (553) T protein:vir:63 162 DRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQ 241 (553) T ss_pred ccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChhH Confidence 433 2 24578888888774 345666777777764432221 0123578999 Q ss_pred eeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHH---------- Q lcl|NC_011801. 157 VIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQ---------- 226 (386) Q Consensus 157 vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~---------- 226 (386) |+|+... ...+..+|+|.+..+...+..............+=.+...++|+.+.. ++...+.+.. T Consensus 242 vlH~f~~---~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~~ 316 (553) T protein:vir:63 242 VIHILEP---REPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVGI 316 (553) T ss_pred heecccc---cCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhccccccccccccc Confidence 9999633 345678999999999998888777776666666655555677765431 2222222111 Q ss_pred ------HHHHHhcc-----cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc-ccH-- Q lcl|NC_011801. 227 ------SFEEQTTG-----ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA-QSN-- 292 (386) Q Consensus 227 ------~~~~~~~~-----~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~-~~~-- 292 (386) .......+ -..|.+..|..|.+++.++.+-...+|.+..+...+.||+.+|||.+.|...-+. +++ T Consensus 317 ~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~ 396 (553) T protein:vir:63 317 FGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSI 396 (553) T ss_pred ccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHH Confidence 11111111 1245678889999999888775566789999999999999999999998654322 222 Q ss_pred HHH-----------HHHHHHHHHHHHHHHH-HHHHHHhhh--h----hhhh------------cch--hhhccCHHHHHH Q lcl|NC_011801. 293 ITM-----------IRAFYQSSLSIYIKPI-ESELSQKLG--T----DVKL------------DIA--SAIDSDNSELIN 340 (386) Q Consensus 293 ~~~-----------~~~~~~~~l~P~~~~i-e~~l~~~l~--~----~~~f------------d~~--~~l~~d~~~~~~ 340 (386) ... ...|....++|+.+.+ ++++-.... + .+-+ .+. ..-..|+..-++ T Consensus 397 R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~ 476 (553) T protein:vir:63 397 QAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQ 476 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHH Confidence 111 1223344455544433 333322110 0 0000 111 122348888889 Q ss_pred HHHHHHhCCCcCHHHHHHHhccC------------------CcCCCCCCC--ccccccccCCCCCC Q lcl|NC_011801. 341 NVQKLASAGVLAPIQAQKLLKNR------------------GVFPELDLD--EGTNLLDNTKNIND 386 (386) Q Consensus 341 ~~~~~~~~g~~t~nE~R~~lg~~------------------p~~p~~~~~--~~~~~~~~~~~~~~ 386 (386) +...++++|+.|.-|+-+..|.. +++...+.. -....-..+++.+| T Consensus 477 A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~ 542 (553) T protein:vir:63 477 AAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAED 542 (553) T ss_pred HHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCC Confidence 99999999999988776666643 111110000 00000111111111 No 129 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.52 E-value=1.6e-13 Score=90.78 Aligned_cols=373 Identities=14% Similarity=0.114 Sum_probs=203.7 Q ss_pred CchhhhhccccccCCc---c----chhhhhh--------cccccc---------cCcccccHHHHhccHHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSG---S----SPVWILN--------QGQPVS---------IKPKAITSAIALKNSDVYAVISRVSS 56 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~---~----~~~~~~~--------~~~~~~---------~~~~~i~~~~a~~~~~v~~~v~~ia~ 56 (386) |.--.| +.+...+. . .+..... ...... ...-.+. +..+..+.|.+|++.+.. T Consensus 1 m~k~~~--k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly-~~m~~D~hi~s~l~~Rk~ 77 (448) T protein:vir:79 1 MAKRGR--KPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVY-HKMLSDGTVKNALNYIFG 77 (448) T ss_pred CCCCCC--CCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHH-HHHhhChHHHHHHHHHHH Confidence 433221 11000000 0 0000000 000000 0001122 234568899999999999 Q ss_pred hhccCceeecch---h----HHHHH---hccCccc---CCHHHHHHHHHHHHHHhCCeEEEEeec--CCCc--eEEEEEE Q lcl|NC_011801. 57 DIAGCRFVTNAQ---P----ITDVL---NAPLGNL---MSGFSVWQAMIVQMMLTGNAFAIIDRD--TNGY--PVRIEPV 119 (386) Q Consensus 57 ~ia~~p~~~~~~---~----~~~~l---~~~PN~~---~s~~~f~~~~~~~~~l~G~a~~~~~~~--~~g~--~~~l~~l 119 (386) .|.+++|.+... + .+..+ ...++.. .++.+++.. +.+.+++|-++.++++. .+|. +..|.+. T Consensus 78 av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r 156 (448) T protein:vir:79 78 RIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKLILDKIVPI 156 (448) T ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeeecCCCceeccccccc Confidence 999999998631 1 22222 1223322 234444433 44567899999999875 3554 3356666 Q ss_pred cCcceE-EeecCCCceeEEEEeccCcc----cceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 120 PNEKVT-VALDDYGKDLTYTVHFDDSK----RSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKL 194 (386) Q Consensus 120 ~~~~v~-~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 194 (386) ++..++ +..+.++....... ..... ......+|..-++|..+.. .+..+|.+.+..+......-....++ T Consensus 157 ~~~~~~~f~~~~d~~l~~~~~-~~~~~~~~~~~~~~~lP~~~~i~~~~~~----~g~p~g~gLlr~~~w~~~fK~~~~~~ 231 (448) T protein:vir:79 157 HPFNIDEVLYDEEGGPKALKL-SGEVKGGSQFVSGLEIPIWKTVVFLHND----DGSFTGQSALRAAVPHWLAKRALILL 231 (448) T ss_pred CCccccceeeecCCceEEeec-CCcccccccCCCccccccceEEEEecCc----cCCcccchhHHHHHHHHHHHHHHHHH Confidence 665322 22222332222111 11110 0112356778888876432 23458999999999999999999999 Q ss_pred HHHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_011801. 195 AISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIA 273 (386) Q Consensus 195 ~~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia 273 (386) ...|.+.-|.|--+.+.+.. ..++++++.+.+...+...|.+++ ++++.|++++-++.+....++.+..++.-++|+ T Consensus 232 w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Is 309 (448) T protein:vir:79 232 INHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIA 309 (448) T ss_pred HHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCcccHHHHHHHHHHHHH Confidence 99999998888777776543 335677777877777766666554 678888888877665555556778888888887 Q ss_pred HHhCCCHHHhcC-CcCcccHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcc-----------hhhhccCHHHH Q lcl|NC_011801. 274 KAFGIPADYLSG-KQDAQSNI--TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDI-----------ASAIDSDNSEL 338 (386) Q Consensus 274 ~~~gvp~~~l~~-~~~~~~~~--~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~-----------~~~l~~d~~~~ 338 (386) .+.-= ..+.. .+.++... ........+.++-.++++++.||+.|.. .+.+|+ +..-..|.++. T Consensus 310 k~iLG--qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~~ 387 (448) T protein:vir:79 310 RALGI--DFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEERNDFSAA 387 (448) T ss_pred HHHhh--hhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCChHHHHHH Confidence 76531 11221 11122221 2223455677888999999999987742 233332 22234577788 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCc-cccc--cccCCCCCC Q lcl|NC_011801. 339 INNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDE-GTNL--LDNTKNIND 386 (386) Q Consensus 339 ~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~-~~~~--~~~~~~~~~ 386 (386) ++.+.+++..+-...+-.|+.+|.....|+.+..- +... -.......| T Consensus 388 a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~~a~~~~~~~~~~~~~~~~ 438 (448) T protein:vir:79 388 ANLMGMLINAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPAD 438 (448) T ss_pred HHHhhhhhccchhhHHHHHHhhcCCCCCCCccccccCCCCcccccccCCcc Confidence 88888888766444445677766322222211100 0000 011111111 No 130 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.51 E-value=2.7e-13 Score=89.53 Aligned_cols=379 Identities=14% Similarity=0.119 Sum_probs=196.7 Q ss_pred CchhhhhccccccCC-c---cchhhhhhccc----ccccCcc-----------cccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_011801. 1 MAFLSNLFKRQKMLS-G---SSPVWILNQGQ----PVSIKPK-----------AITSAIALKNSDVYAVISRVSSDIAGC 61 (386) Q Consensus 1 Mg~~~~l~~~~~~~~-~---~~~~~~~~~~~----~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia~~ia~~ 61 (386) |+|+++-+--..... . .........+. +..++.. .-..+.+..++.+..+|+.+...+=+- T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~ 80 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGN 80 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCC Confidence 999976221000000 0 00000000000 0111111 111223446788889999888877444 Q ss_pred ceeec----chhH----HHHH---hccC--cccCCHHHHHHHHHHHHHHhCCeEEEEeecC--CC--ceEEEEEEcCcce Q lcl|NC_011801. 62 RFVTN----AQPI----TDVL---NAPL--GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--NG--YPVRIEPVPNEKV 124 (386) Q Consensus 62 p~~~~----~~~~----~~~l---~~~P--N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--~g--~~~~l~~l~~~~v 124 (386) .++.. ...+ ..++ ...+ .-.++.+.+.+.++..++..|+||+.+.... .| .+..|..|+|+.+ T Consensus 81 Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l 160 (495) T protein:vir:10 81 GLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDML 160 (495) T ss_pred CcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhc Confidence 44332 1112 2222 2222 3456788899999999999999999876543 33 3568888888887 Q ss_pred E-----------------EeecCCCceeEEEEeccCccc-------ceeEEEcccceeeeccccccCcccccccccHHHH Q lcl|NC_011801. 125 T-----------------VALDDYGKDLTYTVHFDDSKR-------SGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDS 180 (386) Q Consensus 125 ~-----------------~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~ 180 (386) . |..|..+..+.|.+.....+. .....+|+++|+|+ +.. ..+..+|+|.+.. T Consensus 161 ~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~-f~~---r~gQ~RGis~la~ 236 (495) T protein:vir:10 161 ASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHV-TVL---TVRSDAGAPWFQL 236 (495) T ss_pred CCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEec-ccc---CCCcccCcchhHH Confidence 4 223455666777765433321 12356999999999 432 2467899997654 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHH----HHHHHHHhcccccCcceecCCCceeeeccCCh Q lcl|NC_011801. 181 LLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENT----RQSFEEQTTGENAGRAVVLDQSADVETTNISP 256 (386) Q Consensus 181 ~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~----k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~ 256 (386) +. .+..............+=.+...++|+.+... +...... .+.-......-..|.+..|..|.+++.++.+. T Consensus 237 i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~ 313 (495) T protein:vir:10 237 LL-RLNELDQYEDAELVRKKTAALFAAFIQEATAD--STGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPAD 313 (495) T ss_pred HH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCc--cccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCC Confidence 44 34444444444444444444556666653221 1100000 00000011112345688899999999988765 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc-cH--HH----HHHH--------HHHHHHHHHHHH-HHHHHHHhhh Q lcl|NC_011801. 257 NVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ-SN--IT----MIRA--------FYQSSLSIYIKP-IESELSQKLG 320 (386) Q Consensus 257 ~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~-~~--~~----~~~~--------~~~~~l~P~~~~-ie~~l~~~l~ 320 (386) ....|.+..+...+.||+.+|||.+.|...-..+ ++ .. ..+. +....+.|+.+. ++.++-.... T Consensus 314 p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i 393 (495) T protein:vir:10 314 VGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAV 393 (495) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 5567889999999999999999999986543222 22 11 1111 233344454433 3333332211 Q ss_pred --h-h---------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccC------------------CcC---- Q lcl|NC_011801. 321 --T-D---------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNR------------------GVF---- 366 (386) Q Consensus 321 --~-~---------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~------------------p~~---- 366 (386) + + +++-.-.....|+..-+++...++++|+.|.-|+-+..|.. +++ T Consensus 394 ~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~ 473 (495) T protein:vir:10 394 VIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSD 473 (495) T ss_pred CCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCC Confidence 1 1 11111122345888888999999999999988766555543 111 Q ss_pred CCC-CCCccccccccCCCCCC Q lcl|NC_011801. 367 PEL-DLDEGTNLLDNTKNIND 386 (386) Q Consensus 367 p~~-~~~~~~~~~~~~~~~~~ 386 (386) |.. ...+..+.-....+-+| T Consensus 474 p~~~~~~~~~~~~~~~~~~~~ 494 (495) T protein:vir:10 474 PRYVNGSGAEQKSVMEAALNN 494 (495) T ss_pred CCcCCCccCCCCCCCCCCCCC Confidence 000 00000010111111111 No 131 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.51 E-value=1.1e-13 Score=91.69 Aligned_cols=360 Identities=11% Similarity=0.118 Sum_probs=200.4 Q ss_pred Cchhhhh---ccccccCCccchhhhhhcccccc---cCcc------cccHHHHhccHHHHHHHHHHHHhhccCceeecch Q lcl|NC_011801. 1 MAFLSNL---FKRQKMLSGSSPVWILNQGQPVS---IKPK------AITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ 68 (386) Q Consensus 1 Mg~~~~l---~~~~~~~~~~~~~~~~~~~~~~~---~~~~------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~ 68 (386) |-+-... +.+.................+.. ..+. .+..+...+.+.|.+|++.+...|.+++|.|... T Consensus 3 ~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p~ 82 (446) T protein:vir:98 3 MEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQHG 82 (446) T ss_pred ccccCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecCc Confidence 2222111 00011111101000000000000 0111 1222333458999999999999999999998753 Q ss_pred h--HH----HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC-Cc--eE----EEEEEcCcceEEeecCCCcee Q lcl|NC_011801. 69 P--IT----DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN-GY--PV----RIEPVPNEKVTVALDDYGKDL 135 (386) Q Consensus 69 ~--~~----~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~-g~--~~----~l~~l~~~~v~~~~~~~~~~~ 135 (386) + .+ ..|.... .+.....+.+.+.+|-++.++++... |. +. .+....|..++...+.+...+ T Consensus 83 ~~~~a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~~ 156 (446) T protein:vir:98 83 DKRIKKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNGRIV 156 (446) T ss_pred cHHHHHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccCCccc Confidence 2 22 3332221 23444557888999999999987532 11 11 111112222222222221111 Q ss_pred EE-EE---------------------eccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 136 TY-TV---------------------HFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSK 193 (386) Q Consensus 136 ~~-~~---------------------~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~ 193 (386) .. .. .......+..+.+|....++++|... .+..+|.|.+..+......-....+ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~---~~~p~G~gLlr~~~w~~~fK~~~~~ 233 (446) T protein:vir:98 157 DGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTK---GNNPWGTSCLTSVLDYSIFKRAFRD 233 (446) T ss_pred cccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCC---CCCccccchHHHHHHHHHHHHhhHH Confidence 00 00 00001122345578888887776532 3446899999999999999999999 Q ss_pred HHHHHHhccCCCceEEeeCCCCC-----CHHH---HHHHHHHHHHHhcc--cccCcce---ecCCCceeeeccCChh-hH Q lcl|NC_011801. 194 LAISTLRHAIKPSIFIKVPNATL-----GKEA---KENTRQSFEEQTTG--ENAGRAV---VLDQSADVETTNISPN-VT 259 (386) Q Consensus 194 ~~~~~~~ng~~~~~~l~~~~~~~-----~~~~---~~~~k~~~~~~~~~--~~~g~~~---vl~~g~~~~~~~~~~~-d~ 259 (386) +...|.+.-|.|--+-+.+.+.. +++. .+...+.+.+.... ..++.++ +++.|++++-++.... .. T Consensus 234 ~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~ 313 (446) T protein:vir:98 234 MMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSD 313 (446) T ss_pred HHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChh Confidence 99999999898877777754332 1111 12222333333322 2233222 2488998887765432 23 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCC--cCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcchh------ Q lcl|NC_011801. 260 EFLQNVSFSQDQIAKAFGIPADYLSGK--QDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDIAS------ 329 (386) Q Consensus 260 ~~~e~~~~~~~~Ia~~~gvp~~~l~~~--~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~~~------ 329 (386) +|.+..++.-++|+.+.....-.++.. ..++++. +.......+.++-.+++|++.||+.|.. -+.+|+.+ T Consensus 314 ~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~ 393 (446) T protein:vir:98 314 SFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLA 393 (446) T ss_pred hHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccc Confidence 588889999999999887665444432 2244432 2334456677888999999999997742 22333211 Q ss_pred ----------hhccCHHHHHHHHHHHHhCCCcCH---HHHHHHhccCCcCCCC Q lcl|NC_011801. 330 ----------AIDSDNSELINNVQKLASAGVLAP---IQAQKLLKNRGVFPEL 369 (386) Q Consensus 330 ----------~l~~d~~~~~~~~~~~~~~g~~t~---nE~R~~lg~~p~~p~~ 369 (386) --..|.+..++++.+++..|+.++ +.+|+.+|..+.+|.- T Consensus 394 ~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 394 SNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred cccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 124578889999999999998765 4599999853322211 No 132 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.51 E-value=3.6e-13 Score=88.77 Aligned_cols=381 Identities=13% Similarity=0.083 Sum_probs=206.4 Q ss_pred Cchhhhhccccc--c----------CCccchhhhhhcccccccCcc-----------cccHHHHhccHHHHHHHHHHHHh Q lcl|NC_011801. 1 MAFLSNLFKRQK--M----------LSGSSPVWILNQGQPVSIKPK-----------AITSAIALKNSDVYAVISRVSSD 57 (386) Q Consensus 1 Mg~~~~l~~~~~--~----------~~~~~~~~~~~~~~~~~~~~~-----------~i~~~~a~~~~~v~~~v~~ia~~ 57 (386) |--+.++.+-.. + ........ ....+...+.. .-..+.+..++.+..+|+.+... T Consensus 3 ~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~--~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQL--RSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CchhhhhhcccccchHHHHHhhhhccCCCCCcc--cccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 222221111000 0 00000000 00000111111 11223345678899999988887 Q ss_pred hccCceeecch--------------hHH-------HHHhccC------cccCCHHHHHHHHHHHHHHhCCeEEEEeecCC Q lcl|NC_011801. 58 IAGCRFVTNAQ--------------PIT-------DVLNAPL------GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN 110 (386) Q Consensus 58 ia~~p~~~~~~--------------~~~-------~~l~~~P------N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~ 110 (386) +=+-.+++... .+. +.+-..| .-.++.+++.+.++..++..|+||+.+.+... T Consensus 81 vVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~ 160 (533) T protein:vir:34 81 IVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTS 160 (533) T ss_pred hhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccC Confidence 75445543321 111 1222223 33557889999999999999999999876543 Q ss_pred -C--ceEEEEEEcCcceE--------------EeecCCCceeEEEEeccCccc---------ceeEEEcccceeeecccc Q lcl|NC_011801. 111 -G--YPVRIEPVPNEKVT--------------VALDDYGKDLTYTVHFDDSKR---------SGDFLYDSSEVIHFRCTV 164 (386) Q Consensus 111 -g--~~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~vih~~~~~ 164 (386) | .+..|..|+|+.+. |..|..+..+.|.+....... .....++.++|+|+... T Consensus 161 ~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~- 239 (533) T protein:vir:34 161 SSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEP- 239 (533) T ss_pred CCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccc- Confidence 2 24678888887764 445666777777765321110 01234677899999643 Q ss_pred ccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC----------CHHHHHHHHHHH---HHH Q lcl|NC_011801. 165 SGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL----------GKEAKENTRQSF---EEQ 231 (386) Q Consensus 165 ~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~----------~~~~~~~~k~~~---~~~ 231 (386) ...+..+|+|.+..+...+.....-........+=.+.-.++|+.+.... ..+..+.+.... ... T Consensus 240 --~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (533) T protein:vir:34 240 --VEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAY 317 (533) T ss_pred --cCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhc Confidence 34567899999999999988877777776666666666667776542211 111111111111 111 Q ss_pred hcc----cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCc-CcccH--HH---------- Q lcl|NC_011801. 232 TTG----ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQ-DAQSN--IT---------- 294 (386) Q Consensus 232 ~~~----~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~-~~~~~--~~---------- 294 (386) ..+ -..|.+..|..|.+++.++.+-...+|.+..+...+.||+.+|||.+.|...- +.+++ .. T Consensus 318 ~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~ 397 (533) T protein:vir:34 318 YAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFM 397 (533) T ss_pred cCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHH Confidence 111 12356778999999998887766677899999999999999999999986543 22222 11 Q ss_pred -HHHHHHHHHHHHHHHH-HHHHHHHhhhh-----hhhhcc------------hhhhccCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_011801. 295 -MIRAFYQSSLSIYIKP-IESELSQKLGT-----DVKLDI------------ASAIDSDNSELINNVQKLASAGVLAPIQ 355 (386) Q Consensus 295 -~~~~~~~~~l~P~~~~-ie~~l~~~l~~-----~~~fd~------------~~~l~~d~~~~~~~~~~~~~~g~~t~nE 355 (386) ....|....+.|+.+. +++++...... .+.|.. -.....|+..-+++...++++|+.|.-| T Consensus 398 ~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~ 477 (533) T protein:vir:34 398 GRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEK 477 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHH Confidence 1122444445565543 33333332110 111111 1123348888889999999999999887 Q ss_pred HHHHhccCC------------------cCCCCCCC---ccccccccCCCCCC Q lcl|NC_011801. 356 AQKLLKNRG------------------VFPELDLD---EGTNLLDNTKNIND 386 (386) Q Consensus 356 ~R~~lg~~p------------------~~p~~~~~---~~~~~~~~~~~~~~ 386 (386) +-+..|..| +.+..+.. ..+..-..+...+| T Consensus 478 ~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~ 529 (533) T protein:vir:34 478 ECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSD 529 (533) T ss_pred HHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCccc Confidence 766666431 11110000 00001111111111 No 133 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.48 E-value=3.7e-14 Score=94.22 Aligned_cols=370 Identities=14% Similarity=0.046 Sum_probs=195.4 Q ss_pred CchhhhhccccccCC-c--c---chhhhhhcccccccCcc-------cccHHHHhccHHHHHHHHHHHHhhccCcee-ec Q lcl|NC_011801. 1 MAFLSNLFKRQKMLS-G--S---SPVWILNQGQPVSIKPK-------AITSAIALKNSDVYAVISRVSSDIAGCRFV-TN 66 (386) Q Consensus 1 Mg~~~~l~~~~~~~~-~--~---~~~~~~~~~~~~~~~~~-------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~-~~ 66 (386) -+=|.++|.-..... + . .-....+......++.. +-.....-+.|.++.|+..|++.+..- |. +. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 143 (694) T protein:vir:10 65 SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAI 143 (694) T ss_pred chhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 122233332111100 0 0 00000001111111100 112223346788999999999977533 31 10 Q ss_pred ----------------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC--------------- Q lcl|NC_011801. 67 ----------------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--------------- 109 (386) Q Consensus 67 ----------------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--------------- 109 (386) +.+-.++|...-..+.-+..|.+.+.+.. ++|-+.+++.-+. T Consensus 144 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~ 222 (694) T protein:vir:10 144 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYT 222 (694) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccceEEEEEeecCcccccccccccccc Confidence 01223344444344444445555555555 5676665553222 Q ss_pred --CCceEEEEEEcCcceEEeec--------CCCceeEEEEeccCcccceeEEEcccceeeeccccccCc---cccccccc Q lcl|NC_011801. 110 --NGYPVRIEPVPNEKVTVALD--------DYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGES---DTQYMGIP 176 (386) Q Consensus 110 --~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~---~~~~~G~s 176 (386) .|..+.|.+|+|..+.+... ..+.+.+|.+. + ..+..+.++.|.-...++. ...+.|+| T Consensus 223 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~------G--~~IH~SRL~~f~g~plPd~LKp~y~~~G~S 294 (694) T protein:vir:10 223 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI------G--TEVHATRLHTIVSRPVGDMLKPTYSFAGIS 294 (694) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe------c--eEEeeeeEEEecCCCchhhhhcccccCccc Confidence 34455688888888877432 12223333331 1 1244454444432221111 22357999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeC-CCCCCHHHHHHHHHHHH--HHhcccccCcceecC-CCceeeec Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVP-NATLGKEAKENTRQSFE--EQTTGENAGRAVVLD-QSADVETT 252 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~-~~~~~~~~~~~~k~~~~--~~~~~~~~g~~~vl~-~g~~~~~~ 252 (386) ..+.+...+............+.+.-... ++ ++. -+.+.......+..+++ +.+. +|. ++.+++ +..+|++. T Consensus 295 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~-~l-k~dla~~L~~g~~~~l~~R~eli~~~R-sn~-G~~llDk~~Eefeq~ 370 (694) T protein:vir:10 295 MTQLAMPYIDNWLRTRQSVSDIVKQFSVS-GI-LMDLAQALMPGANVDLSMRAELINRYR-DNR-NILFLDKATEEFFQF 370 (694) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhhH-HH-HHHHHHhhcChhHHHHHHHHHHHHHhc-Ccc-ceEEEecCCcceEEE Confidence 99999999988888877777776553332 21 110 01122222222332232 2232 333 477888 47899999 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCcCcc--cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhh-- Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYL-SGKQDAQ--SNITMIRAFY-------QSSLSIYIKPIESELSQKLG-- 320 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l-~~~~~~~--~~~~~~~~~~-------~~~l~P~~~~ie~~l~~~l~-- 320 (386) +.+...+ -+......+.||.+-+||...| |.+..+- ..+...+.|| +.-|.|.++.+-+.+-+..+ T Consensus 371 stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~ 448 (694) T protein:vir:10 371 NTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGA 448 (694) T ss_pred ecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 8777666 4557778889999999998776 3333332 2233344444 45688988888777766543 Q ss_pred --hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcCCCCC-CCccccccccCCCCCC Q lcl|NC_011801. 321 --TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVFPELD-LDEGTNLLDNTKNIND 386 (386) Q Consensus 321 --~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-~~~~~~~~~~~~~~~~ 386 (386) +.+.|.+.++.+++.++++++ ...++..|+++++|+|..|..+|--+..+ .|..++|..+..+-.+ T Consensus 449 idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~ 524 (694) T protein:vir:10 449 VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDID 524 (694) T ss_pred CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhh Confidence 467788888999988887655 45677899999999999987543322222 2223333322222111 No 134 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.48 E-value=1.2e-12 Score=85.99 Aligned_cols=364 Identities=13% Similarity=0.089 Sum_probs=203.3 Q ss_pred CchhhhhccccccC-Cccc---hh------------hhhhccccc---------ccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKML-SGSS---PV------------WILNQGQPV---------SIKPKAITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~~~~-~~~~---~~------------~~~~~~~~~---------~~~~~~i~~~~a~~~~~v~~~v~~ia 55 (386) |.-=++ +++.. +... +. ......... ....-.+. +..+..+.|.+|++.+. T Consensus 1 m~kk~~---k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly-~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:77 1 MAKRGR---KPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVY-HKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCCC---CCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHH-HHHhhChHHHHHHHHHH Confidence 443221 11111 1000 00 000000000 00001122 23456889999999999 Q ss_pred HhhccCceeecch-------hHHHHHhc---cCc---ccCCHHHHHHHHHHHHHHhCCeEEEEeec--CCCce--EEEEE Q lcl|NC_011801. 56 SDIAGCRFVTNAQ-------PITDVLNA---PLG---NLMSGFSVWQAMIVQMMLTGNAFAIIDRD--TNGYP--VRIEP 118 (386) Q Consensus 56 ~~ia~~p~~~~~~-------~~~~~l~~---~PN---~~~s~~~f~~~~~~~~~l~G~a~~~~~~~--~~g~~--~~l~~ 118 (386) ..|.+++|.|... ..+..+.. .+. ...++.+++..+ .+.+.+|-+..++++. .+|.. ..|.+ T Consensus 77 ~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~ 155 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVP 155 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeeccccc Confidence 9999999998631 12222221 121 223566677666 5788999999999875 35543 35666 Q ss_pred EcCcceE-EeecCCCceeEEEEeccCcc----cceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 119 VPNEKVT-VALDDYGKDLTYTVHFDDSK----RSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSK 193 (386) Q Consensus 119 l~~~~v~-~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~ 193 (386) .++..++ ...+.++.... ........ ......+|..-++|.++.. .+..+|.|.+..+......-....+ T Consensus 156 r~~~~~~~f~~~~~~~l~~-~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~----~g~p~g~gLlr~~~w~~~fK~~~~~ 230 (448) T protein:vir:77 156 IHPFNIDEVLYDEEGGPKA-LKLSGEVKGGSQFVNGLEIPIWKTVVFLHND----DGSFTGQSALRAAVPHWLAKRALIL 230 (448) T ss_pred cCCCccceeeeecCCceEE-EecCCcccccccCCCccccccceEEEEecCC----cCCcccchHHHHHHHHHHHHHhhHH Confidence 6664332 22233332221 11111110 1112456778888876432 2335799999999999999999999 Q ss_pred HHHHHHhccCCCceEEeeCCC-CCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_011801. 194 LAISTLRHAIKPSIFIKVPNA-TLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQI 272 (386) Q Consensus 194 ~~~~~~~ng~~~~~~l~~~~~-~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~I 272 (386) +...|.+.-|.|--+.+.+.. ..++++++.+.+...+...|.+++ ++++.|++++-++.+....++.+..++.-++| T Consensus 231 ~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~I 308 (448) T protein:vir:77 231 LINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHDAGI 308 (448) T ss_pred HHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCccCHHHHHHHHHHHH Confidence 999999998988777776543 345677777888777765565553 67888888877766555555677888888888 Q ss_pred HHHhCCCHHHhcCCcCc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhcc-----------hhhhccCHHHH Q lcl|NC_011801. 273 AKAFGIPADYLSGKQDA--QSNITMIRAFYQSSLSIYIKPIESELSQKLGT-DVKLDI-----------ASAIDSDNSEL 338 (386) Q Consensus 273 a~~~gvp~~~l~~~~~~--~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~-~~~fd~-----------~~~l~~d~~~~ 338 (386) +.+..-.- +-...+.+ +...........+.+.-.+++|++.||+.|.. .+.+|+ +..-..|.+++ T Consensus 309 sk~iLGqt-lTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~ 387 (448) T protein:vir:77 309 ARALGIDF-NTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSAA 387 (448) T ss_pred HHHHhccc-cccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHHH Confidence 87764322 21111112 22222223456677888999999999987743 223332 11233577778 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 339 INNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 339 ~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) ++.+.+++ +-+|+.+|.....+++.......+ ......+ T Consensus 388 a~~~~~l~-------~~~~~~~~ip~~~~~~~~~~~~~~--~~~~~~~ 426 (448) T protein:vir:77 388 ANLMGMLI-------NAVKDSEDIPTELKALIDALPSKM--RRALGVV 426 (448) T ss_pred HHHhHHHH-------HHHHHHhcCCccCCcCCCCCchhc--ccccCCC Confidence 88888876 457888774221111111111111 1111111 No 135 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.47 E-value=5.2e-14 Score=93.43 Aligned_cols=369 Identities=14% Similarity=0.057 Sum_probs=194.9 Q ss_pred Cchhhhhcccc-ccCCc-----cchhhhhhcccccccCcc-------cccHHHHhccHHHHHHHHHHHHhhccCcee-ec Q lcl|NC_011801. 1 MAFLSNLFKRQ-KMLSG-----SSPVWILNQGQPVSIKPK-------AITSAIALKNSDVYAVISRVSSDIAGCRFV-TN 66 (386) Q Consensus 1 Mg~~~~l~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~-------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~-~~ 66 (386) .++ .+.|.-. ...++ ..-...++......++.. +-.....-+.|.++.|+..|++.+..- |. +. T Consensus 67 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (695) T protein:vir:78 67 LRL-ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (695) T ss_pred ccc-ceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 222 2222211 00000 000000111111111110 111223346788899999999977533 31 10 Q ss_pred ----------------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC--------------- Q lcl|NC_011801. 67 ----------------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--------------- 109 (386) Q Consensus 67 ----------------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--------------- 109 (386) +.+-.++|...-..+.-+..|.+.+.+.. ++|-+.+++.-+. T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aR-lfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:78 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccceEEEEEeccCcccccccccccccc Confidence 01223344444344444445555555555 5666665553322 Q ss_pred --CCceEEEEEEcCcceEEeec--------CCCceeEEEEeccCcccceeEEEcccceeeeccccccCc---cccccccc Q lcl|NC_011801. 110 --NGYPVRIEPVPNEKVTVALD--------DYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGES---DTQYMGIP 176 (386) Q Consensus 110 --~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~---~~~~~G~s 176 (386) .|..+.|.+|+|..+.+... ..+.+.+|.+. + ..+..+.++.|.-...++. ...+.|+| T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~------G--~kIH~SRL~~f~g~plPd~LKp~y~~~GiS 295 (695) T protein:vir:78 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI------G--TEVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe------c--eEEeeeeEEEecCCCchhhhhcccccCccc Confidence 24455688888888877432 12223333331 1 1244454444432221111 22357999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeC-CCCCCHHHHHHHHHHHH--HHhcccccCcceecC-CCceeeec Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVP-NATLGKEAKENTRQSFE--EQTTGENAGRAVVLD-QSADVETT 252 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~-~~~~~~~~~~~~k~~~~--~~~~~~~~g~~~vl~-~g~~~~~~ 252 (386) ..+.+...+............+.+.-... ++ ++. -+.+.......+..+++ +.+. +|. ++.+++ +..+|++. T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~-~l-k~dla~~L~~g~~~~l~~R~eli~~~R-sn~-G~~llDk~~Eefeq~ 371 (695) T protein:vir:78 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVS-GI-LMDLAQALMPGANVDLSMRAELINRYR-DNR-NILFLDKATEEFFQF 371 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhhH-HH-HHHHHHhhcChhHHHHHHHHHHHHHhc-Ccc-ceEEEecCCcceEEE Confidence 99999999988888877777776553332 21 110 01122222222332232 2232 333 477888 47899999 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcc--cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhh-- Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQ--SNITMIRAFY-------QSSLSIYIKPIESELSQKLG-- 320 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~--~~~~~~~~~~-------~~~l~P~~~~ie~~l~~~l~-- 320 (386) +.+...+ -+......+.||.+-+||...|- .+..+- ..+...+.|| +.-|.|.++.+-+.+-+..+ T Consensus 372 stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~ 449 (695) T protein:vir:78 372 NTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGA 449 (695) T ss_pred ecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 8777666 45577788899999999987763 333332 2233344444 45688988888777766553 Q ss_pred --hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcCCCCC-CCccccccccCCCCCC Q lcl|NC_011801. 321 --TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVFPELD-LDEGTNLLDNTKNIND 386 (386) Q Consensus 321 --~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-~~~~~~~~~~~~~~~~ 386 (386) +.+.|.+.++.+++.++++++ ...++..|+++++|+|..|..+|--+..+ .|..++|..+..+-.+ T Consensus 450 idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~ 525 (695) T protein:vir:78 450 VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDID 525 (695) T ss_pred CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhh Confidence 467788889999988887765 45677899999999999987543322221 2223333322222111 No 136 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.47 E-value=1.8e-13 Score=90.49 Aligned_cols=359 Identities=13% Similarity=0.047 Sum_probs=176.4 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHH----HHhccHHHHHHHHHHHHhhc-cCceeecch-----h- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSA----IALKNSDVYAVISRVSSDIA-GCRFVTNAQ-----P- 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~----~a~~~~~v~~~v~~ia~~ia-~~p~~~~~~-----~- 69 (386) |||..-..+. ....+-....++.+ ..++.+ .|.++..++.+|+.+++.+- +.|..+... . T Consensus 23 d~l~~~~~gl----g~~r~~~~~~~g~~-----~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~~~~~~ 93 (449) T protein:vir:10 23 MGLMVPTMGL----DNKRHSAWCEYGFP-----ELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDADDSED 93 (449) T ss_pred HHHHHHHhcC----CcccchhhhhcCCc-----ccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccccchhh Confidence 3333211110 00011111111111 122322 23457778899999998653 222211110 0 Q ss_pred ---HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEe-ecC---------CCceEEEEEEcCcceEEe-------ec Q lcl|NC_011801. 70 ---ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIID-RDT---------NGYPVRIEPVPNEKVTVA-------LD 129 (386) Q Consensus 70 ---~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~-~~~---------~g~~~~l~~l~~~~v~~~-------~~ 129 (386) +...+.. -+...-+..+.+..-+ -.++|-|++++. ++. .+.+..|.|+....+++. .. T Consensus 94 ~~~~e~~~~~-l~~~~~~~~l~ea~~~-~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp 171 (449) T protein:vir:10 94 ETSWEKKSKQ-VFTNRLWRSFAEADRR-RLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSK 171 (449) T ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHh-hhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCC Confidence 1111110 0000112223323333 346777776663 332 224555666655433332 12 Q ss_pred CCCceeEEEEeccC-cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHH-HHHHHHHhccCC--- Q lcl|NC_011801. 130 DYGKDLTYTVHFDD-SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSS-KLAISTLRHAIK--- 204 (386) Q Consensus 130 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-~~~~~~~~ng~~--- 204 (386) ..+.+..|.+.... +.....+.+.++.++||.. .+.-|.|.+..+...+.....+. .+...+++|..+ T Consensus 172 ~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~-------~~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~ 244 (449) T protein:vir:10 172 TYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGD-------YSEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLN 244 (449) T ss_pred CCCCceEEEEeeeccCCCccceeeccceeEeecC-------CCCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHh Confidence 23445555544321 1122345688888887731 12347788888776554333322 222233333211 Q ss_pred --------CceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 205 --------PSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 205 --------~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ..++....+ .-.++..+++.++.+....+.+ .+.++.+.+|+.++.++.+. .+..+....+||++- T Consensus 245 ~~~~~~~~~~~l~~~~~-~~~e~~~~~~~~~~~~~~~~~~---~~~i~~~~d~~~~~~~~sgl--~d~l~~~~q~iaaa~ 318 (449) T protein:vir:10 245 VNFEKEIDFTNLASLYG-VSIDELQDKFNEVAGEINRGND---VLMTTQGATVTPLVTSVADP--TATYNVNLQTAAAGV 318 (449) T ss_pred hhhhhhhhhhhhhHHhh-CCchHHHHHHHHHHHHHhccch---heeecCCcceEEEecccCCh--hHHHHHHHHHHHHHh Confidence 111111111 1123334445444444333322 45566777899988887766 456777888899999 Q ss_pred CCCHHHhcCCcCc-ccHHHHHHHHHH------HHHHHHHHHHHHHHHHh-hh---hhhhhcchhhhccCHHHHHH----- Q lcl|NC_011801. 277 GIPADYLSGKQDA-QSNITMIRAFYQ------SSLSIYIKPIESELSQK-LG---TDVKLDIASAIDSDNSELIN----- 340 (386) Q Consensus 277 gvp~~~l~~~~~~-~~~~~~~~~~~~------~~l~P~~~~ie~~l~~~-l~---~~~~fd~~~~l~~d~~~~~~----- 340 (386) +||...|-+...+ -++.+..+.||. .-|.|.++.+-+.|-+. ++ ..+.|.+.++.+++.+++++ T Consensus 319 ~IP~t~L~Gqsp~glnst~D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~ 398 (449) T protein:vir:10 319 DIPTRILIGNQQAERSSTEDQKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTM 398 (449) T ss_pred CCCeeeeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHH Confidence 9998877443322 222222333333 23777777777666443 22 35788889999999888765 Q ss_pred --HHHHHHhCC---CcCHHHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 341 --NVQKLASAG---VLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 341 --~~~~~~~~g---~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) ++++++++| +++++|+|+.+|++|..+.+...+-++ ....+.| T Consensus 399 A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~~~~~e~~d---e~~~~~d 446 (449) T protein:vir:10 399 GEINQTMLGSGDNPAFSREEIRTAAGYDNDDEEPLGEEDGD---EEDKATD 446 (449) T ss_pred HHHHHHHHHccccCCcCHHHHHHHhcccCCCCCCCCCCCCc---cccccCC Confidence 445666666 889999999999887543322222111 1122223 No 137 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.46 E-value=6.5e-14 Score=92.89 Aligned_cols=369 Identities=14% Similarity=0.057 Sum_probs=193.3 Q ss_pred Cchhhhhcccccc-C-----CccchhhhhhcccccccCcc-------cccHHHHhccHHHHHHHHHHHHhhccCcee-ec Q lcl|NC_011801. 1 MAFLSNLFKRQKM-L-----SGSSPVWILNQGQPVSIKPK-------AITSAIALKNSDVYAVISRVSSDIAGCRFV-TN 66 (386) Q Consensus 1 Mg~~~~l~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~-------~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~-~~ 66 (386) .++ .+.|.-... . ....-...++......++.. +-.....-+.|.++.|+..|++.+..- |. +. T Consensus 67 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~ 144 (695) T protein:vir:36 67 LRL-ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAI 144 (695) T ss_pred ccc-ceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceec Confidence 222 222221100 0 00000000111111111110 111223346788899999999977533 31 10 Q ss_pred ----------------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC--------------- Q lcl|NC_011801. 67 ----------------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--------------- 109 (386) Q Consensus 67 ----------------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--------------- 109 (386) +.+-.++|...-..+.-+..|.+.+.+.. ++|-+.+++.-+. T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aR-lfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:36 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred ccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccceEEEEEeccCcccccccccccccc Confidence 01223344333333333444454444444 6776665553322 Q ss_pred --CCceEEEEEEcCcceEEeec--------CCCceeEEEEeccCcccceeEEEcccceeeeccccccCc---cccccccc Q lcl|NC_011801. 110 --NGYPVRIEPVPNEKVTVALD--------DYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGES---DTQYMGIP 176 (386) Q Consensus 110 --~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~---~~~~~G~s 176 (386) .|..+.|.+|+|..+.+... ..+.+.+|.+. + ..+..+.++.|.-...++. ...+.|+| T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~------G--~kIH~SRL~~f~g~plPd~LKp~y~~~GiS 295 (695) T protein:vir:36 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI------G--TEVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe------c--eEEeeeeEEEecCCCchhhhhcccccCccc Confidence 24455688888888877432 12223333331 1 1244454444432221111 22357999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeC-CCCCCHHHHHHHHHHHH--HHhcccccCcceecC-CCceeeec Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVP-NATLGKEAKENTRQSFE--EQTTGENAGRAVVLD-QSADVETT 252 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~-~~~~~~~~~~~~k~~~~--~~~~~~~~g~~~vl~-~g~~~~~~ 252 (386) ..+.+...+............+.+.-... ++ ++. -+.+.......+..+++ +.+. +|. ++.+++ +..+|++. T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~-~l-k~dla~aL~~g~~~~l~~R~eli~~~R-sn~-G~~llDk~~Eefeq~ 371 (695) T protein:vir:36 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVS-GI-LMDLAQALMPGANVDLSMRAELINRYR-DNR-NILFLDKATEEFFQF 371 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHH-HH-HHHHHHhhcChhHHHHHHHHHHHHHhc-Ccc-ceEEEecCCcceEEE Confidence 99999999888887777777776543322 21 110 01111112222222232 2232 333 477888 47899999 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcc--cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhh-- Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQ--SNITMIRAFY-------QSSLSIYIKPIESELSQKLG-- 320 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~--~~~~~~~~~~-------~~~l~P~~~~ie~~l~~~l~-- 320 (386) +.+...+ -+......+.||.+-+||...|- .+..+- ..+...+.|| +.-|.|.++.+-+.+-+..+ T Consensus 372 stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~ 449 (695) T protein:vir:36 372 NTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGA 449 (695) T ss_pred ecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 8777666 45577788899999999987763 333332 2233344444 45688988888777766553 Q ss_pred --hhhhhcchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhccCCcCCCCC-CCccccccccCCCCCC Q lcl|NC_011801. 321 --TDVKLDIASAIDSDNSELINN-------VQKLASAGVLAPIQAQKLLKNRGVFPELD-LDEGTNLLDNTKNIND 386 (386) Q Consensus 321 --~~~~fd~~~~l~~d~~~~~~~-------~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-~~~~~~~~~~~~~~~~ 386 (386) +.+.|.+.++.+++.++++++ ...++..|+++++|+|..|..+|--+..+ .|..++|..+..+-.+ T Consensus 450 idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~ 525 (695) T protein:vir:36 450 VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDID 525 (695) T ss_pred CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhh Confidence 467788889999988887765 45677899999999999987544322222 2233333322222222 No 138 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.34 E-value=4.8e-11 Score=77.17 Aligned_cols=373 Identities=10% Similarity=0.056 Sum_probs=191.7 Q ss_pred CchhhhhccccccCCccchhhhhhccc------ccc----------c-CcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQ------PVS----------I-KPKAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~------~~~----------~-~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.= +....++-.|......+. ... + ..-.+. +..++.+.|.+|++.+...|.+++| T Consensus 1 ~~~------~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly-~~m~~D~hi~s~l~~Rk~av~~~~w 73 (488) T protein:vir:95 1 MAD------ITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTF-QLMMRDPAVAASVNIIKMFVRKVNW 73 (488) T ss_pred CCC------ccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHH-HHHhhChHHHHHHHHHHHHHhcCCc Confidence 221 111222222221111110 000 0 011122 3345789999999999999999999 Q ss_pred eecch------h----HHHHHhcc-CcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC-------------Cc--eEEEE Q lcl|NC_011801. 64 VTNAQ------P----ITDVLNAP-LGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN-------------GY--PVRIE 117 (386) Q Consensus 64 ~~~~~------~----~~~~l~~~-PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~-------------g~--~~~l~ 117 (386) .+... . .+..+... -|-..++.+++..+. +.+++|-+..++++... |. +..|. T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~ 152 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLP 152 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeee Confidence 98631 1 22333221 122345667776665 57789999999988532 22 33455 Q ss_pred EEcCcce-EEeecCCCceeEEE-EeccC----------cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHH Q lcl|NC_011801. 118 PVPNEKV-TVALDDYGKDLTYT-VHFDD----------SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEI 185 (386) Q Consensus 118 ~l~~~~v-~~~~~~~~~~~~~~-~~~~~----------~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i 185 (386) +.++.+. .+..+.++...... ..... ........+|....++.++.. ..+..+|.+.+..+.... T Consensus 153 ~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~---~~g~p~g~gLlr~~~w~~ 229 (488) T protein:vir:95 153 IRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDD---EYGNPEGRSPLLNAYVPW 229 (488) T ss_pred ecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecC---CCCccchhhHHHHHHHHH Confidence 5554321 12222222221111 00000 001122346666554444332 234468999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEeeCCC---CCCHHHHHHHHHHHHHH----hcccccCcceecCCCceee-------- Q lcl|NC_011801. 186 EVQDLSSKLAISTLRHAIKPSIFIKVPNA---TLGKEAKENTRQSFEEQ----TTGENAGRAVVLDQSADVE-------- 250 (386) Q Consensus 186 ~~~~~~~~~~~~~~~ng~~~~~~l~~~~~---~~~~~~~~~~k~~~~~~----~~~~~~g~~~vl~~g~~~~-------- 250 (386) ..-....++...|.+..+.|--+.+.+.. ..++++.+.+.+.+.+. ..+..+| ++++.|+++. T Consensus 230 ~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag--~iiP~g~~~~~k~~~~e~ 307 (488) T protein:vir:95 230 KYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAG--LIWPRYIDPDTKEDIFEF 307 (488) T ss_pred HHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhh--eeeccccccccchhhhhh Confidence 99999999999999875544444444321 12344444444444433 2233333 4555555432 Q ss_pred -eccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhh-hh-- Q lcl|NC_011801. 251 -TTNIS-PNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGTD-VK-- 324 (386) Q Consensus 251 -~~~~~-~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~~-~~-- 324 (386) .++.+ .....+.+..++.-++|..+.--.---.+....++++. +.......+.+.-.+++|++.||+.|... +. T Consensus 308 ~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~N 387 (488) T protein:vir:95 308 SLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALN 387 (488) T ss_pred hccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 22222 22334667777777888776532211111111233432 33455667778889999999999876432 22 Q ss_pred ---------hcchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 325 ---------LDIASAIDSDNSELINNVQKLASAGVLAP-----IQAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 325 ---------fd~~~~l~~d~~~~~~~~~~~~~~g~~t~-----nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) |-++..-..|.+++++++.+++..|+.-+ +.+|+.+|..+...+.+....-.+-..+..... T Consensus 388 fg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~ 463 (488) T protein:vir:95 388 MWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDG 463 (488) T ss_pred CCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcc Confidence 22333345678889999999999998754 568999985432222211111000000011100 No 139 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.27 E-value=5.6e-11 Score=76.80 Aligned_cols=275 Identities=12% Similarity=0.075 Sum_probs=160.8 Q ss_pred EEEEeecCCC---ceEEEEEEcCcceE-EeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccH Q lcl|NC_011801. 102 FAIIDRDTNG---YPVRIEPVPNEKVT-VALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPP 177 (386) Q Consensus 102 ~~~~~~~~~g---~~~~l~~l~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~ 177 (386) +.++++...+ .+..|.+.|+.++. ...+.++...........+. ....+|....++.++.. ..+..+|.|. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~--~~~~lp~~kfi~~~~~~---~~g~p~G~gL 75 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGK--ATVRIPVDRLVVFVNER---EGANWLGQSL 75 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCC--CcceeccCCEEEEEeCC---CCCCccchhh Confidence 6777775543 25567888887554 44455554444333222211 22457777666555432 2334689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCC-CC-----------CHHHHHHHHHHHHHHhcccccCcceecCC Q lcl|NC_011801. 178 IDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNA-TL-----------GKEAKENTRQSFEEQTTGENAGRAVVLDQ 245 (386) Q Consensus 178 ~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~-~~-----------~~~~~~~~k~~~~~~~~~~~~g~~~vl~~ 245 (386) +..+......-....++...|.+.-+.|--+.+.+.. .. +.+..+.+.+.......|..+ .++++. T Consensus 76 lr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a--~~iip~ 153 (355) T protein:vir:78 76 LRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA--GGYIPH 153 (355) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce--eEeecC Confidence 9999999999999999999999876444344444321 11 122233344434433344433 578888 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC-cccH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhh- Q lcl|NC_011801. 246 SADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD-AQSN-ITMIRAFYQSSLSIYIKPIESELSQKLGTD- 322 (386) Q Consensus 246 g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~-~~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~- 322 (386) |++++-+..+....++.+..++.-++|+.+.--.---.+..++ ++++ .+.......+.+.-.++.|++.||+.|... T Consensus 154 g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 233 (355) T protein:vir:78 154 GANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDL 233 (355) T ss_pred CceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888887766566667888888889998887543211222222 3444 233456667788889999999999876432 Q ss_pred hhhcchh----------hhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 323 VKLDIAS----------AIDSDNSELINNVQKLASAGVLAPI-----QAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 323 ~~fd~~~----------~l~~d~~~~~~~~~~~~~~g~~t~n-----E~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) +.+|+.. ....+.+++++.+.+++..|+.-++ .+|+.+|. |.+..+ .++ ..+..++..... T Consensus 234 ~~lN~~~~~~~P~~~~~~~~~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gi-p~p~~~-~~~-~~~~~~~~~~~~ 309 (355) T protein:vir:78 234 VDQNWGPEEPAPRLVPAQLGKEQPVTAEAIRALVECGAFTADPELEKDLRARYGL-PAPAER-DDG-ADAAAAKAAGRR 309 (355) T ss_pred HHhcCCCCCCCCEEEecCcChhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CCCCCC-Ccc-cCCccccccccc Confidence 2222211 1234556788999999999987654 47999985 322111 111 111111110100 No 140 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=98.91 E-value=1.1e-09 Score=69.68 Aligned_cols=371 Identities=11% Similarity=0.072 Sum_probs=195.2 Q ss_pred Cchhhhhcccccc-C-Cccchhhhh--------hccccccc---Cc----ccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLSNLFKRQKM-L-SGSSPVWIL--------NQGQPVSI---KP----KAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~~l~~~~~~-~-~~~~~~~~~--------~~~~~~~~---~~----~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.+++ ..+.+.. . ....+-... ........ ++ +.--.+-+-..|.++..+..|++.+|++.+ T Consensus 1 ~~~~r-Pk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL 79 (646) T protein:vir:10 1 MALLK-PKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARL 79 (646) T ss_pred CcccC-CCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeee Confidence 88774 1111000 0 000000000 00000000 11 111111122357888889999999999988 Q ss_pred eec-------------chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEE----eecCCCceEEEEEEcCcceEE Q lcl|NC_011801. 64 VTN-------------AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAII----DRDTNGYPVRIEPVPNEKVTV 126 (386) Q Consensus 64 ~~~-------------~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~----~~~~~g~~~~l~~l~~~~v~~ 126 (386) .+- ++....+-...-..-....++++.+..++-.-|++|+.. ....++ --..+++-.+.|.. T Consensus 80 ~aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~-~~~W~vvt~~Ev~~ 158 (646) T protein:vir:10 80 YVTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAA-EGSWFVVTGSAISR 158 (646) T ss_pred eeeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCC-ccceeeecHHHhcc Confidence 542 223333433333344445788999999999999999974 111222 12234444555522 Q ss_pred eecCCCceeEEEEeccC-cccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_011801. 127 ALDDYGKDLTYTVHFDD-SKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKP 205 (386) Q Consensus 127 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~ 205 (386) .++ .+.+.... .+.+..+.++..++ .||.++ +.+....+--||+.++...+.-..-..+...+..+.-... T Consensus 159 --tg~----~~~i~~p~~~~g~~~v~~~~~d~-lvRiW~-P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~G 230 (646) T protein:vir:10 159 --TGD----EIAVRRPQQRGGSKLVLVDGQDI-LIRCWR-PHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTG 230 (646) T ss_pred --CCC----eeeeecCccCCCCCcceecCCce-EEEEec-CCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhc Confidence 111 11111111 11334455667776 467665 4566778899999999999998888888888777766666 Q ss_pred ceEEeeCCCC------CCHHHHHHHHHHHHH----Hhcc--cccC-cceecCC-Cc------eeeeccCC-hhhHHHHHH Q lcl|NC_011801. 206 SIFIKVPNAT------LGKEAKENTRQSFEE----QTTG--ENAG-RAVVLDQ-SA------DVETTNIS-PNVTEFLQN 264 (386) Q Consensus 206 ~~~l~~~~~~------~~~~~~~~~k~~~~~----~~~~--~~~g-~~~vl~~-g~------~~~~~~~~-~~d~~~~e~ 264 (386) .|++.+|... -++.....+...|-+ .+.. +.+- -++++.. |. +++.++.. .-+.--+++ T Consensus 231 nGvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aikt 310 (646) T protein:vir:10 231 AGIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPM 310 (646) T ss_pred CceeeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhh Confidence 7777776422 122333344444433 3322 2222 2333322 11 33333332 222334789 Q ss_pred HHHHHHHHHHHhCCCHHHhcCCcCcccHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhhh--------------hhhcch Q lcl|NC_011801. 265 VSFSQDQIAKAFGIPADYLSGKQDAQSNI--TMIRAFYQSSLSIYIKPIESELSQKLGTD--------------VKLDIA 328 (386) Q Consensus 265 ~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~--~~~~~~~~~~l~P~~~~ie~~l~~~l~~~--------------~~fd~~ 328 (386) ++..+.+||...-|||+.|-+.++++--. +-...-++ -|.|.+..|++++++.++.. +=||.+ T Consensus 311 R~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS 389 (646) T protein:vir:10 311 KDKAIARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTS 389 (646) T ss_pred HHHHHHHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCc Confidence 99999999999999999985555432111 01111223 49999999999999876421 125655 Q ss_pred hhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccccc----------------------------- Q lcl|NC_011801. 329 SAI-DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLL----------------------------- 378 (386) Q Consensus 329 ~~l-~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~----------------------------- 378 (386) .+. +.|. .+-+..++..|.+|-...|+.+|.. ...+.+..|+-.-+ T Consensus 390 ~Lt~~pd~---~deA~qa~drGAIt~eAlrk~~Gf~-~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~ 465 (646) T protein:vir:10 390 TLASKPNR---LDEAIQLHERNLIKDEEVVKAGAFS-VDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSV 465 (646) T ss_pred ccccCCCC---cHHHHHHHHcCCccHHHHHHHhccc-ccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCcc Confidence 543 2232 2333457889999999999998842 11111111111000 Q ss_pred -------ccCCCCCC Q lcl|NC_011801. 379 -------DNTKNIND 386 (386) Q Consensus 379 -------~~~~~~~~ 386 (386) +.+.++.| T Consensus 466 ~lpp~~~~~~dg~~~ 480 (646) T protein:vir:10 466 GLPPTAAQRTDGDLD 480 (646) T ss_pred ccCCcccccccCCCC Confidence 00001111 No 141 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.89 E-value=2e-08 Score=62.82 Aligned_cols=342 Identities=11% Similarity=0.058 Sum_probs=158.2 Q ss_pred CchhhhhccccccCCccchh-hhhhcc-cccccCcccccHHHH----hccHHHHHHHHHHHHhhccCceeecchhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-WILNQG-QPVSIKPKAITSAIA----LKNSDVYAVISRVSSDIAGCRFVTNAQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~i~~~~a----~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l 74 (386) .-++.+|.++.......-.. ...-.+ ......+..+-.+.. +...-..-+|+-+|+.+.=-.+...+..+..++ T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d~~l~~i~ 82 (409) T protein:vir:94 3 EKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDDFTVNEIF 82 (409) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccCCchHHHHHH Confidence 22233332221111000000 000000 011011111111110 011122234444444332223444444455555 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCcc-cc---eeE Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDSK-RS---GDF 150 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~-~~---~~~ 150 (386) .. |.. ......+..+.+.+|.||+.+..+.+|.+ .+.+++|..+.+..|.....+.+.+.+.... .+ ... T Consensus 83 ~~--N~l---d~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~ 156 (409) T protein:vir:94 83 EE--NNP---DIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPITGLLTEGYAVLERDENNNVVLEA 156 (409) T ss_pred Hh--cCh---hHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCCCceeeeEEEEEecCCCceEEEE Confidence 33 333 23455777888999999999999999876 6788999998888776544333222211100 00 001 Q ss_pred EEcccc----------------------eeeeccccccCcccccccccHH----HHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_011801. 151 LYDSSE----------------------VIHFRCTVSGESDTQYMGIPPI----DSLLNEIEVQDLSSKLAISTLRHAIK 204 (386) Q Consensus 151 ~~~~~~----------------------vih~~~~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~ng~~ 204 (386) .+.++. |++|. +... .++.+|.|.+ ..+...+.....-......|+ +. T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~--n~~~-~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~---a~ 230 (409) T protein:vir:94 157 HFLPDRTDYYYRDSRNNISIANPTGHPLLVPII--HRPD-AVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFY---SF 230 (409) T ss_pred EEecCcEEEEEecCceeEeeeCCCCCcceEEec--cccc-cccccCccccchhHHHHHHHHHHHHHHHHHHHHHh---cC Confidence 112222 33333 2222 2446787755 333444443333333344444 44 Q ss_pred CceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_011801. 205 PSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-----QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIP 279 (386) Q Consensus 205 ~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp 279 (386) |..++..-+ .+.+..+.++..... ++.++ ++.++.++....- ..+++.++..+.++|..=++| T Consensus 231 pqr~i~G~d--~d~~~~~~~~~~~~~---------i~~~~~d~dg~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~t~lP 298 (409) T protein:vir:94 231 PQKYVTGLS--DDAEPMETWKATVSS---------MLQFTKDEDGDKPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLT 298 (409) T ss_pred hhheeEecC--CCCcccchhhhhHHH---------hhcCCCCCCCCCceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCC Confidence 555554211 122333334433333 33332 2345655543222 248899999999999999999 Q ss_pred HHHhcCCcCcccHHHHHHHH---HHHHHHHHHHHHHHHHHHhh----------------hhhhhhcchhhhccCH---HH Q lcl|NC_011801. 280 ADYLSGKQDAQSNITMIRAF---YQSSLSIYIKPIESELSQKL----------------GTDVKLDIASAIDSDN---SE 337 (386) Q Consensus 280 ~~~l~~~~~~~~~~~~~~~~---~~~~l~P~~~~ie~~l~~~l----------------~~~~~fd~~~~l~~d~---~~ 337 (386) +..+|.......+.++.++- +.....-..+.|.+.+.+.+ +...++.+.+....+. .+ T Consensus 299 ~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~ 378 (409) T protein:vir:94 299 LDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSL 378 (409) T ss_pred HHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHH Confidence 99998654321122222111 11111111222222222211 0122333444444454 44 Q ss_pred HHHHHHHHHhCC--CcCHHHHHHHhccCCcC Q lcl|NC_011801. 338 LINNVQKLASAG--VLAPIQAQKLLKNRGVF 366 (386) Q Consensus 338 ~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~ 366 (386) .++++.|++++| ++..+-+++++|..+.+ T Consensus 379 ~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 379 IGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 567889999988 66778999999975432 No 142 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.83 E-value=3.3e-08 Score=61.63 Aligned_cols=374 Identities=11% Similarity=0.039 Sum_probs=154.1 Q ss_pred CchhhhhccccccCCccchhhhhhccc---ccccCcccccHHHH---hccHHHHHHHHHHHHhhccCceeecch-----h Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQ---PVSIKPKAITSAIA---LKNSDVYAVISRVSSDIAGCRFVTNAQ-----P 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~~~i~~~~a---~~~~~v~~~v~~ia~~ia~~p~~~~~~-----~ 69 (386) +-++.+|.++.......-.. ....+. ....-+..+..+.. ..+....-+|+.+++.+---.+.+.++ . T Consensus 23 ~~~i~~L~~~~~~~~~r~~~-l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~d~~~~~~~ 101 (504) T protein:vir:99 23 VDKVNGLYQQLVDRTPRNLL-RASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWPDGDYGSIG 101 (504) T ss_pred HHHHHHHHHHHHHHhHHHHH-HHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceeeCCCCChhhHH Confidence 32333222111100000000 000000 00001111111110 011112234555555433334444332 2 Q ss_pred HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceE-EEEEEcCcceEEeecCCCceeEEEEe----ccCc Q lcl|NC_011801. 70 ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPV-RIEPVPNEKVTVALDDYGKDLTYTVH----FDDS 144 (386) Q Consensus 70 ~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~-~l~~l~~~~v~~~~~~~~~~~~~~~~----~~~~ 144 (386) +.+++.. |... +....+..+.+.+|.||+.+..+.+|.+. .+.+++|..+.+..|.......+.+. ..++ T Consensus 102 l~~i~~~--N~ld---~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g 176 (504) T protein:vir:99 102 GPDVWDE--NFFA---TKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSRDAEG 176 (504) T ss_pred HHHHHHh--cChh---hHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEecCCC Confidence 3333322 4332 45668888999999999999998888764 67789999998877654433221111 1111 Q ss_pred ccceeEEEcccceeeeccccc---------------------cCcccccccccHHH----HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 145 KRSGDFLYDSSEVIHFRCTVS---------------------GESDTQYMGIPPID----SLLNEIEVQDLSSKLAISTL 199 (386) Q Consensus 145 ~~~~~~~~~~~~vih~~~~~~---------------------~~~~~~~~G~s~~~----~~~~~i~~~~~~~~~~~~~~ 199 (386) .......+.+..++.++.... ....+..+|.|.+. .+...+.....-......+| T Consensus 177 ~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~ 256 (504) T protein:vir:99 177 HPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVY 256 (504) T ss_pred eEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHh Confidence 111111233333333321100 00123356777543 33333332222222233333 Q ss_pred hccCCCceEEee-CCCC---CCHHHHHHHHHHHHHHhcccccCccee-cCCCceeeeccCChhhH-HHHHHHHHHHHHHH Q lcl|NC_011801. 200 RHAIKPSIFIKV-PNAT---LGKEAKENTRQSFEEQTTGENAGRAVV-LDQSADVETTNISPNVT-EFLQNVSFSQDQIA 273 (386) Q Consensus 200 ~ng~~~~~~l~~-~~~~---~~~~~~~~~k~~~~~~~~~~~~g~~~v-l~~g~~~~~~~~~~~d~-~~~e~~~~~~~~Ia 273 (386) +.|..++.. .... .+......++........-.......+ -..+.++.++... ++ .+++.++..+.+|+ T Consensus 257 ---a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~--~l~~~~~~l~~~i~~~a 331 (504) T protein:vir:99 257 ---SFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPAS--SPQPHIEMLEQIAMMFS 331 (504) T ss_pred ---cchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCC--ChHHHHHHHHHHHHHHH Confidence 333333321 1000 011112233333333221111111111 1223555555433 33 37899999999999 Q ss_pred HHhCCCHHHhcCCcCcccH-HHHHHHHHHHH----HHHHHHHHHHHHHHhh----------------hhhhhhcchhhhc Q lcl|NC_011801. 274 KAFGIPADYLSGKQDAQSN-ITMIRAFYQSS----LSIYIKPIESELSQKL----------------GTDVKLDIASAID 332 (386) Q Consensus 274 ~~~gvp~~~l~~~~~~~~~-~~~~~~~~~~~----l~P~~~~ie~~l~~~l----------------~~~~~fd~~~~l~ 332 (386) ..=++|+..+|.....++. .++.+ +-... +.-..+.|.+.+.+.+ ...+++.+.+... T Consensus 332 ~~t~~P~~~lG~~~~~n~sSa~Ai~-~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~ 410 (504) T protein:vir:99 332 GETSIPVESLGFSNRANPTSADAYI-ASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLY 410 (504) T ss_pred hhhCCCHHHhcccccccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCc Confidence 9999999999865432222 12211 11111 2222233333332211 0122334555556 Q ss_pred cCHHHHHHHHHHHHhCCC--------------cCHHHHHHHhcc----------------CCcCCCCCC---Cccccccc Q lcl|NC_011801. 333 SDNSELINNVQKLASAGV--------------LAPIQAQKLLKN----------------RGVFPELDL---DEGTNLLD 379 (386) Q Consensus 333 ~d~~~~~~~~~~~~~~g~--------------~t~nE~R~~lg~----------------~p~~p~~~~---~~~~~~~~ 379 (386) .+..++++++.|++++|. +|+.|+.+++.. .+....++. ...+++.. T Consensus 411 ~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~ 490 (504) T protein:vir:99 411 LSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPA 490 (504) T ss_pred cCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCC Confidence 678888999999988764 255554432210 010000100 00112222 Q ss_pred cCCCCCC Q lcl|NC_011801. 380 NTKNIND 386 (386) Q Consensus 380 ~~~~~~~ 386 (386) ++....+ T Consensus 491 ~~~~~~~ 497 (504) T protein:vir:99 491 NEPPAAL 497 (504) T ss_pred CCCCccC Confidence 2222222 No 143 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.78 E-value=5.1e-08 Score=60.55 Aligned_cols=335 Identities=11% Similarity=0.025 Sum_probs=159.4 Q ss_pred cCcccccHHH-Hh----ccHHHHHHHHHHHHhhccCceeecchh----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCe Q lcl|NC_011801. 31 IKPKAITSAI-AL----KNSDVYAVISRVSSDIAGCRFVTNAQP----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNA 101 (386) Q Consensus 31 ~~~~~i~~~~-a~----~~~~v~~~v~~ia~~ia~~p~~~~~~~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a 101 (386) +-......+- .+ ......-+|+.+++.+--..+.+.+.. +.+++.. |.. ......+..+.+.+|.| T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~~~--N~~---d~~~~~~~~~a~i~G~a 75 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGPDGEPDTRASRWWQA--NRL---DSRQKLVWRMAMAQSAG 75 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecCCCchHHHHHHHHHh--cCh---hHHHHHHHHHHhhcCce Confidence 1111110000 00 111223456655554433445544433 3333322 322 34666788889999999 Q ss_pred EEEEeecCCCce------EEEEEEcCcceEEeecCCCceeEEEEec---cCcccce-eEE-------------------- Q lcl|NC_011801. 102 FAIIDRDTNGYP------VRIEPVPNEKVTVALDDYGKDLTYTVHF---DDSKRSG-DFL-------------------- 151 (386) Q Consensus 102 ~~~~~~~~~g~~------~~l~~l~~~~v~~~~~~~~~~~~~~~~~---~~~~~~~-~~~-------------------- 151 (386) |+.+.++.++.. ..+..++|..+.+..+.......+.+.+ ...+... ... T Consensus 76 y~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (434) T protein:vir:98 76 YMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLPW 155 (434) T ss_pred EEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeecccccccc Confidence 999987665432 2377889998888776543222111110 0000000 000 Q ss_pred ------------------EcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCC Q lcl|NC_011801. 152 ------------------YDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPN 213 (386) Q Consensus 152 ------------------~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~ 213 (386) +..=+|+||. +.+. .+ ..|.|.++.....++....+.-......+-.+.|..++.. T Consensus 156 ~~~~~~~~~~~~~~~~h~~g~vPvv~f~--N~~~-~~-~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G-- 229 (434) T protein:vir:98 156 GPDSWVYTGTADSGDVHDLGGMQLVEFA--RMPD-LG-EDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKG-- 229 (434) T ss_pred ccccceecccccccccCCCCccceEEec--cCCC-cC-cCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC-- Confidence 1111233443 2111 22 2588988888888888777766666666656666655542 Q ss_pred CCCCHH--HHHHHHHHHHHHhcccccCcceecC-CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc Q lcl|NC_011801. 214 ATLGKE--AKENTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ 290 (386) Q Consensus 214 ~~~~~~--~~~~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~ 290 (386) ..+.+. .....+..++ .+... .+.++.++ ++.++.++....- ..+++.++..+.+|+..=++|+..++...+.+ T Consensus 230 ~~~~~~~~~~~~~~~~~~-~~~~~-~~~i~~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~ 306 (434) T protein:vir:98 230 HKFAKRTDPATGMTVVDQ-PFVPS-PSAVWASEGENTQFGQLDATDL-SGFLKEHASDVRDMLTISQTPTYLYATDLVNI 306 (434) T ss_pred CCcccccccccccchhhh-hhhcc-ccccccCCCCCceEEEecCcch-HHHHHHHHHHHHHHhcccCCCHHHhccccCCh Confidence 111111 1111111122 11111 23455555 4567766654332 23788889999999999999999998532211 Q ss_pred cHHHHHHHHHHHHHHH----HHHHHHHHHHHhh-------h-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHH Q lcl|NC_011801. 291 SNITMIRAFYQSSLSI----YIKPIESELSQKL-------G-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPI 354 (386) Q Consensus 291 ~~~~~~~~~~~~~l~P----~~~~ie~~l~~~l-------~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~n 354 (386) ..++.. +....|.- ..+.|.+.|.+.+ + ..+++.+......+..+.++++.++++.|+ +.. T Consensus 307 Sg~Al~--~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~-~~e 383 (434) T protein:vir:98 307 SADTIG--ALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGY-PLD 383 (434) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCC-cHH Confidence 122111 11111222 2223333333211 1 123344555566788999999999998885 444 Q ss_pred HHHHHhccCCcCCCCCCC-------c--------cccccccCC-------CCCC Q lcl|NC_011801. 355 QAQKLLKNRGVFPELDLD-------E--------GTNLLDNTK-------NIND 386 (386) Q Consensus 355 E~R~~lg~~p~~p~~~~~-------~--------~~~~~~~~~-------~~~~ 386 (386) -+++++|..+ . +.. + ....-+++. +-.| T Consensus 384 ~~~~~lg~~~---~-e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~d 433 (434) T protein:vir:98 384 VIAEELDESP---A-RVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAVD 433 (434) T ss_pred HHHHhCCCCH---H-HHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCCC Confidence 4444444211 0 000 0 000001111 1111 No 144 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.71 E-value=2.8e-08 Score=61.99 Aligned_cols=382 Identities=11% Similarity=0.104 Sum_probs=182.7 Q ss_pred Cchhhhh-------------cc--ccccCCccchhhh------------hhcccccccCc---ccccHHHHhccHHHHHH Q lcl|NC_011801. 1 MAFLSNL-------------FK--RQKMLSGSSPVWI------------LNQGQPVSIKP---KAITSAIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~~l-------------~~--~~~~~~~~~~~~~------------~~~~~~~~~~~---~~i~~~~a~~~~~v~~~ 50 (386) |.+|.+. ++ .+....+.+++.. ..++.....+. .....+.|+.+|.|..| T Consensus 4 ~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEVd~A 83 (533) T protein:vir:58 4 LEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLISTV 83 (533) T ss_pred cchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcchhhH Confidence 4555432 11 1111111111110 00111000000 01123445678999999 Q ss_pred HHHHHHhhccC-----ceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec-CCCceEEEEEEcCc Q lcl|NC_011801. 51 ISRVSSDIAGC-----RFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD-TNGYPVRIEPVPNE 122 (386) Q Consensus 51 v~~ia~~ia~~-----p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~-~~g~~~~l~~l~~~ 122 (386) |+.|++.+.-. |+.+. +.++.+-....-...+....--...+..|...|..|..++.+ ..+...+|..|+|. T Consensus 84 ideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr 163 (533) T protein:vir:58 84 LDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPY 163 (533) T ss_pred HHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCe Confidence 99999876532 33332 111111111111112222223344566778899999888643 45556799999999 Q ss_pred ceEEeecCCCceeEEEEecc---CcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 123 KVTVALDDYGKDLTYTVHFD---DSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTL 199 (386) Q Consensus 123 ~v~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 199 (386) .++.+++..+...+|.|... +......+.++.+.|+|+..-. ...++-+++|-+..+.+.+.....+....--+= T Consensus 164 ~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl--~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYR 241 (533) T protein:vir:58 164 IFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKI--DTNFFPYGRSYLESARAIWNQLRLMEDALMLYR 241 (533) T ss_pred eeEEEEeeccceEEEeecccccccccCccccccchhheeeeeecc--ccCCCCceehhhhHHHHHHHHHHHHHHHHHHHh Confidence 99998877666666655422 2233344678899999987442 223455777878877665555555544443332 Q ss_pred hccCCCceEEeeCCCCCC----HHHHHHHHHHHHHHhcc-cccCcc----------eec----------CCCceeeeccC Q lcl|NC_011801. 200 RHAIKPSIFIKVPNATLG----KEAKENTRQSFEEQTTG-ENAGRA----------VVL----------DQSADVETTNI 254 (386) Q Consensus 200 ~ng~~~~~~l~~~~~~~~----~~~~~~~k~~~~~~~~~-~~~g~~----------~vl----------~~g~~~~~~~~ 254 (386) -.-+.-+-++.+.-+.+. ++-.+.+-.++++.+-= .++|.+ ..+ +.|.++++|.- T Consensus 242 isRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpG 321 (533) T protein:vir:58 242 VVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQG 321 (533) T ss_pred hcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCC Confidence 222322333333222222 22223333333332211 222322 122 23677887764 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHH--HHHHHHHHHHHHHHHHHHHhhh-------hhhhh Q lcl|NC_011801. 255 SPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRA--FYQSSLSIYIKPIESELSQKLG-------TDVKL 325 (386) Q Consensus 255 ~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~--~~~~~l~P~~~~ie~~l~~~l~-------~~~~f 325 (386) . . +.-++-.++.++.+..+++||.+-++..+......+-.+. =+..-|.-+...|.+.|.+.|- ..|+| T Consensus 322 g-~-lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~ 399 (533) T protein:vir:58 322 S-K-VDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRL 399 (533) T ss_pred C-C-CCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheee Confidence 3 3 5557788899999999999999999765443332222111 1344567777778888877662 12222 Q ss_pred c--chhhhcc--C---HHHHHHHHHHH---HhC------------CCcCHHHHHHHhccCCcCCCCCC----------Cc Q lcl|NC_011801. 326 D--IASAIDS--D---NSELINNVQKL---ASA------------GVLAPIQAQKLLKNRGVFPELDL----------DE 373 (386) Q Consensus 326 d--~~~~l~~--d---~~~~~~~~~~~---~~~------------g~~t~nE~R~~lg~~p~~p~~~~----------~~ 373 (386) + .+..... + +..|+.+++.+ |+. -+.+..|.-+.+++.|+.+.++. .+ T Consensus 400 ~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 400 VMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred eeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCcc Confidence 2 2211100 0 11122222111 100 11122233333445555444322 22 Q ss_pred cccccccCCCCCC Q lcl|NC_011801. 374 GTNLLDNTKNIND 386 (386) Q Consensus 374 ~~~~~~~~~~~~~ 386 (386) .++|++++....- T Consensus 480 ~~~p~~~~~~~~~ 492 (533) T protein:vir:58 480 RGSPIESPRGRTE 492 (533) T ss_pred ccCcccCCCChhh Confidence 2333333222211 No 145 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.69 E-value=5.9e-09 Score=65.72 Aligned_cols=372 Identities=13% Similarity=0.112 Sum_probs=195.0 Q ss_pred Cchhh-hhccccccCCccchhhhhhc-ccc------cccCcccccHH--------HHh-ccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWILNQ-GQP------VSIKPKAITSA--------IAL-KNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~~~~-~~~------~~~~~~~i~~~--------~a~-~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.=-+ |+.+|++..++......... ... ..+....++.+ .++ -++.++..+..|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 65533 34555555444322111110 001 11000111111 122 367788888999999999988 Q ss_pred eecc-------------h--h----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC------ceEE-EE Q lcl|NC_011801. 64 VTNA-------------Q--P----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG------YPVR-IE 117 (386) Q Consensus 64 ~~~~-------------~--~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g------~~~~-l~ 117 (386) .+-. + + +.++...--.--+...++++.+..++-.-|++|+.+.....| .+.. .+ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW~ 160 (629) T protein:vir:86 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEWL 160 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhhe Confidence 5421 1 1 223333323455667899999999999999999988643332 2222 33 Q ss_pred EEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 118 PVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAIS 197 (386) Q Consensus 118 ~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 197 (386) .+-++.++-. .++.. +... .....+..+..+++ +|.++ +.+....+--||+.++...+.-..-..+...+ T Consensus 161 ~vt~~ei~~~--~~~~~----i~lP--~g~~~e~~~~~d~l-~RiW~-P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~a 230 (629) T protein:vir:86 161 ALTPEEVRAS--EKKTI----IELP--TGDKHEFRDGLDGM-FRVWN-PRARRAREPDSPVRANLDSLKEIVRTTKTIAN 230 (629) T ss_pred eechHHhhhc--cCcee----eEcC--CCCcceeeCCCceE-EEeeC-CCcccccCCcchhHHHHHHHHHHHHhhhHHHH Confidence 3334443311 11111 1111 11223334444554 77665 45666788999999999998888877777777 Q ss_pred HHhccCCCceEEeeCCCC---------------------CCHHHHHHHHHHHH----HHhccc--ccC-cceecC----- Q lcl|NC_011801. 198 TLRHAIKPSIFIKVPNAT---------------------LGKEAKENTRQSFE----EQTTGE--NAG-RAVVLD----- 244 (386) Q Consensus 198 ~~~ng~~~~~~l~~~~~~---------------------~~~~~~~~~k~~~~----~~~~~~--~~g-~~~vl~----- 244 (386) ..+.-....|++.+|... ...-..+.+.+.|- ..+... .+- -++++. T Consensus 231 aakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~ 310 (629) T protein:vir:86 231 ASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGEL 310 (629) T ss_pred HHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHH Confidence 666555555665444211 00113344555444 333332 222 233331 Q ss_pred -CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--H-HHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011801. 245 -QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--I-TMIRAFYQSSLSIYIKPIESELSQKLG 320 (386) Q Consensus 245 -~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~-~~~~~~~~~~l~P~~~~ie~~l~~~l~ 320 (386) ++++.-.+.....+. -+++++..+.+||...-|||+.|-+.++.+|- . +-...-++-.|.|.+..|++++++.++ T Consensus 311 i~~i~hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~L 389 (629) T protein:vir:86 311 IKNVTHLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVL 389 (629) T ss_pred hcCeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHH Confidence 223333333333333 47899999999999999999987544322221 0 111223455699999999999998764 Q ss_pred hh-------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC--CCCccccccccCCCC- Q lcl|NC_011801. 321 TD-------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL--DLDEGTNLLDNTKNI- 384 (386) Q Consensus 321 ~~-------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~--~~~~~~~~~~~~~~~- 384 (386) .. +=||.+.+. .|+. +.+-+..++..|.+|-...|+.+|.. .+.+- ..+|+-.-+....-. T Consensus 390 rp~Le~eGiDp~kYvvW~DaS~Lt-~dPd-~~deA~~a~drGAIt~eAlrk~lGf~-eD~~yd~tt~E~~~~~a~d~V~~ 466 (629) T protein:vir:86 390 RTVLMREGIDPNAYVVWHDASQLT-VDPD-KTDEARDAFDRGAITAEAMVKMLGLA-DDTVYDFTTPEGWAQWARDRVGQ 466 (629) T ss_pred HHHHHHhCCCHHHhEeeecCcccc-cCCC-CcHHHHHHHHcCCcCHHHHHHHhcCc-cccccCCCchHHHHHHHHHhhhh Confidence 21 124555442 2221 23333457889999999999999832 11111 111211111000000 Q ss_pred -----------------------------------CC Q lcl|NC_011801. 385 -----------------------------------ND 386 (386) Q Consensus 385 -----------------------------------~~ 386 (386) +| T Consensus 467 ~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~d 503 (629) T protein:vir:86 467 DPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDE 503 (629) T ss_pred CcchhhhhhhhhhhhcccccCccCCCCCccccCCCcc Confidence 01 No 146 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.67 E-value=6.7e-09 Score=65.38 Aligned_cols=372 Identities=13% Similarity=0.112 Sum_probs=195.1 Q ss_pred Cchhh-hhccccccCCccchhhhhhc-ccc------cccCcccccHH--------HHh-ccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWILNQ-GQP------VSIKPKAITSA--------IAL-KNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~~~~-~~~------~~~~~~~i~~~--------~a~-~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.=-+ |+.+|++..++......... ... ..+....++.+ .++ -++.++..+..|++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 65533 34555555444322111110 001 11000111111 122 367788888999999999988 Q ss_pred eecc-------------h--h----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC------ceEE-EE Q lcl|NC_011801. 64 VTNA-------------Q--P----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG------YPVR-IE 117 (386) Q Consensus 64 ~~~~-------------~--~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g------~~~~-l~ 117 (386) .+-. + + +.++...--.--+...++++.+..++-.-|++|+.+.....| .+.. .+ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW~ 160 (629) T protein:vir:99 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEWL 160 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhhe Confidence 5421 1 1 223333323455667899999999999999999988743332 2222 33 Q ss_pred EEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 118 PVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAIS 197 (386) Q Consensus 118 ~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 197 (386) .+-++.++-. .++.. +... .....+..+..+++ +|.++ +.+....+--||+.++...+.-..-..+...+ T Consensus 161 ~vt~~ei~~~--~~~~~----i~lP--~g~~~e~~~~~d~l-~RiW~-P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~a 230 (629) T protein:vir:99 161 ALTPEEVRAS--EKKTI----IELP--TGDKHEFRDGLDGM-FRVWN-PRARRAREPDSPVRANLDSLKEIVRTTKTIAN 230 (629) T ss_pred eechHHhhhc--cCcee----EEcC--CCCccceeCCCceE-EEeeC-CCcccccCCcchhHHHHHHHHHHHHhhhHHHH Confidence 3334443311 11111 1111 11223334445544 77665 45666788999999999999888877777777 Q ss_pred HHhccCCCceEEeeCCCC---------------------CCHHHHHHHHHHHH----HHhccc--ccC-cceecC----- Q lcl|NC_011801. 198 TLRHAIKPSIFIKVPNAT---------------------LGKEAKENTRQSFE----EQTTGE--NAG-RAVVLD----- 244 (386) Q Consensus 198 ~~~ng~~~~~~l~~~~~~---------------------~~~~~~~~~k~~~~----~~~~~~--~~g-~~~vl~----- 244 (386) ..+.-....|++.+|... ...-..+.+.+.|- ..+... .+- -++++. T Consensus 231 aakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~ 310 (629) T protein:vir:99 231 ASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGEL 310 (629) T ss_pred HHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHH Confidence 666555555665444211 00113344555444 333332 222 233331 Q ss_pred -CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--H-HHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011801. 245 -QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--I-TMIRAFYQSSLSIYIKPIESELSQKLG 320 (386) Q Consensus 245 -~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~-~~~~~~~~~~l~P~~~~ie~~l~~~l~ 320 (386) ++++.-.+.....+. -+++++..+.+||...-|||+.|-+.++.+|- . +-...-++-.|.|.+..|++++++.++ T Consensus 311 i~~i~hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~L 389 (629) T protein:vir:99 311 IKNVTHLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVL 389 (629) T ss_pred hcCeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHH Confidence 223333333333333 47899999999999999999987544322221 0 111223455699999999999998764 Q ss_pred hh-------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC--CCCccccccccCCCC- Q lcl|NC_011801. 321 TD-------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL--DLDEGTNLLDNTKNI- 384 (386) Q Consensus 321 ~~-------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~--~~~~~~~~~~~~~~~- 384 (386) .. +=||.+.+. .|+. +.+-+..++..|.+|-...|+.+|.. .+.+- ..+|+-.-+....-. T Consensus 390 rp~Le~eGiDp~kYvvW~DaS~Lt-~dPd-~~deA~~a~drGAIt~eAlrk~lGf~-eD~~yd~tt~E~~~~~a~d~V~~ 466 (629) T protein:vir:99 390 RTVLMREGIDPNAYVVWHDASQLT-VDPD-KTDEARDAFDRGAITAEAMVKMLGLA-DDTVYDFTTPEGWAQWARDRVGQ 466 (629) T ss_pred HHHHHHhCCCHHHhEeeecCcccc-cCCC-CcHHHHHHHHcCCccHHHHHHHhcCc-cccccCCCchHHHHHHHHHhhhh Confidence 21 124555442 2221 23333457889999999999999832 11111 111211101000000 Q ss_pred -----------------------------------CC Q lcl|NC_011801. 385 -----------------------------------ND 386 (386) Q Consensus 385 -----------------------------------~~ 386 (386) +| T Consensus 467 ~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~d 503 (629) T protein:vir:99 467 DPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDE 503 (629) T ss_pred CcchhhhhhhhhhhhcccccCccCCCCCccccCCCcc Confidence 01 No 147 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.64 E-value=1e-09 Score=69.81 Aligned_cols=172 Identities=16% Similarity=0.149 Sum_probs=98.2 Q ss_pred EEeeCC--CCCCHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_011801. 208 FIKVPN--ATLGKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLS 284 (386) Q Consensus 208 ~l~~~~--~~~~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~ 284 (386) |+++++ ..++.. .+.++++++..... ++.+.+.+..++.+|+.++.+...+ .+........||++-|||...|- T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl--~d~l~~~~~~iaa~s~iP~t~Lf 77 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGI--DTFLSQKFDRIVALSGIHEIILK 77 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCCh--HHHHHHHHHHHHhHhcCchhhhc Confidence 444332 111111 12344455433222 2223455566667899888877655 56788899999999999998874 Q ss_pred CCcC-cc--cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhhhhhhhcchhhhccCHHHHH-------HHHHHHHh Q lcl|NC_011801. 285 GKQD-AQ--SNITMIRAFYQ-------SSLSIYIKPIESELSQKLGTDVKLDIASAIDSDNSELI-------NNVQKLAS 347 (386) Q Consensus 285 ~~~~-~~--~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~~~~~fd~~~~l~~d~~~~~-------~~~~~~~~ 347 (386) +... +- ..+...+.||. .-|.|.++.+-+.+.. -..+.|.+.++...+.++++ +++++++. T Consensus 78 G~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~--~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~ 155 (201) T protein:vir:10 78 GKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVT--EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIA 155 (201) T ss_pred CCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHH Confidence 4332 22 22334444544 4466666654443221 24567788888888877765 45677889 Q ss_pred CCCcCHHHHHHHhccCCc---CCCCCCCccc------cccccCCCC Q lcl|NC_011801. 348 AGVLAPIQAQKLLKNRGV---FPELDLDEGT------NLLDNTKNI 384 (386) Q Consensus 348 ~g~~t~nE~R~~lg~~p~---~p~~~~~~~~------~~~~~~~~~ 384 (386) +|+++++|+|+.|-..|. .+.+..++.. ++...+.|+ T Consensus 156 ~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 156 AGIIDADEARDTLRAISTEVKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred cCCCCHHHHHHHHHhcCCcCCCCCCCCCccccccccCCCCCCCCCC Confidence 999999999998854332 3323333221 222222222 No 148 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.59 E-value=2.2e-07 Score=57.06 Aligned_cols=371 Identities=13% Similarity=0.032 Sum_probs=168.7 Q ss_pred CchhhhhccccccCCccchhhhhhcccc---cccCcccccHH-----HHhccHHHHHHHHHHHHhhccCceeecch---h Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQP---VSIKPKAITSA-----IALKNSDVYAVISRVSSDIAGCRFVTNAQ---P 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~-----~a~~~~~v~~~v~~ia~~ia~~p~~~~~~---~ 69 (386) .-++.+|..........- ......+.. ...-+.....+ .-..+.-..-+|+..++.+-.-|+.+... . T Consensus 7 ~~~~~~l~~~~~~~~~r~-~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~~ 85 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRV-RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSD 85 (456) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCcc Confidence 223333221111000000 000000000 00001111110 00122334567777777777778865321 1 Q ss_pred ----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce--eEE--EEec Q lcl|NC_011801. 70 ----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD--LTY--TVHF 141 (386) Q Consensus 70 ----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~--~~~~ 141 (386) +.+++.. |. ...+...+..+++.+|.||..+..+.+|.+ .+..++|..+.+..+..... ... .+.. T Consensus 86 ~~~~~~~i~~~--N~---~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~ 159 (456) T protein:vir:10 86 LALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRAAMRWWRD 159 (456) T ss_pred hHHHHHHHHHh--cC---hhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCCCCcceEEEEEEEEe Confidence 3334432 32 234566788899999999999988888876 46778898888877654321 100 0000 Q ss_pred cCccccee-------------------------EEEcc------cceeeeccccccCcccccccccHHHHHHHHHHHHHH Q lcl|NC_011801. 142 DDSKRSGD-------------------------FLYDS------SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDL 190 (386) Q Consensus 142 ~~~~~~~~-------------------------~~~~~------~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~ 190 (386) .+...... ..... ...-|+-...+-.......|+|.++.....++.... T Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~ 239 (456) T protein:vir:10 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) T ss_pred cCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHH Confidence 00000000 00000 000010000000001113578888777766666554 Q ss_pred HHHHHHHHHhccCCCceEEeeCCCC--CCHHHHHH--HHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHH Q lcl|NC_011801. 191 SSKLAISTLRHAIKPSIFIKVPNAT--LGKEAKEN--TRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVS 266 (386) Q Consensus 191 ~~~~~~~~~~ng~~~~~~l~~~~~~--~~~~~~~~--~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~ 266 (386) +.--........+.|..++...+.. ..++.-.. ....|+. ..+.++.++++.++.++....- ..+.+..+ T Consensus 240 ~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~q~~~~~~-~~~~~~l~ 313 (456) T protein:vir:10 240 AELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEA-----APGALWELPPGVDIWESQANDF-TPMLSAIK 313 (456) T ss_pred HHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhh-----hccccccCCCCcceEEecccCh-hHHHHHHH Confidence 4433333333334443333321100 00111111 1112222 1234667788999887764322 23788999 Q ss_pred HHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------h----hhhhhcchhhh Q lcl|NC_011801. 267 FSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKL-----------G----TDVKLDIASAI 331 (386) Q Consensus 267 ~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~----~~~~fd~~~~l 331 (386) ..+.+|++.=++|+..++...... +.++.+ +....+.-.+...+..|...| + ..++..+.+.. T Consensus 314 ~~i~~~~~~s~~p~~~~~~~~~N~-Sg~Ai~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~ 391 (456) T protein:vir:10 314 EHIRQLSSATKTPLPMLMPDSANQ-SAEGAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) T ss_pred HHHHHHHhccCCChHHhcccccCh-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCC Confidence 999999999999999997542211 211111 111122222222222222211 1 12334454555 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC-CCCCCC--------ccccccccCCCCCC Q lcl|NC_011801. 332 DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF-PELDLD--------EGTNLLDNTKNIND 386 (386) Q Consensus 332 ~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~-p~~~~~--------~~~~~~~~~~~~~~ 386 (386) ..+..+.++++.++++.|+.+..-+++++|..+.. +..+.. .++++.+...-+.. T Consensus 392 ~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 392 RVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS 455 (456) T ss_pred CcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC Confidence 67788999999999999999888888877754310 000000 01222222222222 No 149 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.59 E-value=2.2e-07 Score=57.06 Aligned_cols=371 Identities=13% Similarity=0.032 Sum_probs=168.7 Q ss_pred CchhhhhccccccCCccchhhhhhcccc---cccCcccccHH-----HHhccHHHHHHHHHHHHhhccCceeecch---h Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQP---VSIKPKAITSA-----IALKNSDVYAVISRVSSDIAGCRFVTNAQ---P 69 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~-----~a~~~~~v~~~v~~ia~~ia~~p~~~~~~---~ 69 (386) .-++.+|..........- ......+.. ...-+.....+ .-..+.-..-+|+..++.+-.-|+.+... . T Consensus 7 ~~~~~~l~~~~~~~~~r~-~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~~ 85 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRV-RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSD 85 (456) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCcc Confidence 223333221111000000 000000000 00001111110 00122334567777777777778865321 1 Q ss_pred ----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce--eEE--EEec Q lcl|NC_011801. 70 ----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD--LTY--TVHF 141 (386) Q Consensus 70 ----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~--~~~~ 141 (386) +.+++.. |. ...+...+..+++.+|.||..+..+.+|.+ .+..++|..+.+..+..... ... .+.. T Consensus 86 ~~~~~~~i~~~--N~---~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~ 159 (456) T protein:vir:10 86 LALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRAAMRWWRD 159 (456) T ss_pred hHHHHHHHHHh--cC---hhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCCCCcceEEEEEEEEe Confidence 3334432 32 234566788899999999999988888876 46778898888877654321 100 0000 Q ss_pred cCccccee-------------------------EEEcc------cceeeeccccccCcccccccccHHHHHHHHHHHHHH Q lcl|NC_011801. 142 DDSKRSGD-------------------------FLYDS------SEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDL 190 (386) Q Consensus 142 ~~~~~~~~-------------------------~~~~~------~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~ 190 (386) .+...... ..... ...-|+-...+-.......|+|.++.....++.... T Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~ 239 (456) T protein:vir:10 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) T ss_pred cCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHH Confidence 00000000 00000 000010000000001113578888777766666554 Q ss_pred HHHHHHHHHhccCCCceEEeeCCCC--CCHHHHHH--HHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHH Q lcl|NC_011801. 191 SSKLAISTLRHAIKPSIFIKVPNAT--LGKEAKEN--TRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVS 266 (386) Q Consensus 191 ~~~~~~~~~~ng~~~~~~l~~~~~~--~~~~~~~~--~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~ 266 (386) +.--........+.|..++...+.. ..++.-.. ....|+. ..+.++.++++.++.++....- ..+.+..+ T Consensus 240 ~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~q~~~~~~-~~~~~~l~ 313 (456) T protein:vir:10 240 AELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEA-----APGALWELPPGVDIWESQANDF-TPMLSAIK 313 (456) T ss_pred HHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhh-----hccccccCCCCcceEEecccCh-hHHHHHHH Confidence 4433333333334443333321100 00111111 1112222 1234667788999887764322 23788999 Q ss_pred HHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------h----hhhhhcchhhh Q lcl|NC_011801. 267 FSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKL-----------G----TDVKLDIASAI 331 (386) Q Consensus 267 ~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~----~~~~fd~~~~l 331 (386) ..+.+|++.=++|+..++...... +.++.+ +....+.-.+...+..|...| + ..++..+.+.. T Consensus 314 ~~i~~~~~~s~~p~~~~~~~~~N~-Sg~Ai~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~ 391 (456) T protein:vir:10 314 EHIRQLSSATKTPLPMLMPDSANQ-SAEGAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) T ss_pred HHHHHHHhccCCChHHhcccccCh-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCC Confidence 999999999999999997542211 211111 111122222222222222211 1 12334454555 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC-CCCCCC--------ccccccccCCCCCC Q lcl|NC_011801. 332 DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF-PELDLD--------EGTNLLDNTKNIND 386 (386) Q Consensus 332 ~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~-p~~~~~--------~~~~~~~~~~~~~~ 386 (386) ..+..+.++++.++++.|+.+..-+++++|..+.. +..+.. .++++.+...-+.. T Consensus 392 ~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 392 RVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS 455 (456) T ss_pred CcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC Confidence 67788999999999999999888888877754310 000000 01222222222222 No 150 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.57 E-value=2.6e-07 Score=56.72 Aligned_cols=368 Identities=10% Similarity=0.013 Sum_probs=171.8 Q ss_pred Cchhh---hhcccc-----------ccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceee- Q lcl|NC_011801. 1 MAFLS---NLFKRQ-----------KMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVT- 65 (386) Q Consensus 1 Mg~~~---~l~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~- 65 (386) +-+.+ ++..+. .-..+.+++....................-+.++-...+|+..++-+-+-|+.+ T Consensus 34 e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~ 113 (483) T protein:vir:12 34 ETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 113 (483) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceec Confidence 11100 000000 000000000000000000000000000001123445567777777776677654 Q ss_pred -cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEEEecc Q lcl|NC_011801. 66 -NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYTVHFD 142 (386) Q Consensus 66 -~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~~~~~ 142 (386) .+.+....|..--+.. .......+..+...+|.||..+..+.+|.+ .+..++|..+.+..+... ........+. T Consensus 114 ~~d~~~~~~l~~~~~n~--~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~ 190 (483) T protein:vir:12 114 HTDDEVVKRIDEVLGNR--FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 190 (483) T ss_pred cCChHHHHHHHHHHhcc--HHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 3344444442211112 234455677888999999999999888876 577899999888776432 2221111111 Q ss_pred CcccceeEEEcccceeeeccccc------------------cCcc---------cccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 143 DSKRSGDFLYDSSEVIHFRCTVS------------------GESD---------TQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 143 ~~~~~~~~~~~~~~vih~~~~~~------------------~~~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) .........+.+..+.|+.+... .++. +...|.|.+..+...++....+.... T Consensus 191 ~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~ 270 (483) T protein:vir:12 191 LENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDL 270 (483) T ss_pred eecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHH Confidence 11111112233333333221100 0000 12357888888888888777776666 Q ss_pred HHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKA 275 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~ 275 (386) .+.++..+.|..+++ +.. .+..+.....+. ..+++.++++.+++.+.....+..+....+...+.|+.. T Consensus 271 ~~~~~~~~~~~lv~~--g~~--~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 339 (483) T protein:vir:12 271 SNTFKDSNELTYVLT--NYD--DQELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLF 339 (483) T ss_pred HHHHHHhcCceeeee--cCC--cccchhHHHhhh-------hccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHH Confidence 666677777766554 221 222222222222 123455555556655555555666778888888999999 Q ss_pred hCCCHHHhcCCcCcccHHHH--------------HHHHHHHHHHHHHHHHHHHHHHhh-hhhhhhcchhhhccCHHHHHH Q lcl|NC_011801. 276 FGIPADYLSGKQDAQSNITM--------------IRAFYQSSLSIYIKPIESELSQKL-GTDVKLDIASAIDSDNSELIN 340 (386) Q Consensus 276 ~gvp~~~l~~~~~~~~~~~~--------------~~~~~~~~l~P~~~~ie~~l~~~l-~~~~~fd~~~~l~~d~~~~~~ 340 (386) -++|..-.+..+ ++-+..+ .+..+...+..+++.+...+..+. ...+++.+..-+..|..+.++ T Consensus 340 s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~ 418 (483) T protein:vir:12 340 GQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 418 (483) T ss_pred hCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHH Confidence 999864332111 1111111 122344444444444444333221 123455566667788999999 Q ss_pred HHHHHHhCCCcCHHHHHHHhccCCcCCCCC-----------------CCccc---cccccCCCCCC Q lcl|NC_011801. 341 NVQKLASAGVLAPIQAQKLLKNRGVFPELD-----------------LDEGT---NLLDNTKNIND 386 (386) Q Consensus 341 ~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-----------------~~~~~---~~~~~~~~~~~ 386 (386) .+.++ .|+++...+.++++.-+ +|... .+++. ..-.-..++.+ T Consensus 419 ~~~kl--~GiiS~et~~~~~~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e 481 (483) T protein:vir:12 419 TAQQS--MGIVSHETVLENHPFVE-DLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKE 481 (483) T ss_pred HHHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCccc Confidence 98888 47888777776654211 00000 00000 00000111111 No 151 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.57 E-value=2.7e-07 Score=56.58 Aligned_cols=371 Identities=11% Similarity=0.072 Sum_probs=172.3 Q ss_pred CchhhhhccccccCCccc-------------h-hhhh-hccccc-cc----------Cccccc----HHHHhccHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSS-------------P-VWIL-NQGQPV-SI----------KPKAIT----SAIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~-------------~-~~~~-~~~~~~-~~----------~~~~i~----~~~a~~~~~v~~~ 50 (386) |.+...+........... + .... ..+... .+ .+.... ...-+.++-...+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~i 92 (503) T protein:vir:59 13 EELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLF 92 (503) T ss_pred HhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHH Confidence 332221111100000000 0 0000 000000 00 000000 0000123345567 Q ss_pred HHHHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEee Q lcl|NC_011801. 51 ISRVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVAL 128 (386) Q Consensus 51 v~~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~ 128 (386) |+..++-+-+-|+.+ .+....+.|..--+. ........+..+...+|.||+.+..+.+|++ .+..++|..+.+.. T Consensus 93 vd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~ 169 (503) T protein:vir:59 93 VDQKTQYLVGEPVTFTSDNKTLLEYVNELADD--DFDDILNETVKNMSNKGIEYWHPFVDEEGEF-DYVIFPAEEMIVVY 169 (503) T ss_pred HHHHHhhhhcCCeeeccCcHHHHHHHHHHHhc--CHHHHHHHHHHHHhhCCeEEEEEeecCCCce-EEEEEccceeEEEE Confidence 777777777777653 333333333221111 3445666788889999999999999888876 57889999988876 Q ss_pred cCCC-c-eeEE--EEeccCcc---cceeEEEcccceeeeccccc----------------------c-----Cc----cc Q lcl|NC_011801. 129 DDYG-K-DLTY--TVHFDDSK---RSGDFLYDSSEVIHFRCTVS----------------------G-----ES----DT 170 (386) Q Consensus 129 ~~~~-~-~~~~--~~~~~~~~---~~~~~~~~~~~vih~~~~~~----------------------~-----~~----~~ 170 (386) +... . .... .+...... ......+.+..+.+++.... . .| .+ T Consensus 170 d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~n 249 (503) T protein:vir:59 170 KDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKN 249 (503) T ss_pred eCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecC Confidence 6532 1 1111 11111100 00111233333333221100 0 00 01 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceee Q lcl|NC_011801. 171 QYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVE 250 (386) Q Consensus 171 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~ 250 (386) ...|.|.+..+...++....+.....+.++..+.|-.+++ +....+ .+.....+. .++++.++++.+++ T Consensus 250 n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~--g~~~~~--~~~~~~~~~-------~~~~~~~~~~~~~~ 318 (503) T protein:vir:59 250 NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLK--NYDGEN--PKEFTANLR-------YHSVIKVSGDGGVD 318 (503) T ss_pred CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEee--cCCccc--cchhhhhhh-------cccceeccCCCcce Confidence 2358888888888888877776666666777777766554 322111 111222222 12355566655555 Q ss_pred eccCChhhHHHHHHHHHHHHHHHHHhCCCH---HHhcCCcCcccH----------HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 251 TTNISPNVTEFLQNVSFSQDQIAKAFGIPA---DYLSGKQDAQSN----------ITMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 251 ~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~---~~l~~~~~~~~~----------~~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) .+........+....+...+.|...-++|. ...+...++... .+..+..+...|.-+++.+...++. T Consensus 319 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 398 (503) T protein:vir:59 319 TLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRN 398 (503) T ss_pred eEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 554444444456666666666766666663 333221111110 1112334455555555555544443 Q ss_pred hhh------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCC-------------------CC Q lcl|NC_011801. 318 KLG------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELD-------------------LD 372 (386) Q Consensus 318 ~l~------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-------------------~~ 372 (386) .-. ..+++.+..-+..|..+.++++.+++.+|+++...+.++++.-+ +|... .+ T Consensus 399 ~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~ 477 (503) T protein:vir:59 399 TGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQ-DPEEELARIEEEMNQYAEMQGNLLDD 477 (503) T ss_pred ccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhhccccCc Confidence 221 12455666677889999999999999999988776666654211 00000 00 Q ss_pred cccc-ccccCCCCCC Q lcl|NC_011801. 373 EGTN-LLDNTKNIND 386 (386) Q Consensus 373 ~~~~-~~~~~~~~~~ 386 (386) +++. .-..+...++ T Consensus 478 ~~~~~~~~~~~~~~~ 492 (503) T protein:vir:59 478 EGGDDDLEEDDPNAG 492 (503) T ss_pred cCCCCCCCcCCCCCC Confidence 0000 0000000000 No 152 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.55 E-value=3e-07 Score=56.38 Aligned_cols=370 Identities=14% Similarity=0.059 Sum_probs=165.7 Q ss_pred CchhhhhccccccCCcc-chhhhhhccc-ccccCcccccH-----HHHhccHHHHHHHHHHHHhhccCceeecc---h-- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGS-SPVWILNQGQ-PVSIKPKAITS-----AIALKNSDVYAVISRVSSDIAGCRFVTNA---Q-- 68 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~i~~-----~~a~~~~~v~~~v~~ia~~ia~~p~~~~~---~-- 68 (386) +-++.++.+........ .-......+. .....+..... ..-..+.-...+|+..++.+-+-|+.+.. . T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~d~~~ 86 (456) T protein:vir:79 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCccH Confidence 22222221111000000 0000000000 00000011100 00011223456777777777777886532 1 Q ss_pred --hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-e-EE-EEe-cc Q lcl|NC_011801. 69 --PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-L-TY-TVH-FD 142 (386) Q Consensus 69 --~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-~-~~-~~~-~~ 142 (386) .+.+++.. |. ...+...+..+++.+|.||+.+-.+.+|.+ .+..++|..+.+..+..... + .. .+. .. T Consensus 87 ~~~~~~~~~~--n~---~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~ 160 (456) T protein:vir:79 87 ALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) T ss_pred HHHHHHHHHh--cC---hhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCceEEEEEEEEec Confidence 23344433 32 235667788999999999999988989987 57888899888776643221 1 10 000 00 Q ss_pred CcccceeEEEc-------------------------------ccceeeeccccccCcccccccccHHHHHHHHHHHHHHH Q lcl|NC_011801. 143 DSKRSGDFLYD-------------------------------SSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLS 191 (386) Q Consensus 143 ~~~~~~~~~~~-------------------------------~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 191 (386) +........+. ..++-|.-...+-.......|+|.+......++....+ T Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~gd~e~v~~liD~~~~~ 240 (456) T protein:vir:79 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) T ss_pred CCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCchhhhhHHHHHHHHHH Confidence 00000000000 00111110000000011124677777666655554443 Q ss_pred HHHHHHHHhccCCCceEEeeCC--CCCCHHHHHH--HHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHH Q lcl|NC_011801. 192 SKLAISTLRHAIKPSIFIKVPN--ATLGKEAKEN--TRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSF 267 (386) Q Consensus 192 ~~~~~~~~~ng~~~~~~l~~~~--~~~~~~~~~~--~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~ 267 (386) .--.....+..+.|..++...+ ....++.-+. ....|... .+.++.++++.++.++....- ..+.+.++. T Consensus 241 ~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~-----~~~~~~~~~~~~~~q~~~~~~-~~~~~~l~~ 314 (456) T protein:vir:79 241 ELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAA-----PGALWELPPGVDIWESQTNDF-TPMLSAIKE 314 (456) T ss_pred HHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhh-----ccccccCCCCcceeeecccCh-HHHHHHHHH Confidence 3222222233333333332110 0000110011 11122211 234667788888877654322 337889999 Q ss_pred HHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------h----hhhhhcchhhhc Q lcl|NC_011801. 268 SQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKL-----------G----TDVKLDIASAID 332 (386) Q Consensus 268 ~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~----~~~~fd~~~~l~ 332 (386) .+.+|+..=++|+..++........+ +.+ +....+.-.+...+..|...| + ..++..+.+... T Consensus 315 ~i~~i~~~t~~p~~~~~~~~~N~Sg~-Al~-~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~~~~~i~v~w~~~~~ 392 (456) T protein:vir:79 315 HIRQLSSATKTPLPMLMPDSANQSAE-GAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDR 392 (456) T ss_pred HHHHHHhhcCCChhHhcccccCcHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceEEeCCCCC Confidence 99999999999999997542212111 111 111112222222222222211 1 123344445556 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC-CCCCC----C----ccccccccCCCCCC Q lcl|NC_011801. 333 SDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF-PELDL----D----EGTNLLDNTKNIND 386 (386) Q Consensus 333 ~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~-p~~~~----~----~~~~~~~~~~~~~~ 386 (386) .+..+.++++.++++.|+++...+++++|..+.. +..+. . .++++ .+....| T Consensus 393 ~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~--~~~~~~~ 453 (456) T protein:vir:79 393 VTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNP--VQRPQED 453 (456) T ss_pred cCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhH--hhcCCCC Confidence 6778899999999999999888888877754320 00010 0 11122 2334444 No 153 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.55 E-value=3.1e-07 Score=56.29 Aligned_cols=363 Identities=13% Similarity=0.088 Sum_probs=157.1 Q ss_pred CchhhhhccccccCCccchhhhhhccc---ccccCcccccHH---HHhccHHHHHHHHHHHHhhccCceeecch-hHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQ---PVSIKPKAITSA---IALKNSDVYAVISRVSSDIAGCRFVTNAQ-PITDV 73 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~~~i~~~---~a~~~~~v~~~v~~ia~~ia~~p~~~~~~-~~~~~ 73 (386) .-++.+|.+........... ....+. .....+..+..+ .-..+.-..-+|+..++.+--..+.+.+. .+..+ T Consensus 6 ~~~i~~l~~~~~~~~~r~~~-l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~d~~~l~~i 84 (441) T protein:vir:80 6 LALIEGMYDRIQRLSSWHCC-IEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNGDGYGLDGV 84 (441) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCCChHHHHHH Confidence 11222221111000000000 000000 000001111110 00111222344554444432223333332 34444 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee--EEEEec-cCcccceeE Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL--TYTVHF-DDSKRSGDF 150 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~--~~~~~~-~~~~~~~~~ 150 (386) +.. | ........+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..+...... .+.+.. ......... T Consensus 85 ~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~ 158 (441) T protein:vir:80 85 YAA--N---RLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAE 158 (441) T ss_pred HHh--c---CHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEE Confidence 432 3 2456777888999999999999999999877 578899999888776543211 111111 000000001 Q ss_pred EEcc--------------------------cceeeeccccccCcccccccccHHHH-HHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011801. 151 LYDS--------------------------SEVIHFRCTVSGESDTQYMGIPPIDS-LLNEIEVQDLSSKLAISTLRHAI 203 (386) Q Consensus 151 ~~~~--------------------------~~vih~~~~~~~~~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~ 203 (386) .+.+ =+|+||. +. ...++.+|.|.+.. +...++.......-.....+..+ T Consensus 159 vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--n~-~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~ 235 (441) T protein:vir:80 159 LLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIV--NR-RRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYA 235 (441) T ss_pred EEecCeEEEEEEcCCcceeeccccccCCCceeEEEee--cc-ccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 1111 1234433 21 12344568776543 33444444444333334444455 Q ss_pred CCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-----CceeeeccCChhhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011801. 204 KPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-----SADVETTNISPNVTEFLQNVSFSQDQIAKAFGI 278 (386) Q Consensus 204 ~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gv 278 (386) .|..++. +..++++..+..+.. .++++.++. +.++.++..... ..+++..+..+..|+..-++ T Consensus 236 ~~~~~i~--G~~~~~~~~~~~~~~---------~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~i~~~~~~~~~ 303 (441) T protein:vir:80 236 YPQRWVT--GVSADEFSQPGWVLS---------MASVWAVDKDDDGDTPNVGSFPVNSP-TPYSDQMRLLAQLTAGEAAV 303 (441) T ss_pred Cceeeee--cCCccccccchhhhc---------ccccccCCCCCCCCcceeEecCccch-HHHHHHHHHHHHHHhcccCC Confidence 5655553 333333322221111 123444332 234443332222 23678888999999999999 Q ss_pred CHHHhcCCcCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhh-----hhhhhcchhhhccCHHHHH Q lcl|NC_011801. 279 PADYLSGKQDAQSNITMIR--------------AFYQSSLSIYIKPIESELSQKLG-----TDVKLDIASAIDSDNSELI 339 (386) Q Consensus 279 p~~~l~~~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~l~~~l~-----~~~~fd~~~~l~~d~~~~~ 339 (386) |+..++..+....+.++.+ ..+...|.-.++.+...++..-. ..+++.+...+..+..+.+ T Consensus 304 p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~a 383 (441) T protein:vir:80 304 PERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATA 383 (441) T ss_pred CHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHH Confidence 9999976553222222211 12222222222222222211111 1234555556677888999 Q ss_pred HHHHHHHhCCCcC--HHHHHHHhccCCcCCCCC--------CCccccccccCCCCCC Q lcl|NC_011801. 340 NNVQKLASAGVLA--PIQAQKLLKNRGVFPELD--------LDEGTNLLDNTKNIND 386 (386) Q Consensus 340 ~~~~~~~~~g~~t--~nE~R~~lg~~p~~p~~~--------~~~~~~~~~~~~~~~~ 386 (386) +++.+++.+|+.. ...+++.+|.-+. +... .+.............| T Consensus 384 d~~~kl~~~g~~~~s~~~~~~~l~~~~~-e~~~~~~e~~e~~~~~~~~~~~~~~~~~ 439 (441) T protein:vir:80 384 DAVTKLVGAGILPADSRTVLEMLGLDDV-QVEAVMRHRAESSDPLAVLAGAISRQTN 439 (441) T ss_pred HHHHHHHhcCcccccHHHHHHhCCCCHH-HHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 9999999988753 3345555543210 0000 0001111122222222 No 154 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.53 E-value=3.5e-07 Score=56.00 Aligned_cols=341 Identities=11% Similarity=0.074 Sum_probs=155.3 Q ss_pred CchhhhhccccccCCccchhhhhhcc---cccccCcccccHHHH--hc--cHHHHHHHHHHHHhhccCceeecchhHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQG---QPVSIKPKAITSAIA--LK--NSDVYAVISRVSSDIAGCRFVTNAQPITDV 73 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~a--~~--~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~ 73 (386) ..++.+|.++-......-.. ....+ ......+..+..+.. ++ ..-..-+|+-+++.+.=-.|...+..+.++ T Consensus 3 ~~~i~~L~~~~~~~~~r~~~-~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d~~l~~i 81 (409) T protein:vir:16 3 EKGIGYLRFKLSVHKRRAEM-RYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDDFTVNEI 81 (409) T ss_pred HHHHHHHHHHHHHHhHHHHH-HHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccCcchHHHHH Confidence 22233332221111000000 00000 011011111111110 01 112223444444433222344444445555 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEE--EEeccCcccc--ee Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTY--TVHFDDSKRS--GD 149 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~--~~~~~~~~~~--~~ 149 (386) +.. |... .....+..+.+.+|.||+.+..+.+|.+ .+.+++|..+.+..|........ .+........ .. T Consensus 82 ~~~--N~ld---~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~ 155 (409) T protein:vir:16 82 FEE--NNPD---IFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPITGLLTEGYAVLERDENNNVVLE 155 (409) T ss_pred HHh--cChh---HHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeecccccceeeeEEEEecCCCceEEE Confidence 532 3332 3555778888999999999999888875 67888888888776654333221 1111110000 00 Q ss_pred EEEcccc----------------------eeeeccccccCcccccccccHH----HHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011801. 150 FLYDSSE----------------------VIHFRCTVSGESDTQYMGIPPI----DSLLNEIEVQDLSSKLAISTLRHAI 203 (386) Q Consensus 150 ~~~~~~~----------------------vih~~~~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~ng~ 203 (386) ..+.++. |++|. +... .++.+|.|.+ ..+...+.....-......|+ + T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~--n~~~-~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~---a 229 (409) T protein:vir:16 156 AHFLPDRTDYYYRDSRNNISIANPTGNPLLVPII--HRPD-AVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFY---S 229 (409) T ss_pred EEEecCcEEEEEecCccccceecCCCCcceEEec--cccc-ccccCCccccchhHHHHHHHHHHHHHHHHHHHHHh---c Confidence 1111221 33332 2222 2345787744 444444444444444455555 3 Q ss_pred CCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011801. 204 KPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-----QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGI 278 (386) Q Consensus 204 ~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gv 278 (386) .|..++..-+ .+.+..+.++.... +++.++ ++.++.++....-+ .+++.++..+.++|..=++ T Consensus 230 ~pqr~i~G~d--~d~~~~~~~~~~~~---------~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~l 297 (409) T protein:vir:16 230 FPQKYVTGLS--DDAEPMETWKATVS---------SMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGL 297 (409) T ss_pred ChhheeEecC--CCCCccchhhhhhh---------HhhccCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCC Confidence 4555554221 12222223332222 344443 23456555443222 4899999999999999999 Q ss_pred CHHHhcCCcCcccHHHHHH---HHHHHHHHHHHHHHHHHHHHhh----------------hhhhhhcchhhhcc---CHH Q lcl|NC_011801. 279 PADYLSGKQDAQSNITMIR---AFYQSSLSIYIKPIESELSQKL----------------GTDVKLDIASAIDS---DNS 336 (386) Q Consensus 279 p~~~l~~~~~~~~~~~~~~---~~~~~~l~P~~~~ie~~l~~~l----------------~~~~~fd~~~~l~~---d~~ 336 (386) |+..+|.....-.+.++.+ .-+.....-..+.|.+.+.+.+ ...+++.+.+.... +.. T Consensus 298 P~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a 377 (409) T protein:vir:16 298 TLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLS 377 (409) T ss_pred CHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHH Confidence 9999986542111111111 1111111112222222222211 01123334433323 356 Q ss_pred HHHHHHHHHHhCC--CcCHHHHHHHhccCCcC Q lcl|NC_011801. 337 ELINNVQKLASAG--VLAPIQAQKLLKNRGVF 366 (386) Q Consensus 337 ~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~ 366 (386) +.++++.|++++| ++..+-+++++|+...+ T Consensus 378 ~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 378 LIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 6788899999886 34557789999865432 No 155 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.52 E-value=2.8e-07 Score=56.53 Aligned_cols=373 Identities=11% Similarity=0.061 Sum_probs=173.7 Q ss_pred Cchhhhh---cccc-------cc--------CCccchhhh-------hhcccccccCcccc----cHHHHhccHHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQ-------KM--------LSGSSPVWI-------LNQGQPVSIKPKAI----TSAIALKNSDVYAVI 51 (386) Q Consensus 1 Mg~~~~l---~~~~-------~~--------~~~~~~~~~-------~~~~~~~~~~~~~i----~~~~a~~~~~v~~~v 51 (386) ||+|+++ |++. +. ......... ...+.+........ ..+..........++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 9999865 3321 10 000000000 00000000000000 011111223334555 Q ss_pred HHHHHhhccCc--eeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 52 SRVSSDIAGCR--FVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 52 ~~ia~~ia~~p--~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) +.+|+-+.+-| +.+.+..+.+.|.+--. .-.....+...+.+.+..|.+++.+..+. |. +.+..++|+.+-+... T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~ 157 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQS 157 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEE Confidence 66666655433 45555555555543111 11123445566666777788887776654 33 3456667776654322 Q ss_pred C-CCce----------------eEEE---Eec-cCc----------------ccceeEEE-------------cc---cc Q lcl|NC_011801. 130 D-YGKD----------------LTYT---VHF-DDS----------------KRSGDFLY-------------DS---SE 156 (386) Q Consensus 130 ~-~~~~----------------~~~~---~~~-~~~----------------~~~~~~~~-------------~~---~~ 156 (386) . .+.. .+|. ++. .+. ..|..+.+ .. -. T Consensus 158 d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~ 237 (500) T protein:vir:30 158 NTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPI 237 (500) T ss_pred cCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCcc Confidence 1 1110 0110 000 000 00111000 00 01 Q ss_pred eeeeccccccCc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-------CHHHHHHHHHHH Q lcl|NC_011801. 157 VIHFRCTVSGES-DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-------GKEAKENTRQSF 228 (386) Q Consensus 157 vih~~~~~~~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-------~~~~~~~~k~~~ 228 (386) ..|++....++. .+...|+|....+...++..........+-|+.|.. ..++ +...+ +.+......-.. T Consensus 238 f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v--~~~~l~~~~~~~~g~~~~~~~~d~ 314 (500) T protein:vir:30 238 FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAV--PESLTALTVRTTDGDVVPRPRFES 314 (500) T ss_pred EEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeee--chHHhcccCCCCCccccCCcccCC Confidence 123332221111 123469999999999999988887777777776544 3333 11111 000000000000 Q ss_pred H-HHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc-HHH------------ Q lcl|NC_011801. 229 E-EQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS-NIT------------ 294 (386) Q Consensus 229 ~-~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~-~~~------------ 294 (386) . ..+..-+ .-.+++..++.++....+-++.+..+...++|+...|+++..++..+.+.. ..+ T Consensus 315 ~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:30 315 DQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred CcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 0 0010000 001233457777777778889999999999999999999999987654422 111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHH-hh-------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 295 -MIRAFYQSSLSIYIKPIESELSQ-KL-------GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 295 -~~~~~~~~~l~P~~~~ie~~l~~-~l-------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) ..+..++.+|..++..+.+..+. .+ ...+.+++++-+..|.++.++...+++.+|+|+.-+++..+ .+. T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~--~g~ 468 (500) T protein:vir:30 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV--LNV 468 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc--CCC Confidence 11224444555555444432221 11 12345777777888999999999999999999988877543 111 Q ss_pred CCCCCC----CccccccccCCCCCC Q lcl|NC_011801. 366 FPELDL----DEGTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~----~~~~~~~~~~~~~~~ 386 (386) . +++. .+-.....+.-+..| T Consensus 469 ~-eeea~~~l~~i~~E~~~~~~~~~ 492 (500) T protein:vir:30 469 T-EEKAQEIAAEINTGIVDEINQQR 492 (500) T ss_pred C-HHHHHHHHHHHHHhccccCCCCC Confidence 1 1110 000011111222222 No 156 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.52 E-value=2.8e-07 Score=56.53 Aligned_cols=373 Identities=11% Similarity=0.061 Sum_probs=173.7 Q ss_pred Cchhhhh---cccc-------cc--------CCccchhhh-------hhcccccccCcccc----cHHHHhccHHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQ-------KM--------LSGSSPVWI-------LNQGQPVSIKPKAI----TSAIALKNSDVYAVI 51 (386) Q Consensus 1 Mg~~~~l---~~~~-------~~--------~~~~~~~~~-------~~~~~~~~~~~~~i----~~~~a~~~~~v~~~v 51 (386) ||+|+++ |++. +. ......... ...+.+........ ..+..........++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 9999865 3321 10 000000000 00000000000000 011111223334555 Q ss_pred HHHHHhhccCc--eeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 52 SRVSSDIAGCR--FVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 52 ~~ia~~ia~~p--~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) +.+|+-+.+-| +.+.+..+.+.|.+--. .-.....+...+.+.+..|.+++.+..+. |. +.+..++|+.+-+... T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~ 157 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQS 157 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEE Confidence 66666655433 45555555555543111 11123445566666777788887776654 33 3456667776654322 Q ss_pred C-CCce----------------eEEE---Eec-cCc----------------ccceeEEE-------------cc---cc Q lcl|NC_011801. 130 D-YGKD----------------LTYT---VHF-DDS----------------KRSGDFLY-------------DS---SE 156 (386) Q Consensus 130 ~-~~~~----------------~~~~---~~~-~~~----------------~~~~~~~~-------------~~---~~ 156 (386) . .+.. .+|. ++. .+. ..|..+.+ .. -. T Consensus 158 d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~ 237 (500) T protein:vir:98 158 NTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPI 237 (500) T ss_pred cCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCcc Confidence 1 1110 0110 000 000 00111000 00 01 Q ss_pred eeeeccccccCc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-------CHHHHHHHHHHH Q lcl|NC_011801. 157 VIHFRCTVSGES-DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-------GKEAKENTRQSF 228 (386) Q Consensus 157 vih~~~~~~~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-------~~~~~~~~k~~~ 228 (386) ..|++....++. .+...|+|....+...++..........+-|+.|.. ..++ +...+ +.+......-.. T Consensus 238 f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v--~~~~l~~~~~~~~g~~~~~~~~d~ 314 (500) T protein:vir:98 238 FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAV--PESLTALTVRTTDGDVVPRPRFES 314 (500) T ss_pred EEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeee--chHHhcccCCCCCccccCCcccCC Confidence 123332221111 123469999999999999988887777777776544 3333 11111 000000000000 Q ss_pred H-HHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc-HHH------------ Q lcl|NC_011801. 229 E-EQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS-NIT------------ 294 (386) Q Consensus 229 ~-~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~-~~~------------ 294 (386) . ..+..-+ .-.+++..++.++....+-++.+..+...++|+...|+++..++..+.+.. ..+ T Consensus 315 ~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:98 315 DQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred CcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 0 0010000 001233457777777778889999999999999999999999987654422 111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHH-hh-------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc Q lcl|NC_011801. 295 -MIRAFYQSSLSIYIKPIESELSQ-KL-------GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV 365 (386) Q Consensus 295 -~~~~~~~~~l~P~~~~ie~~l~~-~l-------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~ 365 (386) ..+..++.+|..++..+.+..+. .+ ...+.+++++-+..|.++.++...+++.+|+|+.-+++..+ .+. T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~--~g~ 468 (500) T protein:vir:98 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV--LNV 468 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc--CCC Confidence 11224444555555444432221 11 12345777777888999999999999999999988877543 111 Q ss_pred CCCCCC----CccccccccCCCCCC Q lcl|NC_011801. 366 FPELDL----DEGTNLLDNTKNIND 386 (386) Q Consensus 366 ~p~~~~----~~~~~~~~~~~~~~~ 386 (386) . +++. .+-.....+.-+..| T Consensus 469 ~-eeea~~~l~~i~~E~~~~~~~~~ 492 (500) T protein:vir:98 469 T-EEKAQEIAAEINTGIVDEINQQR 492 (500) T ss_pred C-HHHHHHHHHHHHHhccccCCCCC Confidence 1 1110 000011111222222 No 157 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.51 E-value=5.9e-08 Score=60.24 Aligned_cols=371 Identities=11% Similarity=0.102 Sum_probs=196.0 Q ss_pred Cchhh--hhccccccCCcc---------chh----h--hhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLS--NLFKRQKMLSGS---------SPV----W--ILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~--~l~~~~~~~~~~---------~~~----~--~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.--. |+.+|++...+. .+. . ....+......++.--.+-+--++.++..+..|++.++++.+ T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 54322 122333311110 000 0 000000000111111111222457888899999999999987 Q ss_pred eec--------------c-----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-cCC---Cc------e- Q lcl|NC_011801. 64 VTN--------------A-----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-DTN---GY------P- 113 (386) Q Consensus 64 ~~~--------------~-----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~~~---g~------~- 113 (386) .+- + ..+..+...-+.--+...++++.+..++-.-|++|+.+.- ... +. + T Consensus 81 ~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~~ 160 (631) T protein:vir:10 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (631) T ss_pred EeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccccc Confidence 532 1 1233444445777888899999999999999999998742 221 11 1 Q ss_pred EEEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 114 VRIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSK 193 (386) Q Consensus 114 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~ 193 (386) -+.+++....++.....++..+. ... ..........+ +.||.++ +.+....+--||+.++...+.-..-..+ T Consensus 161 ~~W~~vt~~ei~~~~~g~g~~v~--lp~----g~~h~~~~~~D-~l~RiW~-P~prr~~e~dSpvra~l~~l~Ei~~~t~ 232 (631) T protein:vir:10 161 QEWYAVSKEEIKKSNKGSGTNIV--LPT----GEEHEFVKGTD-IIFRVWI-PKPRKASEPDSPVRAVLDSIREIVRTTK 232 (631) T ss_pred cceeeccHHHHhcccCcccceee--cCC----CCccceecCCc-eEEEeeC-CCcccccCCcchhHHHHHHHHHHHHhhh Confidence 23444555555433222222211 111 11122233334 4466665 4566677899999999999988888887 Q ss_pred HHHHHHhccCCCceEEeeCCCC--------------------CCHHHHHHHHHHHH----HHhcc--cccC-cceecC-- Q lcl|NC_011801. 194 LAISTLRHAIKPSIFIKVPNAT--------------------LGKEAKENTRQSFE----EQTTG--ENAG-RAVVLD-- 244 (386) Q Consensus 194 ~~~~~~~ng~~~~~~l~~~~~~--------------------~~~~~~~~~k~~~~----~~~~~--~~~g-~~~vl~-- 244 (386) ...+..+.-....|++.+|.+. ...-...++.+.+= ..+.. +.+- -++++. T Consensus 233 ~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p 312 (631) T protein:vir:10 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (631) T ss_pred HHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeec Confidence 7777777666666777665421 11123344444442 22322 2222 233332 Q ss_pred ----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--H-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 245 ----QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--I-TMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 245 ----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~-~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) ++++.-.+.....+. -+++++..+.+||...-|||+.|-+.++.+|- . +-...-++-.|.|.+..|++++++ T Consensus 313 ~E~i~~i~hlkf~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~ 391 (631) T protein:vir:10 313 GEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTD 391 (631) T ss_pred hHHhcCeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHh Confidence 223333333333333 47899999999999999999987544322221 0 111223455699999999999998 Q ss_pred hhhhh-------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc-----ccc- Q lcl|NC_011801. 318 KLGTD-------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT-----NLL- 378 (386) Q Consensus 318 ~l~~~-------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~-----~~~- 378 (386) .++.. +=||.+.+. .|+. +.+-+..++..|.+|-...|+.+|.. .++++... ..+ T Consensus 392 q~Lrp~Le~eGvDp~kYvvW~DaS~Lt-~dPd-r~deA~qa~drGAIt~eAlrk~lGf~----eDd~yd~~t~e~~~~~a 465 (631) T protein:vir:10 392 QILRVTLAREGIDPSKYVVWYDPSQLT-IDPD-KSDEAKFAYENGAINGEALRKYLGLG----DDAGYDFTTREGWVMWA 465 (631) T ss_pred hHHHHHHHHhCCCHHHhEeeecCcccc-cCCC-CcHHHHHHHHcCCcCHHHHHHHhcCc----hhcccCcCchHHHHHHH Confidence 76421 124555442 2221 23333458889999999999999832 22221100 000 Q ss_pred ----------------------------cc----CCCCCC Q lcl|NC_011801. 379 ----------------------------DN----TKNIND 386 (386) Q Consensus 379 ----------------------------~~----~~~~~~ 386 (386) .+ +.++.| T Consensus 466 ~~av~~dpaLip~lApl~~~~~~~v~~P~~~a~~~~g~ed 505 (631) T protein:vir:10 466 QDAVSKDPTLIPMLAPLIAGVLKQIEFPQQQAIDSGGNED 505 (631) T ss_pred HHHhhcccCcchhhHHHHHHHhhhccCCCCCCCCCCCCCc Confidence 00 000111 No 158 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.50 E-value=4.3e-07 Score=55.50 Aligned_cols=370 Identities=12% Similarity=0.086 Sum_probs=171.3 Q ss_pred Cchhhhh---ccccc-------cCC-------c-cchhh--------hhhcccccccCcccccH----HHHhccHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQK-------MLS-------G-SSPVW--------ILNQGQPVSIKPKAITS----AIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~~l---~~~~~-------~~~-------~-~~~~~--------~~~~~~~~~~~~~~i~~----~~a~~~~~v~~~ 50 (386) ||+|+|+ |++.- +.. . ..+.. ..-.+............ +..........+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 9998775 43310 000 0 00000 00011111111111100 000111233456 Q ss_pred HHHHHHhhccCc--eeecchhHH-HHHhc--cCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceE Q lcl|NC_011801. 51 ISRVSSDIAGCR--FVTNAQPIT-DVLNA--PLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVT 125 (386) Q Consensus 51 v~~ia~~ia~~p--~~~~~~~~~-~~l~~--~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~ 125 (386) ++..|+-+..=| +.+.++... ..|.. .-|. ...-++..+.+.+..|.+++.+..+..+ +.+..++|+.+- T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~---f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~~v~ad~~~ 155 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDND---FKNKFEEALEKGVALGGFAMRPYIDGNH--IKIAWVRADQFY 155 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhcc---HHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEEEEcCCeeE Confidence 666666654433 455343322 23322 1122 2334455667777788888877666432 345666666654 Q ss_pred Ee-ecCCCce----------------eEEE---Ee--------------cc-Cc--ccceeEEE---c------------ Q lcl|NC_011801. 126 VA-LDDYGKD----------------LTYT---VH--------------FD-DS--KRSGDFLY---D------------ 153 (386) Q Consensus 126 ~~-~~~~~~~----------------~~~~---~~--------------~~-~~--~~~~~~~~---~------------ 153 (386) +. .+..... ..|. ++ +. .. ..|.++.+ + T Consensus 156 P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~ 235 (508) T protein:vir:15 156 PLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTIS 235 (508) T ss_pred EEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEec Confidence 42 2221110 0110 00 00 00 00111110 0 Q ss_pred c---cceeeeccccccCc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC--CHHHHHHHHHH Q lcl|NC_011801. 154 S---SEVIHFRCTVSGES-DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL--GKEAKENTRQS 227 (386) Q Consensus 154 ~---~~vih~~~~~~~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~--~~~~~~~~k~~ 227 (386) . -...||+....++. .+...|+|.+..+...++..+.......+-|+.|. +..++. ...+ +++....+... T Consensus 236 g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v~--~~~l~~d~~~~~~~~~~ 312 (508) T protein:vir:15 236 GLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQ-KHIAVQ--PGMLRFDDEHKPTFDTE 312 (508) T ss_pred CCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcc-cceeec--hHHhcCCCCCccccCCC Confidence 0 00123332111111 12347999999999999998888777777776544 445442 1111 11111000000 Q ss_pred HHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH-HHH----------- Q lcl|NC_011801. 228 FEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN-ITM----------- 295 (386) Q Consensus 228 ~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~-~~~----------- 295 (386) .+.+.+-+. --++|..++.++....+-++.+..+...+.|....|+++..++..+.+..+ .+. T Consensus 313 -~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~ 387 (508) T protein:vir:15 313 -QNVYVGVLS----DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTR 387 (508) T ss_pred -CeeEEeccC----CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHH Confidence 001110000 013345677777777778889999999999999999999999876554322 221 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHhh----h------------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHH Q lcl|NC_011801. 296 --IRAFYQSSLSIYIKPIESELSQKL----G------------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQ 357 (386) Q Consensus 296 --~~~~~~~~l~P~~~~ie~~l~~~l----~------------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R 357 (386) ....++.+|..++..|....+... + ..+.+++++-+-.|.++.++...+++.+|+|+.-+++ T Consensus 388 ~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i 467 (508) T protein:vir:15 388 SSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFL 467 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHH Confidence 122333444444444333222110 0 1234677777888999999999999999999888776 Q ss_pred HHhccCCcCCCCCC-------------CccccccccCCCCCC Q lcl|NC_011801. 358 KLLKNRGVFPELDL-------------DEGTNLLDNTKNIND 386 (386) Q Consensus 358 ~~lg~~p~~p~~~~-------------~~~~~~~~~~~~~~~ 386 (386) ... .+.. +++. ...........+++| T Consensus 468 ~~~--~g~~-deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 468 QRN--YGMT-DEQAAEELAKIQSEAPTDTFEGGRSAILNGGD 506 (508) T ss_pred Hhc--CCCC-hHHHHHHHHHHHHhccccCccccccccCCCCC Confidence 543 1111 1100 001111222233333 No 159 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.49 E-value=4.5e-07 Score=55.39 Aligned_cols=368 Identities=10% Similarity=0.004 Sum_probs=170.0 Q ss_pred Cchhhhhcccccc----------------------------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKM----------------------------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVIS 52 (386) Q Consensus 1 Mg~~~~l~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~ 52 (386) =..|+.+|..... ..+.+++....................-+.++-...+|+ T Consensus 29 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd 108 (492) T protein:vir:94 29 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVD 108 (492) T ss_pred hhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHH Confidence 1111111100000 000000000000000000000000001122344556777 Q ss_pred HHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecC Q lcl|NC_011801. 53 RVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDD 130 (386) Q Consensus 53 ~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~ 130 (386) ..++-+-+-|+.+ .+.+..+.|..--+. ........+..+.+.+|.||..+..+.+|.+ .+..++|..+.+..+. T Consensus 109 ~~~~yl~G~p~~~~~~d~~~~~~l~~~~~n--~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~ 185 (492) T protein:vir:94 109 QKVSYIVGKPIAFKHTDDEVVKRIDEVLGN--RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 185 (492) T ss_pred HHHhhhcccCceeccCchHHHHHHHHHHhc--cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcC Confidence 7777776777654 333443433221111 2334566788889999999999988888875 5777899998887653 Q ss_pred C--CceeEEEEeccCcccceeEEEcccceeeeccccc------------------cCcc---------cccccccHHHHH Q lcl|NC_011801. 131 Y--GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS------------------GESD---------TQYMGIPPIDSL 181 (386) Q Consensus 131 ~--~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~------------------~~~~---------~~~~G~s~~~~~ 181 (386) . .........+..........+....+.++.+... +++. +...|.|.+..+ T Consensus 186 ~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v 265 (492) T protein:vir:94 186 KEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMY 265 (492) T ss_pred CCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHH Confidence 2 2222211111111111112222333333221100 0000 112588889888 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHH Q lcl|NC_011801. 182 LNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEF 261 (386) Q Consensus 182 ~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~ 261 (386) ...++....+..-..+.++..+.|..+++.-+ .+....+...+.. .+++.++++.+++.+........+ T Consensus 266 ~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~----~~~~~~~~~~~~~-------~~~~~~~~~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:94 266 KTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD----DQELPEFKRLLRY-------YGAIKVSDNGGVDTIQVEVPVENS 334 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCC----cccchhhHHHHhh-------ccceecCCCCcceeEeccCCHHHH Confidence 88888888777777777777777766654321 2222222322221 234555555555555444444556 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhhhhhc Q lcl|NC_011801. 262 LQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTDVKLD 326 (386) Q Consensus 262 ~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~~~fd 326 (386) ....+...+.|+..-++|..-.+..+ ++-+..+. +..+...|...++.+...+..+. ...+++. T Consensus 335 ~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~ 413 (492) T protein:vir:94 335 KKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDIS 413 (492) T ss_pred HHHHHHHHHHHHHHhCCcCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEE Confidence 67778888888888888853222111 11111111 22333344444444443333221 1234555 Q ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------ccccccc-cCCCCCC Q lcl|NC_011801. 327 IASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------EGTNLLD-NTKNIND 386 (386) Q Consensus 327 ~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------~~~~~~~-~~~~~~~ 386 (386) +..-+..|..+.++++.+++ |+++...+.++++.-+ +|..... +..+... ....+.| T Consensus 414 f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~-d~~~E~eri~~E~~~~~~~~~~~~~~~~~ 478 (492) T protein:vir:94 414 FNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVE-DLQAELERIEQEQMEYNKQLPNLDDGGAD 478 (492) T ss_pred ecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccccccCC Confidence 66667788899999988885 7788777776654321 0111100 0000000 0000000 No 160 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=98.49 E-value=9e-10 Score=70.18 Aligned_cols=270 Identities=13% Similarity=0.144 Sum_probs=147.2 Q ss_pred hccHHHHHHHHHHHHhhccCceeecchh-------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceE Q lcl|NC_011801. 42 LKNSDVYAVISRVSSDIAGCRFVTNAQP-------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPV 114 (386) Q Consensus 42 ~~~~~v~~~v~~ia~~ia~~p~~~~~~~-------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~ 114 (386) |. ...+.+.|++++=--+.+.+.. +.-+...--|...+-..-++.+..+.+..-+.| .+.. .|--. T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~ 73 (279) T protein:vir:40 1 MS----LFNLSRRAEDVSFSTFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWALQGKEVY-RVWY--GGFKY 73 (279) T ss_pred Cc----ccccchhhcccceeeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhhhccceee-hhhh--hhHHH Confidence 00 1223445555554444444321 222222234666666666666665553333333 1111 11000 Q ss_pred EEEEEcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 115 RIEPVPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKL 194 (386) Q Consensus 115 ~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 194 (386) --..+.++.+.+. +. .+....+++|-.++..+. ++++|.-+- ....-+++. .+. T Consensus 74 ~~~~~~~d~fn~~-----------vr---~~~~~~vtVP~~Dv~Iie--------NPlv~v~~e-e~~kM~~la---~na 127 (279) T protein:vir:40 74 YAQRVNADQFNIV-----------VR---EPNRREVTIRTNDYEMLL--------NPFYGANPQ-RFGVMFGMA---SNG 127 (279) T ss_pred HHhhcCcchhhhh-----------ee---cCCcceeEeecchhhhhh--------cchheeccc-hhhHHHHHH---Hhh Confidence 0001111111100 00 112233556666665442 334555443 222223332 333 Q ss_pred HHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_011801. 195 AISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGEN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIA 273 (386) Q Consensus 195 ~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia 273 (386) ..+-+.+.+..+++++++...-.++..++.+.+++++...++ -+++.+++.|.+++++..+-... ..+-.++.+...+ T Consensus 128 i~~KLD~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYSts-lk~die~lkS~l~ 206 (279) T protein:vir:40 128 IGRRLDSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGS-LQNDANLAIEIAL 206 (279) T ss_pred hhhhhcccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccccc-cHHHHHHHHHHHH Confidence 444457778888999987655568888999999998887765 36799999999999998765444 4677889999999 Q ss_pred HHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCH Q lcl|NC_011801. 274 KAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAP 353 (386) Q Consensus 274 ~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~ 353 (386) ..||||..+|. ++.++++..+|+..+|.|++++.+..|.. .-||.+. ...+-.++|.+.. T Consensus 207 Sq~GinekIL~----GsAtE~q~iAyy~rtVePILkQyek~liY----~~E~fv~------------y~ttta~gg~~~s 266 (279) T protein:vir:40 207 SEYGMPRELLY----GQSNEVTIIAFAIQKVLPLLKQHDKNIIF----NQENFVA------------YISTTAKGGAIES 266 (279) T ss_pred hhcCCchhhcc----ccCchhhhhhHHHhhHHHHHHHhcccccc----hhhhhhh------------hheecccCccccc Confidence 99999999996 56788999999999999999997765432 1122221 1111112333211 Q ss_pred -HHHHHHhccCCcCCC Q lcl|NC_011801. 354 -IQAQKLLKNRGVFPE 368 (386) Q Consensus 354 -nE~R~~lg~~p~~p~ 368 (386) .-.| +-+|+..+ T Consensus 267 ~~~~~---~~~~~~~~ 279 (279) T protein:vir:40 267 KSSKR---DSEPVGND 279 (279) T ss_pred ccccc---cCCCCCCC Confidence 1122 22333111 No 161 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.49 E-value=4.6e-07 Score=55.33 Aligned_cols=357 Identities=13% Similarity=0.121 Sum_probs=157.4 Q ss_pred CchhhhhccccccCCccc-hhhhhhcc-cccccCcccccHHH-HhccH---HHHHHHHHHHHhhccCceeecchhHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSS-PVWILNQG-QPVSIKPKAITSAI-ALKNS---DVYAVISRVSSDIAGCRFVTNAQPITDVL 74 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~i~~~~-a~~~~---~v~~~v~~ia~~ia~~p~~~~~~~~~~~l 74 (386) |++ ++|.+........- .....-.+ ......+..+..+. .+... -..-+|+-+++.+-=..+.+.+..+...+ T Consensus 4 ~~i-~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~l~~~w 82 (422) T protein:vir:97 4 MGM-GYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTNDDFNAWEIF 82 (422) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCCchhHHHHH Confidence 332 12211110000000 00000000 11111111122111 11111 11234444444322223444455566666 Q ss_pred hccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCCceeEEEEe---ccCcccce-e Q lcl|NC_011801. 75 NAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYGKDLTYTVH---FDDSKRSG-D 149 (386) Q Consensus 75 ~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~---~~~~~~~~-~ 149 (386) .. |.. ......+..+.+.+|.||+.+..+. +|.+ .+.+++|..+....|.....+...+. .+..+... . T Consensus 83 ~~--N~l---d~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~~~~~~~~~ 156 (422) T protein:vir:97 83 KA--NNP---DIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPTTFLLTEGYAILESDSNGNPTLE 156 (422) T ss_pred Hh--cCh---HHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCCCCcceeeEEEEEecCCCcEEEE Confidence 43 443 2345567788899999999998875 5554 68889999998887765443322111 11111100 1 Q ss_pred EEEcc---------------------cceeeeccccccCcccccccccHH-HHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_011801. 150 FLYDS---------------------SEVIHFRCTVSGESDTQYMGIPPI-DSLLNEIEVQDLSSKLAISTLRHAIKPSI 207 (386) Q Consensus 150 ~~~~~---------------------~~vih~~~~~~~~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 207 (386) ..++. =.|++|.+. .. ..+.+|.|.+ ..+...++......-......+-.+.|.. T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~--~~-~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr 233 (422) T protein:vir:97 157 AYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHR--PD-AVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQK 233 (422) T ss_pred EEEcCceEEEEcCCCccccccCCCCCcceEEeccc--CC-CccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhh Confidence 11111 123444322 11 2345787755 23333333333332222222233344544 Q ss_pred EEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-----CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 208 FIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-----SADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 208 ~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) ++.. ...+....+.++.... +++.++. +.++.++..+.-+ .+++.++..+.++++.=++|+.. T Consensus 234 ~i~G--~d~d~~~~~~~~~~~~---------~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~ 301 (422) T protein:vir:97 234 YVLG--MDPDAKPMEKWRATVS---------TLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKMYASLFAGGSGLTLDD 301 (422) T ss_pred hhcc--cCcccccCchhhhhhh---------hhhccCCCCCCCcceeeecCCCChh-HHHHHHHHHHHHHhcccCCCHHH Confidence 4432 1122222223333222 3444432 3456555443222 38999999999999999999999 Q ss_pred hcCCcCcccHHHHHH---HHHHHHHHHHHHHHHHHHHHhh----------------hhhhhhcchhhhccCH---HHHHH Q lcl|NC_011801. 283 LSGKQDAQSNITMIR---AFYQSSLSIYIKPIESELSQKL----------------GTDVKLDIASAIDSDN---SELIN 340 (386) Q Consensus 283 l~~~~~~~~~~~~~~---~~~~~~l~P~~~~ie~~l~~~l----------------~~~~~fd~~~~l~~d~---~~~~~ 340 (386) +|.......+.++.+ .-+.....-..+.|.+.+.+.+ ...+++.+.+....|. .+.++ T Consensus 302 lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aD 381 (422) T protein:vir:97 302 LGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGD 381 (422) T ss_pred hccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHH Confidence 986543211222211 1111112222222222222211 0112344444445563 44567 Q ss_pred HHHHHHhC--CCcCHHHHHHHhccCCcCCCCCCCccccccccCCCC Q lcl|NC_011801. 341 NVQKLASA--GVLAPIQAQKLLKNRGVFPELDLDEGTNLLDNTKNI 384 (386) Q Consensus 341 ~~~~~~~~--g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~~ 384 (386) ++.|++++ |++..+-+++++|..+.. .+.-+.-..+..+ T Consensus 382 a~~Kl~~a~~~~~~~~~~~~~lg~~~~~-----~~~~~~~~~~~d~ 422 (422) T protein:vir:97 382 GAIKLNQAIPGFMDADVIRDLTGVKGAD-----KPIPAITEVTTDG 422 (422) T ss_pred HHHHHHhhccccccHHHHHHHcCCCchh-----HHHHHHHhhhccC Confidence 77888887 788889999999864321 1111111111111 No 162 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.45 E-value=6.2e-07 Score=54.63 Aligned_cols=365 Identities=13% Similarity=0.083 Sum_probs=152.8 Q ss_pred Cch----------------hhhhccccccCCccchhhhhhccc---ccccCcccccHHH---HhccHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAF----------------LSNLFKRQKMLSGSSPVWILNQGQ---PVSIKPKAITSAI---ALKNSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~----------------~~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~~~i~~~~---a~~~~~v~~~v~~ia~~i 58 (386) |.- +.+.+...... -... ...+. .....+..+..+. ...+.-..-+|+.+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~r---l~~l-~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 76 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQD---LGDN-TAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQ 76 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHH---HHHH-HHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhh Confidence 111 11111110000 0000 00000 0000111111110 111122234555555544 Q ss_pred ccCceeecch-----hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce-------EEEEEEcCcceEE Q lcl|NC_011801. 59 AGCRFVTNAQ-----PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP-------VRIEPVPNEKVTV 126 (386) Q Consensus 59 a~~p~~~~~~-----~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~-------~~l~~l~~~~v~~ 126 (386) -...+.+-+. .+.+++.. |. .......+..+.+.+|.||+.+.++..|.. ..+.+++|..+.+ T Consensus 77 ~~~g~~~~~~~~~~~~l~~i~~~--N~---~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~ 151 (484) T protein:vir:77 77 ELEGFRLGGADKADEQLWDWWQA--ND---LDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYA 151 (484) T ss_pred ccCceecCCcchhHHHHHHHHHh--cC---HhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEE Confidence 3344544332 23333322 32 235677888999999999999988888754 2477888888887 Q ss_pred eecCCCceeEEEE--eccC--cccceeEEEcc-------------------------cceeeeccccccCcccccccccH Q lcl|NC_011801. 127 ALDDYGKDLTYTV--HFDD--SKRSGDFLYDS-------------------------SEVIHFRCTVSGESDTQYMGIPP 177 (386) Q Consensus 127 ~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~-------------------------~~vih~~~~~~~~~~~~~~G~s~ 177 (386) ..+.......+.+ .... ........+.+ =+|+||.+ ....++..|.|. T Consensus 152 ~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N---~~~~~~~~G~s~ 228 (484) T protein:vir:77 152 QIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPN---RTRLSDLYGTTE 228 (484) T ss_pred EecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEecc---ccccCccCCccc Confidence 7665432221111 0000 00000001111 12344431 122344578776 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH--HHHHHHHHHhcccccCcceecC-CCceeeecc Q lcl|NC_011801. 178 IDS-LLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE--NTRQSFEEQTTGENAGRAVVLD-QSADVETTN 253 (386) Q Consensus 178 ~~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~--~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~ 253 (386) +.- +...++....+.-......+..+.|..++. +...++...+ .-...|+.. .+.++.++ ++.++.++. T Consensus 229 i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~q~~ 301 (484) T protein:vir:77 229 ITPELRSVTDAAARTLMLMQATAELMGVPQRLLF--GVKGEELGVDPETGQTLFDAY-----LARILAFEDHESKAQQFS 301 (484) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh--CCCcchhcccccccchhhhhh-----hhhhcccCCCCceeEeec Confidence 652 333334433333333333333344544443 2111111111 111112211 23455555 467777766 Q ss_pred CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHH----HHHHHHHHHHhh--------h- Q lcl|NC_011801. 254 ISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIY----IKPIESELSQKL--------G- 320 (386) Q Consensus 254 ~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~----~~~ie~~l~~~l--------~- 320 (386) ...-+ .+++.++..+.+|+..=++|+..++.......+.++.+ +....|.-. .+.|...|.+.+ + T Consensus 302 ~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~ 379 (484) T protein:vir:77 302 AAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIR-SSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGG 379 (484) T ss_pred CCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 44333 37888999999999999999999975432211211211 111111111 112222222111 0 Q ss_pred ------hhhhhcchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCcC-------------------------- Q lcl|NC_011801. 321 ------TDVKLDIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGVF-------------------------- 366 (386) Q Consensus 321 ------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~-------------------------- 366 (386) ..+++.+.+....+..+.++.+.+++.+| +++..-+++++|.-+.+ T Consensus 380 ~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~ 459 (484) T protein:vir:77 380 DIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTD 459 (484) T ss_pred CcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 12344555556678888999999998875 66554444444321100 Q ss_pred CCCCCCc-cccccccCCCCCC Q lcl|NC_011801. 367 PELDLDE-GTNLLDNTKNIND 386 (386) Q Consensus 367 p~~~~~~-~~~~~~~~~~~~~ 386 (386) +..+++. ..++-....+... T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~ 480 (484) T protein:vir:77 460 PSGGGNPDNPETPEPQPNPAE 480 (484) T ss_pred ccCCCCCCCCCcccccCCCcc Confidence 0000000 0000000000000 No 163 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.44 E-value=6.5e-07 Score=54.52 Aligned_cols=368 Identities=10% Similarity=0.028 Sum_probs=167.1 Q ss_pred Cch----hhhhcccccc----------------------------CCccchhhhhhcccccccCcccccHHHHhccHHHH Q lcl|NC_011801. 1 MAF----LSNLFKRQKM----------------------------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVY 48 (386) Q Consensus 1 Mg~----~~~l~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~ 48 (386) |-- |..+|+.... ..+.+++................-...-+.++-.. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~ 84 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHA 84 (472) T ss_pred CCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHH Confidence 000 0001100000 00000000000000000000000000011234455 Q ss_pred HHHHHHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEE Q lcl|NC_011801. 49 AVISRVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTV 126 (386) Q Consensus 49 ~~v~~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~ 126 (386) .+|+..++-+-+-|+.+ .+.+....|..--+.. .......+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+ T Consensus 85 ~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~n~--~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~~~p~~~~~ 161 (472) T protein:vir:93 85 NLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNR--FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIP 161 (472) T ss_pred HHHHHHhhhhcccCeeeccCChHHHHHHHHHHhcc--HHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEEEcccceEE Confidence 67777777776667654 3334444442211111 334555677889999999999999888875 577789999888 Q ss_pred eecCC-CceeEEEE-eccCcccceeEEEcccceeeeccccc------------------cCc---------ccccccccH Q lcl|NC_011801. 127 ALDDY-GKDLTYTV-HFDDSKRSGDFLYDSSEVIHFRCTVS------------------GES---------DTQYMGIPP 177 (386) Q Consensus 127 ~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~vih~~~~~~------------------~~~---------~~~~~G~s~ 177 (386) ..+.. ...+.+.+ .+..........+....+.++++... .++ .+...|.|. T Consensus 162 i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~ 241 (472) T protein:vir:93 162 IWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISD 241 (472) T ss_pred EEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCc Confidence 87542 22111111 11100111111122222222211100 000 012368888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChh Q lcl|NC_011801. 178 IDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPN 257 (386) Q Consensus 178 ~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~ 257 (386) +..+...++....+.....+.++..+.|..+++ +.. .+....+...+. ..+++.++++.++..+..... T Consensus 242 ~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~--g~~--~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~ 310 (472) T protein:vir:93 242 IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYD--DQELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVP 310 (472) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCC--cccchhhHHHHh-------hccccccCCCCcceeEeecCC Confidence 988888888777776666777777777766654 221 222222222222 123555565556665555556 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhh Q lcl|NC_011801. 258 VTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTD 322 (386) Q Consensus 258 d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~ 322 (386) +..+....+...+.|+..-++|..-.+..+ ++.+..+. +..+...+.-+++.+...+.... ... T Consensus 311 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 389 (472) T protein:vir:93 311 VENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKD 389 (472) T ss_pred HHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccce Confidence 677788889999999999999864332211 11111111 12333333333333333332211 123 Q ss_pred hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCC-----------------CCc-cccccccCCCC Q lcl|NC_011801. 323 VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELD-----------------LDE-GTNLLDNTKNI 384 (386) Q Consensus 323 ~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~-----------------~~~-~~~~~~~~~~~ 384 (386) +++.+..-+..|..+.++++.++ .|+++..-+.++++.-. +|... .++ +.+.-...... T Consensus 390 i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~ 466 (472) T protein:vir:93 390 VDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVE-DLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERS 466 (472) T ss_pred eeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCC Confidence 44555666677888889988887 46777655555543210 00000 000 00000000000 Q ss_pred CC Q lcl|NC_011801. 385 ND 386 (386) Q Consensus 385 ~~ 386 (386) +| T Consensus 467 ~~ 468 (472) T protein:vir:93 467 NN 468 (472) T ss_pred Cc Confidence 00 No 164 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.41 E-value=7.8e-07 Score=54.06 Aligned_cols=370 Identities=11% Similarity=0.072 Sum_probs=153.4 Q ss_pred Cc-------hhhhhccccccCCcc-chhhhhhccc-ccccCcccccHHH---HhccHHHHHHHHHHHHhhccCceeecch Q lcl|NC_011801. 1 MA-------FLSNLFKRQKMLSGS-SPVWILNQGQ-PVSIKPKAITSAI---ALKNSDVYAVISRVSSDIAGCRFVTNAQ 68 (386) Q Consensus 1 Mg-------~~~~l~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~i~~~~---a~~~~~v~~~v~~ia~~ia~~p~~~~~~ 68 (386) |. +++.|.......... ........+. .....+..+..+. ...+.-..-+|+.+++.+--.++.+-++ T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~ 87 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRFGDA 87 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceecCCC Confidence 11 011111100000000 0000000000 0000011111110 0011122345555555443334444332 Q ss_pred -h----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc-------eEEEEEEcCcceEEeecCCCceeE Q lcl|NC_011801. 69 -P----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY-------PVRIEPVPNEKVTVALDDYGKDLT 136 (386) Q Consensus 69 -~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~-------~~~l~~l~~~~v~~~~~~~~~~~~ 136 (386) + +.+++.. | ....+...+..+++.+|.||+.+.++..+. ...+.+++|..+.+..+....... T Consensus 88 ~~~~~~~~~i~~~--N---~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~ 162 (485) T protein:vir:10 88 DEADEELWQWWQA--N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVS 162 (485) T ss_pred chhHHHHHHHHHh--c---CHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCcee Confidence 1 2333322 2 233567788899999999999998876542 235788889888877764332221 Q ss_pred --EEEeccCccc--ceeEEEccc-------------------------ceeeeccccccCcccccccccHHHH-HHHHHH Q lcl|NC_011801. 137 --YTVHFDDSKR--SGDFLYDSS-------------------------EVIHFRCTVSGESDTQYMGIPPIDS-LLNEIE 186 (386) Q Consensus 137 --~~~~~~~~~~--~~~~~~~~~-------------------------~vih~~~~~~~~~~~~~~G~s~~~~-~~~~i~ 186 (386) +.+....... .....+... +|+||.+. . ...+.+|.|.+.. +...++ T Consensus 163 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~--~-~~~~~~G~s~i~~~v~~liD 239 (485) T protein:vir:10 163 KAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNR--T-RLSDLYGTSEITPELRSMTD 239 (485) T ss_pred EEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccc--c-ccCCCCCccchhHHHHHHHH Confidence 1111111000 001112222 23333321 1 2234578886543 333334 Q ss_pred HHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH--HHHHHHHHHhcccccCcceecC-CCceeeeccCChhhHHHHH Q lcl|NC_011801. 187 VQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE--NTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVTEFLQ 263 (386) Q Consensus 187 ~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~--~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e 263 (386) .......-.....+-.+.|..++. +........+ .-+..|+. ..+.++.++ ++.+|.++....-+ .+++ T Consensus 240 a~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~~d~k~~q~~~~~~~-~~~~ 311 (485) T protein:vir:10 240 AAARILMLMQATAELMGVPQRLIF--GIKPEEIGVDPETGQTLFDA-----YLARILAFEDAEGKIQQFSAAELA-NFTN 311 (485) T ss_pred HHHHHHHHHHHHHHhhcchHHHHh--cCCcccccccccccchhhhh-----cccceeccCCCCceEEeecccchH-HHHH Confidence 333333222233333344544443 2111111000 01111221 123456654 56677666543322 3788 Q ss_pred HHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHh----hhhhhhh Q lcl|NC_011801. 264 NVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQK----LGTDVKL 325 (386) Q Consensus 264 ~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~----l~~~~~f 325 (386) .++....+|+..=++|+..++.......+..+. +..+...+..+++.+....+.. -...+++ T Consensus 312 ~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v 391 (485) T protein:vir:10 312 ALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMET 391 (485) T ss_pred HHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeE Confidence 889999999999999999997543211111111 1222233333332221111100 0013344 Q ss_pred cchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCcC------------------------CCCCCC----ccc Q lcl|NC_011801. 326 DIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGVF------------------------PELDLD----EGT 375 (386) Q Consensus 326 d~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~------------------------p~~~~~----~~~ 375 (386) .+.+.+..+..+.++++.+++.+| +++...+++++|..+-. +.++.+ +.. T Consensus 392 ~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (485) T protein:vir:10 392 VWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAP 471 (485) T ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccc Confidence 555556778889999999999866 66655555554432210 000000 000 Q ss_pred cccccC-CCCCC Q lcl|NC_011801. 376 NLLDNT-KNIND 386 (386) Q Consensus 376 ~~~~~~-~~~~~ 386 (386) .+-.+. .++.| T Consensus 472 ~~~~~~~~~~~~ 483 (485) T protein:vir:10 472 APKPAALESGGD 483 (485) T ss_pred cccCcCCCCCCC Confidence 111111 11111 No 165 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.39 E-value=8.9e-07 Score=53.75 Aligned_cols=372 Identities=12% Similarity=0.112 Sum_probs=173.0 Q ss_pred Cchhhhh---ccccc-------cCC-------cc-chhhh--------hhcccccccCcccccH----HHHhccHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQK-------MLS-------GS-SPVWI--------LNQGQPVSIKPKAITS----AIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~~l---~~~~~-------~~~-------~~-~~~~~--------~~~~~~~~~~~~~i~~----~~a~~~~~v~~~ 50 (386) ||+|+++ |++.- ... .. .+... +..+.+.......+.. +..........+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 9999875 33310 000 00 01000 0001111111000000 001111233455 Q ss_pred HHHHHHhhccCc--eeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEee Q lcl|NC_011801. 51 ISRVSSDIAGCR--FVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVAL 128 (386) Q Consensus 51 v~~ia~~ia~~p--~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~ 128 (386) ++.+|+-+.+=| +.+.+.+..+.|+.-- ..-.....++..+.+....|.+++.+..+. |. +.+..++|+.+-+.. T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ad~~~P~~ 157 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETANDFLDDVF-QQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWATADQVYPLQ 157 (505) T ss_pred HHHHHhhhcCCCceeecCChHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEcCCeeEEEE Confidence 666666655433 3455555555553311 111123445666777777888887776653 33 345556666655432 Q ss_pred -cCCCce-e---------------EEE-----------Eecc-------C-cccceeE---------------EEc---c Q lcl|NC_011801. 129 -DDYGKD-L---------------TYT-----------VHFD-------D-SKRSGDF---------------LYD---S 154 (386) Q Consensus 129 -~~~~~~-~---------------~~~-----------~~~~-------~-~~~~~~~---------------~~~---~ 154 (386) +.+... . +|. +.+. + ...|.++ .+. . T Consensus 158 ~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~ 237 (505) T protein:vir:79 158 ADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKH 237 (505) T ss_pred EcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCc Confidence 221110 0 010 0000 0 0001110 000 0 Q ss_pred cceeeeccccccCc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----eeCCCCCCHHHHHHHHHHH Q lcl|NC_011801. 155 SEVIHFRCTVSGES-DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFI-----KVPNATLGKEAKENTRQSF 228 (386) Q Consensus 155 ~~vih~~~~~~~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l-----~~~~~~~~~~~~~~~k~~~ 228 (386) ....||+....++. .....|+|.+..+...++..........+-|+.|.. ..++ .... ....+....-...+ T Consensus 238 p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~-~~~~~~~~~~~~~f 315 (505) T protein:vir:79 238 PLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQR-RLIVPAEWLKTGS-SYGGQASETHPPMF 315 (505) T ss_pred ceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-ceeechHHhcccC-CCCcccccccccCC Confidence 11223432111111 112379999999999999988887777777776654 3333 1110 00000000000001 Q ss_pred H---HHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH-HHH--------- Q lcl|NC_011801. 229 E---EQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN-ITM--------- 295 (386) Q Consensus 229 ~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~-~~~--------- 295 (386) . ..+.+ +..=+++..++.++....+.++.+..+...++|+...|+++..++..+.+..+ .+. T Consensus 316 d~~~~~y~~-----~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~ 390 (505) T protein:vir:79 316 DPDETVYQA-----MYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQ 390 (505) T ss_pred Cccceeeee-----ccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHH Confidence 0 00111 00112234677777777788889999999999999999999999876554322 111 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhhh--------------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHH Q lcl|NC_011801. 296 ----IRAFYQSSLSIYIKPIESELSQKLG--------------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQ 357 (386) Q Consensus 296 ----~~~~~~~~l~P~~~~ie~~l~~~l~--------------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R 357 (386) .+..++.+|..++..+........+ ..+.+++++-+-.|.++.++...+++.+|+|++-+++ T Consensus 391 t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l 470 (505) T protein:vir:79 391 TRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFL 470 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHH Confidence 1223344444444444433222111 1344677777888999999999999999999887776 Q ss_pred HHhccCCcCCCCCCC--------c-c-ccccccCCCCCC Q lcl|NC_011801. 358 KLLKNRGVFPELDLD--------E-G-TNLLDNTKNIND 386 (386) Q Consensus 358 ~~lg~~p~~p~~~~~--------~-~-~~~~~~~~~~~~ 386 (386) ... .+. .+++.. | . ..+- .+.-+.| T Consensus 471 ~~~--~~~-~eeea~~el~ri~~E~~~~~p~-~~~~gg~ 505 (505) T protein:vir:79 471 MRN--YGL-DEEEADEWLAQIDAENSTAEPE-FNQFGGD 505 (505) T ss_pred Hhc--CCC-ChHHHHHHHHHHHHhccccCCC-chhccCC Confidence 543 111 111110 0 0 0011 1111112 No 166 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.38 E-value=9.5e-07 Score=53.61 Aligned_cols=367 Identities=13% Similarity=0.093 Sum_probs=153.6 Q ss_pred Cch------------h-hhhccccccCCccchhhhhhcccccccCcccccHHH---HhccHHHHHHHHHHHHhhccCcee Q lcl|NC_011801. 1 MAF------------L-SNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAI---ALKNSDVYAVISRVSSDIAGCRFV 64 (386) Q Consensus 1 Mg~------------~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~---a~~~~~v~~~v~~ia~~ia~~p~~ 64 (386) +|+ | +.+-.+........... .........+..+..+. ...+.-..-+|+..++.+-..++. T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY--~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 83 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYY--EAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFR 83 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHH--hccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccCcee Confidence 221 1 11100000000000000 00000000111111110 011122234555555544444666 Q ss_pred ecchh-----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce-------EEEEEEcCcceEEeecCCC Q lcl|NC_011801. 65 TNAQP-----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP-------VRIEPVPNEKVTVALDDYG 132 (386) Q Consensus 65 ~~~~~-----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~-------~~l~~l~~~~v~~~~~~~~ 132 (386) +.+.+ +.+++.. |. .......+..+++.+|.||+.+.++..+.. ..+.+++|..+.+..+... T Consensus 84 ~~~~~~~~~~l~~i~~~--N~---~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~ 158 (485) T protein:vir:24 84 LGDADEADEELWQWWQA--NN---LDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRI 158 (485) T ss_pred cCCCchhHHHHHHHHHh--cC---hhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCc Confidence 54422 3344432 32 335678899999999999999988776543 2577888988887776543 Q ss_pred ceeE--EEEeccC--cccceeEEEccc-------------------------ceeeeccccccCcccccccccHHHH-HH Q lcl|NC_011801. 133 KDLT--YTVHFDD--SKRSGDFLYDSS-------------------------EVIHFRCTVSGESDTQYMGIPPIDS-LL 182 (386) Q Consensus 133 ~~~~--~~~~~~~--~~~~~~~~~~~~-------------------------~vih~~~~~~~~~~~~~~G~s~~~~-~~ 182 (386) .... +.+.... ........+..+ +|+||++ .. ...+.+|.|.+.- +. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n--~~-~~~~~~G~s~i~~~v~ 235 (485) T protein:vir:24 159 GRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPN--RT-RLSDLYGTSEITPELR 235 (485) T ss_pred CceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEecc--Cc-ccCCcCCcccchhhHH Confidence 2211 1111000 000000111111 2334431 11 2344578887653 34 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH--HHHHHHHHHhcccccCcceecC-CCceeeeccCChhhH Q lcl|NC_011801. 183 NEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE--NTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVT 259 (386) Q Consensus 183 ~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~--~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~ 259 (386) ..++....+..-.....+..+.|..++. +........+ .-+..|+. ..+.++.++ ++.++.++....-+ T Consensus 236 ~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~~~~~~~q~~~~~~e- 307 (485) T protein:vir:24 236 SMTDAAARILMLMQATAELMGVPQRLIF--GIKPEEIGVDPETGQTLFDA-----YLARILAFEDAEGKIQQFSAAELA- 307 (485) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhc--cCCccccccccccccchhhh-----cccceeccCCCCceEEeecccchH- Confidence 4444444443333333444455555543 2111111000 01111221 123455554 56677666543323 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhh----- Q lcl|NC_011801. 260 EFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKLG----- 320 (386) Q Consensus 260 ~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l~----- 320 (386) .+++.++..+.+++..=++|+..++.......+..+. +..+...|.-+++.+....+. .. T Consensus 308 ~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~-~~~~~d~ 386 (485) T protein:vir:24 308 NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG-GDVPPDM 386 (485) T ss_pred HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCcccc Confidence 3788888888889999999999997543211121121 112222233222222211110 00 Q ss_pred hhhhhcchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCc--------------------------CCCCCC- Q lcl|NC_011801. 321 TDVKLDIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGV--------------------------FPELDL- 371 (386) Q Consensus 321 ~~~~fd~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~--------------------------~p~~~~- 371 (386) ..+++.+..-...+..+.++.+.+++.+| +++..-+++++|..+- .+..+. T Consensus 387 ~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~ 466 (485) T protein:vir:24 387 LRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGS 466 (485) T ss_pred ceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCC Confidence 12334444445567888888889988765 5554444444332110 000000 Q ss_pred Ccccccc--ccCCCCCC Q lcl|NC_011801. 372 DEGTNLL--DNTKNIND 386 (386) Q Consensus 372 ~~~~~~~--~~~~~~~~ 386 (386) +..++.- .+..+..| T Consensus 467 ~~~~e~~~~~~~~~~~~ 483 (485) T protein:vir:24 467 PNPTPAPKPQPAIEGGD 483 (485) T ss_pred CCCCCCCCCccCCCCCC Confidence 0000000 11111112 No 167 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.38 E-value=4.6e-07 Score=55.34 Aligned_cols=371 Identities=12% Similarity=0.095 Sum_probs=192.7 Q ss_pred Cchhh-hhccccccCCccchhhhhhc-cccc---------c------cCcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWILNQ-GQPV---------S------IKPKAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~~~~-~~~~---------~------~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.--+ |+.+|++..+.......... .+.. + .+++.--.+-+--++.++..+..+++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 65543 34455544433221111100 0000 0 011111111223357888889999999999987 Q ss_pred eec----------------c----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-cCCCc------eEEE Q lcl|NC_011801. 64 VTN----------------A----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-DTNGY------PVRI 116 (386) Q Consensus 64 ~~~----------------~----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~~~g~------~~~l 116 (386) .+- + +.+......--.--+...++++.+..++-.-|++|+.++- ...+. +.+- T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:97 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 532 1 1122222222345566788999999999999999987543 33221 2333 Q ss_pred EE-EcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 117 EP-VPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 117 ~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) |+ +..+.+... .+.... +... .|....|.-+.-+.+|.++ +.+.....--||+.++...+.-..-..+.. T Consensus 161 W~vvs~~Ei~~~---~~~~~~--i~lP---dG~~he~~~~~d~l~RvW~-P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 231 (639) T protein:vir:97 161 WYAVTREEIKSK---AGETAE--ISLP---DGKTHEFNRDLDSLVRIWN-PRPRKASQATSPVRACLETLREIERTTRKI 231 (639) T ss_pred eeeeeHHHhccc---CCCeeE--eecC---CCCCccccCCCceEEEEeC-CCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 33 223333311 111111 1111 1222234444444577665 456667788999999999988888777777 Q ss_pred HHHHhccCCCceEEeeCCCC------------------------CCHHHHHHHHHHHH----HHhccc--ccC-cceecC Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNAT------------------------LGKEAKENTRQSFE----EQTTGE--NAG-RAVVLD 244 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~~------------------------~~~~~~~~~k~~~~----~~~~~~--~~g-~~~vl~ 244 (386) .+..+.-....|++.+|... .+....+.+...|- ..+... .+- -++++. T Consensus 232 ~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~ 311 (639) T protein:vir:97 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (639) T ss_pred HHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEe Confidence 77666555555665554211 01122334444443 333322 222 233332 Q ss_pred C----CceeeeccCCh-hhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH-H-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 245 Q----SADVETTNISP-NVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN-I-TMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 245 ~----g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~-~-~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) . .-+++.+.... -+.--+++++..+.+||...-|||+.|-+.++++-- . +-...-++--|.|.+..|+++|++ T Consensus 312 ~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~ 391 (639) T protein:vir:97 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (639) T ss_pred echHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHh Confidence 1 22444444432 223347899999999999999999987544443211 0 111223455699999999999998 Q ss_pred hhhhh-------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-----ccccccc Q lcl|NC_011801. 318 KLGTD-------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-----EGTNLLD 379 (386) Q Consensus 318 ~l~~~-------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-----~~~~~~~ 379 (386) .++.. +=||.+.+. .|+. +.+-+..++..|.+|-.=-|+.+|. .+.+++ ++-..+. T Consensus 392 ~~Lrp~Le~eGvDp~kYvvW~DaS~Lt-~dPd-~~deA~qa~drGAIt~eAlR~~lG~----~edd~yd~~t~e~~~~~A 465 (639) T protein:vir:97 392 DILTPLLAREGIDPTKYILWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNV----GEDSGYDLTTLDGCREFA 465 (639) T ss_pred hHHHHHHHHhCCCHHHhEeeecCcccc-cCCC-CcHHHHHHHHcCCccHHHHHHHhcc----ccccCCCCCCcHHHHHHH Confidence 76421 124555442 2221 2333345788999998888998883 222222 1111111 Q ss_pred cCCCCCC Q lcl|NC_011801. 380 NTKNIND 386 (386) Q Consensus 380 ~~~~~~~ 386 (386) ...-..| T Consensus 466 ~~~V~~~ 472 (639) T protein:vir:97 466 ADVVTKN 472 (639) T ss_pred HHHhcCC Confidence 1111112 No 168 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.38 E-value=4.6e-07 Score=55.34 Aligned_cols=371 Identities=12% Similarity=0.095 Sum_probs=192.7 Q ss_pred Cchhh-hhccccccCCccchhhhhhc-cccc---------c------cCcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWILNQ-GQPV---------S------IKPKAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~~~~-~~~~---------~------~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) |.--+ |+.+|++..+.......... .+.. + .+++.--.+-+--++.++..+..+++.++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 65543 34455544433221111100 0000 0 011111111223357888889999999999987 Q ss_pred eec----------------c----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-cCCCc------eEEE Q lcl|NC_011801. 64 VTN----------------A----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-DTNGY------PVRI 116 (386) Q Consensus 64 ~~~----------------~----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-~~~g~------~~~l 116 (386) .+- + +.+......--.--+...++++.+..++-.-|++|+.++- ...+. +.+- T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:10 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 532 1 1122222222345566788999999999999999987543 33221 2333 Q ss_pred EE-EcCcceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 117 EP-VPNEKVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 117 ~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) |+ +..+.+... .+.... +... .|....|.-+.-+.+|.++ +.+.....--||+.++...+.-..-..+.. T Consensus 161 W~vvs~~Ei~~~---~~~~~~--i~lP---dG~~he~~~~~d~l~RvW~-P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 231 (639) T protein:vir:10 161 WYAVTREEIKSK---AGETAE--ISLP---DGKTHEFNRDLDSLVRIWN-PRPRKASQATSPVRACLETLREIERTTRKI 231 (639) T ss_pred eeeeeHHHhccc---CCCeeE--eecC---CCCCccccCCCceEEEEeC-CCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 33 223333311 111111 1111 1222234444444577665 456667788999999999988888777777 Q ss_pred HHHHhccCCCceEEeeCCCC------------------------CCHHHHHHHHHHHH----HHhccc--ccC-cceecC Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNAT------------------------LGKEAKENTRQSFE----EQTTGE--NAG-RAVVLD 244 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~~------------------------~~~~~~~~~k~~~~----~~~~~~--~~g-~~~vl~ 244 (386) .+..+.-....|++.+|... .+....+.+...|- ..+... .+- -++++. T Consensus 232 ~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~ 311 (639) T protein:vir:10 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (639) T ss_pred HHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEe Confidence 77666555555665554211 01122334444443 333322 222 233332 Q ss_pred C----CceeeeccCCh-hhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH-H-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 245 Q----SADVETTNISP-NVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN-I-TMIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 245 ~----g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~-~-~~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) . .-+++.+.... -+.--+++++..+.+||...-|||+.|-+.++++-- . +-...-++--|.|.+..|+++|++ T Consensus 312 ~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~ 391 (639) T protein:vir:10 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (639) T ss_pred echHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHh Confidence 1 22444444432 223347899999999999999999987544443211 0 111223455699999999999998 Q ss_pred hhhhh-------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-----ccccccc Q lcl|NC_011801. 318 KLGTD-------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-----EGTNLLD 379 (386) Q Consensus 318 ~l~~~-------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-----~~~~~~~ 379 (386) .++.. +=||.+.+. .|+. +.+-+..++..|.+|-.=-|+.+|. .+.+++ ++-..+. T Consensus 392 ~~Lrp~Le~eGvDp~kYvvW~DaS~Lt-~dPd-~~deA~qa~drGAIt~eAlR~~lG~----~edd~yd~~t~e~~~~~A 465 (639) T protein:vir:10 392 DILTPLLAREGIDPTKYILWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNV----GEDSGYDLTTLDGCREFA 465 (639) T ss_pred hHHHHHHHHhCCCHHHhEeeecCcccc-cCCC-CcHHHHHHHHcCCccHHHHHHHhcc----ccccCCCCCCcHHHHHHH Confidence 76421 124555442 2221 2333345788999998888998883 222222 1111111 Q ss_pred cCCCCCC Q lcl|NC_011801. 380 NTKNIND 386 (386) Q Consensus 380 ~~~~~~~ 386 (386) ...-..| T Consensus 466 ~~~V~~~ 472 (639) T protein:vir:10 466 ADVVTKN 472 (639) T ss_pred HHHhcCC Confidence 1111112 No 169 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.36 E-value=1.1e-06 Score=53.26 Aligned_cols=367 Identities=8% Similarity=-0.016 Sum_probs=162.9 Q ss_pred Cchh-------------hhhccc-------------------------cccCCccchhhhhhcccccccCccccc-HHHH Q lcl|NC_011801. 1 MAFL-------------SNLFKR-------------------------QKMLSGSSPVWILNQGQPVSIKPKAIT-SAIA 41 (386) Q Consensus 1 Mg~~-------------~~l~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~a 41 (386) |.=. +.+... ..-..+.+.+. ..............+ ...- T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~-~~~~~~~~~~~~~~~~~~~k 79 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDIL-DAPPKRDVNGDYDETKPDWR 79 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchh-ccccccccccccccccccce Confidence 1111 111000 00000000000 000000000000000 0000 Q ss_pred hccHHHHHHHHHHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEE Q lcl|NC_011801. 42 LKNSDVYAVISRVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPV 119 (386) Q Consensus 42 ~~~~~v~~~v~~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l 119 (386) +.++-...+|+..+.-+-+-|+.+ .+....+.|..--+. ...+....+..+.+.+|.||+.+..+.+|.+ .+..+ T Consensus 80 i~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~~~~~ 156 (478) T protein:vir:10 80 MYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLNH--KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRV 156 (478) T ss_pred eccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHhc--CHHHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEE Confidence 122334456666666666666654 333333333221111 2445666778899999999999988888876 56778 Q ss_pred cCcceEEeecCCC-c-eeEEEEeccCcccceeEEEcccceeeeccccc----------------------cCcc------ Q lcl|NC_011801. 120 PNEKVTVALDDYG-K-DLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS----------------------GESD------ 169 (386) Q Consensus 120 ~~~~v~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~----------------------~~~~------ 169 (386) +|..+.+..+... . .......+..........+....+.++++... .++. T Consensus 157 ~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv 236 (478) T protein:vir:10 157 PAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFI 236 (478) T ss_pred cccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceE Confidence 8998888765432 1 22111111111111111222333322221100 0000 Q ss_pred ---cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec--C Q lcl|NC_011801. 170 ---TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL--D 244 (386) Q Consensus 170 ---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl--~ 244 (386) +...|.|.+..+...++....+.....+.++..+.|..+++ +...++ .+.....++. ++++.+ + T Consensus 237 ~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~--g~~~~~--~~~~~~~~~~-------~~~~~~~~~ 305 (478) T protein:vir:10 237 PFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--GYEGED--MKDFMHNLKY-------YKAISVAGE 305 (478) T ss_pred EeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee--cCCccc--cchhhhhhhh-------cceEEecCC Confidence 22368888888888888877776666666676677755554 322221 1111111111 123323 2 Q ss_pred CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHH Q lcl|NC_011801. 245 QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKP 310 (386) Q Consensus 245 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ 310 (386) +|.++..+........+....+...+.|...-++|..-....+ ++-+..+. +..+..+++.+++. T Consensus 306 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 384 (478) T protein:vir:10 306 SGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQY 384 (478) T ss_pred CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444444444444555677888888889888888863322111 11121111 22333333333333 Q ss_pred HHHHHHHhh-hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCC------------------- Q lcl|NC_011801. 311 IESELSQKL-GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELD------------------- 370 (386) Q Consensus 311 ie~~l~~~l-~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~------------------- 370 (386) +...+.... ...+++.+..-+..|..+.++++.++ +|+++...+.++++.-. +|... T Consensus 385 i~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~-D~~~E~~ri~~E~~~~~~~~~~~~ 461 (478) T protein:vir:10 385 IIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVE-DPVAEMERIEQENIELNQQLPDIE 461 (478) T ss_pred HHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccc Confidence 332221100 12234555555667888999988887 67888777777664311 00000 Q ss_pred CCccccccccCCCCCC Q lcl|NC_011801. 371 LDEGTNLLDNTKNIND 386 (386) Q Consensus 371 ~~~~~~~~~~~~~~~~ 386 (386) ....++.-....++++ T Consensus 462 ~~~~~~~~~~~~~~~~ 477 (478) T protein:vir:10 462 EGLNGEQQRQSENNQP 477 (478) T ss_pred cccCCCCCCCCCCCCC Confidence 0001111111112222 No 170 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.35 E-value=3.6e-07 Score=55.89 Aligned_cols=370 Identities=10% Similarity=0.064 Sum_probs=192.5 Q ss_pred Cchhh-hhccccccCCccchhhhhhccc-c-----c-------ccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWILNQGQ-P-----V-------SIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN 66 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~~~~~~-~-----~-------~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~ 66 (386) |.--+ |+.+|++..+............ + . ..+++.--.+-+--++.++..+..+++.++++.+..- T Consensus 1 ma~~~lrv~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~rL~as 80 (629) T protein:vir:10 1 MAASTLRVSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVELIAS 80 (629) T ss_pred CCccceeEEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeeeEEEe Confidence 55432 2344444332211110000000 0 0 0011111111122346777788888999999987532 Q ss_pred ---------------chh----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc----eEEEE-EEcCc Q lcl|NC_011801. 67 ---------------AQP----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY----PVRIE-PVPNE 122 (386) Q Consensus 67 ---------------~~~----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~----~~~l~-~l~~~ 122 (386) ++| ..+....--.--+...++++.+..++-.-|+.|+.++....+. +..-| .+..+ T Consensus 81 ~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~vVt~~ 160 (629) T protein:vir:10 81 ELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYVVTND 160 (629) T ss_pred eecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceeeecHH Confidence 122 1233333345556678899999999999999999886544442 33222 33333 Q ss_pred ceEEeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 123 KVTVALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHA 202 (386) Q Consensus 123 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 202 (386) .++. +...... +... .+....|..+.-+.+|.++ +.+.....--||+.++...+.-..-..+...+..+.- T Consensus 161 Ei~~---kg~g~~~--i~lp---dg~~he~~~~~D~l~RvW~-P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSR 231 (629) T protein:vir:10 161 EVKN---KGAGKTD--IELP---DGTIHEYSKGRDVMFRVWN-PRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSR 231 (629) T ss_pred Hhcc---ccCceeE--EEcC---CCceeeeeCCCeeEEEeeC-CCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhH Confidence 3331 1111111 1111 1223444444445567665 4566667889999999999888877777777666655 Q ss_pred CCCceEEeeCCCC-----------C----------CHHHHHHHHHHHH----HHhccc--ccCc-ceecC-C---Cceee Q lcl|NC_011801. 203 IKPSIFIKVPNAT-----------L----------GKEAKENTRQSFE----EQTTGE--NAGR-AVVLD-Q---SADVE 250 (386) Q Consensus 203 ~~~~~~l~~~~~~-----------~----------~~~~~~~~k~~~~----~~~~~~--~~g~-~~vl~-~---g~~~~ 250 (386) ....|++.+|... - +....+.+...|- ..+... .+-- ++++. . --+++ T Consensus 232 L~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ik 311 (629) T protein:vir:10 232 LIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIF 311 (629) T ss_pred HhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCee Confidence 5555555443211 0 1123344444443 333322 2222 33331 1 12444 Q ss_pred eccCC--hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH--H-HHHHHHHHHHHHHHHHHHHHHHHHhhhhh--- Q lcl|NC_011801. 251 TTNIS--PNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN--I-TMIRAFYQSSLSIYIKPIESELSQKLGTD--- 322 (386) Q Consensus 251 ~~~~~--~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~--~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~~--- 322 (386) .+... ..+. -+++++..+.++|...-|||+.|-+.++.+|- . +-...-++--|.|.++.|++++++.++.. T Consensus 312 hLkf~~eite~-~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~ 390 (629) T protein:vir:10 312 HLKIGNEITEV-EIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLR 390 (629) T ss_pred eeeecCchhHH-HHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHH Confidence 44443 3333 47899999999999999999987544322221 0 11122345569999999999999876421 Q ss_pred ----------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-----ccccccccCCCCCC Q lcl|NC_011801. 323 ----------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-----EGTNLLDNTKNIND 386 (386) Q Consensus 323 ----------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-----~~~~~~~~~~~~~~ 386 (386) +=||.+.+ ..|+. +.+-+..++..|.+|-..-|+.+|.- ..+++ ++-+.+....--.| T Consensus 391 ~eGiDp~~Yvvw~DaS~L-t~dPd-~~deA~~a~drGaIt~eAlRr~lG~~----~dd~y~~~t~~~~q~~A~~~v~~~ 463 (629) T protein:vir:10 391 AEGIDPDRYVLWYDASGL-TVDPD-KTDEATAAKEQGAITHEAYRRYLGLA----DEDGYDLETLEGAQAWARDAIVAD 463 (629) T ss_pred HhCCCHHHhEeeecCccc-ccCCC-CcHHHHHHHHcCCccHHHHHHHhccc----cccCCCcCCcHHHHHHHHHHhcCC Confidence 12454443 22332 23334558889999999999999842 22222 11121111111112 No 171 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.34 E-value=1.2e-06 Score=53.05 Aligned_cols=367 Identities=9% Similarity=0.000 Sum_probs=170.2 Q ss_pred Cchhhhhcccc-----------------c-----------cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQ-----------------K-----------MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVIS 52 (386) Q Consensus 1 Mg~~~~l~~~~-----------------~-----------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~ 52 (386) =..|..+|... . -..+.+++....................-+.+.-..-+|+ T Consensus 29 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd 108 (492) T protein:vir:97 29 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVD 108 (492) T ss_pred hhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHH Confidence 00111111000 0 0000000000000000000000000000112344556777 Q ss_pred HHHHhhccCceeec--chhHHHHHhccC-cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 53 RVSSDIAGCRFVTN--AQPITDVLNAPL-GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 53 ~ia~~ia~~p~~~~--~~~~~~~l~~~P-N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) ..++-+-+-|+.+. +......|..-- |. .......+..+++.+|.||..+..+.+|.+ .+..++|..+.+..+ T Consensus 109 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d 184 (492) T protein:vir:97 109 QKVSYIVGKPIAFKHTDDEVVKRIDEVLGNR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWT 184 (492) T ss_pred HHhhhhcccCceeccCchHHHHHHHHHHhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEc Confidence 77777666776543 333433332211 32 234555678889999999999999888875 577789999988876 Q ss_pred CCC--ceeEEEEeccCcccceeEEEcccceeeeccccc------------------cCcc---------cccccccHHHH Q lcl|NC_011801. 130 DYG--KDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS------------------GESD---------TQYMGIPPIDS 180 (386) Q Consensus 130 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~------------------~~~~---------~~~~G~s~~~~ 180 (386) ... ........+..........+....+.++.+... +++. +...|.|.+.. T Consensus 185 ~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~ 264 (492) T protein:vir:97 185 DKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFM 264 (492) T ss_pred CCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHh Confidence 432 222211111111111112233333333321100 0000 11358888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHH Q lcl|NC_011801. 181 LLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTE 260 (386) Q Consensus 181 ~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~ 260 (386) +...++....+..-..+.++..+.|-.+++ +. +.+....++..+.. .+++.++++.+++.+.....+.. T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~~~l~~~--g~--~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~l~~~~~~~~ 333 (492) T protein:vir:97 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVLK--NY--DDQELPEFKRLLRY-------YGAIKVSDNGGVDTIQVEVPVEN 333 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeeee--cC--CcccchhHHHHHhh-------ccceecCCCCcceeEeccCCHHH Confidence 888888877776666777777777766554 21 12222222222221 23555666656665555555666 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhhhhh Q lcl|NC_011801. 261 FLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTDVKL 325 (386) Q Consensus 261 ~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~~~f 325 (386) +....+...+.|+..-++|..-....+ ++-+..+. +..+...+...++.+...++.+- ...+++ T Consensus 334 ~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v 412 (492) T protein:vir:97 334 SKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDI 412 (492) T ss_pred HHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeE Confidence 778888888999999998863322111 11111111 22333344444444433332211 123345 Q ss_pred cchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----c---ccc-ccccCCCCCC Q lcl|NC_011801. 326 DIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----E---GTN-LLDNTKNIND 386 (386) Q Consensus 326 d~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~---~~~-~~~~~~~~~~ 386 (386) .+..-+..|..+.++++.++ .|+++...+.++++.-+ +|..... | ..+ .-....++.| T Consensus 413 ~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~-d~~~Eleri~~E~~~~~~~~~~~~~~~~~ 478 (492) T protein:vir:97 413 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVE-DLQAELERIEQEQTEYNKQLPNLDDGGAD 478 (492) T ss_pred EecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHhhhccccCCCC Confidence 55666677888999998888 47787776666554211 0111100 0 000 0011111111 No 172 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.30 E-value=1.5e-06 Score=52.49 Aligned_cols=347 Identities=11% Similarity=0.083 Sum_probs=158.8 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHH----hccHHHHHHHHHHHHhhccCceeecchhHHHHHhc Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIA----LKNSDVYAVISRVSSDIAGCRFVTNAQPITDVLNA 76 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a----~~~~~v~~~v~~ia~~ia~~p~~~~~~~~~~~l~~ 76 (386) |..... .... - ...-.........+..+..+.. +......-+|+-+++.+.=-.|...+..+..++.. T Consensus 1 l~~~~~----r~~~---~-~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~~d~~l~~i~~~ 72 (410) T protein:vir:95 1 MNLYQS----RVNL---R-YKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFANDDFNVTEIFDR 72 (410) T ss_pred CCcchh----hHHH---H-HHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccCCCchHHHHHhh Confidence 332210 0000 0 0000000000001111111110 01112223444444433323344445556665533 Q ss_pred cCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCc-ccc---eeEEE Q lcl|NC_011801. 77 PLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDS-KRS---GDFLY 152 (386) Q Consensus 77 ~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~ 152 (386) |... .....+..+.+.+|.||+.+..+.+|.+ .+.+++|..+....|.......+.+.+... ..+ ....+ T Consensus 73 --N~ld---~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~~~~~~~~~~~ 146 (410) T protein:vir:95 73 --NNPD---IFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGLLVEGYAVLARDDYNRPTLEAYF 146 (410) T ss_pred --cChH---HHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCceEEEEEEEEecCCCeEEEEEEE Confidence 3332 3555778889999999999999888875 678899999988877655544332221110 111 11122 Q ss_pred cccc---------------------eeeeccccccCcccccccccH----HHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_011801. 153 DSSE---------------------VIHFRCTVSGESDTQYMGIPP----IDSLLNEIEVQDLSSKLAISTLRHAIKPSI 207 (386) Q Consensus 153 ~~~~---------------------vih~~~~~~~~~~~~~~G~s~----~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 207 (386) .++. |++|.+ ... .++.+|.|. +..+...+.....-......|+ +.|.. T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n--~~~-l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~---a~pqr 220 (410) T protein:vir:95 147 EPNATHFIPKDGEPYSVTNETGIPLLVPVIH--RPD-AVRPFGRSRITRAGMYYQKYAKRTLERADITAEFY---SWPQK 220 (410) T ss_pred eCCcEEEEeeCCccccccCCCCCcceEEecc--ccc-CCccCCccccchhHHHHHHHHHHHHHHHHHHHHHh---cchhh Confidence 2222 333332 111 234467774 4444444444444444445554 44545 Q ss_pred EEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-----CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 208 FIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-----SADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 208 ~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) ++..- ..+.+..+.++... ++++.++. +.++.++....-+ .|++.++.....||..=++|+.. T Consensus 221 ~i~G~--d~d~~~~~~~~~~~---------~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s~lP~~~ 288 (410) T protein:vir:95 221 YILGL--DPDAEPMEKWKATV---------SSLLTISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEMGLTLDD 288 (410) T ss_pred eeecc--CCCCCcCchhhhhh---------hhheeccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhcCCCHHH Confidence 44321 11222222222222 23555542 2456555432222 48999999999999999999999 Q ss_pred hcCCcCcccHHHHHHH---HHHHHHHHHHHHHHHHHHHhh--h--------------hhhhhcchhhh---ccCHHHHHH Q lcl|NC_011801. 283 LSGKQDAQSNITMIRA---FYQSSLSIYIKPIESELSQKL--G--------------TDVKLDIASAI---DSDNSELIN 340 (386) Q Consensus 283 l~~~~~~~~~~~~~~~---~~~~~l~P~~~~ie~~l~~~l--~--------------~~~~fd~~~~l---~~d~~~~~~ 340 (386) +|.......+.++.++ =+.....-..+.|.+.+.+.+ . ...+..+.+.. ..+..+.++ T Consensus 289 lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aD 368 (410) T protein:vir:95 289 LGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGD 368 (410) T ss_pred hccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHH Confidence 9865431112222111 111111222222223332211 0 11122233221 224566788 Q ss_pred HHHHHHhC--CCcCHHHHHHHhccCCcCCCCCCCcccc-ccccCCCCC Q lcl|NC_011801. 341 NVQKLASA--GVLAPIQAQKLLKNRGVFPELDLDEGTN-LLDNTKNIN 385 (386) Q Consensus 341 ~~~~~~~~--g~~t~nE~R~~lg~~p~~p~~~~~~~~~-~~~~~~~~~ 385 (386) ++.|++++ |+....-+++++|+.+.. +-.. .-....++. T Consensus 369 a~~Kl~~a~~g~~~~~~~~~~lg~~~~~------~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 369 GVVKLNQALPGYINAETIRDLTGIAGDM------SAKPVVSEGGSNGE 410 (410) T ss_pred HHHHHHHhccCCccHHHHHHhcCCChHH------HHHHHHHHHHhCCC Confidence 88899987 788888899988864321 1000 111222222 No 173 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.30 E-value=1.6e-06 Score=52.43 Aligned_cols=351 Identities=12% Similarity=0.079 Sum_probs=168.3 Q ss_pred Cchhhhhccccc-------cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhHH Q lcl|NC_011801. 1 MAFLSNLFKRQK-------MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPIT 71 (386) Q Consensus 1 Mg~~~~l~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~ 71 (386) ..|.+....+.. -..+.+.+.. ... ......+. -+.++-...+|+..++-+-+-|+.+. +.... T Consensus 7 ~~~i~~~~~~~~r~~~l~~yy~g~~~il~----~~~-~~~~~~~~--ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~ 79 (429) T protein:vir:98 7 SELIQKHRSFNLSYSAYKQLYEGDHAILQ----QKQ-KEQYKPDN--RLVVNFAKYIVDTFNGYFIGVPVQTSHENKQVS 79 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccc----ccc-cccCCCcc--eeecchHHHHHHHHhhhhcccCceeecCChHHH Confidence 111111100000 0000011000 000 00000011 12344556778877777777776543 22222 Q ss_pred HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc--eeEEEEeccC-cccce Q lcl|NC_011801. 72 DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK--DLTYTVHFDD-SKRSG 148 (386) Q Consensus 72 ~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~~~-~~~~~ 148 (386) ..|..- ...-........+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..+.... .......+.. ..... T Consensus 80 ~~l~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~ 157 (429) T protein:vir:98 80 NYLELL-DGYNDQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLE 157 (429) T ss_pred HHHHHH-HhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEE Confidence 222111 11112335667788899999999999999999876 4677888888877664322 2211111111 01011 Q ss_pred eEEEccc-------------------------ceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011801. 149 DFLYDSS-------------------------EVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAI 203 (386) Q Consensus 149 ~~~~~~~-------------------------~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 203 (386) ...+..+ +|++++ +...|.|.+..+...++....+.....+.++.++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--------n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~ 229 (429) T protein:vir:98 158 GSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYV--------ENEERQSLLASVVTLINAFNKAISEKANDVEYFA 229 (429) T ss_pred EEEEeCceEEEEEecCCceEecccccccCCccceEEec--------CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1111111 122221 1236888888888888887777777777777777 Q ss_pred CCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC----CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_011801. 204 KPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ----SADVETTNISPNVTEFLQNVSFSQDQIAKAFGIP 279 (386) Q Consensus 204 ~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp 279 (386) .|-.+++ +...+++..+.++. ++++.+++ +.++..+........+....+...+.|+..-++| T Consensus 230 ~p~~~i~--g~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 296 (429) T protein:vir:98 230 DAYLKIL--GAELDDETLKSLRD-----------TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVA 296 (429) T ss_pred Cceeeee--cCCCCcchhhhHhh-----------CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 7766665 33344443332211 12333321 2244444444444445667889999999999998 Q ss_pred HHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhh----hhhhhcchhhhccCHHHHHHH Q lcl|NC_011801. 280 ADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKLG----TDVKLDIASAIDSDNSELINN 341 (386) Q Consensus 280 ~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l~----~~~~fd~~~~l~~d~~~~~~~ 341 (386) ..-.... ++-+..+. +..+...+.-.++.+...++..-. ..+++.+...+..|..+.++. T Consensus 297 ~~~~~~~--gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~ 374 (429) T protein:vir:98 297 NISDESF--GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQI 374 (429) T ss_pred ccCcccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHH Confidence 5322211 11121111 123333333333333333222111 124555666677889999999 Q ss_pred HHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----c-------cccccccCCCCCC Q lcl|NC_011801. 342 VQKLASAGVLAPIQAQKLLKNRGVFPELDLD----E-------GTNLLDNTKNIND 386 (386) Q Consensus 342 ~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~-------~~~~~~~~~~~~~ 386 (386) +.++ +|+++..-+.++++.-+ +|..... | -...+..+.+.+| T Consensus 375 ~~kl--~g~is~et~~~~l~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 427 (429) T protein:vir:98 375 AGNL--AGIVSEETQVGVLSIVE-NPQKEIERKNSDKSTLISRQAGGLNGQNTTTI 427 (429) T ss_pred HHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCC Confidence 8888 57898877777665321 1111100 0 0111122222222 No 174 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.29 E-value=1.6e-06 Score=52.36 Aligned_cols=374 Identities=13% Similarity=0.101 Sum_probs=170.7 Q ss_pred Cchhh---hhcccccc-CCccc----hhhhh-hcccccccCccc--cc----HHHHhccHHHHHHHHHHHHhhccCcee- Q lcl|NC_011801. 1 MAFLS---NLFKRQKM-LSGSS----PVWIL-NQGQPVSIKPKA--IT----SAIALKNSDVYAVISRVSSDIAGCRFV- 64 (386) Q Consensus 1 Mg~~~---~l~~~~~~-~~~~~----~~~~~-~~~~~~~~~~~~--i~----~~~a~~~~~v~~~v~~ia~~ia~~p~~- 64 (386) |+.-+ .+..+... .+... ..+.. ..+....+.... .. ....+...-...+++..|+-+.+-|.. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i 95 (496) T protein:vir:38 16 MGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKI 95 (496) T ss_pred hccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCCcceE Confidence 33321 11111111 11000 00000 000001010000 00 001112234456777777777665554 Q ss_pred -ecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee---E---- Q lcl|NC_011801. 65 -TNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL---T---- 136 (386) Q Consensus 65 -~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~---~---- 136 (386) +.+....+.|..- -..-....-...++.+...+|.+|+.+..|.+|.+ .+..++|+.+-+.....+... + T Consensus 96 ~~~d~~~~e~l~~~-~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~~~~~~P~~~~~~~~~~~~f~~~~ 173 (496) T protein:vir:38 96 NIDDKAAEEFVLNV-LKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSENVDECVIANSF 173 (496) T ss_pred eeCChHHHHHHHHH-HhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EEEEEcccceEEEEecCCcEEEEEEEEEE Confidence 4455555555331 11122445566788888899999999999888765 456677777665433322211 0 Q ss_pred ------EE------------------EeccCc-ccceeEE-------------Ecccc---eeeeccccccC-ccccccc Q lcl|NC_011801. 137 ------YT------------------VHFDDS-KRSGDFL-------------YDSSE---VIHFRCTVSGE-SDTQYMG 174 (386) Q Consensus 137 ------~~------------------~~~~~~-~~~~~~~-------------~~~~~---vih~~~~~~~~-~~~~~~G 174 (386) |. +...+. ..+..+. +..-+ +.|++....+. ......| T Consensus 174 ~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G 253 (496) T protein:vir:38 174 HKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLG 253 (496) T ss_pred EeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCC Confidence 00 000000 0011100 00001 22232111111 1223479 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC------CHHHHHHHHHHHHHHhcccccCcceecCCCce Q lcl|NC_011801. 175 IPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL------GKEAKENTRQSFEEQTTGENAGRAVVLDQSAD 248 (386) Q Consensus 175 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~------~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~ 248 (386) +|.+..+...++....+.....+-|+.| .+..++ +...+ +.+....+....+ .+. .....-.+++.. T Consensus 254 ~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v--~~~~l~~~~~~~g~~~~~~~~~~~-~~~---~~~~~~~~~~~~ 326 (496) T protein:vir:38 254 ISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLV--PSSFVKTAVNLDGSTTQYFDSTDE-AFF---LYQGDQDDNGKA 326 (496) T ss_pred CchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceec--chHHhhccCCCCCccccCCCCccc-eEE---EeecCCCccccc Confidence 9999999999998877766666666654 334433 11100 0000000000000 000 000111223345 Q ss_pred eeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH-HH---HH----------HHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 249 VETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN-IT---MI----------RAFYQSSLSIYIKPIESE 314 (386) Q Consensus 249 ~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~-~~---~~----------~~~~~~~l~P~~~~ie~~ 314 (386) ++.++......++.+..+...++|+...|+|+..++....+..+ .+ .. ...++.+|..++..+.+. T Consensus 327 i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~ 406 (496) T protein:vir:38 327 IKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEV 406 (496) T ss_pred ceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66666666667788899999999999999999999865544321 11 11 122334444444444322 Q ss_pred HHHhh--------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC--------cccccc Q lcl|NC_011801. 315 LSQKL--------GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD--------EGTNLL 378 (386) Q Consensus 315 l~~~l--------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~--------~~~~~~ 378 (386) .+... ...+.+.++.-+..|..+.++.+.+++.+|+|+.-.++..+ |-..+++.. |....+ T Consensus 407 ~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~---~~~~d~ea~~el~ri~~E~~~~~ 483 (496) T protein:vir:38 407 GKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRA---WNITEAEADEWAEMLAKEKQAEM 483 (496) T ss_pred HHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc---CCCChHHHHHHHHHHHHhhhccC Confidence 22111 12344566666778889999999999999998877766532 111111110 000000 Q ss_pred -ccCCC--CCC Q lcl|NC_011801. 379 -DNTKN--IND 386 (386) Q Consensus 379 -~~~~~--~~~ 386 (386) ....+ ..| T Consensus 484 ~~~d~~~~~~~ 494 (496) T protein:vir:38 484 PNNDMNGIFGE 494 (496) T ss_pred ccccccCCCCC Confidence 00101 111 No 175 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.29 E-value=1.7e-06 Score=52.26 Aligned_cols=369 Identities=15% Similarity=0.085 Sum_probs=177.3 Q ss_pred Cchhhhhccc--cccCC-----------ccchhhh--hhccc--cc---ccCc---------------c--cccHHHHhc Q lcl|NC_011801. 1 MAFLSNLFKR--QKMLS-----------GSSPVWI--LNQGQ--PV---SIKP---------------K--AITSAIALK 43 (386) Q Consensus 1 Mg~~~~l~~~--~~~~~-----------~~~~~~~--~~~~~--~~---~~~~---------------~--~i~~~~a~~ 43 (386) |.+..-+..- ...+. ....... ...+. .. .... . ...+..=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 4433211000 00000 0000000 00000 00 0000 0 000000012 Q ss_pred cHHHHHHHHHHHHhhccCceeecc-------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEE Q lcl|NC_011801. 44 NSDVYAVISRVSSDIAGCRFVTNA-------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRI 116 (386) Q Consensus 44 ~~~v~~~v~~ia~~ia~~p~~~~~-------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l 116 (386) ++-...+|+..++-+-+-|+.+.- ..+...|.+- ............+..+...+|.||..+..+.+|.+ .+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~ 158 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNF-AIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RI 158 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHH-HhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EE Confidence 344456677777777677776531 1233333221 11123445677788889999999999988888875 56 Q ss_pred EEEcCcceEEeecCCCceeEEEEec--cCccccee----EEEcccceeeecccc---------ccCc---------cccc Q lcl|NC_011801. 117 EPVPNEKVTVALDDYGKDLTYTVHF--DDSKRSGD----FLYDSSEVIHFRCTV---------SGES---------DTQY 172 (386) Q Consensus 117 ~~l~~~~v~~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~vih~~~~~---------~~~~---------~~~~ 172 (386) ..++|..+-+..+.....+.....+ .....+.. ..+....+.+++... ..++ .+.. T Consensus 159 ~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 238 (474) T protein:vir:94 159 KNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNK 238 (474) T ss_pred EEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCC Confidence 7788888877765544332211000 00000000 011111111111000 0000 1123 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeec Q lcl|NC_011801. 173 MGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETT 252 (386) Q Consensus 173 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~ 252 (386) .|.|.+..+...++....+..-..+.++..+.|-.+++ +...+++....++ ..|.+.+.+++.+++.+ T Consensus 239 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~----------~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:94 239 EMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GMGMSEEMIQETQ----------KSGAFELFDKDMDVKYL 306 (474) T ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cCCCCchhhhhhh----------hcceeEecCCCCceeEE Confidence 57888888888888777766666666666666655554 4344444333221 12345556667777776 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL 319 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l 319 (386) .....+..+....+...+.|...-++|..-.+ ..++.+.. ....+..+..++.-.++.|...++.+- T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:94 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 66555566788888899999999999874432 22211110 011223455555555555555554431 Q ss_pred h-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC-----------------CCCccc Q lcl|NC_011801. 320 G-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL-----------------DLDEGT 375 (386) Q Consensus 320 ~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~-----------------~~~~~~ 375 (386) . ..+++.+..-+..|..+.++++.++. |+++...+.++++.-+ +|.. +..+++ T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~~~ 463 (474) T protein:vir:94 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVD-DVDYELDEMEKESLEFNDKLPDIDEGD 463 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhcccccCCC Confidence 1 13455666667788999999999884 7888777777654211 0000 001111 Q ss_pred cccccCCCCCC Q lcl|NC_011801. 376 NLLDNTKNIND 386 (386) Q Consensus 376 ~~~~~~~~~~~ 386 (386) ..=..+.+-+| T Consensus 464 ~~~~~~~~~s~ 474 (474) T protein:vir:94 464 ANDKSQNNQSE 474 (474) T ss_pred cCCCCccccCC Confidence 11111222222 No 176 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.29 E-value=1.7e-06 Score=52.26 Aligned_cols=369 Identities=15% Similarity=0.085 Sum_probs=177.3 Q ss_pred Cchhhhhccc--cccCC-----------ccchhhh--hhccc--cc---ccCc---------------c--cccHHHHhc Q lcl|NC_011801. 1 MAFLSNLFKR--QKMLS-----------GSSPVWI--LNQGQ--PV---SIKP---------------K--AITSAIALK 43 (386) Q Consensus 1 Mg~~~~l~~~--~~~~~-----------~~~~~~~--~~~~~--~~---~~~~---------------~--~i~~~~a~~ 43 (386) |.+..-+..- ...+. ....... ...+. .. .... . ...+..=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 4433211000 00000 0000000 00000 00 0000 0 000000012 Q ss_pred cHHHHHHHHHHHHhhccCceeecc-------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEE Q lcl|NC_011801. 44 NSDVYAVISRVSSDIAGCRFVTNA-------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRI 116 (386) Q Consensus 44 ~~~v~~~v~~ia~~ia~~p~~~~~-------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l 116 (386) ++-...+|+..++-+-+-|+.+.- ..+...|.+- ............+..+...+|.||..+..+.+|.+ .+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~ 158 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNF-AIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RI 158 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHH-HhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EE Confidence 344456677777777677776531 1233333221 11123445677788889999999999988888875 56 Q ss_pred EEEcCcceEEeecCCCceeEEEEec--cCccccee----EEEcccceeeecccc---------ccCc---------cccc Q lcl|NC_011801. 117 EPVPNEKVTVALDDYGKDLTYTVHF--DDSKRSGD----FLYDSSEVIHFRCTV---------SGES---------DTQY 172 (386) Q Consensus 117 ~~l~~~~v~~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~vih~~~~~---------~~~~---------~~~~ 172 (386) ..++|..+-+..+.....+.....+ .....+.. ..+....+.+++... ..++ .+.. T Consensus 159 ~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 238 (474) T protein:vir:10 159 KNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNK 238 (474) T ss_pred EEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCC Confidence 7788888877765544332211000 00000000 011111111111000 0000 1123 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeec Q lcl|NC_011801. 173 MGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETT 252 (386) Q Consensus 173 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~ 252 (386) .|.|.+..+...++....+..-..+.++..+.|-.+++ +...+++....++ ..|.+.+.+++.+++.+ T Consensus 239 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~----------~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:10 239 EMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GMGMSEEMIQETQ----------KSGAFELFDKDMDVKYL 306 (474) T ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cCCCCchhhhhhh----------hcceeEecCCCCceeEE Confidence 57888888888888777766666666666666655554 4344444333221 12345556667777776 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL 319 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l 319 (386) .....+..+....+...+.|...-++|..-.+ ..++.+.. ....+..+..++.-.++.|...++.+- T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:10 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 66555566788888899999999999874432 22211110 011223455555555555555554431 Q ss_pred h-------hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC-----------------CCCccc Q lcl|NC_011801. 320 G-------TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL-----------------DLDEGT 375 (386) Q Consensus 320 ~-------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~-----------------~~~~~~ 375 (386) . ..+++.+..-+..|..+.++++.++. |+++...+.++++.-+ +|.. +..+++ T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~~~ 463 (474) T protein:vir:10 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVD-DVDYELDEMEKESLEFNDKLPDIDEGD 463 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhcccccCCC Confidence 1 13455666667788999999999884 7888777777654211 0000 001111 Q ss_pred cccccCCCCCC Q lcl|NC_011801. 376 NLLDNTKNIND 386 (386) Q Consensus 376 ~~~~~~~~~~~ 386 (386) ..=..+.+-+| T Consensus 464 ~~~~~~~~~s~ 474 (474) T protein:vir:10 464 ANDKSQNNQSE 474 (474) T ss_pred cCCCCccccCC Confidence 11111222222 No 177 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.26 E-value=1.9e-06 Score=51.93 Aligned_cols=370 Identities=11% Similarity=0.053 Sum_probs=165.7 Q ss_pred Cchhhhhcc---ccccCCcc------chh-----------hhhhccc-ccccCccc--ccHHHHhccHHHHHHHHHHHHh Q lcl|NC_011801. 1 MAFLSNLFK---RQKMLSGS------SPV-----------WILNQGQ-PVSIKPKA--ITSAIALKNSDVYAVISRVSSD 57 (386) Q Consensus 1 Mg~~~~l~~---~~~~~~~~------~~~-----------~~~~~~~-~~~~~~~~--i~~~~a~~~~~v~~~v~~ia~~ 57 (386) ||+|+++.+ .+....+. .+. +....+. ..+..... +.. .-+..+....+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~-~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHD-KLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccc-ccccCChHHHHHHHHHHh Confidence 999986532 22221111 000 0000000 01000000 110 111222345567777776 Q ss_pred hccCc--eeec-----c-hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 58 IAGCR--FVTN-----A-QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 58 ia~~p--~~~~-----~-~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) +..=| +.+. + ..+.+.|.+--... ....-+...+.+.+-.|.+++.+..+ +|++ .+..++++.+-+... T Consensus 80 l~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n-~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~~P~~~ 156 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDENLTKQLKEALRID-NFDSKSVKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQFWIDFK 156 (518) T ss_pred hcCCCceEEecCccccCcHHHHHHHHHHHHhc-cHHHHHHHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCeeEEEee Confidence 65433 3332 1 22222332211111 12223344555566667777665544 3443 566677766655322 Q ss_pred CC---------------CceeE------------------------EEEeccCccccee---EEE--------ccc---- Q lcl|NC_011801. 130 DY---------------GKDLT------------------------YTVHFDDSKRSGD---FLY--------DSS---- 155 (386) Q Consensus 130 ~~---------------~~~~~------------------------~~~~~~~~~~~~~---~~~--------~~~---- 155 (386) .+ ....+ |.....+.+.+.. ... ..+ T Consensus 157 ~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e 236 (518) T protein:vir:78 157 NNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQL 236 (518) T ss_pred cCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCcc Confidence 11 00000 0000000000000 000 000 Q ss_pred -----------ceeeeccccccCc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCC------ Q lcl|NC_011801. 156 -----------EVIHFRCTVSGES-DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLG------ 217 (386) Q Consensus 156 -----------~vih~~~~~~~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~------ 217 (386) -+.|++....++. .+.-.|+|.+..+...++..+........-|+.|. +..++. ...+. T Consensus 237 ~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~-~~i~v~--~~~l~~~~~~~ 313 (518) T protein:vir:78 237 NHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTK-TKIAAS--ERMFRKKVNKS 313 (518) T ss_pred ceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCC-ceeeec--hhHhccCCCCC Confidence 0112221111111 11235999999999999998888777777777644 444442 11110 Q ss_pred -HHHHHHHHHHHHHHhcccccCcceecCCCc----eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH Q lcl|NC_011801. 218 -KEAKENTRQSFEEQTTGENAGRAVVLDQSA----DVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN 292 (386) Q Consensus 218 -~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~----~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~ 292 (386) ....-.+... .+.+..-+ .-.++|. .++.++....+.++.+..+...++|....|+++..++..+....+ T Consensus 314 ~~~~~~~fd~~-~~~y~~i~----~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TA 388 (518) T protein:vir:78 314 TDKEEWSMNVD-EDYFMQFK----GTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKA 388 (518) T ss_pred CCccccccCCC-CceEEEec----CcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccH Confidence 0000000000 00010000 0012222 366777777788899999999999999999999999754322222 Q ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHHHHhhh----------hhhhhcchhhhccCHHHHHHHHHHHHhCC Q lcl|NC_011801. 293 IT-------------MIRAFYQSSLSIYIKPIESELSQKLG----------TDVKLDIASAIDSDNSELINNVQKLASAG 349 (386) Q Consensus 293 ~~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~----------~~~~fd~~~~l~~d~~~~~~~~~~~~~~g 349 (386) .+ ..+..++.+|.-++..+...+....+ ..+.+++++.+-.|.++.++...+++.+| T Consensus 389 Tei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aG 468 (518) T protein:vir:78 389 TEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSAL 468 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcC Confidence 11 11223334444444444333222111 13567788888999999999999999999 Q ss_pred CcCHHHHHHHhccCCcCCCCCCC--------ccc---------cccccCCCC Q lcl|NC_011801. 350 VLAPIQAQKLLKNRGVFPELDLD--------EGT---------NLLDNTKNI 384 (386) Q Consensus 350 ~~t~nE~R~~lg~~p~~p~~~~~--------~~~---------~~~~~~~~~ 384 (386) +|++.++-+++. |-..+++.. |-+ ---.+++++ T Consensus 469 imS~e~~i~~~~--~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 469 AMSVEEKVKLIH--PKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred CCCHHHHHHHhC--CCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 999888555442 101111100 000 001333333 No 178 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.25 E-value=2e-06 Score=51.83 Aligned_cols=379 Identities=12% Similarity=0.041 Sum_probs=177.1 Q ss_pred Cchhh-hhccccccCCccchhhh----------------hhcccc-cc-cCcccccH---HHHhccHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGSSPVWI----------------LNQGQP-VS-IKPKAITS---AIALKNSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~~~~~~----------------~~~~~~-~~-~~~~~i~~---~~a~~~~~v~~~v~~ia~~i 58 (386) |.+.. ++|..+..-.-...... ...+.. .. ........ ..-+.++-....|+..++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l 80 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYF 80 (453) T ss_pred CccccceeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhhhh Confidence 55442 22222111000000000 000000 00 00000000 00012333445566666655 Q ss_pred ccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCce-e Q lcl|NC_011801. 59 AGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKD-L 135 (386) Q Consensus 59 a~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-~ 135 (386) -+-|+.+. +......|.. -............+..+.+.+|.||..+..+.+|.+ .+..++|..+.+..+..... . T Consensus 81 ~g~~~~~~~~d~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~dd~~~~~~ 158 (453) T protein:vir:73 81 NGIPIKKTHDDKSVLEAMQL-FDNLNDMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPLNVFMVYDDSIKQKP 158 (453) T ss_pred cccCceeecCChHHHHHHHH-HHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEeCCCCcee Confidence 55565442 3333333322 111112334566788899999999999999988876 46678888887776554221 1 Q ss_pred EEE--EeccCcccceeEEEcccceeeeccccc--------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 136 TYT--VHFDDSKRSGDFLYDSSEVIHFRCTVS--------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAI 196 (386) Q Consensus 136 ~~~--~~~~~~~~~~~~~~~~~~vih~~~~~~--------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 196 (386) .+. +.....+......+....+++++.... +++ .+...|.|.+..+...++....+..... T Consensus 159 ~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~ 238 (453) T protein:vir:73 159 LFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKA 238 (453) T ss_pred EEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHH Confidence 111 111111111122233333333321100 000 1113578888888888877777666666 Q ss_pred HHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 197 STLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 197 ~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) +..+..+.|..+++ +...+++....++..---.......+.....+.+.++..+.....+..+....+...+.|+..- T Consensus 239 ~~~~~~~~~~l~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 316 (453) T protein:vir:73 239 NDVEYFSDQYLVFL--GAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFT 316 (453) T ss_pred HHHHHhccceeeee--cCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHh Confidence 66666677766654 4344555555544321111111222233444556666666555556666777888888998888 Q ss_pred CCCHHHhcCCcCccc-H-----------HHHHHHHHHHHHHHHHHHHHHHHHHhh----hhhhhhcchhhhccCHHHHHH Q lcl|NC_011801. 277 GIPADYLSGKQDAQS-N-----------ITMIRAFYQSSLSIYIKPIESELSQKL----GTDVKLDIASAIDSDNSELIN 340 (386) Q Consensus 277 gvp~~~l~~~~~~~~-~-----------~~~~~~~~~~~l~P~~~~ie~~l~~~l----~~~~~fd~~~~l~~d~~~~~~ 340 (386) ++|..-....++.+. + .+..+..+...+...++.+...+...- ...+++.+..-+..|..+.++ T Consensus 317 ~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~ 396 (453) T protein:vir:73 317 MAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAE 396 (453) T ss_pred CCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHH Confidence 888532221111110 0 011223444455555554443333221 123456666677788999999 Q ss_pred HHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----cccc--ccccCCCCCC Q lcl|NC_011801. 341 NVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----EGTN--LLDNTKNIND 386 (386) Q Consensus 341 ~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~~~~--~~~~~~~~~~ 386 (386) .+.+++ |+++..-+.++++.-+ +|..... |-.+ ....+.+..+ T Consensus 397 ~~~k~~--giis~et~~~~~~~~~-d~~~E~~ri~~E~~~~~~~~~~~~~~~ 445 (453) T protein:vir:73 397 TANILK--GITSEETALSVISVIP-DVQAEMEKIKKKKLLQLSLTRTSNLVR 445 (453) T ss_pred HHHHHh--ccCcHHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 999886 7888766666554211 0111100 0000 0111111111 No 179 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.25 E-value=2e-06 Score=51.78 Aligned_cols=355 Identities=12% Similarity=0.107 Sum_probs=150.9 Q ss_pred CchhhhhccccccCCccchhhhhhccc-ccccCcccccHH-HHh----ccHHHHHHHHHHHHhhccCceeecch----hH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQ-PVSIKPKAITSA-IAL----KNSDVYAVISRVSSDIAGCRFVTNAQ----PI 70 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~-~a~----~~~~v~~~v~~ia~~ia~~p~~~~~~----~~ 70 (386) =.|+..+-++.... ........+. .....+.....+ ... .+.-..-+|+.+++.+--..+.+.+. .+ T Consensus 33 ~~l~~~~~~~~~rl---~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d~~~~~~l 109 (501) T protein:vir:25 33 ADMWRLHISERQWL---DRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNALAKENDPA 109 (501) T ss_pred HHHHHHHHHHHHHH---HHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecCCccchHHH Confidence 01111110000000 0000000000 000011111111 000 01122334554444332234444332 23 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC-c--ee---EEEEeccC- Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG-K--DL---TYTVHFDD- 143 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~--~~---~~~~~~~~- 143 (386) ..++.. |.. ......+..+.+.+|.||+.+.++..|. .+..++|..+.+..++.. . .. .|...... T Consensus 110 ~~i~~~--N~~---d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~ 182 (501) T protein:vir:25 110 WEMWQR--NRM---DARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDA 182 (501) T ss_pred HHHHHh--cCh---hHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEEEeecccc Confidence 333322 332 3455678888999999999999988884 366678988886653321 1 11 11111100 Q ss_pred cccceeEEEccc----------------------------------------------ceeeeccccccCcccccccccH Q lcl|NC_011801. 144 SKRSGDFLYDSS----------------------------------------------EVIHFRCTVSGESDTQYMGIPP 177 (386) Q Consensus 144 ~~~~~~~~~~~~----------------------------------------------~vih~~~~~~~~~~~~~~G~s~ 177 (386) ........+.+. +|+||... . .....|.|. T Consensus 183 ~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~--~--~~~~~g~sd 258 (501) T protein:vir:25 183 KPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNG--R--DADDMIVGE 258 (501) T ss_pred CcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCc--c--ccCccccch Confidence 000000111111 23333211 1 112257787 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-CCceeeeccCCh Q lcl|NC_011801. 178 IDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-QSADVETTNISP 256 (386) Q Consensus 178 ~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~ 256 (386) ++.+...++..+..........+-.+.|..++. +.. .++.+.+ +. ..++++.++ ++.++.++. . T Consensus 259 ie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~--G~~--~~~~~~~----~~-----~~~~i~~~~~~~~~~~q~~--~ 323 (501) T protein:vir:25 259 VAPLILLQQAINSVNFDRLIVSRFGANPQRVIS--GWT--GSKAEVL----KA-----SALRVWTFEDPEVKAQAFP--P 323 (501) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh--CCC--CCccchh----hh-----cccceeccCCCCceEEEec--c Confidence 766665555555554444444444444544443 222 2222221 11 123466665 456666554 3 Q ss_pred hhHH-HHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH----HHHHHHhh-------h---- Q lcl|NC_011801. 257 NVTE-FLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI----ESELSQKL-------G---- 320 (386) Q Consensus 257 ~d~~-~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i----e~~l~~~l-------~---- 320 (386) .+++ +.+.++....+|++.=++|+..++........+ +. .+....|.-.+... ...|.+.+ + T Consensus 324 ~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~-Al-~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~ 401 (501) T protein:vir:25 324 ASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAE-AL-AAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDT 401 (501) T ss_pred cChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcc Confidence 3444 789999999999999999999987543222111 11 11122222222222 22222211 1 Q ss_pred ---hhhhhcchhhhccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhccC-------------CcCCCCCCCc Q lcl|NC_011801. 321 ---TDVKLDIASAIDSDNSELINNVQKLASAGV-----------LAPIQAQKLLKNR-------------GVFPELDLDE 373 (386) Q Consensus 321 ---~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~-----------~t~nE~R~~lg~~-------------p~~p~~~~~~ 373 (386) ..+++.+.+....+..+.++++.++++.|+ +|+.|+.++.... +-.+.+..+. T Consensus 402 ~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 481 (501) T protein:vir:25 402 AADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPP 481 (501) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCC Confidence 123445555667788999999999988773 4655543322100 0001111111 Q ss_pred ccccc--ccCCCCCC Q lcl|NC_011801. 374 GTNLL--DNTKNIND 386 (386) Q Consensus 374 ~~~~~--~~~~~~~~ 386 (386) .++.. ..+.+.++ T Consensus 482 ~~~~~~~~~~~~~~~ 496 (501) T protein:vir:25 482 PPQAAAQALNEGGVN 496 (501) T ss_pred CCCCCccccccccCC Confidence 11111 11111222 No 180 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.23 E-value=2.3e-06 Score=51.52 Aligned_cols=370 Identities=12% Similarity=0.075 Sum_probs=151.2 Q ss_pred Cc---------hhhhhccccccCCccc-hhhhhhcc-cccccCcccccHHH---HhccHHHHHHHHHHHHhhccCceeec Q lcl|NC_011801. 1 MA---------FLSNLFKRQKMLSGSS-PVWILNQG-QPVSIKPKAITSAI---ALKNSDVYAVISRVSSDIAGCRFVTN 66 (386) Q Consensus 1 Mg---------~~~~l~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~i~~~~---a~~~~~v~~~v~~ia~~ia~~p~~~~ 66 (386) +| ++.++..........- .......+ ......+..+-.+. -..+.-..-+|+.+++.+--..+.+. T Consensus 6 ~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~ 85 (486) T protein:vir:42 6 PGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLG 85 (486) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecC Confidence 11 1111111100000000 00000000 00000011111110 00112233455555554433445443 Q ss_pred chh-----HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce-------EEEEEEcCcceEEeecCCCce Q lcl|NC_011801. 67 AQP-----ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP-------VRIEPVPNEKVTVALDDYGKD 134 (386) Q Consensus 67 ~~~-----~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~-------~~l~~l~~~~v~~~~~~~~~~ 134 (386) ..+ +.+++.. |.. ......+..+++.+|.||+.+.++..|.. ..+.+++|..+.+..+..... T Consensus 86 ~~~~~~~~~~~i~~~--N~~---d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~ 160 (486) T protein:vir:42 86 DADEADEELWQWWQA--NNL---DIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINR 160 (486) T ss_pred CCchhHHHHHHHHHh--cCh---hHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCCC Confidence 322 3344422 332 24566788899999999999987764432 357778888888777643322 Q ss_pred eEEEE--eccC-c-ccceeEEEcccc-------------------------eeeeccccccCcccccccccHHHH-HHHH Q lcl|NC_011801. 135 LTYTV--HFDD-S-KRSGDFLYDSSE-------------------------VIHFRCTVSGESDTQYMGIPPIDS-LLNE 184 (386) Q Consensus 135 ~~~~~--~~~~-~-~~~~~~~~~~~~-------------------------vih~~~~~~~~~~~~~~G~s~~~~-~~~~ 184 (386) ..+.+ .... . .......+.+.. |++|. +. ....+.+|.|.+.. +... T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~--n~-~~~~~~~G~s~i~~~v~~l 237 (486) T protein:vir:42 161 VSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLP--NR-TRLSDLYGTSEITPELRSM 237 (486) T ss_pred eEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEec--cc-cccCCCCCcccchhhHHHH Confidence 21111 1100 0 000001122222 23332 11 12344578876652 3333 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHH--HHHHHHHHHHhcccccCcceecC-CCceeeeccCChhhHHH Q lcl|NC_011801. 185 IEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAK--ENTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVTEF 261 (386) Q Consensus 185 i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~--~~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~ 261 (386) ++......-......+-.+.|..++.. ........ .+-+..|+. ..+++++++ ++.+|.++.....+ .+ T Consensus 238 iDa~~~~~s~~~~~~e~~a~p~~~i~G--~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~q~~~~~~e-~~ 309 (486) T protein:vir:42 238 TDAAARILMLMQATAELMGVPQRLIFG--IKPEEIGVDSETGQTLFDA-----YLARILAFEDAEGKIQQFSAAELA-NF 309 (486) T ss_pred HHHHHHHHHHHHHHHHhhcchHHHhhc--CCccccccccccccchhhh-----hhchhcccCCCCceEEeecccCHH-HH Confidence 333333333333333334445444432 11111100 011111221 123456554 45677665543222 37 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh----hhhh Q lcl|NC_011801. 262 LQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL----GTDV 323 (386) Q Consensus 262 ~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l----~~~~ 323 (386) ++.++..+.+++..=++|+..++.......+.++. +..+...|.-+++.+....+..- ...+ T Consensus 310 ~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i 389 (486) T protein:vir:42 310 TNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRM 389 (486) T ss_pred HHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceee Confidence 88899999999999999999997543211111111 12222233333322221111000 0123 Q ss_pred hhcchhhhccCHHHHHHHHHHHHhC--CCcCHHHHHHHhccCCcC--------------------------CCCCC--Cc Q lcl|NC_011801. 324 KLDIASAIDSDNSELINNVQKLASA--GVLAPIQAQKLLKNRGVF--------------------------PELDL--DE 373 (386) Q Consensus 324 ~fd~~~~l~~d~~~~~~~~~~~~~~--g~~t~nE~R~~lg~~p~~--------------------------p~~~~--~~ 373 (386) ++.+..-...+..+.++++.+++++ |+++..-+++++|.-+-. +..++ .. T Consensus 390 ~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (486) T protein:vir:42 390 ETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSP 469 (486) T ss_pred eEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCC Confidence 3444445567888889999999876 566544444444321110 00000 00 Q ss_pred ccccccc---CCCCCC Q lcl|NC_011801. 374 GTNLLDN---TKNIND 386 (386) Q Consensus 374 ~~~~~~~---~~~~~~ 386 (386) ..++... +..+.| T Consensus 470 ~~~~~~~~~~~~~~~~ 485 (486) T protein:vir:42 470 TAPPKPQPAIESSGGD 485 (486) T ss_pred CCCCCCCcccCCCCCC Confidence 0011110 111111 No 181 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.23 E-value=2.3e-06 Score=51.46 Aligned_cols=373 Identities=13% Similarity=0.107 Sum_probs=169.3 Q ss_pred Cchhhhhc----cccccCCccc----hhhhh-hcccccccCc-------ccccHHHHhccHHHHHHHHHHHHhhccCc-- Q lcl|NC_011801. 1 MAFLSNLF----KRQKMLSGSS----PVWIL-NQGQPVSIKP-------KAITSAIALKNSDVYAVISRVSSDIAGCR-- 62 (386) Q Consensus 1 Mg~~~~l~----~~~~~~~~~~----~~~~~-~~~~~~~~~~-------~~i~~~~a~~~~~v~~~v~~ia~~ia~~p-- 62 (386) |+....+- +..-..+... ..+.. -.+....+.. .... +.-+...-...+++-.|+-+.+=| T Consensus 16 ~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~-~~~~s~n~~~~iv~~~a~~l~~ep~~ 94 (499) T protein:vir:80 16 MGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVN-RRQLSMNLPKVTAKYMSKLLFNEKVK 94 (499) T ss_pred hccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccc-cceeecchHHHHHHHHHHhhhCCcce Confidence 33211110 0000000000 00000 0010010100 0000 111122334556677777666544 Q ss_pred eeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee--E---- Q lcl|NC_011801. 63 FVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL--T---- 136 (386) Q Consensus 63 ~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~--~---- 136 (386) +.+.+.+..+.|..-- ..-....-+..++......|.+|+.+..|.+|.+ .+..++|+.+-+...+.+... . T Consensus 95 i~~~d~~~~e~l~~~~-~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~a~~~~Pi~~d~~~~~~~~f~~~ 172 (499) T protein:vir:80 95 INIDDETAEEFVLNVL-KTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSENVDECLIANS 172 (499) T ss_pred EeeCCHHHHHHHHHHH-hhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EEEEEcCCceEEEEecCCCeEEEEEEEE Confidence 4455556666654311 1122344566677777889999999999888765 456677777665433222110 0 Q ss_pred -----------------------EEEec------cCcccceeEE-------------E---cccceeeeccccccCc-cc Q lcl|NC_011801. 137 -----------------------YTVHF------DDSKRSGDFL-------------Y---DSSEVIHFRCTVSGES-DT 170 (386) Q Consensus 137 -----------------------~~~~~------~~~~~~~~~~-------------~---~~~~vih~~~~~~~~~-~~ 170 (386) |.+.. .....|.++. + ..-...|++....+.. .+ T Consensus 173 ~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~ 252 (499) T protein:vir:80 173 FHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLT 252 (499) T ss_pred EeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCC Confidence 00000 0000011110 0 0001233332211111 12 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC------CHHHHHHHHHHHHHHhcccccCcceecC Q lcl|NC_011801. 171 QYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL------GKEAKENTRQSFEEQTTGENAGRAVVLD 244 (386) Q Consensus 171 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~------~~~~~~~~k~~~~~~~~~~~~g~~~vl~ 244 (386) ...|+|.+..+...++..........+-|+.|. ...++ +...+ +.+....+.... +.+.. .....-+ T Consensus 253 splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v--~~~~l~~~~~~~g~~~~~~~~~~-~~~~~---~~~~~~~ 325 (499) T protein:vir:80 253 SPLGISVYANALDTLKTLDLMFDSYYQEFKLGK-KKVLV--PSSFVKTAVNLDGSTTQYFDSTD-EAFFL---YQGEQDD 325 (499) T ss_pred CccCCchHhhHHHHHHHHHHHHHHHHHHHHhcc-cceec--chhhhhccCCCCCCcccCCCccc-ceeeE---eeccCCC Confidence 246999999999999988888766667777654 33333 11110 000000000000 00000 0001112 Q ss_pred CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHH Q lcl|NC_011801. 245 QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKP 310 (386) Q Consensus 245 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ 310 (386) ++..++.++....+-++.+..+...++|....|+++..++....+..+.... ...++.+|..++.. T Consensus 326 ~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~ 405 (499) T protein:vir:80 326 NGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVS 405 (499) T ss_pred CcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2335777776666677888999999999999999999998765543221111 11222333333333 Q ss_pred HHHHHHHhh--------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCC----C----cc Q lcl|NC_011801. 311 IESELSQKL--------GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDL----D----EG 374 (386) Q Consensus 311 ie~~l~~~l--------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~----~----~~ 374 (386) |....+... ...+.++++.-+..|.++.++...+++.+|+|+.-+++... +-.++.+. + |- T Consensus 406 il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~---~~~~d~ea~~el~~i~~E~ 482 (499) T protein:vir:80 406 ILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRA---WNITEAEADEWAEMLAKEK 482 (499) T ss_pred HHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhc---CCCChHHHHHHHHHHHHHh Confidence 322211111 12355677777788999999999999999999877765432 10111110 0 00 Q ss_pred cccc-ccC--C-CCCC Q lcl|NC_011801. 375 TNLL-DNT--K-NIND 386 (386) Q Consensus 375 ~~~~-~~~--~-~~~~ 386 (386) ...+ .+. + .+.+ T Consensus 483 ~~~~~~~d~~g~~ge~ 498 (499) T protein:vir:80 483 QAEIPNNDMTGIFGEE 498 (499) T ss_pred hcCCCCCCccccCCCC Confidence 0000 000 0 0111 No 182 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.21 E-value=2.5e-06 Score=51.28 Aligned_cols=367 Identities=11% Similarity=0.051 Sum_probs=174.5 Q ss_pred Cch-----hhhhcccccc---------CCccchhhhhhcccccccCccc-cc---HHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_011801. 1 MAF-----LSNLFKRQKM---------LSGSSPVWILNQGQPVSIKPKA-IT---SAIALKNSDVYAVISRVSSDIAGCR 62 (386) Q Consensus 1 Mg~-----~~~l~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~-i~---~~~a~~~~~v~~~v~~ia~~ia~~p 62 (386) +.. ++.+..+... ..+.+.+... .......... .. ...-+.++-...+|+..++-+-+-| T Consensus 20 ~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~--~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p 97 (479) T protein:vir:79 20 STINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNK--RRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNP 97 (479) T ss_pred ChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccc--ccccccccccccccccCcceeecchHHHHHHHHHhhhhcCC Confidence 111 1111100000 0000000000 0000000000 00 0001123444567777777777777 Q ss_pred eeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc--eeEEE Q lcl|NC_011801. 63 FVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK--DLTYT 138 (386) Q Consensus 63 ~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~ 138 (386) +.+. +.....++..--+. ........+..+.+.+|.+|..+..+..|.+ .+..++|..+.+..+.... ..... T Consensus 98 ~~~~~~~~~~~~~~~~~~~n--~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~i 174 (479) T protein:vir:79 98 IVFNADDDNLTKLLNDLLGE--EFDDTITELYLNASNKGVEWLHPYINRKGEF-KYVIIPAEEAIPIWDSKRQRELVAFI 174 (479) T ss_pred ceeccCCHHHHHHHHHHHhc--CHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEccceeEEEEeCCCCCceEEEE Confidence 6543 33333333221111 2345556777889999999999988888876 4777899988887654321 22111 Q ss_pred --Eecc--Cc-ccceeEEEcccceeeecccccc---------------------------Cc---------ccccccccH Q lcl|NC_011801. 139 --VHFD--DS-KRSGDFLYDSSEVIHFRCTVSG---------------------------ES---------DTQYMGIPP 177 (386) Q Consensus 139 --~~~~--~~-~~~~~~~~~~~~vih~~~~~~~---------------------------~~---------~~~~~G~s~ 177 (386) +... +. .......+....+.|++..... ++ .+...|.|. T Consensus 175 r~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd 254 (479) T protein:vir:79 175 RFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSD 254 (479) T ss_pred EEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCcc Confidence 1111 10 0011112233333333211100 00 011257888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChh Q lcl|NC_011801. 178 IDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPN 257 (386) Q Consensus 178 ~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~ 257 (386) +..+...++....+.....+.++..+.|-.+++.-+....++ +...+ ..++++.++++.+++.+..+.. T Consensus 255 ~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~----~~~~~-------~~~~~i~~~~~~~~~~l~~~~~ 323 (479) T protein:vir:79 255 LTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQE----FIDNI-------RYYKSIKVDGGGGVDKLEINIP 323 (479) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccccc----chhhh-------hhccceecCCCCcceEEeccCC Confidence 888888888777776666767777777766654311111222 11111 1234666777666666665555 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhhh----- Q lcl|NC_011801. 258 VTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKLG----- 320 (386) Q Consensus 258 d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l~----- 320 (386) +..+....+...+.|...-++|..-.+..++.+.. ....+..+...+.-+++.+...++..-+ T Consensus 324 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 403 (479) T protein:vir:79 324 VEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDY 403 (479) T ss_pred HHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Confidence 66677788888899998888886433322221110 0111233444455555554444433221 Q ss_pred hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----c------cccccccC-CCCCC Q lcl|NC_011801. 321 TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----E------GTNLLDNT-KNIND 386 (386) Q Consensus 321 ~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~------~~~~~~~~-~~~~~ 386 (386) ..+++.+..-+..|.++.++.+.++ .|+++...+.++++.-+ ++..... | ..+..... ++..| T Consensus 404 ~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 477 (479) T protein:vir:79 404 KTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVE-DVNDELERLKKQEDTQKEYDDLIPNNQDGVID 477 (479) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHhccCcccCCCcC Confidence 2345556666777889999998888 47898877777654211 0011100 0 01111111 11111 No 183 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.14 E-value=3.7e-06 Score=50.34 Aligned_cols=365 Identities=11% Similarity=0.053 Sum_probs=164.0 Q ss_pred Cchhhhhcccc--------------------------------------ccCCccchhhhhhcccccccCcc-ccc-HHH Q lcl|NC_011801. 1 MAFLSNLFKRQ--------------------------------------KMLSGSSPVWILNQGQPVSIKPK-AIT-SAI 40 (386) Q Consensus 1 Mg~~~~l~~~~--------------------------------------~~~~~~~~~~~~~~~~~~~~~~~-~i~-~~~ 40 (386) -..|++...++ .=..+.+.+... .......+. ... +.. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r--~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 2 FNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQ--MKKVDVYGNIDYDKPDW 79 (474) T ss_pred cceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcc--ccccccccccccccccc Confidence 00011000000 000000000000 000000000 000 000 Q ss_pred HhccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEE Q lcl|NC_011801. 41 ALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEP 118 (386) Q Consensus 41 a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~ 118 (386) -+.++-...+++..++-+-+-|+.+. +....+.|..--+. ........+..+...+|.||+.+..+.+|++ .+.. T Consensus 80 ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n--~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~ 156 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLDT--RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFR 156 (474) T ss_pred eeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHhc--cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEE Confidence 11233445677777776666676543 33333333221111 2334556678889999999999988888875 5777 Q ss_pred EcCcceEEeecCCC--ceeEEEEeccCcccceeEEEcccceeeecccccc------------------Cc---------c Q lcl|NC_011801. 119 VPNEKVTVALDDYG--KDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSG------------------ES---------D 169 (386) Q Consensus 119 l~~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~------------------~~---------~ 169 (386) ++|..+-+..+... ........+..........+....+.+++..... ++ . T Consensus 157 ~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 236 (474) T protein:vir:95 157 VPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFK 236 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeec Confidence 88888887766432 2211111111111111122333333332211000 00 0 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCcee Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADV 249 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~ 249 (386) +...|.|.+..+...++....+.....+.++..+.|..+++ +...++ .+.+.... ...+++.++++.++ T Consensus 237 nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~~~ 305 (474) T protein:vir:95 237 NNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILK--GYEGQD--LEEFMRGL-------KYYKAINVDGDGGV 305 (474) T ss_pred CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCCce Confidence 11358888888888888877776666666777777765554 322221 11112211 12346667777677 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 250 ETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESEL 315 (386) Q Consensus 250 ~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l 315 (386) +.+........+....+...+.|+..-++|..-.+ ...++-+..+. +..+...|..+++.|.+.+ T Consensus 306 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 384 (474) T protein:vir:95 306 ETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTD-KFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFN 384 (474) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66665556666788889999999999999963221 11111111111 2233344444444444332 Q ss_pred HHhh-hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCC-----------------Cc-ccc Q lcl|NC_011801. 316 SQKL-GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDL-----------------DE-GTN 376 (386) Q Consensus 316 ~~~l-~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~-----------------~~-~~~ 376 (386) .... ...+++.++.-+..|..+.+++ ++++|+++...+.++++.-+ +|.... .. +.+ T Consensus 385 g~~~d~~~i~v~f~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~d 460 (474) T protein:vir:95 385 NLKMDVKDIEISFNFNRMMNDAEQSQI---IAQSQYLSRETLVKSSPLVD-DYKAELERIEQEQMEYNKQLPNLDDGGAD 460 (474) T ss_pred CCCcccceeeEEeccCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhcccccccccCC Confidence 2211 1122334444444555555554 55678888777766553211 000000 00 001 Q ss_pred ccccCCCCCC Q lcl|NC_011801. 377 LLDNTKNIND 386 (386) Q Consensus 377 ~~~~~~~~~~ 386 (386) ...+....+| T Consensus 461 ~~~~~~~~~~ 470 (474) T protein:vir:95 461 GAQQQERSND 470 (474) T ss_pred CCcCCCCCcc Confidence 1111111111 No 184 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.11 E-value=4.4e-06 Score=49.95 Aligned_cols=366 Identities=10% Similarity=-0.010 Sum_probs=162.0 Q ss_pred Cchhhhhcccc---------------------------cc-----------CCccchhhhhhcccccccCcccccHHHHh Q lcl|NC_011801. 1 MAFLSNLFKRQ---------------------------KM-----------LSGSSPVWILNQGQPVSIKPKAITSAIAL 42 (386) Q Consensus 1 Mg~~~~l~~~~---------------------------~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~ 42 (386) |-=.++...++ .. ..+.+++.................+..=+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRM 80 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhc Confidence 22111110000 00 00000000000000000000000000011 Q ss_pred ccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEc Q lcl|NC_011801. 43 KNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVP 120 (386) Q Consensus 43 ~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~ 120 (386) .++-...+++..++-+-+-|+... +......|..--+. ........+..++..+|.||..+..+.+|++ .+..++ T Consensus 81 ~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~-~i~~~~ 157 (474) T protein:vir:96 81 FTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLNH--KWDDKLVDILTAASNKGIEWLQPYIDENGEF-KTFRVP 157 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHhc--CHHHHHHHHHHHHHhcCeeEEEEEecCCCce-EEEEEc Confidence 234445566666666666666532 33333333221121 2234455667888899999999988888876 478899 Q ss_pred CcceEEeecCC--CceeEEEEeccCcccceeEEEcccceeeecccc----------------------ccCcc------- Q lcl|NC_011801. 121 NEKVTVALDDY--GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTV----------------------SGESD------- 169 (386) Q Consensus 121 ~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~----------------------~~~~~------- 169 (386) |..+.+..+.. .........+..........+....+.|+.+.. ..++. T Consensus 158 p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 237 (474) T protein:vir:96 158 AEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIP 237 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEE Confidence 99988887642 222211111111111111122222222221100 00000 Q ss_pred --cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-CC Q lcl|NC_011801. 170 --TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-QS 246 (386) Q Consensus 170 --~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-~g 246 (386) +...|.|.+......++....+.....+.++..+.|-.+++ +.... ..+.+...+. .++++.++ .| T Consensus 238 ~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--g~~~~--~~~~~~~~~~-------~~~~i~~~~~~ 306 (474) T protein:vir:96 238 FKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILK--GYEGQ--DLDEFMRNLK-------YYKAINVDGDG 306 (474) T ss_pred eccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCcc--cccchhhhhh-------cCceEEecCCC Confidence 11357888888888888877777777777777777765554 32211 1111111111 13456554 45 Q ss_pred ceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHH Q lcl|NC_011801. 247 ADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIE 312 (386) Q Consensus 247 ~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie 312 (386) .+++.+........+.+..+...+.|+..-++|..-....+ ++-+..+. +..+...|..+++.|. T Consensus 307 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~ 385 (474) T protein:vir:96 307 SGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYII 385 (474) T ss_pred CceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66666665555556778889999999999999964332111 11121222 1233333333333332 Q ss_pred HHHHHhhh-hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCC-------------------CCC Q lcl|NC_011801. 313 SELSQKLG-TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPEL-------------------DLD 372 (386) Q Consensus 313 ~~l~~~l~-~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~-------------------~~~ 372 (386) ..+..... ..+++.+..-+..|..+.++ .+..+|+++...+.++++.-. +|.. ..+ T Consensus 386 ~~~~~~~~~~~i~i~f~~~~p~~~~e~~~---~~~~ag~iS~et~~~~~~~v~-d~~~E~~ri~~E~~e~~~~~~~~~~~ 461 (474) T protein:vir:96 386 DFYKLNIKVQDVEITFNFNVMVNELEQSQ---IGVQSQYLSKETVVTNHPWVD-DPVAELERIEQDNIDFNKQLPPLEGD 461 (474) T ss_pred HHhCCCcccceeeEEeccCCCcCHHHHHH---HHHhcCCCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhcccccccc Confidence 22211100 12233344444455555554 456689998887777654211 0000 011 Q ss_pred ccccccccCCCCC Q lcl|NC_011801. 373 EGTNLLDNTKNIN 385 (386) Q Consensus 373 ~~~~~~~~~~~~~ 385 (386) +.+..-...+.|+ T Consensus 462 ~~~~~~d~~~e~~ 474 (474) T protein:vir:96 462 ANGRAQDNESETN 474 (474) T ss_pred cccccCCCcccCC Confidence 1111111111111 No 185 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.11 E-value=4.4e-06 Score=49.92 Aligned_cols=365 Identities=8% Similarity=0.064 Sum_probs=155.8 Q ss_pred Cchhh-hhccccccCCcc-chhhhhhccc-cc-ccCccccc-HHHHh----ccHHHHHHHHHHHHhhccCceeecch--- Q lcl|NC_011801. 1 MAFLS-NLFKRQKMLSGS-SPVWILNQGQ-PV-SIKPKAIT-SAIAL----KNSDVYAVISRVSSDIAGCRFVTNAQ--- 68 (386) Q Consensus 1 Mg~~~-~l~~~~~~~~~~-~~~~~~~~~~-~~-~~~~~~i~-~~~a~----~~~~v~~~v~~ia~~ia~~p~~~~~~--- 68 (386) ..++. ++++........ ......-.+. .. ........ ....+ .+....-+|+.+++.+--..+.+.+. T Consensus 15 ~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~~d~~~~ 94 (479) T protein:vir:99 15 AKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRKTGTNEN 94 (479) T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccCCCchhh Confidence 11110 111000000000 0000000000 00 00000000 01111 11223446666655443333444332 Q ss_pred -hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee-----cCCCceEEEEEEcCcceEEeecCCCce--eEEEEe Q lcl|NC_011801. 69 -PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR-----DTNGYPVRIEPVPNEKVTVALDDYGKD--LTYTVH 140 (386) Q Consensus 69 -~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~-----~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~ 140 (386) .+.+++.. |.. ......+..+++.+|.||+.+.. +..|.+ .+..++|..+.+..++.... ..|.+. T Consensus 95 ~~~~~i~~~--N~~---d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~~~~~~~~ 168 (479) T protein:vir:99 95 AKGWDTWRL--NQM---DKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDEWPKYLLE 168 (479) T ss_pred HHHHHHHHh--cCh---hHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEecCCcccceeeEEEe Confidence 23344433 322 24556788889999999998764 333443 46777888888776543321 122211 Q ss_pred ccCcc----------------cceeE-------EEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 141 FDDSK----------------RSGDF-------LYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAIS 197 (386) Q Consensus 141 ~~~~~----------------~~~~~-------~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 197 (386) ....+ .+... .+..=+|++|++.. . .+..|.|.+..+...++.......-..+ T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~--~--~~~~g~sd~e~v~~liDa~~~~~s~~~~ 244 (479) T protein:vir:99 169 RQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVM--D--LRGVCYGDVEPLVTVAKAIDKTGLDILL 244 (479) T ss_pred ecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCC--C--cCcCCcchhHHHHHHHHHHHHHHHHHHH Confidence 11100 00000 01112244444321 1 1236899888888777777766666556 Q ss_pred HHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCccee-cCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 198 TLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVV-LDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 198 ~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~v-l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) .++-.+.|..++.. ....++.... ...+.. ..++++. -+++.++.++... .-..+.+.++....+|+..= T Consensus 245 ~~~~~a~p~~~i~G--~~~~~~~~~~-~~~~~~-----~~~~i~~~~~~~~~~~q~~~~-~~~~~~~~l~~~i~~i~~~t 315 (479) T protein:vir:99 245 VQHHQSFQIRWATG--LMLPEGANAD-QEKMRF-----AQESMLISQNEKASFGAIPAA-PLDGLLNAYKESLLEFLALA 315 (479) T ss_pred HHHHhhchhhhhcC--CCcccccccc-hhcccc-----ccccceeecCCCceEEEeccc-chHHHHHHHHHHHHHHhccC Confidence 66666666655542 2111111000 011111 1123433 3556676655532 22336788888889999999 Q ss_pred CCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh---hhhhhhcchhhhccCHHHHH Q lcl|NC_011801. 277 GIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL---GTDVKLDIASAIDSDNSELI 339 (386) Q Consensus 277 gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l---~~~~~fd~~~~l~~d~~~~~ 339 (386) ++|+..+|..++.+ .++. +..+..+|.-.++.+........ ...+++.+.+....+..+.+ T Consensus 316 ~~p~~~~g~~~n~S--g~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~a 393 (479) T protein:vir:99 316 QLPPHIAGQIVNVA--ADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFA 393 (479) T ss_pred CCCHHHcccccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHH Confidence 99999987544321 1111 11222222222222211111000 01223333344456788889 Q ss_pred HHHHHHHhCCCcCHHHHHHHh-ccCC--------------------------cCCCCC--CCccccccccCCCCCC Q lcl|NC_011801. 340 NNVQKLASAGVLAPIQAQKLL-KNRG--------------------------VFPELD--LDEGTNLLDNTKNIND 386 (386) Q Consensus 340 ~~~~~~~~~g~~t~nE~R~~l-g~~p--------------------------~~p~~~--~~~~~~~~~~~~~~~~ 386 (386) +++.+++++|+++...+.+++ |..+ ..|... ..++......+.+..+ T Consensus 394 d~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (479) T protein:vir:99 394 DAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTG 469 (479) T ss_pred HHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCc Confidence 999999988876554444433 2110 011100 1111111122222111 No 186 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.07 E-value=5.3e-06 Score=49.52 Aligned_cols=361 Identities=10% Similarity=0.041 Sum_probs=164.6 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccC Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPL 78 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~P 78 (386) +--+.++............. ....+ ......+ + .-+.+.-....++..++-+-+-|+.+. +....+.|..-- T Consensus 46 ~~~l~~Yy~g~~~i~~~~~~-~~~~~--~~~~~~~-~--~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~ 119 (474) T protein:vir:96 46 INVGQKYYDKDNDINYQAYK-QDLHG--NIDYTKP-D--WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL 119 (474) T ss_pred HHHHHHHhcccCccccccch-hhhcc--ccccccc-c--cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH Confidence 11111111111000000000 00000 0000000 0 001233445667777777767776643 333333332211 Q ss_pred cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCC--CceeEEEEeccCcccceeEEEcccc Q lcl|NC_011801. 79 GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDY--GKDLTYTVHFDDSKRSGDFLYDSSE 156 (386) Q Consensus 79 N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (386) +. ........+..++..+|.||..+-++.+|.+ .+..++|..+-+..+.. ...+.+...+.......-..+.... T Consensus 120 ~n--~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~ 196 (474) T protein:vir:96 120 DT--RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAET 196 (474) T ss_pred hc--cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCe Confidence 11 2445566778899999999999989888875 56778999988877543 2222221111111111112233444 Q ss_pred eeeecccccc------------------Cc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_011801. 157 VIHFRCTVSG------------------ES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFI 209 (386) Q Consensus 157 vih~~~~~~~------------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l 209 (386) +.++.+.... ++ .+...|.|.+......++....+..-..+.++..+.|-.++ T Consensus 197 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:96 197 VTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred EEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 4433321100 00 01235788888888888887766666666667667665554 Q ss_pred eeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc Q lcl|NC_011801. 210 KVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA 289 (386) Q Consensus 210 ~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~ 289 (386) + +... +....+...+. ..+++.++++.+++.+.....+..+....+...+.|...-++|..-....+ + T Consensus 277 ~--g~~~--~~~~~~~~~~~-------~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~ 344 (474) T protein:vir:96 277 R--GYEG--EDLSEFMEGLK-------YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG-S 344 (474) T ss_pred c--CCCc--ccccchhhhhh-------ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc-c Confidence 3 3221 22222222222 224666666666666665555666778888889999999999864322111 1 Q ss_pred ccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHH Q lcl|NC_011801. 290 QSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPI 354 (386) Q Consensus 290 ~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~n 354 (386) +.+..+. +..+...|..+++.+...+.... ...+++.+..-+..|..+.++. +..+|+++.. T Consensus 345 n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~e 421 (474) T protein:vir:96 345 ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQI---GAQSQYLSKE 421 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHH---HHHcCCCChH Confidence 1222222 12233333333333332221111 0123344444445566555555 4557888887 Q ss_pred HHHHHhccCCcCCCCCCC-----------------c-cccccccCCCCCC Q lcl|NC_011801. 355 QAQKLLKNRGVFPELDLD-----------------E-GTNLLDNTKNIND 386 (386) Q Consensus 355 E~R~~lg~~p~~p~~~~~-----------------~-~~~~~~~~~~~~~ 386 (386) .++++++.-. +|...+. . +.+...+.....| T Consensus 422 t~~~~lp~v~-D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (474) T protein:vir:96 422 TLVRHHPWVD-DPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSEN 470 (474) T ss_pred HHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCc Confidence 7776654211 0111000 0 0000011111111 No 187 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.07 E-value=5.3e-06 Score=49.52 Aligned_cols=361 Identities=10% Similarity=0.041 Sum_probs=164.6 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccC Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPL 78 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~P 78 (386) +--+.++............. ....+ ......+ + .-+.+.-....++..++-+-+-|+.+. +....+.|..-- T Consensus 46 ~~~l~~Yy~g~~~i~~~~~~-~~~~~--~~~~~~~-~--~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~ 119 (474) T protein:vir:95 46 INVGQKYYDKDNDINYQAYK-QDLHG--NIDYTKP-D--WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL 119 (474) T ss_pred HHHHHHHhcccCccccccch-hhhcc--ccccccc-c--cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH Confidence 11111111111000000000 00000 0000000 0 001233445667777777767776643 333333332211 Q ss_pred cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCC--CceeEEEEeccCcccceeEEEcccc Q lcl|NC_011801. 79 GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDY--GKDLTYTVHFDDSKRSGDFLYDSSE 156 (386) Q Consensus 79 N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (386) +. ........+..++..+|.||..+-++.+|.+ .+..++|..+-+..+.. ...+.+...+.......-..+.... T Consensus 120 ~n--~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~ 196 (474) T protein:vir:95 120 DT--RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAET 196 (474) T ss_pred hc--cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCe Confidence 11 2445566778899999999999989888875 56778999988877543 2222221111111111112233444 Q ss_pred eeeecccccc------------------Cc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_011801. 157 VIHFRCTVSG------------------ES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFI 209 (386) Q Consensus 157 vih~~~~~~~------------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l 209 (386) +.++.+.... ++ .+...|.|.+......++....+..-..+.++..+.|-.++ T Consensus 197 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:95 197 VTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred EEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 4433321100 00 01235788888888888887766666666667667665554 Q ss_pred eeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCc Q lcl|NC_011801. 210 KVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDA 289 (386) Q Consensus 210 ~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~ 289 (386) + +... +....+...+. ..+++.++++.+++.+.....+..+....+...+.|...-++|..-....+ + T Consensus 277 ~--g~~~--~~~~~~~~~~~-------~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~ 344 (474) T protein:vir:95 277 R--GYEG--EDLSEFMEGLK-------YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG-S 344 (474) T ss_pred c--CCCc--ccccchhhhhh-------ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc-c Confidence 3 3221 22222222222 224666666666666665555666778888889999999999864322111 1 Q ss_pred ccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHH Q lcl|NC_011801. 290 QSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPI 354 (386) Q Consensus 290 ~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~n 354 (386) +.+..+. +..+...|..+++.+...+.... ...+++.+..-+..|..+.++. +..+|+++.. T Consensus 345 n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~e 421 (474) T protein:vir:95 345 ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQI---GAQSQYLSKE 421 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHH---HHHcCCCChH Confidence 1222222 12233333333333332221111 0123344444445566555555 4557888887 Q ss_pred HHHHHhccCCcCCCCCCC-----------------c-cccccccCCCCCC Q lcl|NC_011801. 355 QAQKLLKNRGVFPELDLD-----------------E-GTNLLDNTKNIND 386 (386) Q Consensus 355 E~R~~lg~~p~~p~~~~~-----------------~-~~~~~~~~~~~~~ 386 (386) .++++++.-. +|...+. . +.+...+.....| T Consensus 422 t~~~~lp~v~-D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (474) T protein:vir:95 422 TLVRHHPWVD-DPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSEN 470 (474) T ss_pred HHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCc Confidence 7776654211 0111000 0 0000011111111 No 188 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.05 E-value=6.1e-06 Score=49.19 Aligned_cols=374 Identities=12% Similarity=0.051 Sum_probs=156.0 Q ss_pred CchhhhhccccccCCccchhh-hhhcc-cccccCcccccHHH-H--hccHHHHHHHHHHHHhhccCceeecch-----hH Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVW-ILNQG-QPVSIKPKAITSAI-A--LKNSDVYAVISRVSSDIAGCRFVTNAQ-----PI 70 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~i~~~~-a--~~~~~v~~~v~~ia~~ia~~p~~~~~~-----~~ 70 (386) +-++.+|.+........-... ..-.+ .....-+..+..+. . .......-||+-+++.+.--.|.+-+. .+ T Consensus 17 ~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~~~~~~l 96 (474) T protein:vir:81 17 NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWPDGDLDSLGG 96 (474) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceECCCCCccchHH Confidence 333333322111110000000 00000 00000111111110 0 111222345555555444344443221 23 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce-EEEEEEcCcceEEeecCCCceeEEEEe---ccCccc Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP-VRIEPVPNEKVTVALDDYGKDLTYTVH---FDDSKR 146 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~-~~l~~l~~~~v~~~~~~~~~~~~~~~~---~~~~~~ 146 (386) .+++.. |... .....+..+.+.+|.||+.+..+.+|.+ ..+.+++|..+....|.......+.+. .+..+. T Consensus 97 ~~iw~~--N~ld---~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~ 171 (474) T protein:vir:81 97 TEVVDD--NHLL---SEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLSIIDKDKEGK 171 (474) T ss_pred HHHHHh--cChh---HHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeEEEEEcCCCc Confidence 344422 3332 3556677889999999999998888775 567889999988777654443222111 110000 Q ss_pred -ceeEEEcccc-------------------------eeeeccccccCcccccccccHH----HHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 147 -SGDFLYDSSE-------------------------VIHFRCTVSGESDTQYMGIPPI----DSLLNEIEVQDLSSKLAI 196 (386) Q Consensus 147 -~~~~~~~~~~-------------------------vih~~~~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~ 196 (386) .....+.++. |++|.+. + ..++.+|.|.+ ..+...+.....-..... T Consensus 172 ~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~--~-~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~ 248 (474) T protein:vir:81 172 VLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYK--P-APKRPFGQSRITKPMMGLQDAGVRELARREGHM 248 (474) T ss_pred EEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEeccc--c-cccCcCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 0011122222 3333221 1 12234677754 333333333333333344 Q ss_pred HHHhccCCCceEEee-CCCCC---CHHHHHHHHHHHHHHhcc-cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_011801. 197 STLRHAIKPSIFIKV-PNATL---GKEAKENTRQSFEEQTTG-ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQ 271 (386) Q Consensus 197 ~~~~ng~~~~~~l~~-~~~~~---~~~~~~~~k~~~~~~~~~-~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 271 (386) .|+ +.|.-++.. ..... +......++........- .+.........+.++-++.... -..|++.++..+.. T Consensus 249 e~~---a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~ 324 (474) T protein:vir:81 249 DVF---SYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKL 324 (474) T ss_pred HHh---cchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHH Confidence 444 344444431 11111 111223444444433221 1111122222345666555432 22388999999999 Q ss_pred HHHHhCCCHHHhcCCcCcc-cHHHHHH---HHHHHHHHHHHHHHHHHHHHhh-------h-----------hhhhhcchh Q lcl|NC_011801. 272 IAKAFGIPADYLSGKQDAQ-SNITMIR---AFYQSSLSIYIKPIESELSQKL-------G-----------TDVKLDIAS 329 (386) Q Consensus 272 Ia~~~gvp~~~l~~~~~~~-~~~~~~~---~~~~~~l~P~~~~ie~~l~~~l-------~-----------~~~~fd~~~ 329 (386) +|..=++|+..||.....+ .+.+..+ .=+.....-..+.|.+.+.+.+ + ..++..+.+ T Consensus 325 ~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d 404 (474) T protein:vir:81 325 FAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRD 404 (474) T ss_pred HHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecC Confidence 9999999999998542111 1111111 1111112222222333332211 0 112223444 Q ss_pred hhccCHHHHHHHHHHHHhCCC--cCHHHHHHHhccCCcCCC-----CCCCccccccc--cCCCCCC Q lcl|NC_011801. 330 AIDSDNSELINNVQKLASAGV--LAPIQAQKLLKNRGVFPE-----LDLDEGTNLLD--NTKNIND 386 (386) Q Consensus 330 ~l~~d~~~~~~~~~~~~~~g~--~t~nE~R~~lg~~p~~p~-----~~~~~~~~~~~--~~~~~~~ 386 (386) ....+..++++++.|++++|. ....-+++++|..|..-. ....++..++. ..+.+.. T Consensus 405 ~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~ 470 (474) T protein:vir:81 405 PRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNG 470 (474) T ss_pred CCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCC Confidence 456678889999999998763 333445665554322100 00000111110 0111111 No 189 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.00 E-value=7.6e-06 Score=48.64 Aligned_cols=368 Identities=11% Similarity=0.060 Sum_probs=147.2 Q ss_pred CchhhhhccccccCCccchh-hhhhcc-cccccCcccccHHHH---hccHHHHHHHHHHHHhhccCceee---------- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPV-WILNQG-QPVSIKPKAITSAIA---LKNSDVYAVISRVSSDIAGCRFVT---------- 65 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~i~~~~a---~~~~~v~~~v~~ia~~ia~~p~~~---------- 65 (386) ..++++|..........-.. .....+ ......+..+..+.. ..+.-..-+|+.+++.+---.+.+ T Consensus 10 ~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~~~~~ 89 (488) T protein:vir:23 10 EKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGEEPES 89 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceeccCCccccccc Confidence 22333222111000000000 000000 000001111111110 112222345555554332111211 Q ss_pred -cch----hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC--------CCceEEEEEEcCcceEEeecCCC Q lcl|NC_011801. 66 -NAQ----PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT--------NGYPVRIEPVPNEKVTVALDDYG 132 (386) Q Consensus 66 -~~~----~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~--------~g~~~~l~~l~~~~v~~~~~~~~ 132 (386) .+. .+.+++.. | ........+..+.+.+|.||+.+..+. .|. ..+.+++|..+.+..+... T Consensus 90 ~~d~~~~~~l~~i~~~--N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~-~~i~~~~p~~~~~~~d~~~ 163 (488) T protein:vir:23 90 GGENDPASELWDWWQA--N---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEV-PLIRVEPPTALYAEVDPRT 163 (488) T ss_pred ccchhHHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCc-ceEEEeccceeEEEEecCC Confidence 111 23333322 2 234566778889999999999886543 222 2466788888777766433 Q ss_pred ceeEEEEe----ccCcccceeEEEcccc-------------------------eeeeccccccCcccccccccHHHH-HH Q lcl|NC_011801. 133 KDLTYTVH----FDDSKRSGDFLYDSSE-------------------------VIHFRCTVSGESDTQYMGIPPIDS-LL 182 (386) Q Consensus 133 ~~~~~~~~----~~~~~~~~~~~~~~~~-------------------------vih~~~~~~~~~~~~~~G~s~~~~-~~ 182 (386) ....+.+. ..+........+.++. |++|++. ....+.+|.|.+.- +. T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~---~~~~~~~G~s~i~~~v~ 240 (488) T protein:vir:23 164 RKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNR---TRLSDLYGTSEISPELR 240 (488) T ss_pred CceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccc---cccCCcCCccchhhhHH Confidence 22221111 1110000011122222 3333321 12334577776642 22 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHH--HHHHHHHHHhcccccCcceecCCC--ceeeeccCChhh Q lcl|NC_011801. 183 NEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKE--NTRQSFEEQTTGENAGRAVVLDQS--ADVETTNISPNV 258 (386) Q Consensus 183 ~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~--~~k~~~~~~~~~~~~g~~~vl~~g--~~~~~~~~~~~d 258 (386) ..++.............+-.+.|..++. +...++...+ .-+..|+.. .++++.+++| .+|.++..... T Consensus 241 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~-----~~~v~~~~~g~~~~~~q~~~~~~- 312 (488) T protein:vir:23 241 SVTDAAAQILMNMQGTANLMAIPQRLIF--GAKPEELGINAETGQRMFDAY-----MARILAFEGGEGAHAEQFSAAEL- 312 (488) T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHh--CCCcccccccccccchhhhhh-----hhhhccCCCCCCceeEecCCCCh- Confidence 3333333332222222232333433332 2111111111 011112211 2346666665 45655544332 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHh----hh Q lcl|NC_011801. 259 TEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIR--------------AFYQSSLSIYIKPIESELSQK----LG 320 (386) Q Consensus 259 ~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~l~~~----l~ 320 (386) ..+++.++..+.+|+..=++|+..++......-+.++.+ ..+...|.-+++.+....... .. T Consensus 313 ~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~ 392 (488) T protein:vir:23 313 RNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEY 392 (488) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhh Confidence 237888999999999999999999975432211221211 122222222222222111110 00 Q ss_pred hhhhhcchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCcCCCCCCC----------------------cccc Q lcl|NC_011801. 321 TDVKLDIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGVFPELDLD----------------------EGTN 376 (386) Q Consensus 321 ~~~~fd~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~p~~~~~----------------------~~~~ 376 (386) ..+++.+......+..+.++++.+++++| +++..-+++++|.-+.+ ...+. +.+. T Consensus 393 ~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (488) T protein:vir:23 393 YRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVE-REQMRQWLEQDQKQGLGLIGSLYGASTPEGK 471 (488) T ss_pred ccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchH-HHHHHHHHHHHHHHHHHHHHHHhccCCCccc Confidence 12233444444567888899999998865 67777777776543211 00000 0000 Q ss_pred ccccCCCCCC Q lcl|NC_011801. 377 LLDNTKNIND 386 (386) Q Consensus 377 ~~~~~~~~~~ 386 (386) .-..+.+..+ T Consensus 472 ~~~~~~~~~~ 481 (488) T protein:vir:23 472 PGEAPVGEPP 481 (488) T ss_pred CCCCCCCCCC Confidence 0000001111 No 190 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.00 E-value=7.7e-06 Score=48.62 Aligned_cols=375 Identities=11% Similarity=0.072 Sum_probs=175.8 Q ss_pred Cchhhh--------hccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhH Q lcl|NC_011801. 1 MAFLSN--------LFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPI 70 (386) Q Consensus 1 Mg~~~~--------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~ 70 (386) ..|.++ +-+-..-..+.+++.......... .....+ +.+.-....++..++-+-+-|+.+. +... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~---~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~ 120 (511) T protein:vir:93 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE---YMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDV 120 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCccc---ccCcce--eecchHHHHHHHHhhhhcccCeeeccCChHH Confidence 111110 000000000000000000000000 000011 1233445566666666667776653 3333 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--Eecc--C- Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFD--D- 143 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~--~- 143 (386) ...|.. -+..-........+..++..+|.||..+.++.+|.+ .+..++|..+.+..+... ...... +... . T Consensus 121 ~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:93 121 LEVIEA-FNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHH-HHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 333322 112223445667788899999999999999888875 577889999888776542 222211 1110 0 Q ss_pred cccc---eeEEEcccceeeecccccc-------------Cc---------ccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 144 SKRS---GDFLYDSSEVIHFRCTVSG-------------ES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAIST 198 (386) Q Consensus 144 ~~~~---~~~~~~~~~vih~~~~~~~-------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 198 (386) .... ....+.+..+.+++..... ++ .+...|.|.+..+...++....+..-..+. T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~ 278 (511) T protein:vir:93 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHH Confidence 0000 0112344444433211100 00 011358888888888888877776666666 Q ss_pred HhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhc-ccc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 199 LRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTT-GEN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 199 ~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~-~~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ++..+.|-.+++.. ...+.+.....++.-.-... ... .+...-.+++.++..++....+..+....+...+.|+..- T Consensus 279 ~~~~~~~~lv~~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s 357 (511) T protein:vir:93 279 MSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHHhhCcceeeecC-cccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 77777765655421 22333333222211100000 000 0111223445566666555455556778888899999999 Q ss_pred CCCHHHhcC-CcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHH Q lcl|NC_011801. 277 GIPADYLSG-KQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNS 336 (386) Q Consensus 277 gvp~~~l~~-~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~ 336 (386) ++|..-.+. .++.+-. ....+..+..+|.-.++.|...+..... ..+++.+..-+..|.. T Consensus 358 ~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~ 437 (511) T protein:vir:93 358 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLI 437 (511) T ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHH Confidence 998743321 1221100 0112334555566655555554443221 1345556666777888 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCC------------------------CCccccccccCCCCCC Q lcl|NC_011801. 337 ELINNVQKLASAGVLAPIQAQKLLKNRGVFPELD------------------------LDEGTNLLDNTKNIND 386 (386) Q Consensus 337 ~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~------------------------~~~~~~~~~~~~~~~~ 386 (386) +.++++.++ .|+++...+.++++.-+ +|... ....+..-..+.+++| T Consensus 438 e~~~~~~kl--~g~iS~et~~~~l~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) T protein:vir:93 438 EELKAYIDS--GGKISQTTLMSLFSFFQ-DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCccccccc Confidence 899988888 47787766666653211 00000 0001111122222222 No 191 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.96 E-value=9.1e-06 Score=48.21 Aligned_cols=379 Identities=12% Similarity=0.043 Sum_probs=174.1 Q ss_pred Cchhhhhc-----------cccccC-Ccc-chhhhhhcccc--cccCcccccH---HHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_011801. 1 MAFLSNLF-----------KRQKML-SGS-SPVWILNQGQP--VSIKPKAITS---AIALKNSDVYAVISRVSSDIAGCR 62 (386) Q Consensus 1 Mg~~~~l~-----------~~~~~~-~~~-~~~~~~~~~~~--~~~~~~~i~~---~~a~~~~~v~~~v~~ia~~ia~~p 62 (386) |..+..+. .+.... ... ........+.. .......... ..-+.+.-....++..++-+-+-| T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNP 110 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccC Confidence 22222111 000000 000 00000000000 0000000000 001123344567777777777777 Q ss_pred eeecc------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ce Q lcl|NC_011801. 63 FVTNA------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KD 134 (386) Q Consensus 63 ~~~~~------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~ 134 (386) +.+.. .++...|.. ....-........+..++..+|.||+.+.++.+|.+ .+..++|..+.++.+... .. T Consensus 111 ~~~~~~d~~~~~~~~~~l~~-~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~~~~~~ 188 (502) T protein:vir:48 111 IRVEYDDNEDNSQNDDAIKR-IGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNS 188 (502) T ss_pred eeEecCCccchhHHHHHHHH-HHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCce Confidence 76532 234444432 222223445778889999999999999999888875 467789999888776532 22 Q ss_pred eEE-E-Eec--cCcccceeEEEcccceeeecccc-------ccCc---------ccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 135 LTY-T-VHF--DDSKRSGDFLYDSSEVIHFRCTV-------SGES---------DTQYMGIPPIDSLLNEIEVQDLSSKL 194 (386) Q Consensus 135 ~~~-~-~~~--~~~~~~~~~~~~~~~vih~~~~~-------~~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~ 194 (386) ... . +.. ..........+....++++.... .+++ .+...|.|.+..+...++....+... T Consensus 189 ~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~ 268 (502) T protein:vir:48 189 IAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESD 268 (502) T ss_pred EEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHH Confidence 211 1 111 11111111123333333222110 0000 11236888898888888888877777 Q ss_pred HHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_011801. 195 AISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAK 274 (386) Q Consensus 195 ~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~ 274 (386) ..+.++....|-.++........++....+++... .. ....+..-..+++.+++.++....+..+....+...+.|+. T Consensus 269 ~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~ 346 (502) T protein:vir:48 269 TANHMSDMADAILAIYGDLALPQGMQASDMKRTRL-MQ-LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHV 346 (502) T ss_pred HHHHHHHhcCceeeeecCcccccccchhhhhhcce-ee-ccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHH Confidence 77777877777666553221112222222221100 00 00000111123445555555544445566778889999999 Q ss_pred HhCCCHHHhcC-CcCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhh------hhhhhcchhhhccC Q lcl|NC_011801. 275 AFGIPADYLSG-KQDAQSNIT-------------MIRAFYQSSLSIYIKPIESELSQKLG------TDVKLDIASAIDSD 334 (386) Q Consensus 275 ~~gvp~~~l~~-~~~~~~~~~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~------~~~~fd~~~~l~~d 334 (386) .-++|....+. .++. ..++ ..+..+...|.-.++.+...+...-. ..+++.+...+..| T Consensus 347 ~s~~p~~~~~~~~~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d 425 (502) T protein:vir:48 347 FTNTPDMSDNHFSGNA-SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKS 425 (502) T ss_pred HhCCCCcCccccccCc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcC Confidence 99999744432 1211 1111 11234444455555444444433211 12445566667778 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhccCC--------------------cCCCCCCC--cccccc--ccCCCCCC Q lcl|NC_011801. 335 NSELINNVQKLASAGVLAPIQAQKLLKNRG--------------------VFPELDLD--EGTNLL--DNTKNIND 386 (386) Q Consensus 335 ~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p--------------------~~p~~~~~--~~~~~~--~~~~~~~~ 386 (386) .++.++++.++ .|+++...+.++++.-. .++..+.. .+.+.. .++...+| T Consensus 426 ~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~ 499 (502) T protein:vir:48 426 LYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFER 499 (502) T ss_pred HHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCC Confidence 99999998888 46777666666554211 10000000 000000 11111111 No 192 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.93 E-value=1.1e-05 Score=47.85 Aligned_cols=371 Identities=11% Similarity=0.017 Sum_probs=174.6 Q ss_pred hhhhhcccccc--------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--ch---h Q lcl|NC_011801. 3 FLSNLFKRQKM--------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQ---P 69 (386) Q Consensus 3 ~~~~l~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~---~ 69 (386) |+..+...... ..+......... .......+ +.+ +.+.-....|+..++-+-+-|+.+. +. . T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~--~~~~~~~~-~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~ 75 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGH--RRLDDEKA-DYR--VRHKWGGYISSFATGYVIGNPVSIGVMEGGSAD 75 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCccccccc--ccccccCC-cce--eecchHHHHHHhhhhheeccCceEeeCCCccHH Confidence 44332211000 000000000000 00000000 001 1233445566666666655565542 21 1 Q ss_pred HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc--eeEEEEeccCcccc Q lcl|NC_011801. 70 ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK--DLTYTVHFDDSKRS 147 (386) Q Consensus 70 ~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~~~~~~~ 147 (386) ....|.. --..-........+..+.+.+|.||..+..+.+|.+ .+..++|..+.+..+.... .............. T Consensus 76 ~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~ 153 (440) T protein:vir:95 76 QLSTIKD-IEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKV 153 (440) T ss_pred HHHHHHH-HHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCce Confidence 2222211 000112334556777889999999999999888876 4677899999888766442 22111111111111 Q ss_pred eeEEEcccceeeecccc-----------ccCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_011801. 148 GDFLYDSSEVIHFRCTV-----------SGES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSI 207 (386) Q Consensus 148 ~~~~~~~~~vih~~~~~-----------~~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 207 (386) ....+....+++++... .+++ .+...|.|.+..+...++....+.....+..+..+.|-. T Consensus 154 ~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~ 233 (440) T protein:vir:95 154 NMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAML 233 (440) T ss_pred EEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhccee Confidence 11123333333221100 0000 112357888888888888877777666777777777766 Q ss_pred EEeeC--CCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_011801. 208 FIKVP--NATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSG 285 (386) Q Consensus 208 ~l~~~--~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~ 285 (386) +++.. ....++++...+++.-.-.. .........+++.+++.+........+....+...+.|+..-++|..-.+. T Consensus 234 v~~g~~~~~~~~~e~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 311 (440) T protein:vir:95 234 LVKGDLDGIKLSPEDAAKMKDANMLFL--KTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDR 311 (440) T ss_pred eeecccccCCCCccchhhhhhccceec--ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc Confidence 66532 12335555554443222111 111112223344455555544455557788889999999999999743321 Q ss_pred -CcCccc-H-----------HHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHh Q lcl|NC_011801. 286 -KQDAQS-N-----------ITMIRAFYQSSLSIYIKPIESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLAS 347 (386) Q Consensus 286 -~~~~~~-~-----------~~~~~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~ 347 (386) .++.+. . .+..+..+...+..+++.+...++..-+ ..+++.+...+..|..+.++.+.++ T Consensus 312 ~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl-- 389 (440) T protein:vir:95 312 FNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA-- 389 (440) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH-- Confidence 111110 0 0112334455555555555555443322 2345556666778899999999888 Q ss_pred CCCcCHHHHHHHhccCCcCCCCC-----------CCcccccc-ccCCCCCC Q lcl|NC_011801. 348 AGVLAPIQAQKLLKNRGVFPELD-----------LDEGTNLL-DNTKNIND 386 (386) Q Consensus 348 ~g~~t~nE~R~~lg~~p~~p~~~-----------~~~~~~~~-~~~~~~~~ 386 (386) .|+++...+.++++.- ++... ..+..+.. ....++.| T Consensus 390 ~g~iS~et~~~~l~~~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 438 (440) T protein:vir:95 390 GGEISQETLMENASFT--DYKTEHSRILKQGGSSDLEIGQIVGDADVGQAD 438 (440) T ss_pred hccCcHHHHHHhCCCC--CcHHHHHHHHHHHHHhhhhHHhhccCCCCCCcC Confidence 5788876666665421 11000 00000000 11111122 No 193 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.92 E-value=1.1e-05 Score=47.79 Aligned_cols=374 Identities=10% Similarity=0.077 Sum_probs=171.8 Q ss_pred Cchhhhh---ccccc------cC--------Cccchh-hhhhcccccccCcc-------cc----cHHHHhccHHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQK------ML--------SGSSPV-WILNQGQPVSIKPK-------AI----TSAIALKNSDVYAVI 51 (386) Q Consensus 1 Mg~~~~l---~~~~~------~~--------~~~~~~-~~~~~~~~~~~~~~-------~i----~~~~a~~~~~v~~~v 51 (386) ||+|+++ +++-- .. ...++. ..........+.+. .. ..+..........++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 9998764 22110 00 000000 00000111111110 00 011111223334555 Q ss_pred HHHHHhhccCc--eeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEe-e Q lcl|NC_011801. 52 SRVSSDIAGCR--FVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVA-L 128 (386) Q Consensus 52 ~~ia~~ia~~p--~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~-~ 128 (386) +.+|+-+..=| +.+.+..+.+.|..--. .-.....+...+......|.+++.+..+. |. +.+..++++.+-+. . T Consensus 81 ~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~v~ad~~~P~~~ 157 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNEILQKFLDDMLT-NDRFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAFIQAPVFFPLES 157 (522) T ss_pred HHHhhhhcCCcceeecCChHHHHHHHHHHh-hcchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEEEcCCceEEEEE Confidence 56666555433 34445555544432110 11123345556666666777777666653 32 33444555554442 1 Q ss_pred cCCCc----------------eeEEE-----------------------Eec------c-C-cccceeEE---------- Q lcl|NC_011801. 129 DDYGK----------------DLTYT-----------------------VHF------D-D-SKRSGDFL---------- 151 (386) Q Consensus 129 ~~~~~----------------~~~~~-----------------------~~~------~-~-~~~~~~~~---------- 151 (386) +..+. ..+|. +.+ . + ...|.++. T Consensus 158 ~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l 237 (522) T protein:vir:47 158 NTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNL 237 (522) T ss_pred cCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCC Confidence 11111 00111 000 0 0 00011110 Q ss_pred -----Ecc--cc-eeeeccccccCcc-cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----eeCCCCCC Q lcl|NC_011801. 152 -----YDS--SE-VIHFRCTVSGESD-TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFI-----KVPNATLG 217 (386) Q Consensus 152 -----~~~--~~-vih~~~~~~~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l-----~~~~~~~~ 217 (386) ++. .. ..||+....+... +..+|+|....+...++..+..-.....-|+-|... .++ .......+ T Consensus 238 ~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~-i~v~~~~l~~~~~~~~ 316 (522) T protein:vir:47 238 EPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRR-VIVPEHLTQRQYQRPD 316 (522) T ss_pred CCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccce-eecchHHhccCCCCCC Confidence 000 00 1244432222221 234799999999999998888777777777766542 222 21111000 Q ss_pred HHHHHHHHHHH---HHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc-HH Q lcl|NC_011801. 218 KEAKENTRQSF---EEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS-NI 293 (386) Q Consensus 218 ~~~~~~~k~~~---~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~-~~ 293 (386) .+. .....| +..|.+-+. -.+++.+++.++....+-++.+..+...+.|+...|+++..++..+.+.. .. T Consensus 317 g~~--~~~~~fd~~~~~f~~~~~----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAt 390 (522) T protein:vir:47 317 GTI--DFRPRFDVEQNVYMQIGG----SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTAT 390 (522) T ss_pred ccc--ccccccCcccceEeecCC----CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHH Confidence 000 000011 111111100 01233457777777788889999999999999999999999987654332 21 Q ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHHHh-h-------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_011801. 294 TM-------------IRAFYQSSLSIYIKPIESELSQK-L-------GTDVKLDIASAIDSDNSELINNVQKLASAGVLA 352 (386) Q Consensus 294 ~~-------------~~~~~~~~l~P~~~~ie~~l~~~-l-------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t 352 (386) +. .+..++.+|..++..+.+..+.. + ...+.+++++-+..|.++.++...+++.+|+|+ T Consensus 391 Ei~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s 470 (522) T protein:vir:47 391 EIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFST 470 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 21 23345555555555555333321 1 123457888888899999999999999999998 Q ss_pred HHHHHHHh-ccC----------------CcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 353 PIQAQKLL-KNR----------------GVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 353 ~nE~R~~l-g~~----------------p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) +-+++..+ |.. +..|.+..-.+++ .++..-+| T Consensus 471 ~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~--~~~~~~~d 519 (522) T protein:vir:47 471 KKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMH--DQNEEKAD 519 (522) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCC--CcccccCC Confidence 87765543 210 0000000000111 11111222 No 194 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.87 E-value=1.4e-05 Score=47.25 Aligned_cols=373 Identities=10% Similarity=0.029 Sum_probs=167.2 Q ss_pred Cchhhhhcccccc--C------Cccch----------------hh--hhhcccc---cccCccc-c-c----HHHHhccH Q lcl|NC_011801. 1 MAFLSNLFKRQKM--L------SGSSP----------------VW--ILNQGQP---VSIKPKA-I-T----SAIALKNS 45 (386) Q Consensus 1 Mg~~~~l~~~~~~--~------~~~~~----------------~~--~~~~~~~---~~~~~~~-i-~----~~~a~~~~ 45 (386) |+.++-+|.--.. . ..-.+ .+ ....+.. ....... . . ...-+.++ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n 85 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHN 85 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecc Confidence 5555433211000 0 00000 00 0000000 0000000 0 0 00012234 Q ss_pred HHHHHHHHHHHhhccCceee--cchhH----HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEE Q lcl|NC_011801. 46 DVYAVISRVSSDIAGCRFVT--NAQPI----TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPV 119 (386) Q Consensus 46 ~v~~~v~~ia~~ia~~p~~~--~~~~~----~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l 119 (386) -...+|+..++-+-+-|+.+ .+... ..++.. | ....+...+..+.+.+|.||+.+..+.+|.+ .+..+ T Consensus 86 ~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~ 159 (481) T protein:vir:10 86 YAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDL--N---DADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVL 159 (481) T ss_pred hHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEE Confidence 45567777777666666543 23333 333322 2 2446778889999999999999999888876 57788 Q ss_pred cCcceEEeecCCCc--eeEE--EEeccCccc---ceeEEEcccceeeeccccc--------cCc---------ccccccc Q lcl|NC_011801. 120 PNEKVTVALDDYGK--DLTY--TVHFDDSKR---SGDFLYDSSEVIHFRCTVS--------GES---------DTQYMGI 175 (386) Q Consensus 120 ~~~~v~~~~~~~~~--~~~~--~~~~~~~~~---~~~~~~~~~~vih~~~~~~--------~~~---------~~~~~G~ 175 (386) +|..+.+..+.... .... .+....... .....+....+.|++.... +++ .+...|. T Consensus 160 ~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 239 (481) T protein:vir:10 160 DPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQ 239 (481) T ss_pred cccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCC Confidence 99998887765431 1111 111111110 0111233333333321100 000 0123577 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCC Q lcl|NC_011801. 176 PPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNIS 255 (386) Q Consensus 176 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~ 255 (386) |.+..+...++..........+.++..+.|..++... ...+++..+.++..-.- ..... ......+++.++.-+... T Consensus 240 ~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~l~~~ 316 (481) T protein:vir:10 240 GDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGN-VDLDSEDAKAFRDANMI-HLEPG-TNANGSEGKAEVKYVYKQ 316 (481) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-cCCCccchhhhhhccce-ecccc-ccccCCCCCcceeEEeec Confidence 8777777777766555544555556556666655421 12344433333321100 00000 001112233445444444 Q ss_pred hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhh- Q lcl|NC_011801. 256 PNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKLG- 320 (386) Q Consensus 256 ~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l~- 320 (386) .....+.+..+...+.|+..-++|....+... ++.+..+. +..+...++-+++.+...++..-+ T Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 395 (481) T protein:vir:10 317 YDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLK 395 (481) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 45566778889999999999999975554222 11121111 112222222222222222221111 Q ss_pred ----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCc---------------CCCCCCCccccccccC Q lcl|NC_011801. 321 ----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGV---------------FPELDLDEGTNLLDNT 381 (386) Q Consensus 321 ----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~---------------~p~~~~~~~~~~~~~~ 381 (386) ..+++.+...+..|..+.++++.++. |+++...+.++++.-.- .+..+....++.. ++ T Consensus 396 ~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~ 472 (481) T protein:vir:10 396 QHNYAELTITFTPNLPKSMMESINAFNALS--GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAF-EN 472 (481) T ss_pred ccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccC-CC Confidence 12345555666778888999988884 67876666655442110 0000011111111 11 Q ss_pred CCCCC Q lcl|NC_011801. 382 KNIND 386 (386) Q Consensus 382 ~~~~~ 386 (386) .+..| T Consensus 473 ~~~~d 477 (481) T protein:vir:10 473 HLNVD 477 (481) T ss_pred CCCCC Confidence 22222 No 195 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.87 E-value=1.4e-05 Score=47.24 Aligned_cols=365 Identities=11% Similarity=0.028 Sum_probs=163.0 Q ss_pred Cchhh---------------------------hhccccc-----------cCCccchhhhhhcccccccCcc-ccc-HHH Q lcl|NC_011801. 1 MAFLS---------------------------NLFKRQK-----------MLSGSSPVWILNQGQPVSIKPK-AIT-SAI 40 (386) Q Consensus 1 Mg~~~---------------------------~l~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~-~i~-~~~ 40 (386) .++|+ ++..+.. -..+.+.+.. ........+. ... ... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~--~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:94 2 FNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK--QMKKVDVHGNIDYDKPDW 79 (474) T ss_pred cccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc--ccchhccccccccccCcc Confidence 01111 0000000 0000000000 0000000000 000 000 Q ss_pred HhccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEE Q lcl|NC_011801. 41 ALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEP 118 (386) Q Consensus 41 a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~ 118 (386) -+.++-...+|+..++-+-+-|+.+. +....+.|..- .. .........+..+...+|.||+.+..+.+|.+ .+.. T Consensus 80 ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~-~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~ 156 (474) T protein:vir:94 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDV-LD-TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFR 156 (474) T ss_pred eeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHH-Hh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEE Confidence 01234445677777777767776543 33333333221 11 12335556677888999999999999888875 5677 Q ss_pred EcCcceEEeecCCC--ceeEEEEeccCcccceeEEEcccceeeeccccc-----------------------cCc----c Q lcl|NC_011801. 119 VPNEKVTVALDDYG--KDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS-----------------------GES----D 169 (386) Q Consensus 119 l~~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~-----------------------~~~----~ 169 (386) ++|..+-+..+... ........+..........+....+.+++.... ..| . T Consensus 157 ~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 236 (474) T protein:vir:94 157 VPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFK 236 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEec Confidence 88999888876432 221111111111111111122222222211000 000 0 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCcee Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADV 249 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~ 249 (386) +...|.|.+..+...++....+.....+.++..+.|..+++ +...++ .+.+.... ...+++.+++|.++ T Consensus 237 nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~~~ 305 (474) T protein:vir:94 237 NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYEGED--LEEFMRGL-------KYYKAINVDGDGGV 305 (474) T ss_pred CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCCce Confidence 11368888888888888877776666666777777766554 322221 11122211 12346667777676 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHH--------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 250 ETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITM--------------IRAFYQSSLSIYIKPIESEL 315 (386) Q Consensus 250 ~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~--------------~~~~~~~~l~P~~~~ie~~l 315 (386) +.+........+.+..+...+.|...-++|..-....+ ++.+..+ .+..+...|+.+++.+.+.+ T Consensus 306 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 384 (474) T protein:vir:94 306 ETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN 384 (474) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66665555555677788888889888888853221111 1111111 12234444444444444333 Q ss_pred HHhhh-hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------cccccc-ccCCCCCC Q lcl|NC_011801. 316 SQKLG-TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------EGTNLL-DNTKNIND 386 (386) Q Consensus 316 ~~~l~-~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------~~~~~~-~~~~~~~~ 386 (386) ..... ..+++.+..-+..|..+.++ .++.+|+++...+.++++.-+ +|..... +..+.. ...+...| T Consensus 385 ~~~~d~~~i~v~f~~~~p~~~~e~a~---~~~~~g~iS~et~l~~l~~v~-D~~~E~eri~~E~~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:94 385 NLKTDVKDIEISFNFNRMMNDAEQSQ---IIAQSQYLSRETLVKSSPLVD-DYKAELERIEQEQMEYNKQLPNLDDGGAD 460 (474) T ss_pred CCCcccceeeEEeccCcccCHHHHHH---HHHHcCCCCHHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccCCCCCC Confidence 22111 12333333334445555444 455678888877777654211 0000000 000000 00111111 No 196 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.87 E-value=1.4e-05 Score=47.24 Aligned_cols=365 Identities=11% Similarity=0.028 Sum_probs=163.0 Q ss_pred Cchhh---------------------------hhccccc-----------cCCccchhhhhhcccccccCcc-ccc-HHH Q lcl|NC_011801. 1 MAFLS---------------------------NLFKRQK-----------MLSGSSPVWILNQGQPVSIKPK-AIT-SAI 40 (386) Q Consensus 1 Mg~~~---------------------------~l~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~-~i~-~~~ 40 (386) .++|+ ++..+.. -..+.+.+.. ........+. ... ... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~--~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:97 2 FNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK--QMKKVDVHGNIDYDKPDW 79 (474) T ss_pred cccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc--ccchhccccccccccCcc Confidence 01111 0000000 0000000000 0000000000 000 000 Q ss_pred HhccHHHHHHHHHHHHhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEE Q lcl|NC_011801. 41 ALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEP 118 (386) Q Consensus 41 a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~ 118 (386) -+.++-...+|+..++-+-+-|+.+. +....+.|..- .. .........+..+...+|.||+.+..+.+|.+ .+.. T Consensus 80 ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~-~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~ 156 (474) T protein:vir:97 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDV-LD-TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFR 156 (474) T ss_pred eeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHH-Hh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEE Confidence 01234445677777777767776543 33333333221 11 12335556677888999999999999888875 5677 Q ss_pred EcCcceEEeecCCC--ceeEEEEeccCcccceeEEEcccceeeeccccc-----------------------cCc----c Q lcl|NC_011801. 119 VPNEKVTVALDDYG--KDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS-----------------------GES----D 169 (386) Q Consensus 119 l~~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~-----------------------~~~----~ 169 (386) ++|..+-+..+... ........+..........+....+.+++.... ..| . T Consensus 157 ~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 236 (474) T protein:vir:97 157 VPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFK 236 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEec Confidence 88999888876432 221111111111111111122222222211000 000 0 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCcee Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADV 249 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~ 249 (386) +...|.|.+..+...++....+.....+.++..+.|..+++ +...++ .+.+.... ...+++.+++|.++ T Consensus 237 nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~~~ 305 (474) T protein:vir:97 237 NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYEGED--LEEFMRGL-------KYYKAINVDGDGGV 305 (474) T ss_pred CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCCce Confidence 11368888888888888877776666666777777766554 322221 11122211 12346667777676 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHH--------------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 250 ETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITM--------------IRAFYQSSLSIYIKPIESEL 315 (386) Q Consensus 250 ~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~--------------~~~~~~~~l~P~~~~ie~~l 315 (386) +.+........+.+..+...+.|...-++|..-....+ ++.+..+ .+..+...|+.+++.+.+.+ T Consensus 306 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 384 (474) T protein:vir:97 306 ETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN 384 (474) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66665555555677788888889888888853221111 1111111 12234444444444444333 Q ss_pred HHhhh-hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------cccccc-ccCCCCCC Q lcl|NC_011801. 316 SQKLG-TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------EGTNLL-DNTKNIND 386 (386) Q Consensus 316 ~~~l~-~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------~~~~~~-~~~~~~~~ 386 (386) ..... ..+++.+..-+..|..+.++ .++.+|+++...+.++++.-+ +|..... +..+.. ...+...| T Consensus 385 ~~~~d~~~i~v~f~~~~p~~~~e~a~---~~~~~g~iS~et~l~~l~~v~-D~~~E~eri~~E~~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:97 385 NLKTDVKDIEISFNFNRMMNDAEQSQ---IIAQSQYLSRETLVKSSPLVD-DYKAELERIEQEQMEYNKQLPNLDDGGAD 460 (474) T ss_pred CCCcccceeeEEeccCcccCHHHHHH---HHHHcCCCCHHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhhccccCCCCCC Confidence 22111 12333333334445555444 455678888877777654211 0000000 000000 00111111 No 197 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.83 E-value=1.6e-05 Score=46.86 Aligned_cols=363 Identities=12% Similarity=0.118 Sum_probs=171.0 Q ss_pred Cc----hh----hhhcccc------------ccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhcc Q lcl|NC_011801. 1 MA----FL----SNLFKRQ------------KMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAG 60 (386) Q Consensus 1 Mg----~~----~~l~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~ 60 (386) |. +. .++.... .-..+.+++.. .......+ ..-+.++-...+|+..++-+-+ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~-----~~~~~~~~---~~ki~~n~~~~Ivd~~~~~l~g 90 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILT-----APEKETGA---DNRIVVNSAKYVVDVYNGYFCG 90 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccccc-----CcccccCC---cceeecchHHHHHHHHhhhhcc Confidence 11 10 0000000 00000010000 00000000 1111233445666666666666 Q ss_pred Cceee--cch-hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc--ee Q lcl|NC_011801. 61 CRFVT--NAQ-PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK--DL 135 (386) Q Consensus 61 ~p~~~--~~~-~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~ 135 (386) -|+.+ .++ .....|.. -............+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..+.... .. T Consensus 91 ~p~~~~~~~d~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~ 168 (470) T protein:vir:99 91 IEPKLALLNDSSKIDEIAR-WNRQENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHAFIIYDDTVQRQPL 168 (470) T ss_pred CCeeEeeCCchhHHHHHHH-HHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEccceeEEEEcCCCCcceE Confidence 66543 222 22222221 111224456777888899999999999999888876 5777899998887766432 11 Q ss_pred EEE-EeccCcccce---eEEEcccceeeecccc----------ccCc---------ccccccccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 136 TYT-VHFDDSKRSG---DFLYDSSEVIHFRCTV----------SGES---------DTQYMGIPPIDSLLNEIEVQDLSS 192 (386) Q Consensus 136 ~~~-~~~~~~~~~~---~~~~~~~~vih~~~~~----------~~~~---------~~~~~G~s~~~~~~~~i~~~~~~~ 192 (386) ... +......... ...+..+.+++++... ..++ .+...|.|.+..+...++....+. T Consensus 169 ~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~ 248 (470) T protein:vir:99 169 AFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVI 248 (470) T ss_pred EEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHH Confidence 111 1111100000 1112222222211000 0000 112368888888888888887777 Q ss_pred HHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec-----CCCceeeeccCChhhHHHHHHHHH Q lcl|NC_011801. 193 KLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL-----DQSADVETTNISPNVTEFLQNVSF 267 (386) Q Consensus 193 ~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~ 267 (386) ....+.++..+.|..++.. ....++........+.. .+++.+ +++.++..+........+....+. T Consensus 249 s~~~~~~~~~~~~~~~i~g--~~~~~~~~g~~~~~~~~-------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 319 (470) T protein:vir:99 249 SQKANQVEYFDNAYMYMIG--FKLPEDDEGNPKFDFKN-------NRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQH 319 (470) T ss_pred HHHHHHHHHhcCceeeeec--CCcccccccchhhhhhh-------cceeeecCCCCCCCCcceEEeecCChHHHHHHHHH Confidence 7777777777777666653 22222211111111211 122222 345556666655555556777889 Q ss_pred HHHHHHHHhCCCHHHhcCCcCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhhcchh Q lcl|NC_011801. 268 SQDQIAKAFGIPADYLSGKQDAQSNIT-------------MIRAFYQSSLSIYIKPIESELSQKLG-----TDVKLDIAS 329 (386) Q Consensus 268 ~~~~Ia~~~gvp~~~l~~~~~~~~~~~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~fd~~~ 329 (386) ..+.|+..-++|....+...+.....+ ..+..+..+|.-.++.+...+...-. ..+++.+.. T Consensus 320 l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~ 399 (470) T protein:vir:99 320 LTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTR 399 (470) T ss_pred HHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCC Confidence 999999999999744332111111111 11233444444444444444333211 234556666 Q ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----cc-------------ccccccCCCCCC Q lcl|NC_011801. 330 AIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----EG-------------TNLLDNTKNIND 386 (386) Q Consensus 330 ~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~~-------------~~~~~~~~~~~~ 386 (386) -+..|..+.++++.++. |+++...+.++++.- +|...+. |- .+......+..| T Consensus 400 ~~p~~~~e~a~~~~kl~--giis~et~l~~l~~v--d~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee 469 (470) T protein:vir:99 400 NLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDI--EPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEE 469 (470) T ss_pred CCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCC--CHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccC Confidence 67778899999988885 678877777765431 1111100 00 000011111111 No 198 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.76 E-value=2.2e-05 Score=46.16 Aligned_cols=367 Identities=11% Similarity=0.049 Sum_probs=168.8 Q ss_pred Cch------------------hhhhcccccc-----------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHH Q lcl|NC_011801. 1 MAF------------------LSNLFKRQKM-----------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVI 51 (386) Q Consensus 1 Mg~------------------~~~l~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v 51 (386) |.. +.++...... ..+.+++. ......... .+.+ +.++-....| T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~----~~~~~~~~~-~~~k--i~~n~~~~iv 73 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAID----AEPTKDLWK-PDNR--LTVNFTKYIV 73 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchh----cCCCccccC-ccce--eecchHHHHH Confidence 211 1111100000 00000000 000000000 0111 1233445667 Q ss_pred HHHHHhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 52 SRVSSDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 52 ~~ia~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) +..++-+-+-|+.+. +......|.+--+. -........+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..+ T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~-N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d 151 (453) T protein:vir:39 74 DTFTGYFNGIPVKKSHSDKETLSKLQEFDNL-NDMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPENMFMVYD 151 (453) T ss_pred HHHhhhhcccCceeccCChHHHHHHHHHHHh-cChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEec Confidence 777776666676543 33333333221111 12334667788899999999999999888875 466688888888776 Q ss_pred CCCc-eeEEE--EeccCcccceeEEEcccceeeeccccc--------cCc---------ccccccccHHHHHHHHHHHHH Q lcl|NC_011801. 130 DYGK-DLTYT--VHFDDSKRSGDFLYDSSEVIHFRCTVS--------GES---------DTQYMGIPPIDSLLNEIEVQD 189 (386) Q Consensus 130 ~~~~-~~~~~--~~~~~~~~~~~~~~~~~~vih~~~~~~--------~~~---------~~~~~G~s~~~~~~~~i~~~~ 189 (386) .... ...+. +............+.+..+.++..... +++ .+...|.|.+..+...++... T Consensus 152 ~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~ 231 (453) T protein:vir:39 152 DTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFN 231 (453) T ss_pred CCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHH Confidence 5332 11111 111111110111122222222221100 000 011358888888888887777 Q ss_pred HHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_011801. 190 LSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQ 269 (386) Q Consensus 190 ~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~ 269 (386) .+..-..+.++..+.|..+++ +..++++..+.++..- ..... . ....+++.++..++.......+.+..+... T Consensus 232 ~~~s~~~~~~~~~~~p~~~~~--g~~~~~~~~~~~~~~~--~~~~~-~--~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~ 304 (453) T protein:vir:39 232 KAISEKANDVDYFSDQYLTFL--GAAVEEEDLKNIRSNR--VINYY-G--ESSEAKNVDVKFLEKPDSDSQTENLLDRLT 304 (453) T ss_pred HHHHHHHHHHHHhhCceeeee--cCCCCchhhhhhhhcc--eeeec-C--CCCCCCCCceeEEeecCCHHHHHHHHHHHH Confidence 666666666667777766654 3344555444433210 00000 0 011123444444444444555677788888 Q ss_pred HHHHHHhCCCHHHhcCCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh----hhhhhhcchhhhcc Q lcl|NC_011801. 270 DQIAKAFGIPADYLSGKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL----GTDVKLDIASAIDS 333 (386) Q Consensus 270 ~~Ia~~~gvp~~~l~~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l----~~~~~fd~~~~l~~ 333 (386) +.|+..-++|..-.+..++.+.. ....+..+..++..+++.+...++..- ...+++.+..-+.. T Consensus 305 ~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~ 384 (453) T protein:vir:39 305 KLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPK 384 (453) T ss_pred HHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCc Confidence 88888888885322211111110 011233445555655555554433221 12345566666777 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----c---cc---cccccCCCCCC Q lcl|NC_011801. 334 DNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----E---GT---NLLDNTKNIND 386 (386) Q Consensus 334 d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~---~~---~~~~~~~~~~~ 386 (386) |..+.++++.++ .|+++...+.++++.-+ +|...+. | .. ....++..+.+ T Consensus 385 ~~~~~a~~~~kl--~g~is~et~l~~l~~v~-D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 444 (453) T protein:vir:39 385 DIKEQAETANIL--MGITSQETALSVISVIP-DVQAEMEKIKKEEASTAIFDKDKQPSEKGTD 444 (453) T ss_pred CHHHHHHHHHHH--hccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHHhccCCCCCCC Confidence 889999998888 57888877776654211 0111100 0 00 00011111111 No 199 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.72 E-value=2.5e-05 Score=45.77 Aligned_cols=375 Identities=11% Similarity=0.067 Sum_probs=175.5 Q ss_pred Cchhhh--------hccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhH Q lcl|NC_011801. 1 MAFLSN--------LFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPI 70 (386) Q Consensus 1 Mg~~~~--------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~ 70 (386) ..|+++ +.+-..-..+.+++.... .......-...-+.+.-....++..++-+-+-|+.+. +... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~-----~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~ 120 (511) T protein:vir:96 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL-----TRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV 120 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCcccccc-----CcCcccccCcceeecchHHHHHHHHHhhhccCCceeecCchHH Confidence 111110 000000000000000000 0000000000001123345566666766667776543 3334 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEE--EEecc--Cc Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTY--TVHFD--DS 144 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~--~~~~~--~~ 144 (386) ...|.. -...-........+..++..+|.||..+-++.+|.+ .+..++|..+.+..+... ..+.. .+... .. T Consensus 121 ~~~l~~-~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~ 198 (511) T protein:vir:96 121 LEAIEA-FNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHH-HHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 444432 122223445667788899999999999999888865 577889999888776532 11111 11110 00 Q ss_pred ccce----eEEEcccceeeecccccc-------------C-----c----ccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 145 KRSG----DFLYDSSEVIHFRCTVSG-------------E-----S----DTQYMGIPPIDSLLNEIEVQDLSSKLAIST 198 (386) Q Consensus 145 ~~~~----~~~~~~~~vih~~~~~~~-------------~-----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 198 (386) .... ...+.+..+.+++..... + | .+...|.|.+..+...++....+..-..+. T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~ 278 (511) T protein:vir:96 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 0000 012333333332210000 0 0 011368888888888888877776666777 Q ss_pred HhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc-cc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 199 LRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG-EN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 199 ~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~-~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ++..+.|-.+++.. ...+.++....++.-.-.... .. .+...-.+++.++..++.......+....+...+.|...- T Consensus 279 ~~~~~~~~lv~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:96 279 MSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHHhhCceeeeecC-ccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 77777776655431 223333332222111000000 00 0111223445566666655555666788888999999999 Q ss_pred CCCHHHhcC-CcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCHH Q lcl|NC_011801. 277 GIPADYLSG-KQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDNS 336 (386) Q Consensus 277 gvp~~~l~~-~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~~ 336 (386) ++|..-.+. .++.+-. .+..+..+..+|.-.++.|...+..... ..+++.+..-+..|.. T Consensus 358 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~ 437 (511) T protein:vir:96 358 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLI 437 (511) T ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHH Confidence 998744321 1211100 0112334555555555555554443321 1345556666777889 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCC------------------------CCccccccccCCCCCC Q lcl|NC_011801. 337 ELINNVQKLASAGVLAPIQAQKLLKNRGVFPELD------------------------LDEGTNLLDNTKNIND 386 (386) Q Consensus 337 ~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~------------------------~~~~~~~~~~~~~~~~ 386 (386) +.++++.++ .|+++...+.++++.-. +|... ....++.-...++.+| T Consensus 438 e~~~~~~kl--~G~iS~et~l~~l~~v~-D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) T protein:vir:96 438 EELKAYIDS--GGKISQTTLMSLFSFFQ-DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHH--hccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCccccccc Confidence 999988887 57888777766654211 00000 0001111111222222 No 200 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=97.71 E-value=1e-06 Score=53.43 Aligned_cols=312 Identities=12% Similarity=0.132 Sum_probs=146.5 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeecch--hHHHHHhccC Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTNAQ--PITDVLNAPL 78 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~~--~~~~~l~~~P 78 (386) ||+|. +.+|...++.-...+..... .. .+....-.+-+.+-+-+|+.||.- +-.++. ....+|.. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~----~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-- 67 (320) T protein:vir:97 1 MGIFN-FKKRETLTPELKESIIRQVT----IE-----DESPFTGTTDFNVRNEVAESIATY-LGAYKTSAKRLSLLTN-- 67 (320) T ss_pred CCccc-cccccccChhHHhhhhheee----ec-----cCCCcccccccchhhHHHHHHHHH-hhhhccccceeeeeeC-- Confidence 99995 33333333322211111110 00 000000001111122222322210 000111 11122322 Q ss_pred cccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEEEeccCc-ccceeEEEcccce Q lcl|NC_011801. 79 GNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYTVHFDDS-KRSGDFLYDSSEV 157 (386) Q Consensus 79 N~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v 157 (386) -..|++.++.+.+..-..|++..... |+.+ .+.+++.- -...+.+. ..+. ..-....+|-.|+ T Consensus 68 -----~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~------~~~~~~~~--~~~~~~~~--~~D~FN~~V~mtvpfyD~ 131 (320) T protein:vir:97 68 -----NPSFLRRLVKHALHNKTTYVYKSPTY-GWLI------TDSMTIEG--LRARLTFT--LPDPFNSAVTMTVPFYDV 131 (320) T ss_pred -----CHHHHHHHHHHhhcccceEEeeCCcc-ceee------ecceeeee--eeeeEEEe--cCcccceeEEEEeeeech Confidence 23699999999999888998875532 3221 12222210 00011110 0000 0001122222222 Q ss_pred eeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccc- Q lcl|NC_011801. 158 IHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGEN- 236 (386) Q Consensus 158 ih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~- 236 (386) .-..++++|..+-++ -.+...+.....+-+.|-+....+++.+...-=++-.++....+.++..-.+ T Consensus 132 --------~ILdnpl~gv~tqe~----gkM~g~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk~mq~~A~~ 199 (320) T protein:vir:97 132 --------GIIDSPLVEVDTEEA----NKMLEAAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIKAMLATAEL 199 (320) T ss_pred --------hhhhhhhcccChHHh----hHHHHHHhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHHHHHHHHHHH Confidence 122345677776632 2233444455677778888888888765321115666666666655443332 Q ss_pred cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHH---HH Q lcl|NC_011801. 237 AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPI---ES 313 (386) Q Consensus 237 ~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~i---e~ 313 (386) -.++-+++.|-+++++...-.-.. ..-.+..+...+.-|+||..+|. ++.++.+..+|+...+.|+++|+ |. T Consensus 200 ~nG~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~----GsAte~~~Iaf~~~~V~PLL~Q~~~~Ek 274 (320) T protein:vir:97 200 LSGYTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILD----GSATDGEKVAVMFRFVEPILEQFREYEP 274 (320) T ss_pred hcCcccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhcc----ccCCcceeeehhhHhHHHHHHHhhhcCc Confidence 234888899999999876543332 33455667788889999999986 44556677889999999999997 43 Q ss_pred HHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCC-CCCCcccc Q lcl|NC_011801. 314 ELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPE-LDLDEGTN 376 (386) Q Consensus 314 ~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~-~~~~~~~~ 376 (386) .|..++-. ||-+. ++. -+|-+..|.+-. +|++-.+.+ -+++-|+. T Consensus 275 ~Lvy~m~~--E~FVs-~mt--------------TGG~l~S~~~~~-~~~~~~~~~~~~~~~~~~ 320 (320) T protein:vir:97 275 SLIYAMRD--EFFVS-FMT--------------TGGMLNSNRVDG-WGKEKAPNESKGGDVGDV 320 (320) T ss_pred ceeeeecc--ceeee-eee--------------cCceeecccccc-cccccCCccccCCcccCC Confidence 43332211 11111 110 144444444332 243332211 11222222 No 201 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.70 E-value=2.7e-05 Score=45.63 Aligned_cols=380 Identities=11% Similarity=0.049 Sum_probs=175.9 Q ss_pred CchhhhhccccccC-Ccc-chhhhhhcc-cccccC----cccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhHH Q lcl|NC_011801. 1 MAFLSNLFKRQKML-SGS-SPVWILNQG-QPVSIK----PKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPIT 71 (386) Q Consensus 1 Mg~~~~l~~~~~~~-~~~-~~~~~~~~~-~~~~~~----~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~ 71 (386) +..+.++..+.... ... ........+ ...... ....-...-+.+.-..-.|+..++-+-+-|+.+. +.... T Consensus 42 ~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~ 121 (512) T protein:vir:97 42 INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVL 121 (512) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCChHHH Confidence 11111111100000 000 000000000 000000 0000000001123345566777776667776653 33333 Q ss_pred HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--EeccCc--c Q lcl|NC_011801. 72 DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFDDS--K 145 (386) Q Consensus 72 ~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~~~--~ 145 (386) ..|..- ...-........+..+...+|.||..+.++.+|.+ .+..++|..+.+..+... ...... +..... . T Consensus 122 ~~l~~~-~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~ 199 (512) T protein:vir:97 122 EAIEAF-NDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKT 199 (512) T ss_pred HHHHHH-HhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeecccc Confidence 344321 11123445667788889999999999999888875 578899999888876543 122111 111000 0 Q ss_pred c----ceeEEEcccceeeeccccc-------------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 146 R----SGDFLYDSSEVIHFRCTVS-------------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTL 199 (386) Q Consensus 146 ~----~~~~~~~~~~vih~~~~~~-------------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 199 (386) . .....+....+.+++.... .++ .+...|.|.+..+...++....+..-..+.+ T Consensus 200 ~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~ 279 (512) T protein:vir:97 200 DEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYM 279 (512) T ss_pred ccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 0 0011334444444321100 000 0113588888888888888887766666667 Q ss_pred hccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc---cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 200 RHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG---ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 200 ~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~---~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) +..+.|-.+++.. ...+.+.....+....-.... .+.+...-.++|.+++.+........+....+...+.|+..- T Consensus 280 ~~~~~~~lv~~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 358 (512) T protein:vir:97 280 SDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 358 (512) T ss_pred HHhcCceeeeecC-ccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 7777776665432 223343333332211111111 011111123456666666655455556777888889999988 Q ss_pred CCCHHHhcCC-cCcccHH-------------HHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCH Q lcl|NC_011801. 277 GIPADYLSGK-QDAQSNI-------------TMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDN 335 (386) Q Consensus 277 gvp~~~l~~~-~~~~~~~-------------~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~ 335 (386) ++|..-.+.. ++.+ .. +..+..+...|.-.++.+...+...-. ..+++.+..-+..|. T Consensus 359 ~~p~~~~~~~~gn~S-g~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~ 437 (512) T protein:vir:97 359 NTPNMKDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL 437 (512) T ss_pred CCcccCcccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH Confidence 9887443221 2111 11 111234444444444444444332211 123455555566788 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhccCC-----------------------cCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 336 SELINNVQKLASAGVLAPIQAQKLLKNRG-----------------------VFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 336 ~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-----------------------~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .+.++++.++. |+++...+.++++.-+ ....++....++.-..+.+.+| T Consensus 438 ~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (512) T protein:vir:97 438 IEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 509 (512) T ss_pred HHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcccccc Confidence 88888888884 7788776666654211 0000000011111122222222 No 202 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.70 E-value=2.7e-05 Score=45.61 Aligned_cols=381 Identities=10% Similarity=0.044 Sum_probs=174.9 Q ss_pred Cch---hhhhccccccC-Cccc-hhhhhhccc-ccc----cCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--ch Q lcl|NC_011801. 1 MAF---LSNLFKRQKML-SGSS-PVWILNQGQ-PVS----IKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQ 68 (386) Q Consensus 1 Mg~---~~~l~~~~~~~-~~~~-~~~~~~~~~-~~~----~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~ 68 (386) |.. +.++..+.... ...- .......+. ... ......-...-+.+.-....++..++-+-+-|+.+. +. T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:10 39 LQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred ccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCch Confidence 111 11110000000 0000 000000000 000 000000000001123345566666666666676543 33 Q ss_pred hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--Eecc-- Q lcl|NC_011801. 69 PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFD-- 142 (386) Q Consensus 69 ~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~-- 142 (386) .....|..- ...-........+..++..+|.||..+.++.+|.+ .+..++|..+.+..+... ...... +... T Consensus 119 ~~~~~l~~~-~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~ 196 (511) T protein:vir:10 119 DVLEAIEAF-NDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI 196 (511) T ss_pred HHHHHHHHH-HhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeec Confidence 333444321 11112334666788889999999999999888865 567788998888776543 121111 1110 Q ss_pred Cccc-c-e--eEEEcccceeeeccccc-------------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 143 DSKR-S-G--DFLYDSSEVIHFRCTVS-------------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAI 196 (386) Q Consensus 143 ~~~~-~-~--~~~~~~~~vih~~~~~~-------------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 196 (386) .... . . ...+.+..+.++..... +++ .+...|.|.+..+...++....+..-.. T Consensus 197 d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~ 276 (511) T protein:vir:10 197 DKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTA 276 (511) T ss_pred ccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHH Confidence 0000 0 0 01233333333321100 000 0113588888888888887777766666 Q ss_pred HHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhc-ccc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_011801. 197 STLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTT-GEN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAK 274 (386) Q Consensus 197 ~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~-~~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~ 274 (386) +.++..+.|-.+++.. ...+.++....++.-.-... ... .+...-.++|.+++.++....+..+....+...+.|+. T Consensus 277 ~~~~~~~~~~lv~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~ 355 (511) T protein:vir:10 277 NYMSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHM 355 (511) T ss_pred HHHHHhhCceeeeecc-ccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHH Confidence 6677777775655431 22334433332221110010 000 01112234456666666555556667788888999999 Q ss_pred HhCCCHHHhcC-CcCccc-H-----------HHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccC Q lcl|NC_011801. 275 AFGIPADYLSG-KQDAQS-N-----------ITMIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSD 334 (386) Q Consensus 275 ~~gvp~~~l~~-~~~~~~-~-----------~~~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d 334 (386) .-++|..-.+. .++.+- + ....+..+..+|.-.++.+...+..... ..+++.+..-+..| T Consensus 356 ~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d 435 (511) T protein:vir:10 356 FTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 435 (511) T ss_pred HhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcC Confidence 99998743321 121110 0 1112334455555555555554443322 13455666667788 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhccCC-----------------------cCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 335 NSELINNVQKLASAGVLAPIQAQKLLKNRG-----------------------VFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 335 ~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-----------------------~~p~~~~~~~~~~~~~~~~~~~ 386 (386) ..+.++++.+++ |+++...+.++++.-+ ....++..+.++.-..+++.+| T Consensus 436 ~~~~~~~~~kl~--G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) T protein:vir:10 436 LIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCccc Confidence 999999999885 6777766666553211 0000111111112222222222 No 203 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=97.69 E-value=2.8e-05 Score=45.52 Aligned_cols=369 Identities=10% Similarity=0.043 Sum_probs=165.1 Q ss_pred Cch-----------------------hhhhccccccCCcc-chhhhhhcccccccCc-ccccHHHHhccHHHHHHHHHHH Q lcl|NC_011801. 1 MAF-----------------------LSNLFKRQKMLSGS-SPVWILNQGQPVSIKP-KAITSAIALKNSDVYAVISRVS 55 (386) Q Consensus 1 Mg~-----------------------~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~i~~~~a~~~~~v~~~v~~ia 55 (386) |-+ +.++.......... ................ .......-+.++-...+|+..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 211 11111111000000 0000000000000000 0000000112334455666666 Q ss_pred HhhccCceeec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCC Q lcl|NC_011801. 56 SDIAGCRFVTN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYG 132 (386) Q Consensus 56 ~~ia~~p~~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~ 132 (386) +-+-+-|+.+. +....+.|..--+.. .......+...+..+|.||..+.++. +|. ..+..++|..+-+..+... T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~n~--~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~~~p~~~~~i~d~~~ 157 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLGDD--YERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYACVDSKEVIPIYSKSL 157 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHhcC--HHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEEEcccceEEEEcCCC Confidence 66666666543 333444443221212 33455667788999999999998875 454 4678889999888776543 Q ss_pred c--eeE---EEEeccC-cccc--eeEEEcccceeeeccccc----------------------------cCcc------- Q lcl|NC_011801. 133 K--DLT---YTVHFDD-SKRS--GDFLYDSSEVIHFRCTVS----------------------------GESD------- 169 (386) Q Consensus 133 ~--~~~---~~~~~~~-~~~~--~~~~~~~~~vih~~~~~~----------------------------~~~~------- 169 (386) . ... ++..... .... ....+....+.|++.... .++. T Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 237 (471) T protein:vir:10 158 DKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIP 237 (471) T ss_pred CCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEE Confidence 1 111 1111100 0000 111223333443322110 0000 Q ss_pred --cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC--- Q lcl|NC_011801. 170 --TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD--- 244 (386) Q Consensus 170 --~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~--- 244 (386) +...|.|.+......++....+.....+.++..+.|-.+++..+....++ ....+.. ++++.++ T Consensus 238 ~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~----~~~~~~~-------~~~i~~~~~~ 306 (471) T protein:vir:10 238 FKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQE----FLEDLKR-------YKMIKMDNDG 306 (471) T ss_pred eccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccch----hHHHhhc-------CCeEEecCCC Confidence 11347788888888888777766666666676666655554322111222 2222211 1122221 Q ss_pred --CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH------------HHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 245 --QSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN------------ITMIRAFYQSSLSIYIKP 310 (386) Q Consensus 245 --~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~------------~~~~~~~~~~~l~P~~~~ 310 (386) .+.+++.+........+....+...+.|...-++|..-....++.+.. ....+..+...+.-.++. T Consensus 307 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~l 386 (471) T protein:vir:10 307 MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKM 386 (471) T ss_pred CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223344444333344567778888888988888886422211221110 011122333444444444 Q ss_pred HHHHHHHhhhhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCC-------Cc-cccccccCC Q lcl|NC_011801. 311 IESELSQKLGTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDL-------DE-GTNLLDNTK 382 (386) Q Consensus 311 ie~~l~~~l~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~-------~~-~~~~~~~~~ 382 (386) +...+...-...+++.+...+..|..+.++.+.++ .|+++...+.++++.-. +|.... .+ ......... T Consensus 387 i~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~-D~~~E~eri~~E~~~~~~~~~~~~~ 463 (471) T protein:vir:10 387 ILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVE-DWQDELRLQKAEQEGRSEKLYDMEE 463 (471) T ss_pred HHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHhcccccCC Confidence 44333222123455666677788999999998887 57899888777654211 011000 00 111111222 Q ss_pred CCCC Q lcl|NC_011801. 383 NIND 386 (386) Q Consensus 383 ~~~~ 386 (386) ...| T Consensus 464 ~~~~ 467 (471) T protein:vir:10 464 VEHE 467 (471) T ss_pred CCCc Confidence 2222 No 204 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.64 E-value=3.3e-05 Score=45.12 Aligned_cols=368 Identities=9% Similarity=0.009 Sum_probs=163.2 Q ss_pred Cc-----hhhhhcccccc-----------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCcee Q lcl|NC_011801. 1 MA-----FLSNLFKRQKM-----------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFV 64 (386) Q Consensus 1 Mg-----~~~~l~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~ 64 (386) .+ ++.++..+... ..+.+.+................-...-+.++-...+|+..++-+-+-|+. T Consensus 23 ~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~ 102 (478) T protein:vir:10 23 KYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVT 102 (478) T ss_pred ccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCce Confidence 00 11111100000 000000000000000000000000000112344556777777777677765 Q ss_pred ec--chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCC-C-ceeEEEEe Q lcl|NC_011801. 65 TN--AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDY-G-KDLTYTVH 140 (386) Q Consensus 65 ~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~-~-~~~~~~~~ 140 (386) +. +....+.|..--+ .........+..+...+|.+|+.+-.+.+|.+ .+..++|..+.+..+.. . ........ T Consensus 103 ~~~~~~~~~~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~ 179 (478) T protein:vir:10 103 FGVDNDKALKQIQHTLN--HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRV 179 (478) T ss_pred eecCChHHHHHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEE Confidence 43 3334443322212 13445666678889999999999988888875 57778999988776532 2 22221111 Q ss_pred ccCcccceeEEEcccceeeeccccc----------------------cCc---------ccccccccHHHHHHHHHHHHH Q lcl|NC_011801. 141 FDDSKRSGDFLYDSSEVIHFRCTVS----------------------GES---------DTQYMGIPPIDSLLNEIEVQD 189 (386) Q Consensus 141 ~~~~~~~~~~~~~~~~vih~~~~~~----------------------~~~---------~~~~~G~s~~~~~~~~i~~~~ 189 (386) +..........+....|.+++.... .++ .+...|.|.+..+...++... T Consensus 180 ~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~ 259 (478) T protein:vir:10 180 YELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALD 259 (478) T ss_pred EeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHH Confidence 1111111112233444433322100 000 012358888888888888777 Q ss_pred HHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec--CCCceeeeccCChhhHHHHHHHHH Q lcl|NC_011801. 190 LSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL--DQSADVETTNISPNVTEFLQNVSF 267 (386) Q Consensus 190 ~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl--~~g~~~~~~~~~~~d~~~~e~~~~ 267 (386) .+.....+.++..+.|-.+++ +...++ .......+.. .+++.+ ++|.+++.+........+.+..+. T Consensus 260 ~~~S~~~~~~~~~~~~~~~~~--g~~~~~--~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 328 (478) T protein:vir:10 260 KRLSDTQNTFDESVELIYILK--GYEGED--MKDFMHNLKY-------YKAISVAGESGSGVDTIKVEVPIDSVKEYTKM 328 (478) T ss_pred HHHHHHHHHHHHhhCcceeee--cCCccc--ccchhhhhhh-------CceeEecCCCCCcceEEeecCCHHHHHHHHHH Confidence 766666666666666655544 322111 1111111211 123333 334445545544556667788888 Q ss_pred HHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhh-hhhhhhcchhhhc Q lcl|NC_011801. 268 SQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKL-GTDVKLDIASAID 332 (386) Q Consensus 268 ~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l-~~~~~fd~~~~l~ 332 (386) ..+.|...-++|..-....+ ++-+..+. ...+..++.-+++.+.+.+.... ...+++.+..-+. T Consensus 329 l~~~I~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p 407 (478) T protein:vir:10 329 LRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVM 407 (478) T ss_pred HHHHHHHHhCCcCcCccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCC Confidence 99999999998863222111 11111111 12223333333333322221111 0133445555566 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------ccccc-cccCCCCCC Q lcl|NC_011801. 333 SDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------EGTNL-LDNTKNIND 386 (386) Q Consensus 333 ~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------~~~~~-~~~~~~~~~ 386 (386) .|..+.++.+.++ .|+++...+.++++.-. +|..... +.... ....++.+| T Consensus 408 ~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~d 466 (478) T protein:vir:10 408 VNELENSQIAMNS--TGLLSKETILGNHSWVQ-DPVAEMERIEQENIELNQQLPDIEEGLND 466 (478) T ss_pred CCHHHHHHHHHHH--hCCCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHhccccCCCCcc Confidence 7888888888776 57888766666553210 0100000 00000 000011111 No 205 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.60 E-value=4e-05 Score=44.72 Aligned_cols=374 Identities=10% Similarity=0.042 Sum_probs=170.6 Q ss_pred Cchhhhhccc-ccc-------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhH Q lcl|NC_011801. 1 MAFLSNLFKR-QKM-------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPI 70 (386) Q Consensus 1 Mg~~~~l~~~-~~~-------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~ 70 (386) ..|+++.... .+. ..+.+++.... . ......-...-+.+.-..-.|+..++-+-+-|+.+. +..+ T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~----~-~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~ 120 (511) T protein:vir:99 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL----T-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV 120 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCcccccc----C-cccccccCcceeecchHHHHHHHHHhhhcccCceeecCchHH Confidence 1111110000 000 00000000000 0 000000000001223344566666666666676543 3333 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--Eecc--C- Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFD--D- 143 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~--~- 143 (386) ...|..--+. -........+..++..+|.||..+.++.+|.+ .+..++|..+-+..+... ...... +... . T Consensus 121 ~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:99 121 LEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHhh-cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 3333221111 12345666788899999999999999888865 577889999888776542 221111 1100 0 Q ss_pred cccc---eeEEEcccceeeecccccc-------------Cc---------ccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 144 SKRS---GDFLYDSSEVIHFRCTVSG-------------ES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAIST 198 (386) Q Consensus 144 ~~~~---~~~~~~~~~vih~~~~~~~-------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 198 (386) .... ....+.+..+.+++..... ++ .+...|.|.+..+...++....+..-..+. T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~ 278 (511) T protein:vir:99 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred CccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 0000 0112344444443211000 00 011358888888888888777766666666 Q ss_pred HhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc--cccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 199 LRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG--ENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 199 ~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ++..+.|-.+++.. ...+.++....++.-.-.... .-.+...-.++|.+++.++....+..+....+...+.|+..- T Consensus 279 ~~~~~~~~lv~~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:99 279 MSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHHhhchhhhhccC-cccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 66656665544321 223333333222210000000 000111223455666666655555666778888899999999 Q ss_pred CCCHHHhcC-CcCcccHHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccC Q lcl|NC_011801. 277 GIPADYLSG-KQDAQSNITM--------------IRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSD 334 (386) Q Consensus 277 gvp~~~l~~-~~~~~~~~~~--------------~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d 334 (386) ++|..-.+. .++. +..+ .+..+...|.-.++.|...+...-. ..+++.+..-+..| T Consensus 358 ~~P~~~~~~~~gn~--Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n 435 (511) T protein:vir:99 358 NTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKS 435 (511) T ss_pred CCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcC Confidence 998744321 2221 1111 1223344444444444443332211 12455565666778 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhccCC-----------------------cCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 335 NSELINNVQKLASAGVLAPIQAQKLLKNRG-----------------------VFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 335 ~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-----------------------~~p~~~~~~~~~~~~~~~~~~~ 386 (386) ..+.++.+.++. |+++...+.++++.-+ ..+.++..+.++.-..+..+.| T Consensus 436 ~~e~~~~~~kl~--GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 508 (511) T protein:vir:99 436 LIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSID 508 (511) T ss_pred HHHHHHHHHHHh--ccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCccc Confidence 889999988884 7888776766653211 0000011111111122222222 No 206 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=97.57 E-value=4.3e-05 Score=44.53 Aligned_cols=379 Identities=12% Similarity=0.070 Sum_probs=173.0 Q ss_pred Cch----hhhhcccccc--CCccchhhhhhccccccc--Ccccc---cHHHHhccHHHHHHHHHHHHhhccCceeecc-- Q lcl|NC_011801. 1 MAF----LSNLFKRQKM--LSGSSPVWILNQGQPVSI--KPKAI---TSAIALKNSDVYAVISRVSSDIAGCRFVTNA-- 67 (386) Q Consensus 1 Mg~----~~~l~~~~~~--~~~~~~~~~~~~~~~~~~--~~~~i---~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~-- 67 (386) +.. +.++...... .+.-........+..... ..... ....-+.+.-..-+|+..++-+-+-|+.+.. T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 116 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCC Confidence 221 1111100000 000000000000000000 00000 0001122344556777777777677765432 Q ss_pred ----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC-c-eeEE--EE Q lcl|NC_011801. 68 ----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG-K-DLTY--TV 139 (386) Q Consensus 68 ----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~-~-~~~~--~~ 139 (386) ..+...|.. -...-........+..++..+|.||+.+.++.+|.+ .+..++|..+.+..+... . .... .+ T Consensus 117 ~~~~~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~ 194 (501) T protein:vir:27 117 NDNNSQNDDTIKR-IGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYDNSLEDNSIAAVRYY 194 (501) T ss_pred ccchHHHHHHHHH-HHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEecCCCCCceEEEEEEE Confidence 123333322 111123445777889999999999999999888875 467788999888776532 1 2111 11 Q ss_pred eccC--cccceeEEEcccceeeecccc-------ccCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011801. 140 HFDD--SKRSGDFLYDSSEVIHFRCTV-------SGES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRH 201 (386) Q Consensus 140 ~~~~--~~~~~~~~~~~~~vih~~~~~-------~~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 201 (386) .... ........+..+.+.++.... .+++ .+...|.|.+..+...++....+..-..+.+.. T Consensus 195 ~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 274 (501) T protein:vir:27 195 NRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSD 274 (501) T ss_pred EeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHH Confidence 1111 100011112222221111000 0000 112368888988888888887777777777777 Q ss_pred cCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_011801. 202 AIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPAD 281 (386) Q Consensus 202 g~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~ 281 (386) ...|-.++........++....++.. ....-...+.....+++.++..++....+..+....+...+.|+..-++|.. T Consensus 275 ~~~~~~v~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~ 352 (501) T protein:vir:27 275 MADAILAIYGDLALPKGMQASDMKRT--RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDM 352 (501) T ss_pred hcCceeeeecCccCCcccchhhhhhc--CceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 67666655432212222222222221 0111111122233445566666665555556677788888999999999864 Q ss_pred HhcC-CcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh-h-----hhhhhcchhhhccCHHHHHHHH Q lcl|NC_011801. 282 YLSG-KQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL-G-----TDVKLDIASAIDSDNSELINNV 342 (386) Q Consensus 282 ~l~~-~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l-~-----~~~~fd~~~~l~~d~~~~~~~~ 342 (386) -.+. .++.+.. ....+..+...|.-+++.+...++..- + ..+++.+...+..|..+.++++ T Consensus 353 ~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~ 432 (501) T protein:vir:27 353 SDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSIL 432 (501) T ss_pred CccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHH Confidence 3322 1211110 011123444445555544444443221 1 1244556666777888999988 Q ss_pred HHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------------ccccccccCCCCCC Q lcl|NC_011801. 343 QKLASAGVLAPIQAQKLLKNRGVFPELDLD-------------EGTNLLDNTKNIND 386 (386) Q Consensus 343 ~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------------~~~~~~~~~~~~~~ 386 (386) .++ .|+++..-+.++++.-. +|..... ..+......+...| T Consensus 433 ~kl--~g~iS~et~l~~l~~v~-D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d 486 (501) T protein:vir:27 433 TGL--GGQVSQETALSLSGLVE-SPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTD 486 (501) T ss_pred HHH--hccCcHHHHHHhCCCCC-CHHHHHHHHHHHHHhhhHhhhcCccccccccccC Confidence 887 46777766666543211 0011100 00001111111111 No 207 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=375 Identities=10% Similarity=0.055 Sum_probs=170.8 Q ss_pred Cchhhhhccc-cc-------cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhH Q lcl|NC_011801. 1 MAFLSNLFKR-QK-------MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPI 70 (386) Q Consensus 1 Mg~~~~l~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~ 70 (386) ..|.++...+ .. -..+.+++... .. ......-...-+.+.-..-.++..++-+-+-|+.+. +... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~----~~-~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~ 120 (511) T protein:vir:96 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE----LT-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV 120 (511) T ss_pred HHHHHHHHHhhhHHHHHHHHHhhccCccccc----cC-cccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHH Confidence 1111110000 00 00000000000 00 000000000001223344566666666666676543 3334 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--EeccC--c Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFDD--S 144 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~~--~ 144 (386) ...|.. -...-....+...+..++..+|.||..+.++.+|.+ .+..++|..+.+..+... ...... +.... . T Consensus 121 ~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:96 121 LEAIEA-FNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHH-HHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc Confidence 344322 111112334666788889999999999999888865 567889999888876543 122211 11110 0 Q ss_pred cc-c---eeEEEcccceeeeccccc-------------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 145 KR-S---GDFLYDSSEVIHFRCTVS-------------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAIST 198 (386) Q Consensus 145 ~~-~---~~~~~~~~~vih~~~~~~-------------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 198 (386) .. . ....+.+..+.++..... +++ .+...|.|.+..+...++....+..-..+. T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~ 278 (511) T protein:vir:96 199 TDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 00 0 011334444444322110 000 012358888888888888777766666666 Q ss_pred HhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhc-ccc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 199 LRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTT-GEN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 199 ~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~-~~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ++..+.|-.+++.. ...+.++.+..++.-.-... +.. .+.-.-.+++.+++.++.......+....+...+.|+..- T Consensus 279 ~~~~~~~~lv~~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:96 279 MSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHHhhcchhheecC-ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 66666665555431 22333333332221110000 000 0000112234444444444445556778888899999999 Q ss_pred CCCHHHhcCCcCcccHHH--------------HHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCH Q lcl|NC_011801. 277 GIPADYLSGKQDAQSNIT--------------MIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDN 335 (386) Q Consensus 277 gvp~~~l~~~~~~~~~~~--------------~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~ 335 (386) ++|..-.+... ++-+.. ..+..+...|.-.++.|...+...-. ..+++.+..-+..|. T Consensus 358 ~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~ 436 (511) T protein:vir:96 358 NTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL 436 (511) T ss_pred CCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCH Confidence 99975432211 111111 12234445555555555544443221 124555666677788 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhccCC-----------------------cCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 336 SELINNVQKLASAGVLAPIQAQKLLKNRG-----------------------VFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 336 ~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-----------------------~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .+.++.+.++. |+++...+.++++.-+ .....+....++.-..+++.++ T Consensus 437 ~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) T protein:vir:96 437 IEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCccc Confidence 89999988885 6777666655543211 0000111111122222222222 No 208 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=375 Identities=10% Similarity=0.055 Sum_probs=170.8 Q ss_pred Cchhhhhccc-cc-------cCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--chhH Q lcl|NC_011801. 1 MAFLSNLFKR-QK-------MLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPI 70 (386) Q Consensus 1 Mg~~~~l~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~ 70 (386) ..|.++...+ .. -..+.+++... .. ......-...-+.+.-..-.++..++-+-+-|+.+. +... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~----~~-~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~ 120 (511) T protein:vir:78 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE----LT-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV 120 (511) T ss_pred HHHHHHHHHhhhHHHHHHHHHhhccCccccc----cC-cccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHH Confidence 1111110000 00 00000000000 00 000000000001223344566666666666676543 3334 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--EeccC--c Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VHFDD--S 144 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~~~~--~ 144 (386) ...|.. -...-....+...+..++..+|.||..+.++.+|.+ .+..++|..+.+..+... ...... +.... . T Consensus 121 ~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:78 121 LEAIEA-FNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHH-HHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc Confidence 344322 111112334666788889999999999999888865 567889999888876543 122211 11110 0 Q ss_pred cc-c---eeEEEcccceeeeccccc-------------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 145 KR-S---GDFLYDSSEVIHFRCTVS-------------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAIST 198 (386) Q Consensus 145 ~~-~---~~~~~~~~~vih~~~~~~-------------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 198 (386) .. . ....+.+..+.++..... +++ .+...|.|.+..+...++....+..-..+. T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~ 278 (511) T protein:vir:78 199 TDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 00 0 011334444444322110 000 012358888888888888777766666666 Q ss_pred HhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhc-ccc-cCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 199 LRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTT-GEN-AGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAF 276 (386) Q Consensus 199 ~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~-~~~-~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 276 (386) ++..+.|-.+++.. ...+.++.+..++.-.-... +.. .+.-.-.+++.+++.++.......+....+...+.|+..- T Consensus 279 ~~~~~~~~lv~~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:78 279 MSDLNDAMLLIKGN-LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHHhhcchhheecC-ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 66666665555431 22333333332221110000 000 0000112234444444444445556778888899999999 Q ss_pred CCCHHHhcCCcCcccHHH--------------HHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhhhcchhhhccCH Q lcl|NC_011801. 277 GIPADYLSGKQDAQSNIT--------------MIRAFYQSSLSIYIKPIESELSQKLG-------TDVKLDIASAIDSDN 335 (386) Q Consensus 277 gvp~~~l~~~~~~~~~~~--------------~~~~~~~~~l~P~~~~ie~~l~~~l~-------~~~~fd~~~~l~~d~ 335 (386) ++|..-.+... ++-+.. ..+..+...|.-.++.|...+...-. ..+++.+..-+..|. T Consensus 358 ~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~ 436 (511) T protein:vir:78 358 NTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL 436 (511) T ss_pred CCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCH Confidence 99975432211 111111 12234445555555555544443221 124555666677788 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhccCC-----------------------cCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 336 SELINNVQKLASAGVLAPIQAQKLLKNRG-----------------------VFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 336 ~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-----------------------~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .+.++.+.++. |+++...+.++++.-+ .....+....++.-..+++.++ T Consensus 437 ~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) T protein:vir:78 437 IEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCccc Confidence 89999988885 6777666655543211 0000111111122222222222 No 209 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.50 E-value=5.5e-05 Score=43.94 Aligned_cols=365 Identities=9% Similarity=0.016 Sum_probs=168.4 Q ss_pred Cchhhhhccc-----------cccCCccchhhhh------hcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_011801. 1 MAFLSNLFKR-----------QKMLSGSSPVWIL------NQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRF 63 (386) Q Consensus 1 Mg~~~~l~~~-----------~~~~~~~~~~~~~------~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~ 63 (386) ..+.++.... ..-..+.+.+... ..+..........+.+ +.+.-....|+..++-+-+-|+ T Consensus 7 ~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~G~p~ 84 (470) T protein:vir:10 7 KKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNR--IPSNFYQLLVDQEAGYVASVFP 84 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcc--cccchHHHHHHhhhhheeccce Confidence 1111111000 0000000100000 0000000000001111 1233344566666666667776 Q ss_pred ee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE- Q lcl|NC_011801. 64 VT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT- 138 (386) Q Consensus 64 ~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~- 138 (386) .+ .+......|..--+. ...+-...+..++..+|.||..+-.+..|.+ .+..++|..+-+..+... ...... T Consensus 85 ~~~~~d~~~~~~l~~~~~~--~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir 161 (470) T protein:vir:10 85 DIDVGKDADNKKIIDVLGD--DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQITPIYATTLDNKLLGILR 161 (470) T ss_pred eeecCchHHHHHHHHHHhh--hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEE Confidence 54 333333333221121 2334445677888999999999999988875 567789999888876542 122111 Q ss_pred -EeccCcccc----eeEEEcccceeeeccccc----------------------------cCc---------cccccccc Q lcl|NC_011801. 139 -VHFDDSKRS----GDFLYDSSEVIHFRCTVS----------------------------GES---------DTQYMGIP 176 (386) Q Consensus 139 -~~~~~~~~~----~~~~~~~~~vih~~~~~~----------------------------~~~---------~~~~~G~s 176 (386) +........ ....+....+.|++.... .++ .+...|.| T Consensus 162 ~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~s 241 (470) T protein:vir:10 162 SYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLP 241 (470) T ss_pred EEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCCC Confidence 111110100 011223333333321100 000 01125888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-----CCceeee Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-----QSADVET 251 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-----~g~~~~~ 251 (386) .+..+...++....+..-..+.++..+.|-.+++.-+..-.++ ....+... +.+.++ .+.+++- T Consensus 242 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~----~~~~~~~~-------~~i~~~~~~~~~~~~~~~ 310 (470) T protein:vir:10 242 ELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQ----FMNDLRKY-------KSIKINNTGNGDNSGVDK 310 (470) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccch----hhhhhhhc-------CeEeccCCCCCcCceeEE Confidence 8888888888877777777777777777766665322111122 22222221 122221 2334444 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--------------HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 252 TNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--------------MIRAFYQSSLSIYIKPIESELSQ 317 (386) Q Consensus 252 ~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--------------~~~~~~~~~l~P~~~~ie~~l~~ 317 (386) +........+....+...+.|...-++|..-. ...++-+.. ..+..+..+|+-.++.|...++. T Consensus 311 lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~--~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~ 388 (470) T protein:vir:10 311 LQIDIPVEARDDALKITRKNIFLFGQGIDPAN--FESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (470) T ss_pred EeecCChHHHHHHHHHHHHHHHHHhCCCCCCc--cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 44433444567778888888988888885322 111111111 11234445555555555444332 Q ss_pred hh--hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCC-----------CccccccccCCCC Q lcl|NC_011801. 318 KL--GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDL-----------DEGTNLLDNTKNI 384 (386) Q Consensus 318 ~l--~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~-----------~~~~~~~~~~~~~ 384 (386) .- ...+++.+..-+..|..+.++.+.++ .|+++..-+.++++.-. +|.... +...+.-.....+ T Consensus 389 ~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~-D~~~E~eri~~E~~e~~~~~~~~~~~~~~~ 465 (470) T protein:vir:10 389 SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVD-DWQQELKDLAKDKEENDPYSNQADELNGKG 465 (470) T ss_pred cCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHhhccccccCCCC Confidence 11 12445666667788999999998887 57888877776654211 111110 1111111222223 Q ss_pred CC Q lcl|NC_011801. 385 ND 386 (386) Q Consensus 385 ~~ 386 (386) .| T Consensus 466 ~d 467 (470) T protein:vir:10 466 VN 467 (470) T ss_pred CC Confidence 33 No 210 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.48 E-value=5.9e-05 Score=43.79 Aligned_cols=367 Identities=14% Similarity=0.084 Sum_probs=148.3 Q ss_pred Cchh----hhhccccccCCccchhhhhhcc---cccccCcccccHHHH---hccHHHHHHHHHHHHhhccCceeecch-- Q lcl|NC_011801. 1 MAFL----SNLFKRQKMLSGSSPVWILNQG---QPVSIKPKAITSAIA---LKNSDVYAVISRVSSDIAGCRFVTNAQ-- 68 (386) Q Consensus 1 Mg~~----~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~a---~~~~~v~~~v~~ia~~ia~~p~~~~~~-- 68 (386) |+-- .+|.+........- .-....+ ......+..+..+.. ..+.-...+|+..++.+-..++.+..+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~-~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~ 79 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHH-HHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCCCch Confidence 4321 11111100000000 0000000 000011111111110 111223445555555443334444322 Q ss_pred ---hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec------CCCceEEEEEEcCcceEEeecCC-C-ceeEE Q lcl|NC_011801. 69 ---PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD------TNGYPVRIEPVPNEKVTVALDDY-G-KDLTY 137 (386) Q Consensus 69 ---~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~------~~g~~~~l~~l~~~~v~~~~~~~-~-~~~~~ 137 (386) .+..++.. |. .......+..+.+.+|.||..+.++ ..|.+ .+..++|..+.+..+.. . ....+ T Consensus 80 ~~~~l~~i~~~--N~---~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D~~~~~~~~~~ 153 (480) T protein:vir:78 80 GLEELWNWWQA--ND---LDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) T ss_pred hHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEcCCCccceEEE Confidence 23344432 32 2356677889999999999988653 34444 46788888888877643 1 11111 Q ss_pred E-Ee--ccC-cccceeEEEccc-----------------------------ceeeeccccccCcccccccccHHHH-HHH Q lcl|NC_011801. 138 T-VH--FDD-SKRSGDFLYDSS-----------------------------EVIHFRCTVSGESDTQYMGIPPIDS-LLN 183 (386) Q Consensus 138 ~-~~--~~~-~~~~~~~~~~~~-----------------------------~vih~~~~~~~~~~~~~~G~s~~~~-~~~ 183 (386) . +. ... ........+.+. +|+||.+. ...++.+|.|.+.- +.. T Consensus 154 i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~---~~~~~~~G~s~i~~~v~~ 230 (480) T protein:vir:78 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND---PRLGNRYGRSEISPELRK 230 (480) T ss_pred EEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecc---cccCCccCcccchhhHHH Confidence 1 00 000 000000111112 23333311 12344578877653 444 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-CCceeeeccCChhhHHHH Q lcl|NC_011801. 184 EIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVTEFL 262 (386) Q Consensus 184 ~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~ 262 (386) .++.............+-.+.|..+|. +....+...+.-...|... .+.++.++ ++.++.++....-+ .++ T Consensus 231 l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~ 302 (480) T protein:vir:78 231 VTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAAELR-NFA 302 (480) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhh--cCCccccccccccchhhhh-----hhhhccCCCCCceEEecCccCHH-HHH Confidence 455444443333444444455555443 3222222111111112211 12344443 45677666543222 267 Q ss_pred HHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHH----HHHHHhh-------h-------hhhh Q lcl|NC_011801. 263 QNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIE----SELSQKL-------G-------TDVK 324 (386) Q Consensus 263 e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie----~~l~~~l-------~-------~~~~ 324 (386) +..+..+.+|+..=++|+..++.......+.++.+. ....|.-.+...+ ..|.+.+ + ..++ T Consensus 303 ~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~-~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~ 381 (480) T protein:vir:78 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) T ss_pred HHHHHHHHHHhcccCCChHHhccccCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeee Confidence 888899999999999999999754322112222211 1111222222222 2222211 1 1123 Q ss_pred hcchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCc----------------------CCCCCCCcc-----c Q lcl|NC_011801. 325 LDIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGV----------------------FPELDLDEG-----T 375 (386) Q Consensus 325 fd~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~----------------------~p~~~~~~~-----~ 375 (386) +.+.+....+..+.++.+.+++.+| +++..-+++.+|.-+- ...++.+.. + T Consensus 382 v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) T protein:vir:78 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) T ss_pred EEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCC Confidence 3333344567778888888888765 4443333333321110 000000000 0 Q ss_pred cccccC---CCCCC Q lcl|NC_011801. 376 NLLDNT---KNIND 386 (386) Q Consensus 376 ~~~~~~---~~~~~ 386 (386) +....+ .++-. T Consensus 462 ~~~~~~~~~~~~~~ 475 (480) T protein:vir:78 462 ETKTETQTSPSGFN 475 (480) T ss_pred CCCCccccccCCCC Confidence 000000 00000 No 211 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.43 E-value=7e-05 Score=43.37 Aligned_cols=370 Identities=13% Similarity=0.140 Sum_probs=160.7 Q ss_pred Cchhh--hhccccccC------------Cccchhh--hhhccccc-c------cCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_011801. 1 MAFLS--NLFKRQKML------------SGSSPVW--ILNQGQPV-S------IKPKAITSAIALKNSDVYAVISRVSSD 57 (386) Q Consensus 1 Mg~~~--~l~~~~~~~------------~~~~~~~--~~~~~~~~-~------~~~~~i~~~~a~~~~~v~~~v~~ia~~ 57 (386) |.++- .+...-+.. ....+.. ....+... . ......+.+ +.+.-...+|+..++- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~Iv~~~~~~ 78 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAAN--VMVNHAKYITDMNVGF 78 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcce--eecchHHHHHHHHhhh Confidence 44421 000000000 0000000 00000000 0 000000111 1123344566666666 Q ss_pred hccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce----------------EEEEEE Q lcl|NC_011801. 58 IAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP----------------VRIEPV 119 (386) Q Consensus 58 ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~----------------~~l~~l 119 (386) +-+-|+.+ ........|..-.+. -....+...+..+...+|.||.++..+.+|.+ ..+..+ T Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v 157 (499) T protein:vir:10 79 MTGNPVKYVAEKGKNIDDILEVFNQ-IDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVI 157 (499) T ss_pred hcccCceeecCChhHHHHHHHHHhh-cCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEE Confidence 65666654 233333333221111 12334677788889999999999988887753 346677 Q ss_pred cCcceEEeecCCCce-----eEEEEecc-Ccccc--eeEEEcccceeeeccccc-------------cCc---------c Q lcl|NC_011801. 120 PNEKVTVALDDYGKD-----LTYTVHFD-DSKRS--GDFLYDSSEVIHFRCTVS-------------GES---------D 169 (386) Q Consensus 120 ~~~~v~~~~~~~~~~-----~~~~~~~~-~~~~~--~~~~~~~~~vih~~~~~~-------------~~~---------~ 169 (386) +|..+-+..+..... +.|+.... ..... ....+.+..+.+++.... +++ . T Consensus 158 ~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 237 (499) T protein:vir:10 158 DPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFR 237 (499) T ss_pred cccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEec Confidence 887776665543221 11111110 00000 001223333333211000 000 0 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCccee--cCCCc Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVV--LDQSA 247 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~v--l~~g~ 247 (386) +...|.|.+..+...++....+.....+.++..+.|..+++ +...+++... ...+. .+.+.. .++|. T Consensus 238 n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~--G~~~~~~~~~--~~~~~-------~~~~~~~~~~~~~ 306 (499) T protein:vir:10 238 NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF--GFGLGDDKDD--IQRLK-------RGAIEAPPREEGA 306 (499) T ss_pred CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCccccccch--hhhhh-------hcceeccCCCCCC Confidence 12357788888888888777776666666677777766654 3333221110 11111 112222 34556 Q ss_pred eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCH---HHhcCCcCcccH----------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 248 DVETTNISPNVTEFLQNVSFSQDQIAKAFGIPA---DYLSGKQDAQSN----------ITMIRAFYQSSLSIYIKPIESE 314 (386) Q Consensus 248 ~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~---~~l~~~~~~~~~----------~~~~~~~~~~~l~P~~~~ie~~ 314 (386) +++.+........+....+...+.|...-++|. .-++...++... .+..+..+..++.-+++.+... T Consensus 307 d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~ 386 (499) T protein:vir:10 307 DIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTI 386 (499) T ss_pred cceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666665544455566777788888888888774 222111111000 1112234444455555544444 Q ss_pred HHHhh----hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccC-----------------------Cc-- Q lcl|NC_011801. 315 LSQKL----GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNR-----------------------GV-- 365 (386) Q Consensus 315 l~~~l----~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~-----------------------p~-- 365 (386) ++..- ...+++.+..-+..|..+.++.+.++ .|+++..-+.++++.- +. T Consensus 387 ~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 464 (499) T protein:vir:10 387 VNIKGANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRG 464 (499) T ss_pred HhccCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 33211 12334555555677888999998888 5677655444433210 00 Q ss_pred -CCCCC----CCccccccccCCCCCC Q lcl|NC_011801. 366 -FPELD----LDEGTNLLDNTKNIND 386 (386) Q Consensus 366 -~p~~~----~~~~~~~~~~~~~~~~ 386 (386) .|... ..+..++-....+.++ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (499) T protein:vir:10 465 QDPDRLELEDKQDDSSENDKEAGSNH 490 (499) T ss_pred CCCCCCCCCCCCcccCCCCCCCcccc Confidence 01010 1111111122222111 No 212 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=97.42 E-value=7.1e-05 Score=43.34 Aligned_cols=369 Identities=8% Similarity=0.076 Sum_probs=159.7 Q ss_pred Cchhhhh---cccc-------ccC------Cccchh--------hh-hhcccccccCcccccHH----HHhccHHHHHHH Q lcl|NC_011801. 1 MAFLSNL---FKRQ-------KML------SGSSPV--------WI-LNQGQPVSIKPKAITSA----IALKNSDVYAVI 51 (386) Q Consensus 1 Mg~~~~l---~~~~-------~~~------~~~~~~--------~~-~~~~~~~~~~~~~i~~~----~a~~~~~v~~~v 51 (386) ||+|+++ |++. +.. ....+. +. +..+.+.++....+... ..+.......+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 9998765 2221 000 000010 00 00111111111111000 011111122333 Q ss_pred HHHHHhhcc--Cceeecch-----------hHHHHHhc--cCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEE Q lcl|NC_011801. 52 SRVSSDIAG--CRFVTNAQ-----------PITDVLNA--PLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRI 116 (386) Q Consensus 52 ~~ia~~ia~--~p~~~~~~-----------~~~~~l~~--~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l 116 (386) ..+|+.+-. +.+.+.+. .....|+. +-|. ....++..+.+.+..|.+++.+..+.. . +.+ T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~---f~~~~~~~~e~a~a~G~~a~k~~~d~~-~-~~I 155 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNK---FIKNLSDYLEPTFALGGLTVRPYVDNG-E-IEF 155 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhcc---HHHHHHHHHHHHhhhCCEEEEEEEeCC-e-eEE Confidence 333333322 12232211 11222211 0011 223344455666667887877766543 2 335 Q ss_pred EEEcCcceEEee-cCCCc----------------eeEEE---Ee-------------c--------cCcccceeE----- Q lcl|NC_011801. 117 EPVPNEKVTVAL-DDYGK----------------DLTYT---VH-------------F--------DDSKRSGDF----- 150 (386) Q Consensus 117 ~~l~~~~v~~~~-~~~~~----------------~~~~~---~~-------------~--------~~~~~~~~~----- 150 (386) ..++++.+-+.. +..+. ..+|. ++ + .....|.++ T Consensus 156 ~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~ 235 (517) T protein:vir:98 156 SWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEEL 235 (517) T ss_pred EEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccccc Confidence 556666554321 11110 00010 00 0 000011111 Q ss_pred --------EEcc--cc-eeeeccccccCcc-cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCC-- Q lcl|NC_011801. 151 --------LYDS--SE-VIHFRCTVSGESD-TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATL-- 216 (386) Q Consensus 151 --------~~~~--~~-vih~~~~~~~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~-- 216 (386) .++. .+ ..|++....+... +...|+|....+...++..+..-.....-|+.|.+ +.++ +...+ T Consensus 236 ~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p~~~l~~ 312 (517) T protein:vir:98 236 YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--SDVMLRT 312 (517) T ss_pred ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--Chhhhcc Confidence 1111 01 1233322111111 22479999999999999888777666677777554 3332 22111 Q ss_pred --CHHHHHHHHHHHH---HHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc Q lcl|NC_011801. 217 --GKEAKENTRQSFE---EQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS 291 (386) Q Consensus 217 --~~~~~~~~k~~~~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~ 291 (386) +..... ....|+ ..+.+-+ .-+++-.++.++....+-++.+..+...+.|+...|+++..++..+.+.. T Consensus 313 ~~~~~g~~-~~~~~d~~~~~y~~~~-----~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~k 386 (517) T protein:vir:98 313 VPDESGMP-PPQVFDPDVNVYKSIR-----MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMK 386 (517) T ss_pred ccCCCCcc-cCCCCCcccceeeecc-----CCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccc Confidence 000000 000000 0011000 00122346667777777889999999999999999999999987655432 Q ss_pred -HHHHH--HHHHHHHHHHHHHHHHHHHHHh------------h-------hhhhhhcchhhhccCHHHHHHHHHHHHhCC Q lcl|NC_011801. 292 -NITMI--RAFYQSSLSIYIKPIESELSQK------------L-------GTDVKLDIASAIDSDNSELINNVQKLASAG 349 (386) Q Consensus 292 -~~~~~--~~~~~~~l~P~~~~ie~~l~~~------------l-------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g 349 (386) ..+.. +.-.-.++.-+...++.+|... + ...+.+++++-+..|.++.++...+++.+| T Consensus 387 TATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG 466 (517) T protein:vir:98 387 TATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFG 466 (517) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcC Confidence 22211 1111123333333333333321 1 123567888888999999999999999999 Q ss_pred CcCHHHHHHHhccCCcCCCCCCC--------c--cccccc---cCCCCCC Q lcl|NC_011801. 350 VLAPIQAQKLLKNRGVFPELDLD--------E--GTNLLD---NTKNIND 386 (386) Q Consensus 350 ~~t~nE~R~~lg~~p~~p~~~~~--------~--~~~~~~---~~~~~~~ 386 (386) +|++-+++.++- +.. +++.. | ..++.. ...+..+ T Consensus 467 ~ms~~~~i~~~~--g~~-eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~ 513 (517) T protein:vir:98 467 FIPTVEAIQRIF--KVP-KKTAEQWLEEIRKDQIELDPVTISQRAQKRMF 513 (517) T ss_pred CCCHHHHHHHhC--CCC-hHHHHHHHHHHHHhccccCCCCccccccCCCC Confidence 998888765541 111 11100 0 001111 1111111 No 213 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.42 E-value=7.1e-05 Score=43.33 Aligned_cols=379 Identities=12% Similarity=0.048 Sum_probs=169.4 Q ss_pred Cch---hhhhccccccC-Ccc-chhhhhhccc-ccccCcc----cccHHHHhccHHHHHHHHHHHHhhccCceeecc--- Q lcl|NC_011801. 1 MAF---LSNLFKRQKML-SGS-SPVWILNQGQ-PVSIKPK----AITSAIALKNSDVYAVISRVSSDIAGCRFVTNA--- 67 (386) Q Consensus 1 Mg~---~~~l~~~~~~~-~~~-~~~~~~~~~~-~~~~~~~----~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~~--- 67 (386) +.. +.++-.+.... ... ........+. ....... ......-+.++-...+|+..++-+-+-|+.+.- T Consensus 38 ~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~ 117 (501) T protein:vir:96 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCc Confidence 221 11111000000 000 0000000000 0000000 000011123444556777777766666765531 Q ss_pred ---hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCC--ceeEEE--Ee Q lcl|NC_011801. 68 ---QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYG--KDLTYT--VH 140 (386) Q Consensus 68 ---~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~--~~ 140 (386) ..+...|.. ....-........+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..+... ...... +. T Consensus 118 ~~~~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 195 (501) T protein:vir:96 118 DDNSQNDDAIKR-IGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYN 195 (501) T ss_pred cchhHHHHHHHH-HHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEE Confidence 223333322 222223445777889999999999999999988876 567789999988876532 222111 11 Q ss_pred ccC--cccceeEEEcccceeeecccc-------ccCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 141 FDD--SKRSGDFLYDSSEVIHFRCTV-------SGES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHA 202 (386) Q Consensus 141 ~~~--~~~~~~~~~~~~~vih~~~~~-------~~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 202 (386) ... ........+.++.+.++.... .+++ .+...|.|.+..+...++....+.....+.+... T Consensus 196 ~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~ 275 (501) T protein:vir:96 196 RGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDM 275 (501) T ss_pred eecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 101 000011112222222211000 0000 0123588888888888888777777777777777 Q ss_pred CCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_011801. 203 IKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADY 282 (386) Q Consensus 203 ~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~ 282 (386) +.|..++.........+....++..- ...-...+.......+.++.-++....+..+....+...+.|+..-++|..- T Consensus 276 ~~~~l~i~G~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 353 (501) T protein:vir:96 276 ADAILAIYGDLALPKGMQASDMKRTR--LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMS 353 (501) T ss_pred cCceeeeecccccCcccchhhhhhcC--eeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 77766654321111122222221100 0000111112223344555555544455556777888889999999998644 Q ss_pred hcCCcCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhh-h-----hhhhhcchhhhccCHHHHHHHHH Q lcl|NC_011801. 283 LSGKQDAQSNIT-------------MIRAFYQSSLSIYIKPIESELSQKL-G-----TDVKLDIASAIDSDNSELINNVQ 343 (386) Q Consensus 283 l~~~~~~~~~~~-------------~~~~~~~~~l~P~~~~ie~~l~~~l-~-----~~~~fd~~~~l~~d~~~~~~~~~ 343 (386) .+........++ ..+..+..+|+-.++.+...++..- + ..+++.+...+..|.++.++++. T Consensus 354 ~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~ 433 (501) T protein:vir:96 354 DTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILT 433 (501) T ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHH Confidence 432211111111 1123444445554444444433321 1 12445566667778899999988 Q ss_pred HHHhCCCcCHHHHHHHhccCCcCCCCC----------------CCc----cc-cccccCCCCCC Q lcl|NC_011801. 344 KLASAGVLAPIQAQKLLKNRGVFPELD----------------LDE----GT-NLLDNTKNIND 386 (386) Q Consensus 344 ~~~~~g~~t~nE~R~~lg~~p~~p~~~----------------~~~----~~-~~~~~~~~~~~ 386 (386) ++. |+++..-+.++++.-. +|... .+. .+ ..-.++....| T Consensus 434 kl~--g~iS~et~~~~l~~v~-D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d 494 (501) T protein:vir:96 434 GLG--GQVSQETALSLSGLVE-SPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTD 494 (501) T ss_pred HHh--ccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCC Confidence 885 6676555544433110 00000 000 00 00001111111 No 214 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.35 E-value=8.8e-05 Score=42.81 Aligned_cols=361 Identities=12% Similarity=0.066 Sum_probs=165.3 Q ss_pred Cc------------------hhhhhcccccc-----------CCccchhhhhhcccccccCcccccHHHHhccHHHHHHH Q lcl|NC_011801. 1 MA------------------FLSNLFKRQKM-----------LSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVI 51 (386) Q Consensus 1 Mg------------------~~~~l~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v 51 (386) |. .+.++...... ..+.+.+. ......... ...+ +..+-....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~----~~~~~~~~~-~~~k--i~~n~~~~iv 73 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAID----DEPAKDSWK-PDNR--LAVNFTKYIV 73 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cCccccccC-ccce--eecchHHHHH Confidence 11 11111100000 00000000 000000000 0111 1233444566 Q ss_pred HHHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeec Q lcl|NC_011801. 52 SRVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALD 129 (386) Q Consensus 52 ~~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~ 129 (386) +..+.-+-+-|+.+ .+......|..--+ .-........+..+.+.+|.||..+..+.+|.+ .+..++|..+.+..+ T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d 151 (452) T protein:vir:36 74 DTFTGYFNGIPVKKSHSDKEILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPENMFMVYD 151 (452) T ss_pred HHHhhhhcccCceeecCChhHHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEc Confidence 66666665666554 33333333322111 112334667788889999999999999888876 467788988887766 Q ss_pred CCCc--eeEEE-EeccCcccceeEEEcccceeeeccccc--------cCc---------ccccccccHHHHHHHHHHHHH Q lcl|NC_011801. 130 DYGK--DLTYT-VHFDDSKRSGDFLYDSSEVIHFRCTVS--------GES---------DTQYMGIPPIDSLLNEIEVQD 189 (386) Q Consensus 130 ~~~~--~~~~~-~~~~~~~~~~~~~~~~~~vih~~~~~~--------~~~---------~~~~~G~s~~~~~~~~i~~~~ 189 (386) .... ..... +............+....++++..... +++ .+...|.|.+......++... T Consensus 152 ~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d 231 (452) T protein:vir:36 152 DTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFN 231 (452) T ss_pred CCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHH Confidence 5321 22111 111110111111122222222211000 000 111357788887777777777 Q ss_pred HHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-----CCceeeeccCChhhHHHHHH Q lcl|NC_011801. 190 LSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-----QSADVETTNISPNVTEFLQN 264 (386) Q Consensus 190 ~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~ 264 (386) .+.....+.++..+.|..+++ +...+++....++. ++++.++ .+.++..+.....+..+... T Consensus 232 ~~~s~~~~~~~~~~~p~~~~~--g~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 298 (452) T protein:vir:36 232 KAISEKANDVDYFSDQYLTFL--GAAVEEEDLKNIRS-----------NRVINYYADGEGKNVDVKFLEKPDSDSQTENL 298 (452) T ss_pred HHHHHHHHHHHHhcCceeEee--cCCcCchhhhhhhh-----------cceEEecCCCCccCCcceeEeecCCHHHHHHH Confidence 776666666677777766554 44445544333221 1122222 12334444444445556777 Q ss_pred HHHHHHHHHHHhCCCHHHhcCCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh----hhhhhhcch Q lcl|NC_011801. 265 VSFSQDQIAKAFGIPADYLSGKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL----GTDVKLDIA 328 (386) Q Consensus 265 ~~~~~~~Ia~~~gvp~~~l~~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l----~~~~~fd~~ 328 (386) .+...+.|+..-++|..-.+..++.+.. ....+..+..++...++.+...+...- ...+++.+. T Consensus 299 ~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~ 378 (452) T protein:vir:36 299 LDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFT 378 (452) T ss_pred HHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeC Confidence 8888999999999986322221221110 011123444455555554444333221 123455555 Q ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-----------------ccccccccCCCCCC Q lcl|NC_011801. 329 SAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-----------------EGTNLLDNTKNIND 386 (386) Q Consensus 329 ~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-----------------~~~~~~~~~~~~~~ 386 (386) .-+..|..+.++.+.++ .|+++..-+.++++.-. +|..... .+++........+| T Consensus 379 ~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:36 379 RNEPKDIKEQAETANIL--MGITSQETALSVISVIP-DVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVSETN 450 (452) T ss_pred CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHhhccCCCCcccccCcccc Confidence 66777889999998887 57788766666654211 0110000 00011111111111 No 215 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.30 E-value=0.0001 Score=42.49 Aligned_cols=366 Identities=13% Similarity=0.077 Sum_probs=149.7 Q ss_pred Cchh----hhhccccccCCccchhhhhhcc---cccccCcccccHHHH---hccHHHHHHHHHHHHhhccCceeecch-- Q lcl|NC_011801. 1 MAFL----SNLFKRQKMLSGSSPVWILNQG---QPVSIKPKAITSAIA---LKNSDVYAVISRVSSDIAGCRFVTNAQ-- 68 (386) Q Consensus 1 Mg~~----~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~a---~~~~~v~~~v~~ia~~ia~~p~~~~~~-- 68 (386) |+-- .+|.+........-.. ....+ ......+..+..+.. ..+.-..-+|+..++.+-..++.+..+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~-~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~ 79 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLE-AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecCCCch Confidence 4431 1111110000000000 00000 000001111111110 011122345555555443334444322 Q ss_pred ---hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeec------CCCceEEEEEEcCcceEEeecCCC--ceeEE Q lcl|NC_011801. 69 ---PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRD------TNGYPVRIEPVPNEKVTVALDDYG--KDLTY 137 (386) Q Consensus 69 ---~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~------~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~ 137 (386) .+..++.. |. .......+..+.+.+|.||+.+.++ .+|.+ .+..++|..+.+..+... ....+ T Consensus 80 ~~~~l~~i~~~--N~---~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~~ 153 (480) T protein:vir:78 80 GLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) T ss_pred hHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEcCCCccceEEE Confidence 23344432 32 3446677889999999999888653 34444 577888999888876532 11111 Q ss_pred E-Ee--ccCcc-cceeEEEccc-----------------------------ceeeeccccccCcccccccccHHHH-HHH Q lcl|NC_011801. 138 T-VH--FDDSK-RSGDFLYDSS-----------------------------EVIHFRCTVSGESDTQYMGIPPIDS-LLN 183 (386) Q Consensus 138 ~-~~--~~~~~-~~~~~~~~~~-----------------------------~vih~~~~~~~~~~~~~~G~s~~~~-~~~ 183 (386) . +. ..+.+ ......+.++ +|+||.+ ....++.+|.|.+.- +.. T Consensus 154 i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n---~~~~~~~~G~sdi~~~i~~ 230 (480) T protein:vir:78 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTN---DPRLGNRYGRSEISPELRK 230 (480) T ss_pred EEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeec---ccccCCccCccchhHHHHH Confidence 1 10 00000 0001112222 2333331 112344568876653 444 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-CCceeeeccCChhhHHHH Q lcl|NC_011801. 184 EIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-QSADVETTNISPNVTEFL 262 (386) Q Consensus 184 ~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~ 262 (386) .++.............+-.+.|..++. +..+++...+.-...|... .+.++.++ ++.+|.++....-+ .++ T Consensus 231 l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~ 302 (480) T protein:vir:78 231 VTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAAELR-NFA 302 (480) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhh--CCCccccccccccchhhhh-----hhhhccCCCCCceEEecCccCHH-HHH Confidence 445444443333333344445544443 3222221111111112211 12344544 34667665543322 378 Q ss_pred HHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------h-------hhhh Q lcl|NC_011801. 263 QNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIRAFYQSSLSIYIKPIESELSQKL-----------G-------TDVK 324 (386) Q Consensus 263 e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~-------~~~~ 324 (386) +..+..+.+|+..=++|+..++.......+..+.+ +....|.-.+...+..|...| + ..++ T Consensus 303 ~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~ 381 (480) T protein:vir:78 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) T ss_pred HHHHHHHHHHhcccCCCHHHhccccCchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeee Confidence 88899999999999999999985432111222221 111222222222222222211 1 1223 Q ss_pred hcchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhccCCcCCCCCC------------------CccccccccCCCC Q lcl|NC_011801. 325 LDIASAIDSDNSELINNVQKLASAG--VLAPIQAQKLLKNRGVFPELDL------------------DEGTNLLDNTKNI 384 (386) Q Consensus 325 fd~~~~l~~d~~~~~~~~~~~~~~g--~~t~nE~R~~lg~~p~~p~~~~------------------~~~~~~~~~~~~~ 384 (386) +.+.+....+..+.++.+.+++.+| +++..-+++++|.-+- +...+ .++.-.-.+..+. T Consensus 382 v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) T protein:vir:78 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT-QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) T ss_pred EEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHh-HHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCC Confidence 3444444567778888888888765 4554444544442210 00000 0000000000111 Q ss_pred CC Q lcl|NC_011801. 385 ND 386 (386) Q Consensus 385 ~~ 386 (386) .| T Consensus 461 ~~ 462 (480) T protein:vir:78 461 TE 462 (480) T ss_pred CC Confidence 11 No 216 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.28 E-value=0.00011 Score=42.32 Aligned_cols=367 Identities=10% Similarity=0.023 Sum_probs=155.0 Q ss_pred Cc---------hhhhhcccc------------------c-----------cCCccchhhhhhcccccccCcccccHHHHh Q lcl|NC_011801. 1 MA---------FLSNLFKRQ------------------K-----------MLSGSSPVWILNQGQPVSIKPKAITSAIAL 42 (386) Q Consensus 1 Mg---------~~~~l~~~~------------------~-----------~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~ 42 (386) |. |+.+.|+.. . -..+.+.+.................+..=+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRM 80 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc Confidence 10 011111000 0 000000000000000000000000001111 Q ss_pred ccHHHHHHHHHHHHhhccCceee--cchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEc Q lcl|NC_011801. 43 KNSDVYAVISRVSSDIAGCRFVT--NAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVP 120 (386) Q Consensus 43 ~~~~v~~~v~~ia~~ia~~p~~~--~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~ 120 (386) .++-...+++..++-+-+-|..+ .+....+.|..--+. ........+..+...+|.+|+.+..+.+|.+ .+..++ T Consensus 81 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 157 (468) T protein:vir:96 81 YTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLNH--KWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRVP 157 (468) T ss_pred ccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHhc--CHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEc Confidence 23344556666666665666654 233333333221111 2344556688889999999999888888765 577788 Q ss_pred CcceEEeecCC--CceeEEEEeccCcccceeEEEcccceeeeccccc----------------------cCc-------- Q lcl|NC_011801. 121 NEKVTVALDDY--GKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVS----------------------GES-------- 168 (386) Q Consensus 121 ~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~----------------------~~~-------- 168 (386) |..+.+..+.. +....+...+..........+....+.+++.... +++ T Consensus 158 p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~ 237 (468) T protein:vir:96 158 AEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIP 237 (468) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEE Confidence 88887776532 1222111111111111111222333332221100 000 Q ss_pred -ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC--C Q lcl|NC_011801. 169 -DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD--Q 245 (386) Q Consensus 169 -~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~--~ 245 (386) .+...|.|.+..+...++....+..-..+.++..+.|..+++ +....+ .+.+...++ .++++.++ + T Consensus 238 ~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~~--~~~~~~~~~-------~~~~i~~~~d~ 306 (468) T protein:vir:96 238 FKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLK--GYEGED--LEEFMYNLK-------YYKAINVDGDG 306 (468) T ss_pred ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--cchhhhhhh-------cCceEEecCCC Confidence 012357888888777777777766666666777777765554 322221 111111111 12344442 3 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHH Q lcl|NC_011801. 246 SADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPI 311 (386) Q Consensus 246 g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~i 311 (386) +.+++.+........+....+...+.|...-++|..-.. ...++-+..+. +..+...++-+++.+ T Consensus 307 ~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li 385 (468) T protein:vir:96 307 SGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYI 385 (468) T ss_pred CCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 344444444444555677788888999999898863221 11111121121 122233333333332 Q ss_pred HHHHHHhh-hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----cccc--cc-ccCCC Q lcl|NC_011801. 312 ESELSQKL-GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----EGTN--LL-DNTKN 383 (386) Q Consensus 312 e~~l~~~l-~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~~~~--~~-~~~~~ 383 (386) ...+.... ...+++.++.-+..|..+.+++ +..+|+++.-.+.++++.-. +|..... |-.. .. ..-.+ T Consensus 386 ~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~v~-D~~~E~~ri~~E~~~~~~~~~~~~~ 461 (468) T protein:vir:96 386 IDFYKLSIKVQDVEITFNFNVMVNELEQSQI---GVNSQYLSKETVVTNHPWVD-DPVAEMERIDQEELALPSIEEGLNG 461 (468) T ss_pred HHHhCCCcccceeeEEecCCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHhhccCC Confidence 22211110 0122333444445555555554 56679999888877653211 0111100 0000 00 11111 Q ss_pred C-CC Q lcl|NC_011801. 384 I-ND 386 (386) Q Consensus 384 ~-~~ 386 (386) . +| T Consensus 462 ~~~~ 465 (468) T protein:vir:96 462 KENN 465 (468) T ss_pred CCCC Confidence 1 11 No 217 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=372 Identities=11% Similarity=0.098 Sum_probs=163.4 Q ss_pred Cch------hhhhcccc------c-------cCCccchhhhhhccccccc---CcccccHHHHhccHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAF------LSNLFKRQ------K-------MLSGSSPVWILNQGQPVSI---KPKAITSAIALKNSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~------~~~l~~~~------~-------~~~~~~~~~~~~~~~~~~~---~~~~i~~~~a~~~~~v~~~v~~ia~~i 58 (386) |-+ |..-+... + =..+.+.+........... ......+..=+.+.-..-.|+..++-+ T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl 87 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYL 87 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhh Confidence 111 11100000 0 0000010000000000000 000000000011223344666666666 Q ss_pred ccCceeecc-----hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc Q lcl|NC_011801. 59 AGCRFVTNA-----QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK 133 (386) Q Consensus 59 a~~p~~~~~-----~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 133 (386) -+-|+++.- ..+...|+..-+ .........+..++..+|.||.++-.+.+|.+ .+..++|..+-++.+.... T Consensus 88 ~G~Pv~~~~~d~~~~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~i~p~~~~pv~d~~~~ 164 (537) T protein:vir:78 88 LSNGVEVKVKDEDNTQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQTVDGLTLIPVFDDYGV 164 (537) T ss_pred cccCceeecCcchhHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEEEccceeEEEEcCCCC Confidence 677776532 234445543222 12234556777889999999999988888865 4677888888777665433 Q ss_pred eeE----EEEeccCcc--c----ceeEEEcccceeeecccccc------------------------------------- Q lcl|NC_011801. 134 DLT----YTVHFDDSK--R----SGDFLYDSSEVIHFRCTVSG------------------------------------- 166 (386) Q Consensus 134 ~~~----~~~~~~~~~--~----~~~~~~~~~~vih~~~~~~~------------------------------------- 166 (386) ... |........ . .....+.+..|.+++..... T Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 244 (537) T protein:vir:78 165 LKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQ 244 (537) T ss_pred ceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccccccc Confidence 211 110000000 0 00112334444443211100 Q ss_pred ---Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 167 ---ESD---------TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTG 234 (386) Q Consensus 167 ---~~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~ 234 (386) +++ +...|.|.+..+...++....+....++.++..+.|-.++. +....+ ...++..+.. T Consensus 245 ~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~--g~~~~~--~~~~~~~l~~---- 316 (537) T protein:vir:78 245 VLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVK--GFSGDS--TDKLRQNIKA---- 316 (537) T ss_pred ccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeee--cCCCcc--chhHHHHHhh---- Confidence 000 11357888888888888888877777777776666655544 332221 1122222222 Q ss_pred cccCcceec-CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccH------------HHHHHHHHH Q lcl|NC_011801. 235 ENAGRAVVL-DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSN------------ITMIRAFYQ 301 (386) Q Consensus 235 ~~~g~~~vl-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~------------~~~~~~~~~ 301 (386) . +++.+ +.+.+++.+.....+.......+...+.|...-.+|-.--...++.+.. ....+..+. T Consensus 317 --~-~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~ 393 (537) T protein:vir:78 317 --K-KMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLR 393 (537) T ss_pred --c-CceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHH Confidence 1 23333 2343444444433333344556666666655444332111111111110 011233445 Q ss_pred HHHHHHHHHHHHHHHHhhh-----hhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcC---------- Q lcl|NC_011801. 302 SSLSIYIKPIESELSQKLG-----TDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVF---------- 366 (386) Q Consensus 302 ~~l~P~~~~ie~~l~~~l~-----~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~---------- 366 (386) ..|+-.++.|...++.+-. ..+++.+..-+..|..+.++.+.++++.|+++...+.++++.-.-+ T Consensus 394 ~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~ 473 (537) T protein:vir:78 394 KVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEEL 473 (537) T ss_pred HHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHH Confidence 5555555555555443211 2345556666778999999999999999999877666544210000 Q ss_pred ----------------CCCCC----Cc---cc------cccccCCCCCC Q lcl|NC_011801. 367 ----------------PELDL----DE---GT------NLLDNTKNIND 386 (386) Q Consensus 367 ----------------p~~~~----~~---~~------~~~~~~~~~~~ 386 (386) +.++. +. +. .+..++....| T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 522 (537) T protein:vir:78 474 DLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVAD 522 (537) T ss_pred HhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCC Confidence 00000 00 00 00001111111 No 218 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.06 E-value=0.00019 Score=41.01 Aligned_cols=369 Identities=12% Similarity=0.075 Sum_probs=162.3 Q ss_pred Cchhhhhcc-ccccCCccchhhhhhcccccc------cCcccccHHHHhccHHHHHHHHHHHHhhccCceee--cchhHH Q lcl|NC_011801. 1 MAFLSNLFK-RQKMLSGSSPVWILNQGQPVS------IKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVT--NAQPIT 71 (386) Q Consensus 1 Mg~~~~l~~-~~~~~~~~~~~~~~~~~~~~~------~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~--~~~~~~ 71 (386) ..|+++... +.... ........+.-.. ......-...-+.++-....|+..++-+-+-|+.+ .+.... T Consensus 28 ~~li~~~~~~~~~r~---~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~~ 104 (506) T protein:vir:94 28 MKFITHHFNYQRPRL---EMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDDGSN 104 (506) T ss_pred HHHHHHHHHHHHHHH---HHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeecCcchHH Confidence 111111000 00000 0000000000000 00000000111234455667777777766667653 333333 Q ss_pred HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc--eeEEE--EeccC-ccc Q lcl|NC_011801. 72 DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK--DLTYT--VHFDD-SKR 146 (386) Q Consensus 72 ~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~~~~--~~~~~-~~~ 146 (386) ..|.. -...-........+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..+.... ..... +.... .+. T Consensus 105 ~~l~~-~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~ 182 (506) T protein:vir:94 105 SGFDT-FNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDN 182 (506) T ss_pred HHHHH-HHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCC Confidence 33322 111112334566788888899999999999888865 5677889888887664321 11100 00000 000 Q ss_pred ce------eEEEcccc-------------------------eeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 147 SG------DFLYDSSE-------------------------VIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLA 195 (386) Q Consensus 147 ~~------~~~~~~~~-------------------------vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 195 (386) .. ...+.... |++++ +.-.|.|.+......++....+..-. T Consensus 183 ~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--------n~~~~~sd~e~~~~liDa~d~~~S~~ 254 (506) T protein:vir:94 183 QVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFK--------NSNFRLGDFENVLPLIDLYDAAQSDT 254 (506) T ss_pred ceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEec--------CCCCCCCchhhhHHHHHHHHHHHHHH Confidence 00 00011111 12221 11246666766666666655554444 Q ss_pred HHHHhccCCCceEEeeCCC---------------------CCCHHHHHHHHHHHHHH--hcccccCcceecCCCceeeec Q lcl|NC_011801. 196 ISTLRHAIKPSIFIKVPNA---------------------TLGKEAKENTRQSFEEQ--TTGENAGRAVVLDQSADVETT 252 (386) Q Consensus 196 ~~~~~ng~~~~~~l~~~~~---------------------~~~~~~~~~~k~~~~~~--~~~~~~g~~~vl~~g~~~~~~ 252 (386) .+..+..+.|-.+++.... ....+..+ +...+... +.-...+.+...+.+.+++-+ T Consensus 255 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l 333 (506) T protein:vir:94 255 ANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE-LIKEMKDANMLLLKSGMTVNGTQTSVDAKYI 333 (506) T ss_pred HHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH-HHhhhhhcCeeeecccccccCccccccceee Confidence 4433333333333221000 00111111 11111111 000111122233344556656 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhc-CCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011801. 253 NISPNVTEFLQNVSFSQDQIAKAFGIPADYLS-GKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKL 319 (386) Q Consensus 253 ~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~-~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l 319 (386) ........+....+.....|...-++|..-.. ..++.+.. ....+..+...++..++.+...+...- T Consensus 334 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~ 413 (506) T protein:vir:94 334 NKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIH 413 (506) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 66556666788889999999999999973221 11111100 012234555666666666555544321 Q ss_pred ------hhhhhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------ccccccccCCCCCC Q lcl|NC_011801. 320 ------GTDVKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------EGTNLLDNTKNIND 386 (386) Q Consensus 320 ------~~~~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------~~~~~~~~~~~~~~ 386 (386) ...+++.+..-+..|..+.++++.++ .|+++...++++++.-. +|..... +...........++ T Consensus 414 ~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~lp~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 490 (506) T protein:vir:94 414 GDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQLPGVT-NPQDIVDMMKEQSANGDYSFDQNGVISN 490 (506) T ss_pred CccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHhhcchhhcCCCc Confidence 12345566666778899999998888 57899988888764211 0111000 00011111111111 No 219 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=96.28 E-value=0.00079 Score=37.60 Aligned_cols=368 Identities=9% Similarity=0.017 Sum_probs=159.1 Q ss_pred Cchhhh-------hccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceeec--c-hhH Q lcl|NC_011801. 1 MAFLSN-------LFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAGCRFVTN--A-QPI 70 (386) Q Consensus 1 Mg~~~~-------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~-~~~ 70 (386) -.|+++ +.+-..-..+.+.+.................+..-+.+.-..-+|+..++-+-+-|+... + ... T Consensus 7 ~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~ 86 (451) T protein:vir:10 7 RAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDIDNNKEL 86 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeecCCcHHH Confidence 111111 100000000111000000000000000000000111234445677777777777776543 2 223 Q ss_pred HHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC-------ceEEEEEEcCcceEEeecCCC-ceeEEEE--- Q lcl|NC_011801. 71 TDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG-------YPVRIEPVPNEKVTVALDDYG-KDLTYTV--- 139 (386) Q Consensus 71 ~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g-------~~~~l~~l~~~~v~~~~~~~~-~~~~~~~--- 139 (386) .+++..--+. ........+..+...+|.||..+.++... ....+..++|..+-+..+... ....+.+ T Consensus 87 ~~~~~~~~~n--~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~ 164 (451) T protein:vir:10 87 NEKVTDVLGN--EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEAVIRYY 164 (451) T ss_pred HHHHHHHhcc--CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 3444321111 24456667888899999999988877642 133467788888877665422 1111111 Q ss_pred e--ccCccc--c----eeEEEcccceeeecccccc------------Cc---------ccccccccHHHHHHHHHHHHHH Q lcl|NC_011801. 140 H--FDDSKR--S----GDFLYDSSEVIHFRCTVSG------------ES---------DTQYMGIPPIDSLLNEIEVQDL 190 (386) Q Consensus 140 ~--~~~~~~--~----~~~~~~~~~vih~~~~~~~------------~~---------~~~~~G~s~~~~~~~~i~~~~~ 190 (386) . ....+. + ....+....+.+++..... ++ .+...|.|.+..+...++.... T Consensus 165 ~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~liDa~~~ 244 (451) T protein:vir:10 165 IQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKILDLYDR 244 (451) T ss_pred EeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCchhhHHHHHHHHHH Confidence 0 000000 0 0011233333332210000 00 0122467778887777777776 Q ss_pred HHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecC-----CCceeeeccCChhhHHHHHHH Q lcl|NC_011801. 191 SSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLD-----QSADVETTNISPNVTEFLQNV 265 (386) Q Consensus 191 ~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~ 265 (386) +..-..+.++..+.|-.+++.-+...+++.... +.. .+++.+. .|.++.-+........+.... T Consensus 245 ~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~----~~~-------~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 313 (451) T protein:vir:10 245 VMSGFANDLEDIQQIIYILENFGGEDTSEFLKE----LKR-------YKTIKTETDSEGDSGGLKTMQIEIPTEARKIIL 313 (451) T ss_pred HHHHHHHHHHHhccceeeeecCCcccchhhHHH----Hhh-------CCeEEecCcCCccCCcceEEeecCCHHHHHHHH Confidence 666666666666666555442121222332222 211 1223332 223333333333444467778 Q ss_pred HHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhhhhhhhcchhhh Q lcl|NC_011801. 266 SFSQDQIAKAFGIPADYLSGKQDAQSNITMI--------------RAFYQSSLSIYIKPIESELSQKLGTDVKLDIASAI 331 (386) Q Consensus 266 ~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~fd~~~~l 331 (386) +...+.|...-++|.. .....++-+..+. +..+..+++-.++.+...++..-...+++.+..-+ T Consensus 314 ~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~ 391 (451) T protein:vir:10 314 EILKKQIYESGQGLQQ--DTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVTDYKKIQQTYTRNM 391 (451) T ss_pred HHHHHHHHHHhCcccc--cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccceeEEecCCC Confidence 8888999999999852 2111111111111 22333334443333333332211123455566667 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC----c----cccccccCCCCCC Q lcl|NC_011801. 332 DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD----E----GTNLLDNTKNIND 386 (386) Q Consensus 332 ~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~----~----~~~~~~~~~~~~~ 386 (386) ..|..+.++.+.++. |+++...+.++++.-. +|..... + ..+....-..-+| T Consensus 392 p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~-d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 392 MSNDLEDADIATKSV--GIIPTKIILRHHPWVD-DVEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred CCCHHHHHHHHHHHh--ccCchHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 778899999999885 7888877776654211 0111100 0 0001111111112 No 220 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=93.82 E-value=0.0063 Score=32.66 Aligned_cols=377 Identities=12% Similarity=0.105 Sum_probs=159.3 Q ss_pred Cchhhhhcc-ccccCCccchhhhhhccc-ccccCccccc---HHHHhccHHHHHHHHHHHHhhccCceeec--chhHHHH Q lcl|NC_011801. 1 MAFLSNLFK-RQKMLSGSSPVWILNQGQ-PVSIKPKAIT---SAIALKNSDVYAVISRVSSDIAGCRFVTN--AQPITDV 73 (386) Q Consensus 1 Mg~~~~l~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~i~---~~~a~~~~~v~~~v~~ia~~ia~~p~~~~--~~~~~~~ 73 (386) +.++.+... +.... ........+. .......... ...-+.++-..-+|+..++-+-+-|+.+. +..+... T Consensus 21 ~~~i~~~~~~~~~r~---~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~ 97 (489) T protein:vir:99 21 KNYISRFKAEQLERL---KELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLGVPVEYKNENKDLQAA 97 (489) T ss_pred HHHHHHHHHHHHHHH---HHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhccCCceeecCChhHHHH Confidence 222222110 00000 0000000000 0000000000 00001234445677777776666666543 3333333 Q ss_pred HhccCcccCCHHHHHHHHHHHHHHhCCeEEEEee----cCCCceEEEEEEcCcceEEeecCCC-c-eeEEE--EeccCcc Q lcl|NC_011801. 74 LNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDR----DTNGYPVRIEPVPNEKVTVALDDYG-K-DLTYT--VHFDDSK 145 (386) Q Consensus 74 l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~----~~~g~~~~l~~l~~~~v~~~~~~~~-~-~~~~~--~~~~~~~ 145 (386) |..--+ .-....+...+..+++.+|.||..+.. +..|. ..+..++|..+.+..+... . ..... +...... T Consensus 98 l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~ 175 (489) T protein:vir:99 98 IDLMSV-RNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGS 175 (489) T ss_pred HHHHHh-hcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCC Confidence 322111 112335667888889999999977653 23333 4678888998887776433 1 22111 1111100 Q ss_pred ---cceeEEEcccceeeeccccc-----------cCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011801. 146 ---RSGDFLYDSSEVIHFRCTVS-----------GES---------DTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHA 202 (386) Q Consensus 146 ---~~~~~~~~~~~vih~~~~~~-----------~~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 202 (386) ......+.+..+.+++.... +++ .+...|.|.+..+...++....+.....+..... T Consensus 176 ~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~ 255 (489) T protein:vir:99 176 GKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDS 255 (489) T ss_pred CceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 00111233333333221100 000 0112477777777777766666555555544555 Q ss_pred CCCceEEeeCCCCCCHHHHHHHHHHHHHHhc-------ccccCcceecCCC-------ceeeeccCChhhHHHHHHHHHH Q lcl|NC_011801. 203 IKPSIFIKVPNATLGKEAKENTRQSFEEQTT-------GENAGRAVVLDQS-------ADVETTNISPNVTEFLQNVSFS 268 (386) Q Consensus 203 ~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~-------~~~~g~~~vl~~g-------~~~~~~~~~~~d~~~~e~~~~~ 268 (386) +.|-.+++ +.....+........+..... ....++++.++.+ .+++.+.....+..+....+.. T Consensus 256 ~~~~l~i~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 333 (489) T protein:vir:99 256 VNALLVIA--GNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRL 333 (489) T ss_pred hhhhhhhc--cCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHH Confidence 55544443 222222222222222221111 1112234444332 2333333333444456677888 Q ss_pred HHHHHHHhCCCHHHh-cCCcCcccH------------HHHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcc Q lcl|NC_011801. 269 QDQIAKAFGIPADYL-SGKQDAQSN------------ITMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDI 327 (386) Q Consensus 269 ~~~Ia~~~gvp~~~l-~~~~~~~~~------------~~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~ 327 (386) .+.|...-++|..-. +..++.+-. .+..+..+...|.-+++.+...+...-+ ..+++.+ T Consensus 334 ~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f 413 (489) T protein:vir:99 334 VADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVF 413 (489) T ss_pred HHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEe Confidence 889999999885322 111211100 0111234445555555555544432211 1245555 Q ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCC-cCCCCCCC----c---cccccc--cCCCCCC Q lcl|NC_011801. 328 ASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRG-VFPELDLD----E---GTNLLD--NTKNIND 386 (386) Q Consensus 328 ~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p-~~p~~~~~----~---~~~~~~--~~~~~~~ 386 (386) ..-+..|..+.++++.+++ |+++...+.++++.-. ..+...+. | -....+ ..++.+| T Consensus 414 ~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 480 (489) T protein:vir:99 414 TPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASG 480 (489) T ss_pred CCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCC Confidence 6666778899999988885 7888887777654210 01111000 0 000001 1111111 No 221 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=91.34 E-value=0.016 Score=30.38 Aligned_cols=372 Identities=13% Similarity=0.046 Sum_probs=152.5 Q ss_pred Cchhhhh-------ccccc----cCCccchhhhhhcccccc-cCc---ccccHH---HHhc----cHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAFLSNL-------FKRQK----MLSGSSPVWILNQGQPVS-IKP---KAITSA---IALK----NSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~~~~l-------~~~~~----~~~~~~~~~~~~~~~~~~-~~~---~~i~~~---~a~~----~~~v~~~v~~ia~~i 58 (386) |.-.+.. ..++. ...++. .+......+.+ ... ..-+.+ .++. .+.+.+.++.+.+.+ T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~-~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~v 79 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEP-TVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQV 79 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChH-HHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhh Confidence 5522110 00000 000000 00000000100 000 000111 1122 233344444444444 Q ss_pred ccCceeec-chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc----e----------EEEEEEcCcc Q lcl|NC_011801. 59 AGCRFVTN-AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY----P----------VRIEPVPNEK 123 (386) Q Consensus 59 a~~p~~~~-~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~----~----------~~l~~l~~~~ 123 (386) -.-|..+. ...+..++..-=-...+-.+|.+.++...+.+|-|++++.....+. . -.+..+.|.. T Consensus 80 f~k~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~ 159 (501) T protein:vir:95 80 FMRDPVVKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTE 159 (501) T ss_pred hcCCcceeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHhh Confidence 43343333 2335555544334666889999999999999999999997643321 0 1133333322 Q ss_pred e-----------------EE-----eecCC-C---------------ceeEEE-EeccCccc--------ce----eEEE Q lcl|NC_011801. 124 V-----------------TV-----ALDDY-G---------------KDLTYT-VHFDDSKR--------SG----DFLY 152 (386) Q Consensus 124 v-----------------~~-----~~~~~-~---------------~~~~~~-~~~~~~~~--------~~----~~~~ 152 (386) | .+ ..++. + ....+. +.....+. +. .... T Consensus 160 IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~ 239 (501) T protein:vir:95 160 IINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYK 239 (501) T ss_pred hcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceee Confidence 2 00 01110 0 000011 11100000 00 0000 Q ss_pred c------ccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHH Q lcl|NC_011801. 153 D------SSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQ 226 (386) Q Consensus 153 ~------~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~ 226 (386) + .-..|=|-+. .....+...|.+|+..++..--.....+.-....+...+.|-.+++-.+ ++..+.... T Consensus 240 ~~~~g~~~l~~IPfv~~-~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~----~~~~~~~~~ 314 (501) T protein:vir:95 240 PTDAQGKRLTEIPFMFI-GSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLT----EEWVTNVLK 314 (501) T ss_pred eeccCCCcCCeeeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCc----ccccccCCC Confidence 0 1111112211 1223344567788777664433333332323344556677777765322 221111100 Q ss_pred HHHHHhcccccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHH Q lcl|NC_011801. 227 SFEEQTTGENAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSL 304 (386) Q Consensus 227 ~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l 304 (386) ....-|++ ..+.++.|.++.-+..+++-+. .+.++...+++..+ | ..++.........++ ....--...| T Consensus 315 --~~i~~G~~--~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~~-G--a~ll~~~~~~~Ta~~~~~~~~~~~S~L 386 (501) T protein:vir:95 315 --GSVNFGSR--GGIPLPVGADAKLLQASENTML-KEAMDTKERQMVAL-G--AKLVEQKEVQRTATEAELEAASEGSTL 386 (501) T ss_pred --Cceeeccc--ccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHHH-H--HhhccCCccchhHHHHHHHHHHHhHHH Confidence 01222333 3567777766666655554443 34444444444332 3 334432211111222 1223345668 Q ss_pred HHHHHHHHHHHHHhhhhh------------hhhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC Q lcl|NC_011801. 305 SIYIKPIESELSQKLGTD------------VKLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD 372 (386) Q Consensus 305 ~P~~~~ie~~l~~~l~~~------------~~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~ 372 (386) .-++..++++|++.|-.. |+++.+-.........++++.+++..|.++..+.++.|....+++....+ T Consensus 387 ~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~ 466 (501) T protein:vir:95 387 SSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSK 466 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHH Confidence 888888898888766321 22222211112223345666778889988888888888766655422110 Q ss_pred ----------c------cccccccCCCCCC Q lcl|NC_011801. 373 ----------E------GTNLLDNTKNIND 386 (386) Q Consensus 373 ----------~------~~~~~~~~~~~~~ 386 (386) + .++.-..+.++.| T Consensus 467 e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~ 496 (501) T protein:vir:95 467 AKEKIAKDTAEAMALATPANVPGDGSGGDN 496 (501) T ss_pred HHHHHHhhhcCcccccccCCCCCCCccccc Confidence 0 0011111111111 No 222 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=87.28 E-value=0.04 Score=28.26 Aligned_cols=373 Identities=13% Similarity=0.148 Sum_probs=145.3 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHh--ccHHHHHHHHHHHHhhccCce---eec--------c Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIAL--KNSDVYAVISRVSSDIAGCRF---VTN--------A 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~--~~~~v~~~v~~ia~~ia~~p~---~~~--------~ 67 (386) =||...+|..-............-+..+...-..-++..++. ....|+...++| -++|- ++. + T Consensus 47 ~gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~~~i~~l~~y~yi~~~~v~ql~~li----~~lp~l~y~i~~~~~~k~~~ 122 (525) T protein:vir:10 47 DGFVMDLCNNGKIKTVNLDTLQLWFNNPDKYINNIVNLLTYYYIIDGNVFQLYDLI----FSLPPLDYQIKVLKRDKDYK 122 (525) T ss_pred HHHHHHhhcCCceeeeeHHHHHhhhcChHHHHHHHHHHHHHhhhhcchHHHHHHHH----HhcCCcceeehhhhhccchh Confidence 344444442222222111111111111111100001111111 123344544444 33442 211 1 Q ss_pred hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeE--------------------EEEeecCCCceEEEEEEcCcceE-- Q lcl|NC_011801. 68 QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAF--------------------AIIDRDTNGYPVRIEPVPNEKVT-- 125 (386) Q Consensus 68 ~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~--------------------~~~~~~~~g~~~~l~~l~~~~v~-- 125 (386) .++ .+++..-..-.--.++-+.+..++...|.-. +++-....|..+.. ++-..+. T Consensus 123 ~~~-s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~v--id~~~f~~~ 199 (525) T protein:vir:10 123 EDL-STINLYLEKKIQHKQLTRDLLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVAV--IDLQWFDEM 199 (525) T ss_pred hHH-HHHHHHHHHhHHHHHHHHHHHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccccCCceEEE--EehHHhhhh Confidence 111 1111111111112223333333443334311 11111112221111 1111110 Q ss_pred -------------EeecCCCceeEEEEeccCcccceeEEEcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHH Q lcl|NC_011801. 126 -------------VALDDYGKDLTYTVHFDDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSS 192 (386) Q Consensus 126 -------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~ 192 (386) +........-+-.|...+...-.-..+|.+.++|+|+.... .+..-|.|-...+...+....... T Consensus 200 ~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~--rnqrlG~s~vtp~l~dI~hk~klr 277 (525) T protein:vir:10 200 SELERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLS--RNQRLGIPYGTQTLFDIQHKQKLR 277 (525) T ss_pred hHHHHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccc--cCcccCcchhhhHHHHHHHHHHHH Confidence 00000000001111112222222346888999999976533 344458888888888777766665 Q ss_pred HHHHHHHhccCCCceEEeeCCC-----CCCHHHHHH----HHHHHHHHhcccccCccee--cCCCceee--eccCChhhH Q lcl|NC_011801. 193 KLAISTLRHAIKPSIFIKVPNA-----TLGKEAKEN----TRQSFEEQTTGENAGRAVV--LDQSADVE--TTNISPNVT 259 (386) Q Consensus 193 ~~~~~~~~ng~~~~~~l~~~~~-----~~~~~~~~~----~k~~~~~~~~~~~~g~~~v--l~~g~~~~--~~~~~~~d~ 259 (386) ....+..+.=..|-.++++.+. .+-+...++ .|.+++...... +| +.+ ++.=++++ .+.....-. T Consensus 278 d~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK-~G-i~vi~~Pdfa~~efp~ik~~~~gl 355 (525) T protein:vir:10 278 DLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKDK-NG-IACIAMPDFATFEFPEIKNGDKTL 355 (525) T ss_pred HHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhcccccc-cC-eEEEeccceeecccccccCcccCC Confidence 5555555444556666766431 122323333 344444322222 23 333 23222222 222111111 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--------hhhhhcchhh Q lcl|NC_011801. 260 EFLQNVSFSQDQIAKAFGIPADYLSGKQ-DAQSNITMIRAFYQSSLSIYIKPIESELSQKLG--------TDVKLDIASA 330 (386) Q Consensus 260 ~~~e~~~~~~~~Ia~~~gvp~~~l~~~~-~~~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~--------~~~~fd~~~~ 330 (386) + -.-.+...++|..|+|++..++++++ +++.+.-...-||.. +.-+++.||+..+..+- ..+-|+.+.- T Consensus 356 D-g~K~d~I~~DI~~A~GlS~sL~nGdggNyAtaslnld~fykk-igVm~e~Iee~y~kL~d~Vl~~~k~~nyifnydkd 433 (525) T protein:vir:10 356 D-PKKYDSIDNDITNATGISQVLTNGTKGNYASAKLNLDVFYKK-IGVMLEIIEEIYNQLIDIILGEEKGCNYIFQYNKD 433 (525) T ss_pred C-chhhhhhhhhhhhhhccceeeecCCCCceeeeeeeHHHHHHH-HHHHHHHHHHHHHHHHhhhcCcccCcceEEecCCC Confidence 1 01345566789999999999997553 333333334556765 66778888866655431 1223454443 Q ss_pred hccCHHHHHHHHHHHHhCCCc----------CHHHH-----HHH----hc---cCCc----CCCCCCCccccccccCCCC Q lcl|NC_011801. 331 IDSDNSELINNVQKLASAGVL----------APIQA-----QKL----LK---NRGV----FPELDLDEGTNLLDNTKNI 384 (386) Q Consensus 331 l~~d~~~~~~~~~~~~~~g~~----------t~nE~-----R~~----lg---~~p~----~p~~~~~~~~~~~~~~~~~ 384 (386) ..-+.+++.+.+-++...||. +-++- ||. +. ..|+ ..+-++.+-+.|......+ T Consensus 434 ~pi~~kkk~d~LIkL~d~g~s~k~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~P~~dd~~~ 513 (525) T protein:vir:10 434 TPIEREKKLDTLIKLEAQGYSAKYVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGSPKLDDSDS 513 (525) T ss_pred chhhhhhhhhhhhhhhccchhhhhhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccCCccCCCcc Confidence 444566666666666666652 11111 111 11 1111 0111222323333222222 Q ss_pred CC Q lcl|NC_011801. 385 ND 386 (386) Q Consensus 385 ~~ 386 (386) +| T Consensus 514 ~d 515 (525) T protein:vir:10 514 SD 515 (525) T ss_pred hh Confidence 23 No 223 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=80.63 E-value=0.093 Score=26.24 Aligned_cols=364 Identities=13% Similarity=0.084 Sum_probs=158.0 Q ss_pred Cchhhh------hcccc----ccCCccchhhhhhcc-cccccCcccccH-HHHhc----cHHHHHHHHHHHHhhccCcee Q lcl|NC_011801. 1 MAFLSN------LFKRQ----KMLSGSSPVWILNQG-QPVSIKPKAITS-AIALK----NSDVYAVISRVSSDIAGCRFV 64 (386) Q Consensus 1 Mg~~~~------l~~~~----~~~~~~~~~~~~~~~-~~~~~~~~~i~~-~~a~~----~~~v~~~v~~ia~~ia~~p~~ 64 (386) |+.-.+ ....+ ....++ ..+..... ...-..++.-.. +..++ .+.+...++.+++.+-+-|.. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~-~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p~ 79 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQ-REVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPPV 79 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcCh-HHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCce Confidence 664221 00000 000111 01111000 011111111111 11222 344556666666666555554 Q ss_pred ec-chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceE------------------ Q lcl|NC_011801. 65 TN-AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVT------------------ 125 (386) Q Consensus 65 ~~-~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~------------------ 125 (386) +. ...+.++..+ ....+-.+|.+.++...+.+|-|++.+.....|.---+..+.|..|. T Consensus 80 ~~~p~~l~~~~~D--~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g~l~~v~lre~ 157 (452) T protein:vir:94 80 ITHPDAMSKYFED--QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDGRLLMVVLREF 157 (452) T ss_pred ecccHHHHHHHhc--ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccCCeeEEEEEEE Confidence 43 3334444322 56778899999999999999999999988766642233333333321 Q ss_pred -EeecC-CC----ceeEEE----------Ee-ccCcccce-----eEEEccc----ceeeeccccccCcccccccccHHH Q lcl|NC_011801. 126 -VALDD-YG----KDLTYT----------VH-FDDSKRSG-----DFLYDSS----EVIHFRCTVSGESDTQYMGIPPID 179 (386) Q Consensus 126 -~~~~~-~~----~~~~~~----------~~-~~~~~~~~-----~~~~~~~----~vih~~~~~~~~~~~~~~G~s~~~ 179 (386) ...+. +. ....|. +. +.....+. ....... ..|=|-+. .....+...|.||+. T Consensus 158 ~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~-~~~~~~~~~~~pPLl 236 (452) T protein:vir:94 158 YTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCI-TPSGLSMTPAKPPMI 236 (452) T ss_pred EEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEE-cCCCCCCCCCccchH Confidence 01111 00 000111 00 00000000 0000000 11212111 122334567889988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceecCC-CceeeeccCChhh Q lcl|NC_011801. 180 SLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVLDQ-SADVETTNISPNV 258 (386) Q Consensus 180 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl~~-g~~~~~~~~~~~d 258 (386) .++..-......+.-....+...+.|-.++.--+ +.. ...-|.+ .++.+++ |.++.-+..+..- T Consensus 237 ~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~---~~~----------~i~iG~~--~~~~lpe~~~~~~yie~~g~~ 301 (452) T protein:vir:94 237 DIVDINYSHYRTSADLEHGRHFTGLPTPWITGAE---SQS----------TMHIGST--KAWVIPEVAAKVGFLEFTGQG 301 (452) T ss_pred HHHHHHHHHhcchhHHHHHHHHcccceeEeecCc---CCC----------ceEeccc--ccccCCCCCCcceEEccCchh Confidence 7666544444443334455566677866665221 111 1223433 3677774 6554444443332 Q ss_pred HH-HHHHHHHHHHHHHHHhCCCHHHhcCCcCcc-cHHHH--HHHHHHHHHHHHHHHHHHHHHHhhhh-----------hh Q lcl|NC_011801. 259 TE-FLQNVSFSQDQIAKAFGIPADYLSGKQDAQ-SNITM--IRAFYQSSLSIYIKPIESELSQKLGT-----------DV 323 (386) Q Consensus 259 ~~-~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~-~~~~~--~~~~~~~~l~P~~~~ie~~l~~~l~~-----------~~ 323 (386) .. ..+..+...++. ...| ..++....... ..+.. ..+-.+..|.-++..+++++++.|-. .| T Consensus 302 i~~~~~~l~~le~~m-~~~G--a~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~~~v 378 (452) T protein:vir:94 302 LQSLEKALSEKQAQL-ASLS--ARLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGTLNI 378 (452) T ss_pred HHHHHHHHHHHHHHH-HHHH--HHhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCceEE Confidence 21 122222222222 1112 13333222121 22222 22233577888888889888876521 12 Q ss_pred hhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccc-c--c-cccCCCCCC Q lcl|NC_011801. 324 KLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGT-N--L-LDNTKNIND 386 (386) Q Consensus 324 ~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~-~--~-~~~~~~~~~ 386 (386) +++.+-..........+++-+++..|.++....++.|.+.++++.+...+.. . + -.++..|+. T Consensus 379 ~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~~~ 445 (452) T protein:vir:94 379 KLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSNTP 445 (452) T ss_pred EeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCCCC Confidence 2233322222223455666678999999999999998766655333221110 0 0 011111221 No 224 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=77.56 E-value=0.12 Score=25.56 Aligned_cols=364 Identities=13% Similarity=0.071 Sum_probs=146.7 Q ss_pred hhhhhccccccCCccchh-------hhh---hc----------ccc-cccCcc---cccHH--HHhccHHHHHHHHHHHH Q lcl|NC_011801. 3 FLSNLFKRQKMLSGSSPV-------WIL---NQ----------GQP-VSIKPK---AITSA--IALKNSDVYAVISRVSS 56 (386) Q Consensus 3 ~~~~l~~~~~~~~~~~~~-------~~~---~~----------~~~-~~~~~~---~i~~~--~a~~~~~v~~~v~~ia~ 56 (386) .-++-. +.....++. |.. .. ..+ ...... .+..+ .|.-.+.+...++.++. T Consensus 1 m~~~~~---~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G 77 (513) T protein:vir:97 1 MADKDP---KSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSG 77 (513) T ss_pred CCCCCC---CCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhh Confidence 111100 000001111 000 00 000 011111 11111 12223445566777666 Q ss_pred hhccCceeecc---hhHHH-HHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce-----------------EE Q lcl|NC_011801. 57 DIAGCRFVTNA---QPITD-VLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP-----------------VR 115 (386) Q Consensus 57 ~ia~~p~~~~~---~~~~~-~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~-----------------~~ 115 (386) .+.+-|..+.. ..+.. ++..-=-...+-.+|.+.++...+.+|-|++++.....+.+ -- T Consensus 78 ~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy 157 (513) T protein:vir:97 78 KPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPY 157 (513) T ss_pred hhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCce Confidence 66666665542 23444 33333346677899999999999999999999976543211 11 Q ss_pred EEEEcCcceE----------------------EeecCCCceeEEEEe-----------ccCcc---cceeEEEccc---- Q lcl|NC_011801. 116 IEPVPNEKVT----------------------VALDDYGKDLTYTVH-----------FDDSK---RSGDFLYDSS---- 155 (386) Q Consensus 116 l~~l~~~~v~----------------------~~~~~~~~~~~~~~~-----------~~~~~---~~~~~~~~~~---- 155 (386) +..+.|..|. ...|+.+......+. ....+ .......... T Consensus 158 ~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l 237 (513) T protein:vir:97 158 WVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGL 237 (513) T ss_pred EEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcC Confidence 2333333320 011211111110000 00000 0000111100 Q ss_pred ceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_011801. 156 EVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGE 235 (386) Q Consensus 156 ~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~ 235 (386) ..|-|-+. .....+...|.||+..++..-........-....+...+.|-.++..-+ ++..+ ...-|. T Consensus 238 ~~IP~v~~-~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~----~~~~~-------~i~iG~ 305 (513) T protein:vir:97 238 NYVPLVTF-YADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGAS----GEDSD-------PVVVGP 305 (513) T ss_pred CceeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCC----cCCCC-------ceEeec Confidence 01111111 1123445678888876665544444444444444566677877775221 11111 122344 Q ss_pred ccCcceecCC-CceeeeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_011801. 236 NAGRAVVLDQ-SADVETTNISPNVTE-FLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMI--RAFYQSSLSIYIKPI 311 (386) Q Consensus 236 ~~g~~~vl~~-g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~--~~~~~~~l~P~~~~i 311 (386) + .++.+++ |.++.-+..+.+.++ ..+..+...+++ ...| ..+|.........++.. ..--...|.-++..+ T Consensus 306 ~--~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm-~~~G--a~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~l 380 (513) T protein:vir:97 306 N--KVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQM-AGYG--AEFLKRKTGGQTATARALDSAEATSDLSAMTGLF 380 (513) T ss_pred c--ccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHH-HHHH--HHhhccCCccccHHHHHHHHHHHHHHHHHHHHHH Confidence 3 3566764 555544444333222 123333333333 2233 23343222112222222 233445677788888 Q ss_pred HHHHHHhhhhh----------hhhcchh--hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC-CCC-------- Q lcl|NC_011801. 312 ESELSQKLGTD----------VKLDIAS--AIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP-ELD-------- 370 (386) Q Consensus 312 e~~l~~~l~~~----------~~fd~~~--~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p-~~~-------- 370 (386) ++++++.|-.. .+|.+.. ....-....++++-+++..|.++....++.+.+.++++ ..+ T Consensus 381 e~al~~~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~ 460 (513) T protein:vir:97 381 EDALAQALDITADWLRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEEL 460 (513) T ss_pred HHHHHHHHHHHHHHhCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHH Confidence 88888765321 1222211 11111123345555667777777666666554433332 111 Q ss_pred --------CCcccc--ccc---cCCCC---CC Q lcl|NC_011801. 371 --------LDEGTN--LLD---NTKNI---ND 386 (386) Q Consensus 371 --------~~~~~~--~~~---~~~~~---~~ 386 (386) ++.+.+ ++. +.+.. .| T Consensus 461 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 492 (513) T protein:vir:97 461 MEEISEAMGRAGLDLDPAQKNPPEGGEGEGEG 492 (513) T ss_pred HHhhhhccCCCCccccccCCCCCCCCCCCCCC Confidence 000000 111 00001 01 No 225 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=76.69 E-value=0.13 Score=25.39 Aligned_cols=371 Identities=14% Similarity=0.080 Sum_probs=153.9 Q ss_pred hhhhhccccccCCccchh-------hh---hhc-c----------cc--cccCccc-ccHHH--HhccHHHHHHHHHHHH Q lcl|NC_011801. 3 FLSNLFKRQKMLSGSSPV-------WI---LNQ-G----------QP--VSIKPKA-ITSAI--ALKNSDVYAVISRVSS 56 (386) Q Consensus 3 ~~~~l~~~~~~~~~~~~~-------~~---~~~-~----------~~--~~~~~~~-i~~~~--a~~~~~v~~~v~~ia~ 56 (386) .+..--++.+.. ..+|. |. ... | .+ .....+. +.... |.=.+.+...++.+++ T Consensus 1 ~~~~~~~~~~V~-~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G 79 (489) T protein:vir:78 1 MLTENGQGSGVK-TKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVG 79 (489) T ss_pred CccCCCccCCCC-ccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhc Confidence 111000111110 01110 00 000 0 00 0000000 11000 1112344556666555 Q ss_pred hhccCceeec-chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc-----------eEEEEEEcCcce Q lcl|NC_011801. 57 DIAGCRFVTN-AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY-----------PVRIEPVPNEKV 124 (386) Q Consensus 57 ~ia~~p~~~~-~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~-----------~~~l~~l~~~~v 124 (386) .+-.-|..+. ...+..++..-=-...+-.+|.+.++...+.+|-|++++.....|. ---+..+.|..| T Consensus 80 ~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~I 159 (489) T protein:vir:78 80 SVMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENI 159 (489) T ss_pred hhhcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhh Confidence 5555544443 3445566644445677789999999999999999999998765542 112333333332 Q ss_pred ---E--------------Eee-----cC-CC--ce--eEEEE--------------eccCcccce--eEEE-c-----cc Q lcl|NC_011801. 125 ---T--------------VAL-----DD-YG--KD--LTYTV--------------HFDDSKRSG--DFLY-D-----SS 155 (386) Q Consensus 125 ---~--------------~~~-----~~-~~--~~--~~~~~--------------~~~~~~~~~--~~~~-~-----~~ 155 (386) + +.. +. +. .. ..|.+ .....+... ...+ + +- T Consensus 160 inW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l 239 (489) T protein:vir:78 160 VNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESLR 239 (489) T ss_pred cCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCCCcc Confidence 1 111 10 01 00 00000 000000000 0001 0 01 Q ss_pred ceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_011801. 156 EVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGE 235 (386) Q Consensus 156 ~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~ 235 (386) ..|=|-+.. ....+...|.+|+..++..--.....+.-....+...+.|-.++...+ ..+++..+.... ....-|+ T Consensus 240 ~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d-~~~~~~~~~~~~--~~i~~g~ 315 (489) T protein:vir:78 240 GVIPFTFIG-ATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGE-NLTPQAFKEANP--NGIKFGS 315 (489) T ss_pred CeeeEEEEe-cCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCc-cCCcccccccCc--cceeeCC Confidence 111111111 122345578888877665544444444445555566777877776322 233433332221 1122233 Q ss_pred ccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 236 NAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSLSIYIKPIES 313 (386) Q Consensus 236 ~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l~P~~~~ie~ 313 (386) + ..+.++.+.++.-+..++..+. .+.++....+.+ ..| ..++.-... ...++ .....-...|.-++..+|+ T Consensus 316 ~--~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~-~lG--a~l~~~~~~-~Ta~~~~~~~~~~~S~L~~~a~~~e~ 388 (489) T protein:vir:78 316 R--RGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIG--AQLITPTQQ-ITAQSARIQRGADTSVMATIARNVSQ 388 (489) T ss_pred c--ccccCCCCCCcceeccCcchHH-HHHHHHHHHHHH-HHh--hhhccCCcc-hhHHHHHHHHHHhhHHHHHHHHHHHH Confidence 3 3566766665555554444442 222222222222 222 334432211 11122 2234456778888899999 Q ss_pred HHHHhhhhhh-----------hh--cchhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccc-- Q lcl|NC_011801. 314 ELSQKLGTDV-----------KL--DIASAI-DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNL-- 377 (386) Q Consensus 314 ~l~~~l~~~~-----------~f--d~~~~l-~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~-- 377 (386) ++++.|-... +| +.+-.. ..|.. ..+.+-+++..|.++..+.++.|...++++..+-++-++. T Consensus 389 al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~-~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~~e~~~~ei~~ 467 (489) T protein:vir:78 389 AYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQ-DRAAWMADINAGLLPATAYYAALRKAGVTDWTDADIKDAVAD 467 (489) T ss_pred HHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHhh Confidence 9988663221 11 222111 12332 3455566777777777777776665555422211110000 Q ss_pred --c---------ccCCCCCC Q lcl|NC_011801. 378 --L---------DNTKNIND 386 (386) Q Consensus 378 --~---------~~~~~~~~ 386 (386) . .+..+..+ T Consensus 468 ~~~~~~~~~~g~~~~~~q~~ 487 (489) T protein:vir:78 468 QPLPVATEVQGEIPQSAQQQ 487 (489) T ss_pred cCCCcccCCcccCCCCcccc Confidence 0 00111111 No 226 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=76.24 E-value=0.14 Score=25.31 Aligned_cols=368 Identities=14% Similarity=0.086 Sum_probs=155.4 Q ss_pred hhhhhccccccCCccchh-------hh--------------hhcccc--cccCccc-ccHH--HHhccHHHHHHHHHHHH Q lcl|NC_011801. 3 FLSNLFKRQKMLSGSSPV-------WI--------------LNQGQP--VSIKPKA-ITSA--IALKNSDVYAVISRVSS 56 (386) Q Consensus 3 ~~~~l~~~~~~~~~~~~~-------~~--------------~~~~~~--~~~~~~~-i~~~--~a~~~~~v~~~v~~ia~ 56 (386) .+..--++.+.. ..+|. |. ...+.+ .....+. +... .|.-.+.+...++.+++ T Consensus 1 ~~~~~~~~~~V~-~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G 79 (491) T protein:vir:95 1 MLTANGQGSGVK-TKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVG 79 (491) T ss_pred CcccCCccCCCC-ccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhc Confidence 111000111110 00110 00 000000 0000010 1111 11112344556666666 Q ss_pred hhccCceeec-chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCc-----------eEEEEEEcCcce Q lcl|NC_011801. 57 DIAGCRFVTN-AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGY-----------PVRIEPVPNEKV 124 (386) Q Consensus 57 ~ia~~p~~~~-~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~-----------~~~l~~l~~~~v 124 (386) .+-.-|..+. ...+..++..-=-...+-.+|.+.++...+.+|-|++++.....+. ---+..+.|..| T Consensus 80 ~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~I 159 (491) T protein:vir:95 80 SVMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENI 159 (491) T ss_pred hhhcCCceeeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhh Confidence 6655555443 3445566644445677789999999999999999999998755432 112333333333 Q ss_pred ---E--------------Eee-----cC-C--Cce--eEEE-E-------------eccCccccee--EE-Ec-----cc Q lcl|NC_011801. 125 ---T--------------VAL-----DD-Y--GKD--LTYT-V-------------HFDDSKRSGD--FL-YD-----SS 155 (386) Q Consensus 125 ---~--------------~~~-----~~-~--~~~--~~~~-~-------------~~~~~~~~~~--~~-~~-----~~ 155 (386) + +.. +. + +.. ..|. . .....+.... .. ++ +- T Consensus 160 inW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l 239 (491) T protein:vir:95 160 VNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLR 239 (491) T ss_pred cCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCccc Confidence 1 110 10 0 000 0010 0 0000000000 00 00 01 Q ss_pred ceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_011801. 156 EVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGE 235 (386) Q Consensus 156 ~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~ 235 (386) ..|-|-+.. ....+...|.+|+..++..--.....+.-....+...+.|-.+++..+ ..+++..+.... ....-|+ T Consensus 240 ~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d-~~~~~~~~~~~~--~~i~~g~ 315 (491) T protein:vir:95 240 GVIPFTFIG-ATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGD-NLTPQSFKEANP--NGIKFGS 315 (491) T ss_pred CeeEEEEEe-cCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCc-ccCcchhhccCc--ceeEecC Confidence 111111111 122344578888877665544444444445555567777877775422 234433332211 1122233 Q ss_pred ccCcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 236 NAGRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSLSIYIKPIES 313 (386) Q Consensus 236 ~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l~P~~~~ie~ 313 (386) ++ .+.++.|.++.-+..+++.+. .+.++....+.. ..| ..++.-... ...++ .....-...|.-++..+++ T Consensus 316 ~~--~~~lP~~~~~~~ie~~~~~~~-~~~l~~~e~qm~-~~G--a~l~~~~~~-~Ta~~~~~~~~~~~S~L~~~a~~~e~ 388 (491) T protein:vir:95 316 RC--GHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIG--AQLITPSQQ-ITAESARIQRGADTSVMATIARNVSQ 388 (491) T ss_pred cC--CcCCCCCCccceeecCcchHH-HHHHHHHHHHHH-HHH--HHhccCCcc-hhHHHHHHHHHHhhHHHHHHHHHHHH Confidence 33 466666666655555443332 112222112111 112 123322211 11122 2234456778888899999 Q ss_pred HHHHhhhhh-----------hh--hcchhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCcccc--- Q lcl|NC_011801. 314 ELSQKLGTD-----------VK--LDIASAID-SDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTN--- 376 (386) Q Consensus 314 ~l~~~l~~~-----------~~--fd~~~~l~-~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~--- 376 (386) ++++.|-.. ++ .+.+-... .|.. ....+-+++.+|.++....++.|....+++..+-++-.. T Consensus 389 al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~~~-~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~ 467 (491) T protein:vir:95 389 AYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQPMTAQ-DRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNAIED 467 (491) T ss_pred HHHHHHHHHHHHcCCCCCCceEEEeecccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHHh Confidence 988866321 11 22222222 2333 455666677888888888888776555543222111100 Q ss_pred -----------------ccccCCC Q lcl|NC_011801. 377 -----------------LLDNTKN 383 (386) Q Consensus 377 -----------------~~~~~~~ 383 (386) ..+.+.. T Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 468 APLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred cCCCCCccccccccchhhhhhccC Confidence 0011111 No 227 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=62.34 E-value=0.34 Score=23.18 Aligned_cols=366 Identities=10% Similarity=0.025 Sum_probs=153.6 Q ss_pred Cchhh------hhccccccC-Cccchhhhh--hccccccc--Ccc------------ccc-------HHHHhccHHHHHH Q lcl|NC_011801. 1 MAFLS------NLFKRQKML-SGSSPVWIL--NQGQPVSI--KPK------------AIT-------SAIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~------~l~~~~~~~-~~~~~~~~~--~~~~~~~~--~~~------------~i~-------~~~a~~~~~v~~~ 50 (386) |..-. .....+... ......... ..+-+-.. ... .++ .+.|.=.+....+ T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~t 93 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPT 93 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHH Confidence 33210 000000000 000000000 00111000 000 000 0012223455566 Q ss_pred HHHHHHhhccCceeecch---hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce----------EEEE Q lcl|NC_011801. 51 ISRVSSDIAGCRFVTNAQ---PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP----------VRIE 117 (386) Q Consensus 51 v~~ia~~ia~~p~~~~~~---~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~----------~~l~ 117 (386) ++.+++.+.+-|..+... .+..++..-=-...+-.+|.+.++...+.+|-|++++.....+.. --+. T Consensus 94 l~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~ 173 (488) T protein:vir:96 94 MNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTAA 173 (488) T ss_pred HHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEEE Confidence 666666666666554422 255666444457777899999999999999999999876543210 1122 Q ss_pred EEcCcce-----------------EE-----eecCCCc--eeEEEE-eccC----------cccceeEEEcc------cc Q lcl|NC_011801. 118 PVPNEKV-----------------TV-----ALDDYGK--DLTYTV-HFDD----------SKRSGDFLYDS------SE 156 (386) Q Consensus 118 ~l~~~~v-----------------~~-----~~~~~~~--~~~~~~-~~~~----------~~~~~~~~~~~------~~ 156 (386) .+.|..| .+ ..|.... ...+.+ ...+ ...... ..+. -. T Consensus 174 ~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e-~~~~~~g~~~l~ 252 (488) T protein:vir:96 174 FYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDE-WTPVLINSKQSD 252 (488) T ss_pred EechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccc-eEeecCCCcccC Confidence 2222221 11 1121110 111110 0000 000110 1110 01 Q ss_pred eeeeccccccCcccccccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_011801. 157 VIHFRCTVSGESDTQYMGIPPIDSLLN-EIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGE 235 (386) Q Consensus 157 vih~~~~~~~~~~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~ 235 (386) .|=|-+. .....+...|.+|+..++. .+...+....+... +...+.|..++...+ .+++..+.... .|- T Consensus 253 ~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~i-l~~~~~p~lv~~~~~--~~~~~~~~~~~------~g~ 322 (488) T protein:vir:96 253 TIPFFLA-SSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKA-MILANEAKWMVDMGD--MNKTMASEMNP------LGF 322 (488) T ss_pred eeEEEEE-ecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHH-HHhcCCceeeeccCC--CCccccccccc------cee Confidence 1111111 1122344577787776554 44445555555444 446667767654433 34433332211 121 Q ss_pred ccCc--ceecCCC-ceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHHHHHHHH Q lcl|NC_011801. 236 NAGR--AVVLDQS-ADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSLSIYIKP 310 (386) Q Consensus 236 ~~g~--~~vl~~g-~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l~P~~~~ 310 (386) ..+. +...+.| ++|.+.+ ++.+ ..+.++...++.+. .| ..++..... ...++ .....-+..|.-++.. T Consensus 323 ~~~~~~~~~~~~g~~~~~e~~--~~~l-~~~~l~~l~~qm~~-~G--a~l~~~~~~-~Ta~~~~~~~~~~~S~L~~~a~~ 395 (488) T protein:vir:96 323 TLAGRMPYYVKNGDVKVIQAQ--FSPE-TENKVEKLFEQAVK-VG--ASLFTQQSN-ETATGAAIRSGSSTASMATLGNN 395 (488) T ss_pred eecccccccccCCceeecCCc--hhHH-HHHHHHHHHHHHHH-Hh--HhhccCCCc-chHHHHHHHHHHhhHHHHHHHHH Confidence 1211 2333344 4455443 3322 12223333333211 12 233332211 11222 2233456778888999 Q ss_pred HHHHHHHhhhhh---------------hhhcchh--hh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCC-CCCC Q lcl|NC_011801. 311 IESELSQKLGTD---------------VKLDIAS--AI-DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFP-ELDL 371 (386) Q Consensus 311 ie~~l~~~l~~~---------------~~fd~~~--~l-~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p-~~~~ 371 (386) +|+++++.|... .+|+++. .. ..|. ..++++-+++.+|.++..+.++.+.+.+++. +-+. T Consensus 396 le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~-~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~ 474 (488) T protein:vir:96 396 VEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNP-QMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSK 474 (488) T ss_pred HHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCH Confidence 999998866321 2233221 11 2233 3466677788999999999988887766652 2222 Q ss_pred CccccccccCCCCC Q lcl|NC_011801. 372 DEGTNLLDNTKNIN 385 (386) Q Consensus 372 ~~~~~~~~~~~~~~ 385 (386) .+-.+.+.....+- T Consensus 475 e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 475 EEFDEHIAELGFGM 488 (488) T ss_pred HHHHHHHhhcCCCC Confidence 11111111111111 No 228 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=62.07 E-value=0.34 Score=23.14 Aligned_cols=349 Identities=11% Similarity=0.052 Sum_probs=137.0 Q ss_pred Cchh-hhhccccc---cCCccch---------hhhhhcccccccCcccccHHHH-------hccHHHHHHHHHHHHhhcc Q lcl|NC_011801. 1 MAFL-SNLFKRQK---MLSGSSP---------VWILNQGQPVSIKPKAITSAIA-------LKNSDVYAVISRVSSDIAG 60 (386) Q Consensus 1 Mg~~-~~l~~~~~---~~~~~~~---------~~~~~~~~~~~~~~~~i~~~~a-------~~~~~v~~~v~~ia~~ia~ 60 (386) ..+. +.--++.. ...++.. ........ ..|..-.++.... .....|..-+..+.+ T Consensus 40 ~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~---- 114 (522) T protein:vir:94 40 SLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQ-SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVER---- 114 (522) T ss_pred cccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCCC-CcccccccchhhhhccCcccchhHHHHHHHHHHHH---- Confidence 1110 00000000 0000000 00011110 1122212221100 011112221211111 Q ss_pred CceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCceeEEE-- Q lcl|NC_011801. 61 CRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDLTYT-- 138 (386) Q Consensus 61 ~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~-- 138 (386) -+...| .+.| .+.-+..+..+++.+|||.+++..+..|.+..+..+|-..+-+..|..+...... T Consensus 115 --------~~~~~~-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~ 181 (522) T protein:vir:94 115 --------VLMAYM-ETNS----FRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTI 181 (522) T ss_pred --------HHHHHH-HhcC----cHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeee Confidence 112223 2234 3344556678888899999998777777665554455555655555555331111 Q ss_pred --Ee-----------------------------ccCccc--------cee-------EEEcccceeeeccccccCccccc Q lcl|NC_011801. 139 --VH-----------------------------FDDSKR--------SGD-------FLYDSSEVIHFRCTVSGESDTQY 172 (386) Q Consensus 139 --~~-----------------------------~~~~~~--------~~~-------~~~~~~~vih~~~~~~~~~~~~~ 172 (386) +. .....+ +.. ..+..-+.+.+||.. ..+.. T Consensus 182 ~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~---~~ge~ 258 (522) T protein:vir:94 182 DKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVR---LDGED 258 (522) T ss_pred eeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccccCCceeeeeee---cCCCc Confidence 00 000000 000 011122334444432 23457 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec--CCCceee Q lcl|NC_011801. 173 MGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL--DQSADVE 250 (386) Q Consensus 173 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl--~~g~~~~ 250 (386) ||.||...+...+...+...+.......-...|..++. ++........ ..+.+ +.++- ++++... T Consensus 259 YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~-~~g~~~~~~~----------~~~~~--g~~v~g~~~~v~~~ 325 (522) T protein:vir:94 259 YGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN-PNGITQPRRL----------NKAAT--GEFVAGRVEDINFL 325 (522) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-ccccccchhe----------eccCC--ceeecCCcccceee Confidence 99999999999999999999999999988888876553 3434333322 11111 12222 2233344 Q ss_pred eccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhcCCcCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----- Q lcl|NC_011801. 251 TTNISPNVTEF-LQNVSFSQDQIAKAFGIPADYLSGKQDAQSNI-TMIRAFYQSSLSIYIKPIESELSQKLGTDV----- 323 (386) Q Consensus 251 ~~~~~~~d~~~-~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~-~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~----- 323 (386) ++.. +.+.+. .+..+..+.+|-.+|-+.....-..+.-+-.| ..+..-....|.|....+.++|-.-|...+ T Consensus 326 ~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~ 404 (522) T protein:vir:94 326 QLTK-GQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQ 404 (522) T ss_pred eccc-ccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 4333 234443 56677788889999966531111112112222 123344556677777777776654332110 Q ss_pred ------hhcchhhhccCHHHHHHHHHHHHhC-CCc-CHHHHHHHhccCCcCCCCCCCccccccccCCC---------CCC Q lcl|NC_011801. 324 ------KLDIASAIDSDNSELINNVQKLASA-GVL-APIQAQKLLKNRGVFPELDLDEGTNLLDNTKN---------IND 386 (386) Q Consensus 324 ------~fd~~~~l~~d~~~~~~~~~~~~~~-g~~-t~nE~R~~lg~~p~~p~~~~~~~~~~~~~~~~---------~~~ 386 (386) ++. +.+++.+.-+-...+.++... .++ ..+.+- .++-+.+.+..+.++..+.+....+ ..+ T Consensus 405 r~g~lP~~p-~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia-~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee 482 (522) T protein:vir:94 405 SAGMIPDLP-KEAVEPTVSTGLEALGRGQDLEKLTQAVNMMT-GLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDE 482 (522) T ss_pred hcCCCCCCC-cccEEeeEecHHHHHHHHHHHHHHHHHHHHHH-hccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHH Confidence 000 001111111111111111100 000 001110 0110000011111111111111110 001 No 229 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=61.73 E-value=0.35 Score=23.10 Aligned_cols=372 Identities=15% Similarity=0.088 Sum_probs=146.0 Q ss_pred Cchhhhh-------cccc----ccCCccchhhhhhcccccc-cCcc---cccHH---HHhc----cHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MAFLSNL-------FKRQ----KMLSGSSPVWILNQGQPVS-IKPK---AITSA---IALK----NSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg~~~~l-------~~~~----~~~~~~~~~~~~~~~~~~~-~~~~---~i~~~---~a~~----~~~v~~~v~~ia~~i 58 (386) |.-.++. ..++ ....+. ..+......+.+ .... .-+.. .++. .+.....++.+++.+ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~-~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 110 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQ-EAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQV 110 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcCh-HHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchh Confidence 5532210 0000 000100 011000000000 0100 00111 1122 233445555555544 Q ss_pred ccCceeec-chhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCce------------EEEEEEcCcce- Q lcl|NC_011801. 59 AGCRFVTN-AQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYP------------VRIEPVPNEKV- 124 (386) Q Consensus 59 a~~p~~~~-~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~------------~~l~~l~~~~v- 124 (386) -+-|..+. ...+..++..-=-...+-.+|.+.++...+.+|-|++++.....|.. --+..+.|..| T Consensus 111 frk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~Ii 190 (535) T protein:vir:80 111 FSRDPIRQLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSII 190 (535) T ss_pred hcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhcc Confidence 44443332 33455555433346667899999999999999999999976554431 11222222211 Q ss_pred ----------------EE-----eecCC-CceeE--EEE--------------eccCcc---cceeEEEcc------cce Q lcl|NC_011801. 125 ----------------TV-----ALDDY-GKDLT--YTV--------------HFDDSK---RSGDFLYDS------SEV 157 (386) Q Consensus 125 ----------------~~-----~~~~~-~~~~~--~~~--------------~~~~~~---~~~~~~~~~------~~v 157 (386) .+ ..++. +.... |.+ ...... ......++. -.. T Consensus 191 nW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~ 270 (535) T protein:vir:80 191 NWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKE 270 (535) T ss_pred CccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCe Confidence 00 11111 11110 100 000000 000001111 111 Q ss_pred eeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_011801. 158 IHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENA 237 (386) Q Consensus 158 ih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 237 (386) |=|-+. .....+...|-+|+..++..--.....+.-....+...+.|-.+++-.+... .+...+ -....-|.+ T Consensus 271 IPfv~~-~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~----~~~~~~-~~~i~iG~~- 343 (535) T protein:vir:80 271 IPFQFI-GPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDW----VEDVFK-DFKVHLGSR- 343 (535) T ss_pred eEEEEe-ecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhh----hhcCCC-CcceEecCc- Confidence 212111 1223445678888876665544433333334444556677877776332111 110000 001222333 Q ss_pred CcceecCCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_011801. 238 GRAVVLDQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNITMIR--AFYQSSLSIYIKPIESEL 315 (386) Q Consensus 238 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~~~~--~~~~~~l~P~~~~ie~~l 315 (386) ..+.++.|.++.-+..+++-+. .+..+...+++++ .| ..++..........+... .--...|.-++..++++| T Consensus 344 -~~~~lP~~~~~~~~e~~~~~~a-~~~l~~~e~qM~~-lG--a~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al 418 (535) T protein:vir:80 344 -AIIPLPQGATAGILQITPNSVP-FEAMTHKESQMIA-MG--ANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAF 418 (535) T ss_pred -ccccCCCCCCcceeeeccchhH-HHHHHHHHHHHHH-HH--HHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHH Confidence 3567776665554444443333 2233333333333 22 222321111111222221 122456777888888888 Q ss_pred HHhhhhhh--------------hhcchhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCC-------- Q lcl|NC_011801. 316 SQKLGTDV--------------KLDIASAI-DSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLD-------- 372 (386) Q Consensus 316 ~~~l~~~~--------------~fd~~~~l-~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~-------- 372 (386) ++.|-... +.+.+-.. ..|..+ ...+-+++..|.++..+.++.|.+.++++..... T Consensus 419 ~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~-~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~ 497 (535) T protein:vir:80 419 RKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNE-RAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKAT 497 (535) T ss_pred HHHHHHHHHHcCCccCCCceEEEeccccccccCCHHH-HHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHH Confidence 87663221 11211111 223433 4444567777777777776666544443211100 Q ss_pred -c-------cccccccCCCCC------------C Q lcl|NC_011801. 373 -E-------GTNLLDNTKNIN------------D 386 (386) Q Consensus 373 -~-------~~~~~~~~~~~~------------~ 386 (386) | ++.......+++ | T Consensus 498 ~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~ 531 (535) T protein:vir:80 498 VEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGN 531 (535) T ss_pred hhhhhccccCCCCCCCCCCCCCcCcccCCccccc Confidence 0 111111111111 1 No 230 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=57.43 E-value=0.43 Score=22.57 Aligned_cols=337 Identities=13% Similarity=0.079 Sum_probs=134.2 Q ss_pred Cchhhhhcccc------------ccCCcc------chhhhhhcccccccCcccccHHHHhc-------cHHHHHHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKRQ------------KMLSGS------SPVWILNQGQPVSIKPKAITSAIALK-------NSDVYAVISRVS 55 (386) Q Consensus 1 Mg~~~~l~~~~------------~~~~~~------~~~~~~~~~~~~~~~~~~i~~~~a~~-------~~~v~~~v~~ia 55 (386) ..- +|.+. ..+... .......+.. ..|..-.++.....+ ...|..-+..+. T Consensus 40 lP~---~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve 115 (535) T protein:vir:33 40 IPS---LFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVE 115 (535) T ss_pred ccc---ccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-CcccccccChHHHhccccCcchHHHHHHHHHHHH Confidence 110 00000 000000 0001111111 122222222221111 111222111111 Q ss_pred HhhccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee Q lcl|NC_011801. 56 SDIAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL 135 (386) Q Consensus 56 ~~ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 135 (386) +-+...| .+-| .+.-+..+..+++.+||+.+++..+.. ....+..++-..+-+..|..+... T Consensus 116 ------------~~~~~~~-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~~f~~~pl~~~~v~~d~~G~vd 177 (535) T protein:vir:33 116 ------------RIIMNYI-ESNS----YRVTLFECLKQLIVAGNALLYLPEPEG-SYNPMKLYRLSSYVVQRDAYGNVL 177 (535) T ss_pred ------------HHHHHHH-HhcC----cHHHHHHHHHHHHhhCceeEEeecCCC-CceeeEEEEcCeeEEeeCCCCCee Confidence 1122233 2234 334455667888889999888866543 233333444455555444444221 Q ss_pred EEE-----------------------------------EeccCcccce-----------------eEEEcccceeeeccc Q lcl|NC_011801. 136 TYT-----------------------------------VHFDDSKRSG-----------------DFLYDSSEVIHFRCT 163 (386) Q Consensus 136 ~~~-----------------------------------~~~~~~~~~~-----------------~~~~~~~~vih~~~~ 163 (386) ... .-......+. ...+..-+.+..||. T Consensus 178 ~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~ 257 (535) T protein:vir:33 178 QIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMV 257 (535) T ss_pred EEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeee Confidence 100 0000000000 001112233444443 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCccee- Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVV- 242 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~v- 242 (386) . ..+..||.||...+...+...+...+.......-...|..++. ++........ ..+.+ +.++ T Consensus 258 ~---~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~-~~g~~~~~~~----------~~~~~--g~~v~ 321 (535) T protein:vir:33 258 R---IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQPRRL----------TKAQT--GDFVP 321 (535) T ss_pred e---cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-cccccchhhc----------ccCCc--eeeec Confidence 2 2345799999999999999999999999999888888876653 3333333211 11211 1222 Q ss_pred -cCCCceeeeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011801. 243 -LDQSADVETTNISPNVTE-FLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSLSIYIKPIESELSQK 318 (386) Q Consensus 243 -l~~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l~P~~~~ie~~l~~~ 318 (386) -.+++...++...+ +.+ ..+..+..+..|-.+|-+.. +.......-+.+| .+..-....|.|....+.++|-.- T Consensus 322 g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~P 399 (535) T protein:vir:33 322 GRREDIDFLQLEKQA-DFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (535) T ss_pred CCcccceeeeccccc-chhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 23334444444333 343 35667788888988885442 2111222222322 234445566777777777776543 Q ss_pred hhhhh-----------hhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCCCCCCccccccc-------- Q lcl|NC_011801. 319 LGTDV-----------KLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPELDLDEGTNLLD-------- 379 (386) Q Consensus 319 l~~~~-----------~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~~~~~~~~~~~~-------- 379 (386) |...+ ++.- ..++...-+-...+.+... +-+.-..-..+.. +.|. ..+..++ T Consensus 400 li~r~~~il~r~g~lP~~p~-~~v~~~yis~La~aqr~~~--~~~l~~~~~~la~--~~P~----~~d~~id~d~~~~~~ 470 (535) T protein:vir:33 400 LVRVLLKQLQATSQIPELPK-EAVEPTISTGLEAIGRGQD--LDKLERCISAWAA--LAPM----QGDPDINLAVIKLRI 470 (535) T ss_pred HHHHHHHHHHhcCCCCCCCc-cceeEEEecHHHHHHHHHH--HHHHHHHHHHHHh--hChh----hhhccCCHHHHHHHH Confidence 32111 0100 1112222222222222211 1111111112221 1110 0110010 Q ss_pred cCCCCCC Q lcl|NC_011801. 380 NTKNIND 386 (386) Q Consensus 380 ~~~~~~~ 386 (386) -...+.| T Consensus 471 a~~~Gvp 477 (535) T protein:vir:33 471 ANAIGID 477 (535) T ss_pred HHHcCCC Confidence 0001111 No 231 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=55.80 E-value=0.47 Score=22.38 Aligned_cols=359 Identities=11% Similarity=0.097 Sum_probs=155.7 Q ss_pred Cchhhhhccc-------cccCCccchhhhhh-----------------cccccccCccc-----ccH-HHHhccHHHHHH Q lcl|NC_011801. 1 MAFLSNLFKR-------QKMLSGSSPVWILN-----------------QGQPVSIKPKA-----ITS-AIALKNSDVYAV 50 (386) Q Consensus 1 Mg~~~~l~~~-------~~~~~~~~~~~~~~-----------------~~~~~~~~~~~-----i~~-~~a~~~~~v~~~ 50 (386) |.||.+-=.+ ....+...|..... .+.....++.. |.. +....+|.|..| T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 80 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA 80 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhH Confidence 8888642111 11111111110000 01111111111 111 223568999999 Q ss_pred HHHHHHhhccC-----ceeec--chh------------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCC Q lcl|NC_011801. 51 ISRVSSDIAGC-----RFVTN--AQP------------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNG 111 (386) Q Consensus 51 v~~ia~~ia~~-----p~~~~--~~~------------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g 111 (386) |+.|.+.+.-. |+.+. +.+ ..++|+. -+-...+++ .+..|...|..|..++-+... T Consensus 81 v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fHkiid~k~ 155 (511) T protein:vir:56 81 IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSL-LQMRKHGYK----WFRKWYVDSRIYFHKILDKDN 155 (511) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEEEEEEecccc Confidence 99999876532 33322 111 1122211 233333444 455667789999988877776 Q ss_pred ceEEEEEEcCcceEEeecC-----------CCceeEEEEeccCcc----------cceeEEEcccceeeeccccccCccc Q lcl|NC_011801. 112 YPVRIEPVPNEKVTVALDD-----------YGKDLTYTVHFDDSK----------RSGDFLYDSSEVIHFRCTVSGESDT 170 (386) Q Consensus 112 ~~~~l~~l~~~~v~~~~~~-----------~~~~~~~~~~~~~~~----------~~~~~~~~~~~vih~~~~~~~~~~~ 170 (386) ...+|..|+|..++.++.- .+...+|.|...+.. ....+.++.+.|.|...-....+.+ T Consensus 156 GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~ 235 (511) T protein:vir:56 156 NIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCAD 235 (511) T ss_pred ceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCC Confidence 7889999999998764321 111223333221111 1233678888887664322111234 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC------c Q lcl|NC_011801. 171 QYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG------R 239 (386) Q Consensus 171 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g------~ 239 (386) ..+.+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ ++..+..... ...+| + T Consensus 236 ~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk 315 (511) T protein:vir:56 236 DPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTN 315 (511) T ss_pred CCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchh Confidence 44567777777776666655555444332233333444443323333332322 3333322110 01112 1 Q ss_pred -ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcC-ccc----HHHHHHH--HHH Q lcl|NC_011801. 240 -AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQD-AQS----NITMIRA--FYQ 301 (386) Q Consensus 240 -~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~-~~~----~~~~~~~--~~~ 301 (386) +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+-|...+. +.. +.+-.+. =+. T Consensus 316 ~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~ 394 (511) T protein:vir:56 316 AMSMLEDYYLPRREGSKGTEVSTLPGGQS-LGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFT 394 (511) T ss_pred hhhhHhhhcccccCCCCccceeeccccCC-cChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHH Confidence 1122 23667777654322 223456678889999999999999874431 111 1221111 122 Q ss_pred HHHHHHHHHHHHHHHHhhhhh------------------hhhcchhhhccCH-HHH--HHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_011801. 302 SSLSIYIKPIESELSQKLGTD------------------VKLDIASAIDSDN-SEL--INNVQKLASAGVLAPIQAQKLL 360 (386) Q Consensus 302 ~~l~P~~~~ie~~l~~~l~~~------------------~~fd~~~~l~~d~-~~~--~~~~~~~~~~g~~t~nE~R~~l 360 (386) .-|.-+...|...|...|-.. ++|++. +.+. .++ ++++..-++ +-+.+ T Consensus 395 KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~---~Dn~f~ElKe~Eil~~Rl~--------~l~~~ 463 (511) T protein:vir:56 395 KFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFN---QDSYFEEAKELEILNSRMN--------AMRDI 463 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEee---ecchHHHHHHHHHHHHHHH--------HHHHh Confidence 334555555555544433111 111110 1111 111 122111111 11110 Q ss_pred ccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 361 KNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 361 g~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .| ..|..+....--.+ T Consensus 464 --dp--------yvGky~S~~yi~k~ 479 (511) T protein:vir:56 464 --QD--------YAGKYYSHKYIQKN 479 (511) T ss_pred --cc--------hhccccchHHHHHH Confidence 11 11122211111111 No 232 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=54.66 E-value=0.5 Score=22.24 Aligned_cols=377 Identities=10% Similarity=0.089 Sum_probs=147.1 Q ss_pred Cchhhh----hcccc-ccCCccc-----hhhh-hhcccccccCccccc-------HHHHhccHHHHHHHHHHHHhhccC- Q lcl|NC_011801. 1 MAFLSN----LFKRQ-KMLSGSS-----PVWI-LNQGQPVSIKPKAIT-------SAIALKNSDVYAVISRVSSDIAGC- 61 (386) Q Consensus 1 Mg~~~~----l~~~~-~~~~~~~-----~~~~-~~~~~~~~~~~~~i~-------~~~a~~~~~v~~~v~~ia~~ia~~- 61 (386) |||.-. +..+. +..++.. ++.. ...+......+..-+ -+....+|.|..||+.|.+.+.-. T Consensus 5 fgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d 84 (558) T protein:vir:10 5 FGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEAIVSD 84 (558) T ss_pred hcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEec Confidence 666521 11110 1111110 0100 111111111122111 112356899999999998876532 Q ss_pred ----ceeec--chh----HH--------HHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC---CCceEEEEEEc Q lcl|NC_011801. 62 ----RFVTN--AQP----IT--------DVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT---NGYPVRIEPVP 120 (386) Q Consensus 62 ----p~~~~--~~~----~~--------~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~---~g~~~~l~~l~ 120 (386) |+.+. ..+ +. ++|+. -+-...+++ .+..|...|..|..++-|. .....+|..|+ T Consensus 85 ~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-l~F~~~~~e----~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lD 159 (558) T protein:vir:10 85 LYDSPVEVELSNLNASNTLKKKIREEFRYIKEM-MDFDKKSHE----IFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYID 159 (558) T ss_pred CCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhheeeeEEEEEEEEeCCCccccceeeeeeC Confidence 33221 111 11 22211 233333333 4556677899888886653 33578999999 Q ss_pred CcceEEeecCC----------------------CceeEEEEeccCc---------ccceeEEEcccceeeeccccccCcc Q lcl|NC_011801. 121 NEKVTVALDDY----------------------GKDLTYTVHFDDS---------KRSGDFLYDSSEVIHFRCTVSGESD 169 (386) Q Consensus 121 ~~~v~~~~~~~----------------------~~~~~~~~~~~~~---------~~~~~~~~~~~~vih~~~~~~~~~~ 169 (386) |..++.++.-. +...+|.|..... ..+..+.++.+-|. +-+.-.-+ . T Consensus 160 Pr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~-y~hSGL~d-~ 237 (558) T protein:vir:10 160 PLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSIT-MCTSGLVD-R 237 (558) T ss_pred cccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhhee-eeccccee-c Confidence 99997654321 1112333322111 11122344444443 32221111 1 Q ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC------ Q lcl|NC_011801. 170 TQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG------ 238 (386) Q Consensus 170 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g------ 238 (386) ++-.=+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+...++ ++....+... ...+| T Consensus 238 ~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddr 317 (558) T protein:vir:10 238 NKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDR 317 (558) T ss_pred CCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccc Confidence 112223444444444444433333332221122223334433322333332222 3332222110 01111 Q ss_pred c-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc--HHHHHH--HHHHHH Q lcl|NC_011801. 239 R-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS--NITMIR--AFYQSS 303 (386) Q Consensus 239 ~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~--~~~~~~--~~~~~~ 303 (386) + +..+ +.|.++++|.-.. .+.=++-.++.++.+..+++||.+-|...+..+. +.+-.+ .=+..- T Consensus 318 k~msMlEDyWLpRReGgrgTEItTLpGgq-nLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KF 396 (558) T protein:vir:10 318 KFMSMMEDFWLPRREGGRGTEITTLPGGQ-NLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKF 396 (558) T ss_pred hhhhhHhhhcccccCCCCccceeeccccC-CcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHH Confidence 1 1122 2366777765443 2222446678889999999999998864432111 111111 112223 Q ss_pred HHHHHHHHHHHHHHhhh------------------hhhhhc--------------------------------------- Q lcl|NC_011801. 304 LSIYIKPIESELSQKLG------------------TDVKLD--------------------------------------- 326 (386) Q Consensus 304 l~P~~~~ie~~l~~~l~------------------~~~~fd--------------------------------------- 326 (386) |.-+...|...|...|- ..++|+ T Consensus 397 I~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi 476 (558) T protein:vir:10 397 VGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYV 476 (558) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHH Confidence 44444444444443220 011111 Q ss_pred chhhhccC---HHHHHHHHHHHHhCCCcC-HHHHHHHhccCCcCCCCCCC--c-cccccccCCCCC----C Q lcl|NC_011801. 327 IASAIDSD---NSELINNVQKLASAGVLA-PIQAQKLLKNRGVFPELDLD--E-GTNLLDNTKNIN----D 386 (386) Q Consensus 327 ~~~~l~~d---~~~~~~~~~~~~~~g~~t-~nE~R~~lg~~p~~p~~~~~--~-~~~~~~~~~~~~----~ 386 (386) ...+|+.. .++..+.+++-...|++- |+|.--+-| .|+++.+++. . +..+..+.-..+ | T Consensus 477 ~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (558) T protein:vir:10 477 RKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITG-EPLPQEGDPAMEGMGEQPVDPDLEAQAQAVD 546 (558) T ss_pred HHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhc-cccCccCCchhccCCCCCcccccccchhhhh Confidence 11122222 222223334444445443 443322221 2333322221 1 111111111111 1 No 233 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=54.44 E-value=0.5 Score=22.22 Aligned_cols=376 Identities=12% Similarity=0.139 Sum_probs=148.2 Q ss_pred Cchhhh--hccccccCCccch-----------hhhh-hcccccccCccc------ccH-HHHhccHHHHHHHHHHHHhhc Q lcl|NC_011801. 1 MAFLSN--LFKRQKMLSGSSP-----------VWIL-NQGQPVSIKPKA------ITS-AIALKNSDVYAVISRVSSDIA 59 (386) Q Consensus 1 Mg~~~~--l~~~~~~~~~~~~-----------~~~~-~~~~~~~~~~~~------i~~-~~a~~~~~v~~~v~~ia~~ia 59 (386) |+=+-. |.+........++ +... ..+......+.. |.. +....+|.|..||+.|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 543321 1111111111111 1110 111111222211 111 223468999999999988765 Q ss_pred cC-----ceeec--ch------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC---CCceEEEE Q lcl|NC_011801. 60 GC-----RFVTN--AQ------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT---NGYPVRIE 117 (386) Q Consensus 60 ~~-----p~~~~--~~------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~---~g~~~~l~ 117 (386) -. |+.+. .- ...++|+. -+-...+++ .+..|...|..|..++-+. .....+|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-l~F~~~~~e----~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRL-LDFENRSYE----IFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEEEEEEecCCCccccceeee Confidence 32 33322 11 11122211 233333444 4556667888888876653 34578999 Q ss_pred EEcCcceEEeecC-----CC-------------ceeEEEEecc--CcccceeEEEcccceeeeccccccCcccccccccH Q lcl|NC_011801. 118 PVPNEKVTVALDD-----YG-------------KDLTYTVHFD--DSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIPP 177 (386) Q Consensus 118 ~l~~~~v~~~~~~-----~~-------------~~~~~~~~~~--~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s~ 177 (386) .|+|..++.++.. ++ ...+|.|... ....+..+.++.+-|. +-+... ...++-.=+|- T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~-y~hSGl-~d~~~~~i~sy 233 (533) T protein:vir:10 156 YIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSIC-YVHSGI-MDLNKNMTLSH 233 (533) T ss_pred eccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhhee-eeeccc-eeCCCCceecc Confidence 9999999874322 11 1111222110 1122334556665444 333221 11222223444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC------c-ceec-- Q lcl|NC_011801. 178 IDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG------R-AVVL-- 243 (386) Q Consensus 178 ~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g------~-~~vl-- 243 (386) +..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ ++....+... ...+| + +..+ T Consensus 234 LhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 313 (533) T protein:vir:10 234 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 313 (533) T ss_pred chHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhh Confidence 5555555444444443333222222223344433323333332322 3332222110 01111 1 1122 Q ss_pred --------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc--HHHHHH--HHHHHHHHHHHHHH Q lcl|NC_011801. 244 --------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS--NITMIR--AFYQSSLSIYIKPI 311 (386) Q Consensus 244 --------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~--~~~~~~--~~~~~~l~P~~~~i 311 (386) +.|.++++|.-..+ +.-++-.++.++.+..+++||.+-|...+..+. +.+-.+ .=+..-|.-+...| T Consensus 314 yWLPRReGgrgTEItTLpGgqn-Lgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 392 (533) T protein:vir:10 314 FWLPRREGGRGTEITTLPGGQN-LGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRF 392 (533) T ss_pred hcccccCCCCccceeeccccCC-cChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 23667777654322 223456678889999999999998864432111 111111 11222344444444 Q ss_pred HHHHHHhhh------------------hhhhhc--c-------------------------------------hhhhccC Q lcl|NC_011801. 312 ESELSQKLG------------------TDVKLD--I-------------------------------------ASAIDSD 334 (386) Q Consensus 312 e~~l~~~l~------------------~~~~fd--~-------------------------------------~~~l~~d 334 (386) ...|...|- ..++|+ . ..+|+.. T Consensus 393 s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 472 (533) T protein:vir:10 393 SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQT 472 (533) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 444443321 011111 1 1122221 Q ss_pred ---HHHHHHHHHHHHhCCCcC-HH-HHHHHhccCCcCCCCCCCccccccccCCCCCC Q lcl|NC_011801. 335 ---NSELINNVQKLASAGVLA-PI-QAQKLLKNRGVFPELDLDEGTNLLDNTKNIND 386 (386) Q Consensus 335 ---~~~~~~~~~~~~~~g~~t-~n-E~R~~lg~~p~~p~~~~~~~~~~~~~~~~~~~ 386 (386) .++..+.+++-...|++- |+ |.-.. ..+..|..++...++.......-++ T Consensus 473 Deei~~~~kqI~~E~k~~~~~~p~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) T protein:vir:10 473 DVEMKEIDKQIESEMESGIIADPAAEMDPA--MAAGDPDAGGAPAEEVAPEGPDPSD 527 (533) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCcchhhHH--hcCCCCCcCCcccccCCCCCCCcch Confidence 122222333333344331 11 11111 1122233333222221111111111 No 234 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=44.13 E-value=0.82 Score=21.06 Aligned_cols=343 Identities=13% Similarity=0.061 Sum_probs=132.2 Q ss_pred Cc-hh-----------hhhccccccCCcc------chhhhhhcccccccCcccccHHHHh-------ccHHHHHHHHHHH Q lcl|NC_011801. 1 MA-FL-----------SNLFKRQKMLSGS------SPVWILNQGQPVSIKPKAITSAIAL-------KNSDVYAVISRVS 55 (386) Q Consensus 1 Mg-~~-----------~~l~~~~~~~~~~------~~~~~~~~~~~~~~~~~~i~~~~a~-------~~~~v~~~v~~ia 55 (386) .. +| .++| ..+... .......+.. ..|..-.++..... ....|..-+..+. T Consensus 40 lP~~~~~~~~~~~~~~~~~~---dst~~~a~~~Laa~l~~~ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve 115 (535) T protein:vir:15 40 IPSLFPKESDNESTDYTTPW---QAVGARGLNNLASKLMLALFPM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVE 115 (535) T ss_pred cccccCCCCCcccccccccc---cccHHHHHHHHHHHHHHhhcCC-CcccccccChHHHhccCCCcchHHHHHHHHHHHH Confidence 11 00 0010 000000 0001111111 12222222221111 1112222222111 Q ss_pred HhhccCceeecchhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCcee Q lcl|NC_011801. 56 SDIAGCRFVTNAQPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGKDL 135 (386) Q Consensus 56 ~~ia~~p~~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 135 (386) +-+...| .+.| .+.-+..+..+++.+||+.+++..+..+ ...+..++-..+-+..|..+... T Consensus 116 ------------~~~~~~l-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~-~~~f~~~pl~~~~v~~d~~G~vd 177 (535) T protein:vir:15 116 ------------RIIMNYI-ESNS----YRVTLFECLKQLIVAGNALLYLPEPEGS-YNPMKLYRLSSYVVQRDAYGNVL 177 (535) T ss_pred ------------HHHHHHH-HhcC----cHHHHHHHHHHHHhhCceeEEeecCCCC-ceeeEEEEcCeeEEeeCCCCCee Confidence 1122233 2234 3345556677888899998877654432 22223333344444444433211 Q ss_pred E-----------------------------------EEEeccCcccc-----------------eeEEEcccceeeeccc Q lcl|NC_011801. 136 T-----------------------------------YTVHFDDSKRS-----------------GDFLYDSSEVIHFRCT 163 (386) Q Consensus 136 ~-----------------------------------~~~~~~~~~~~-----------------~~~~~~~~~vih~~~~ 163 (386) . |..-.....++ ....+..-+.+..||. T Consensus 178 ~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~ 257 (535) T protein:vir:15 178 QIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMV 257 (535) T ss_pred EEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCCceeeeee Confidence 1 00000000000 0001112233444443 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHHHhcccccCcceec Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEEQTTGENAGRAVVL 243 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~~~~~~~~g~~~vl 243 (386) . ..+..||.||...+...+...+...+.......-...|..++. ++........ ..+.+..-+.-- T Consensus 258 ~---~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~-~~g~~~~~~l----------~~~~~g~~v~g~ 323 (535) T protein:vir:15 258 R---IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQPRRL----------TKAQTGDFVPGR 323 (535) T ss_pred e---cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-ccccccchhc----------ccCCceeeecCC Confidence 2 2345799999999999999999999999999888888876653 3333333211 112111101112 Q ss_pred CCCceeeeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhcCCcCcccHHH--HHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011801. 244 DQSADVETTNISPNVTE-FLQNVSFSQDQIAKAFGIPADYLSGKQDAQSNIT--MIRAFYQSSLSIYIKPIESELSQKLG 320 (386) Q Consensus 244 ~~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~~--~~~~~~~~~l~P~~~~ie~~l~~~l~ 320 (386) .+++...++...+ +.+ ..+..+..+..|-.+|-+.. +.......-+.+| .+..-....|.|....+.++|-.-|. T Consensus 324 ~~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (535) T protein:vir:15 324 REDIDFLQLEKQA-DFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (535) T ss_pred cccceeeeccccc-chhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH Confidence 3334444444333 333 35667778888988885442 2111222222322 23444556677777777777654332 Q ss_pred hhh-----------hhcchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhccCCcCCC-CCCC-cccccc--ccCCCCC Q lcl|NC_011801. 321 TDV-----------KLDIASAIDSDNSELINNVQKLASAGVLAPIQAQKLLKNRGVFPE-LDLD-EGTNLL--DNTKNIN 385 (386) Q Consensus 321 ~~~-----------~fd~~~~l~~d~~~~~~~~~~~~~~g~~t~nE~R~~lg~~p~~p~-~~~~-~~~~~~--~~~~~~~ 385 (386) ..+ ++.- ..++...-+-...+.+... +-+.-..-..+.. +.|. .+.. -.++.+ .-...+. T Consensus 402 ~r~~~il~r~g~lP~~p~-~~v~~~yis~La~aqr~~~--~~~l~~~~~~la~--~~P~~ld~~id~d~~~~~~a~~~Gv 476 (535) T protein:vir:15 402 RVLLKQLQATSQIPELPK-EAVEPTISTGLEAIGRGQD--LDKLERCISAWAA--LAPMQGDPDINLAVIKLRIANAIGI 476 (535) T ss_pred HHHHHHHHhcCCCCCCCc-cceeEEEecHHHHHHHHHH--HHHHHHHHHHHHh--cChhhhhccCCHHHHHHHHHHHcCC Confidence 111 0100 1122222222222222211 1111111112221 1111 0000 000000 0000111 Q ss_pred C Q lcl|NC_011801. 386 D 386 (386) Q Consensus 386 ~ 386 (386) | T Consensus 477 p 477 (535) T protein:vir:15 477 D 477 (535) T ss_pred C Confidence 1 No 235 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=43.44 E-value=0.84 Score=20.99 Aligned_cols=351 Identities=13% Similarity=0.093 Sum_probs=150.0 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhcc------Cce-e--ecc---- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAG------CRF-V--TNA---- 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~------~p~-~--~~~---- 67 (386) ...|..+- .-..+...+........ ......-+.+ +=+++-..|++.+|..+.+ -|| + +.+ T Consensus 21 e~~w~e~~--~~~lP~~~~~~~~~~~~--~~~~~~~~~~--i~dst~~~a~~~Las~L~~~ltPp~~~WF~l~~~d~~~~ 94 (547) T protein:vir:10 21 EQIWDCIR--KYIMPMRSDFFSDLRSE--GSINWNQNRE--VFDSTAGDGLETLSSSLHGSLTSPATKWFELAFRDKELN 94 (547) T ss_pred HHHHHHHH--HHhcccccccccCCCCC--cccccccccc--cccchHHHHHHHHHHHHHHhhcCCCCcccccccCCcccc Confidence 22222110 00111111110000000 0000000000 0123334555555544432 233 1 111 Q ss_pred -------------hhHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC-CCceEEEEEEcCcceEEeecCCCc Q lcl|NC_011801. 68 -------------QPITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT-NGYPVRIEPVPNEKVTVALDDYGK 133 (386) Q Consensus 68 -------------~~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~-~g~~~~l~~l~~~~v~~~~~~~~~ 133 (386) +-+...|. +-| .+.-+..++.+++.+|+|.+++..+. ......+..++...+-+..+..+. T Consensus 95 ~~~~v~~~L~~ve~~i~~~l~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d~~G~ 169 (547) T protein:vir:10 95 SDDECRKWLENATHDVYSALQ-DSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEEDSRGQ 169 (547) T ss_pred chHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeCCCcC Confidence 11222332 233 33345667889999999998887653 223445666666776666666554 Q ss_pred eeEEEE----------------------------------------ec----cCccc-----------c---eeEE---- Q lcl|NC_011801. 134 DLTYTV----------------------------------------HF----DDSKR-----------S---GDFL---- 151 (386) Q Consensus 134 ~~~~~~----------------------------------------~~----~~~~~-----------~---~~~~---- 151 (386) ...... +. .+... . ..+. T Consensus 170 v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~ 249 (547) T protein:vir:10 170 VVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKE 249 (547) T ss_pred eeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEec Confidence 321110 00 00000 0 0000 Q ss_pred ----------EcccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHH Q lcl|NC_011801. 152 ----------YDSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAK 221 (386) Q Consensus 152 ----------~~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~ 221 (386) |..-+.+.+||.. ..+..||.||...+...+...+...+.......-...|...+. ++....+ T Consensus 250 ~~~~~l~esg~~e~P~~~~Rw~~---~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~-~~g~~~~--- 322 (547) T protein:vir:10 250 GAVQLGEEGGYYEMPAYAIRWRK---SAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVT-ERGLISD--- 322 (547) T ss_pred CceeeeecCCcccCCeeeeeeee---cCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecc-ccccccc--- Confidence 1111233344432 2345799999999999999999999988888888777766543 2323221 Q ss_pred HHHHHHHHHHhcccccCcceecCCCceeeeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhcCCcCcccHH-HHHHHH Q lcl|NC_011801. 222 ENTRQSFEEQTTGENAGRAVVLDQSADVETTNISPNVTEF-LQNVSFSQDQIAKAFGIPADYLSGKQDAQSNI-TMIRAF 299 (386) Q Consensus 222 ~~~k~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~~~-~~~~~~ 299 (386) ++. + .|++.+.++.-.++++...+ +.+. .+..+..+..|-.+|-+........+.-+..| .++..- T Consensus 323 ------~~~---~--pgg~~~~~~~~~v~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~~E 390 (547) T protein:vir:10 323 ------IDL---G--ASGLTVVRDMESMKPFESRA-RFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRYEL 390 (547) T ss_pred ------cee---c--CCeeeecCCcccceeeeccc-chHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHHHH Confidence 111 1 23455666666777776553 4443 57788888899999987665443222222223 233455 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhh-h-hcchhhhccCHHHHHH---------HHHHHHhCCCcCHHHHHHHhccCCcCCC Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLGTDV-K-LDIASAIDSDNSELIN---------NVQKLASAGVLAPIQAQKLLKNRGVFPE 368 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~~~~-~-fd~~~~l~~d~~~~~~---------~~~~~~~~g~~t~nE~R~~lg~~p~~p~ 368 (386) ....|.|....+.++|-.-|...+ . ..-.+.+..-+++..+ .+..+-++. -..++-......... T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq--~~~~~~~i~~~~~~v-- 466 (547) T protein:vir:10 391 MQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQ--KIDQAASIERWAGST-- 466 (547) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHH--HHHHHHHHHHHHHHH-- Confidence 667788888888877654332211 1 0111111111111110 000000000 001121111110000 Q ss_pred CCCCccccccccCCCCCC Q lcl|NC_011801. 369 LDLDEGTNLLDNTKNIND 386 (386) Q Consensus 369 ~~~~~~~~~~~~~~~~~~ 386 (386) ....+. +|=....-+-| T Consensus 467 ~~laq~-~P~vld~id~d 483 (547) T protein:vir:10 467 AQLAEI-NPEVLDIPDWD 483 (547) T ss_pred HHhhcc-ChhhhhcCCHH Confidence 000000 00000011112 No 236 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=39.93 E-value=0.99 Score=20.60 Aligned_cols=365 Identities=9% Similarity=0.056 Sum_probs=153.1 Q ss_pred Cchhhhhccc-------cccCCccchhhhh-------------hccccc----ccCccc------cc-HHHHhccHHHHH Q lcl|NC_011801. 1 MAFLSNLFKR-------QKMLSGSSPVWIL-------------NQGQPV----SIKPKA------IT-SAIALKNSDVYA 49 (386) Q Consensus 1 Mg~~~~l~~~-------~~~~~~~~~~~~~-------------~~~~~~----~~~~~~------i~-~~~a~~~~~v~~ 49 (386) +.+|.+--+. ++..+...|.... ..+... ...+.. |. -+....+|.|.. T Consensus 13 ~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~ 92 (524) T protein:vir:98 13 FKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVEN 92 (524) T ss_pred hhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHHHhhccchhh Confidence 8998753111 1111111111000 000000 001100 11 122356899999 Q ss_pred HHHHHHHhhccC-----ceeec--chh------------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCC Q lcl|NC_011801. 50 VISRVSSDIAGC-----RFVTN--AQP------------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTN 110 (386) Q Consensus 50 ~v~~ia~~ia~~-----p~~~~--~~~------------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~ 110 (386) ||+.|.+.+.-. |+.+. ..+ ..++|+. -+-...+++ .+..|...|..|..++.+.+ T Consensus 93 Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fhkiid~~ 167 (524) T protein:vir:98 93 AVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNI-YDFDNMGAR----LFRDWYVDSRIYFHKIMHKD 167 (524) T ss_pred HHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceeEEEEEEcCC Confidence 999998876422 33322 111 1122211 233333433 45566778999998886544 Q ss_pred Cc--eEEEEEEcCcceEEeecC------CC------ceeEEEEeccC---------cccceeEEEcccceeeeccccccC Q lcl|NC_011801. 111 GY--PVRIEPVPNEKVTVALDD------YG------KDLTYTVHFDD---------SKRSGDFLYDSSEVIHFRCTVSGE 167 (386) Q Consensus 111 g~--~~~l~~l~~~~v~~~~~~------~~------~~~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~~~~~ 167 (386) .. ..+|..|+|..++.++.. .+ ...+|.|.... ...+..+.++.+.|.|....... T Consensus 168 ~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d- 246 (524) T protein:vir:98 168 ESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLED- 246 (524) T ss_pred CCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCccc- Confidence 33 789999999999765311 11 11122222100 12334467888888886522211 Q ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHh--------cc--cc Q lcl|NC_011801. 168 SDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQT--------TG--EN 236 (386) Q Consensus 168 ~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~--------~~--~~ 236 (386) ...++ +|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+.. .| .+ T Consensus 247 ~~~~i--isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrd 324 (524) T protein:vir:98 247 CSNNI--IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKN 324 (524) T ss_pred CCCCe--eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeec Confidence 11111 2444444444444333333332222122333344443323343333333 333333321 11 11 Q ss_pred cCc-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc---HHHHH--HHHH Q lcl|NC_011801. 237 AGR-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS---NITMI--RAFY 300 (386) Q Consensus 237 ~g~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~---~~~~~--~~~~ 300 (386) ..+ +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=|...+.+-+ +.+-. ..=+ T Consensus 325 drk~msMlEDyWLpRReGgrgTEItTLpggqn-lgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF 403 (524) T protein:vir:98 325 QQNNLSMTEDYWLMRRDGKAITEVSTLPGGQN-FSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKF 403 (524) T ss_pred cccccchhhhhcccccCCCCccceeeccccCC-cChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHH Confidence 111 2222 23667777764432 223456678889999999999998864321111 11111 1112 Q ss_pred HHHHHHHHHHHHHHHHHhhh------------------hhhhhc--chhhhcc-----CHHHHHHHHHHHHh--CCCc-- Q lcl|NC_011801. 301 QSSLSIYIKPIESELSQKLG------------------TDVKLD--IASAIDS-----DNSELINNVQKLAS--AGVL-- 351 (386) Q Consensus 301 ~~~l~P~~~~ie~~l~~~l~------------------~~~~fd--~~~~l~~-----d~~~~~~~~~~~~~--~g~~-- 351 (386) ..-|.-+...|...|...|- ..++|+ .+..... =+..|+.+++.+-. +.++ T Consensus 404 ~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~ 483 (524) T protein:vir:98 404 SKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSH 483 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccch Confidence 23344455555554444331 112222 2211100 11223333332211 1122 Q ss_pred ----------CHHHHHH---Hhc---cCCcC--CCCCCCcc Q lcl|NC_011801. 352 ----------APIQAQK---LLK---NRGVF--PELDLDEG 374 (386) Q Consensus 352 ----------t~nE~R~---~lg---~~p~~--p~~~~~~~ 374 (386) |=.|+-+ .+. +.|.. |..+.+++ T Consensus 484 dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 484 KYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 3333322 111 12332 23333333 No 237 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=37.30 E-value=1.1 Score=20.30 Aligned_cols=374 Identities=10% Similarity=0.054 Sum_probs=152.4 Q ss_pred Cchhhhhcccc--------------ccCCccchhh----------------hhhcccccccCccccc-------HHHHhc Q lcl|NC_011801. 1 MAFLSNLFKRQ--------------KMLSGSSPVW----------------ILNQGQPVSIKPKAIT-------SAIALK 43 (386) Q Consensus 1 Mg~~~~l~~~~--------------~~~~~~~~~~----------------~~~~~~~~~~~~~~i~-------~~~a~~ 43 (386) |++++ +|+-+ +..+...|.. ....++.....+..-+ -+.... T Consensus 1 ~~~~~-lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~ 79 (516) T protein:vir:10 1 MKFLD-LFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLIN 79 (516) T ss_pred CCchH-hcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhh Confidence 55543 22221 1111111110 0111111111111111 122357 Q ss_pred cHHHHHHHHHHHHhhccC-----ceeec--chh------------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEE Q lcl|NC_011801. 44 NSDVYAVISRVSSDIAGC-----RFVTN--AQP------------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAI 104 (386) Q Consensus 44 ~~~v~~~v~~ia~~ia~~-----p~~~~--~~~------------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~ 104 (386) +|.|..||+.|.+.+.-. |+.+. +.+ ..++|+. -+-...+++ .+..|...|..|.. T Consensus 80 ~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fh 154 (516) T protein:vir:10 80 NPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKLDT----LFRRWYVDSRIFFH 154 (516) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEEEE Confidence 899999999998876532 33322 111 1122221 233334444 44556677888887 Q ss_pred Eeec-CCCceEEEEEEcCcceEEeecC-----CCce------eEEEEeccC---------cccceeEEEcccceeeeccc Q lcl|NC_011801. 105 IDRD-TNGYPVRIEPVPNEKVTVALDD-----YGKD------LTYTVHFDD---------SKRSGDFLYDSSEVIHFRCT 163 (386) Q Consensus 105 ~~~~-~~g~~~~l~~l~~~~v~~~~~~-----~~~~------~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~ 163 (386) ++.+ ......+|..|+|..++.++.- .+.. .+|.|...+ ...+..+.++.+-|.|.. . T Consensus 155 Kiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~h-S 233 (516) T protein:vir:10 155 KIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYAS-S 233 (516) T ss_pred EEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeec-c Confidence 5544 3455789999999998765322 1111 122222111 122234556666655443 1 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG 238 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g 238 (386) -.-+..++. =+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+... ..++| T Consensus 234 GL~d~~~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TG 312 (516) T protein:vir:10 234 GLMDCSDRG-IIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred cceeCCCCc-eeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 111112222 24555555555544444444333222222333344433323333332322 3333222110 01122 Q ss_pred ------c-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc----cHHHHHH Q lcl|NC_011801. 239 ------R-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ----SNITMIR 297 (386) Q Consensus 239 ------~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~----~~~~~~~ 297 (386) + +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=|...+..+ .+.+-.+ T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItR 391 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQT-MGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITR 391 (516) T ss_pred eeccchhhhhhHhhhcccccCCCCccceeeccccCC-cChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhH Confidence 1 1122 23667777654322 22345667888999999999999986544321 2222221 Q ss_pred H--HHHHHHHHHHHHHHHHHHHhhh------------------hhhhhc--chhhhc----c-CHHHHHHHHHHHH--hC Q lcl|NC_011801. 298 A--FYQSSLSIYIKPIESELSQKLG------------------TDVKLD--IASAID----S-DNSELINNVQKLA--SA 348 (386) Q Consensus 298 ~--~~~~~l~P~~~~ie~~l~~~l~------------------~~~~fd--~~~~l~----~-d~~~~~~~~~~~~--~~ 348 (386) . =+..-|.-+...|...|...|- ..++|+ .+.... . =+..|+.+++.+- -+ T Consensus 392 DEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvG 471 (516) T protein:vir:10 392 DELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVG 471 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 1 1233355555555555544331 112222 221110 0 1233444443332 23 Q ss_pred CCcCHHHHHHHhccCCcCCCCCCC--------cccccc-ccCCCCCC Q lcl|NC_011801. 349 GVLAPIQAQKLLKNRGVFPELDLD--------EGTNLL-DNTKNIND 386 (386) Q Consensus 349 g~~t~nE~R~~lg~~p~~p~~~~~--------~~~~~~-~~~~~~~~ 386 (386) ++++.+=+|+.+=.. .+++.. |..+++ ..+..-.| T Consensus 472 ky~s~~yi~k~ILr~---tDeei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 472 KYVSHDYVMKNILQM---TEEQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred cccchHHHHHHHhcC---CHhhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 455544444322100 000000 111111 11111111 No 238 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=37.30 E-value=1.1 Score=20.30 Aligned_cols=374 Identities=10% Similarity=0.054 Sum_probs=152.4 Q ss_pred Cchhhhhcccc--------------ccCCccchhh----------------hhhcccccccCccccc-------HHHHhc Q lcl|NC_011801. 1 MAFLSNLFKRQ--------------KMLSGSSPVW----------------ILNQGQPVSIKPKAIT-------SAIALK 43 (386) Q Consensus 1 Mg~~~~l~~~~--------------~~~~~~~~~~----------------~~~~~~~~~~~~~~i~-------~~~a~~ 43 (386) |++++ +|+-+ +..+...|.. ....++.....+..-+ -+.... T Consensus 1 ~~~~~-lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~ 79 (516) T protein:vir:10 1 MKFLD-LFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLIN 79 (516) T ss_pred CCchH-hcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhh Confidence 55543 22221 1111111110 0111111111111111 122357 Q ss_pred cHHHHHHHHHHHHhhccC-----ceeec--chh------------HHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEE Q lcl|NC_011801. 44 NSDVYAVISRVSSDIAGC-----RFVTN--AQP------------ITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAI 104 (386) Q Consensus 44 ~~~v~~~v~~ia~~ia~~-----p~~~~--~~~------------~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~ 104 (386) +|.|..||+.|.+.+.-. |+.+. +.+ ..++|+. -+-...+++ .+..|...|..|.. T Consensus 80 ~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fh 154 (516) T protein:vir:10 80 NPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKLDT----LFRRWYVDSRIFFH 154 (516) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEEEE Confidence 899999999998876532 33322 111 1122221 233334444 44556677888887 Q ss_pred Eeec-CCCceEEEEEEcCcceEEeecC-----CCce------eEEEEeccC---------cccceeEEEcccceeeeccc Q lcl|NC_011801. 105 IDRD-TNGYPVRIEPVPNEKVTVALDD-----YGKD------LTYTVHFDD---------SKRSGDFLYDSSEVIHFRCT 163 (386) Q Consensus 105 ~~~~-~~g~~~~l~~l~~~~v~~~~~~-----~~~~------~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~ 163 (386) ++.+ ......+|..|+|..++.++.- .+.. .+|.|...+ ...+..+.++.+-|.|.. . T Consensus 155 Kiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~h-S 233 (516) T protein:vir:10 155 KIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYAS-S 233 (516) T ss_pred EEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeec-c Confidence 5544 3455789999999998765322 1111 122222111 122234556666655443 1 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG 238 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g 238 (386) -.-+..++. =+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+... ..++| T Consensus 234 GL~d~~~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TG 312 (516) T protein:vir:10 234 GLMDCSDRG-IIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred cceeCCCCc-eeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 111112222 24555555555544444444333222222333344433323333332322 3333222110 01122 Q ss_pred ------c-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc----cHHHHHH Q lcl|NC_011801. 239 ------R-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ----SNITMIR 297 (386) Q Consensus 239 ------~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~----~~~~~~~ 297 (386) + +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=|...+..+ .+.+-.+ T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItR 391 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQT-MGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITR 391 (516) T ss_pred eeccchhhhhhHhhhcccccCCCCccceeeccccCC-cChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhH Confidence 1 1122 23667777654322 22345667888999999999999986544321 2222221 Q ss_pred H--HHHHHHHHHHHHHHHHHHHhhh------------------hhhhhc--chhhhc----c-CHHHHHHHHHHHH--hC Q lcl|NC_011801. 298 A--FYQSSLSIYIKPIESELSQKLG------------------TDVKLD--IASAID----S-DNSELINNVQKLA--SA 348 (386) Q Consensus 298 ~--~~~~~l~P~~~~ie~~l~~~l~------------------~~~~fd--~~~~l~----~-d~~~~~~~~~~~~--~~ 348 (386) . =+..-|.-+...|...|...|- ..++|+ .+.... . =+..|+.+++.+- -+ T Consensus 392 DEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvG 471 (516) T protein:vir:10 392 DELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVG 471 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 1 1233355555555555544331 112222 221110 0 1233444443332 23 Q ss_pred CCcCHHHHHHHhccCCcCCCCCCC--------cccccc-ccCCCCCC Q lcl|NC_011801. 349 GVLAPIQAQKLLKNRGVFPELDLD--------EGTNLL-DNTKNIND 386 (386) Q Consensus 349 g~~t~nE~R~~lg~~p~~p~~~~~--------~~~~~~-~~~~~~~~ 386 (386) ++++.+=+|+.+=.. .+++.. |..+++ ..+..-.| T Consensus 472 ky~s~~yi~k~ILr~---tDeei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 472 KYVSHDYVMKNILQM---TEEQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred cccchHHHHHHHhcC---CHhhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 455544444322100 000000 111111 11111111 No 239 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=34.59 E-value=1.3 Score=19.99 Aligned_cols=377 Identities=13% Similarity=0.147 Sum_probs=149.0 Q ss_pred Cc--hhh-hhccccccCCc-----------cchhhh-hhcccccccCccc------ccH-HHHhccHHHHHHHHHHHHhh Q lcl|NC_011801. 1 MA--FLS-NLFKRQKMLSG-----------SSPVWI-LNQGQPVSIKPKA------ITS-AIALKNSDVYAVISRVSSDI 58 (386) Q Consensus 1 Mg--~~~-~l~~~~~~~~~-----------~~~~~~-~~~~~~~~~~~~~------i~~-~~a~~~~~v~~~v~~ia~~i 58 (386) |. +|- .+.+.+..... +.+... ...+......+.. |+. +....+|.|..||+.|.+.+ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNET 80 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 32 110 02111111100 111111 1222222222222 111 23456899999999998876 Q ss_pred ccC-----ceeec--ch------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC---CCceEEE Q lcl|NC_011801. 59 AGC-----RFVTN--AQ------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT---NGYPVRI 116 (386) Q Consensus 59 a~~-----p~~~~--~~------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~---~g~~~~l 116 (386) .-. |+.+. +- ...++|+. -+-...+++ .+..|...|..|..++-|. .....+| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-l~F~~~~~e----~fR~WYVDgRi~fhKiid~k~pk~GI~EL 155 (537) T protein:vir:10 81 ICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRL-LDFDNRAYE----IFRRWYVDGRLFFHKVIDPKKPRQGLVEL 155 (537) T ss_pred eEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhheeeeEEEEEEEEeCCCccccceee Confidence 532 33222 11 11122211 233333443 4556677888888886653 3357899 Q ss_pred EEEcCcceEEeecC---C--Cce------------e-EEEEec--cCcccceeEEEcccceeeeccccccCccccccccc Q lcl|NC_011801. 117 EPVPNEKVTVALDD---Y--GKD------------L-TYTVHF--DDSKRSGDFLYDSSEVIHFRCTVSGESDTQYMGIP 176 (386) Q Consensus 117 ~~l~~~~v~~~~~~---~--~~~------------~-~~~~~~--~~~~~~~~~~~~~~~vih~~~~~~~~~~~~~~G~s 176 (386) ..|+|..++.++.- . ... . +|.|.. .....+..+.++.+-| ++-+.-.-+ .++-+.+| T Consensus 156 r~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI-~y~hSGl~d-~n~~~i~s 233 (537) T protein:vir:10 156 RYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSI-AYCHSGIQD-LNKNMVLS 233 (537) T ss_pred eeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhhe-eeeccccee-CCCCeeee Confidence 99999998654431 1 111 0 111110 0112233455666444 333211111 22345667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC------c-ceec- Q lcl|NC_011801. 177 PIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG------R-AVVL- 243 (386) Q Consensus 177 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g------~-~~vl- 243 (386) -+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ ++....+... ...+| + +..+ T Consensus 234 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlE 313 (537) T protein:vir:10 234 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE 313 (537) T ss_pred eehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhh Confidence 67777666665555554443332222333344443323333333322 3332222110 01111 1 1122 Q ss_pred ---------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCccc--HHHHHH--HHHHHHHHHHHHH Q lcl|NC_011801. 244 ---------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQS--NITMIR--AFYQSSLSIYIKP 310 (386) Q Consensus 244 ---------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~~--~~~~~~--~~~~~~l~P~~~~ 310 (386) +.|.++++|.-..+ +.-++-.++.++.+..+++||.+-|...+..+. +.+-.+ .=+..-|.-+... T Consensus 314 DyWLPRReGgrgTEItTLpGgqn-lgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~r 392 (537) T protein:vir:10 314 DFWLPRREGGRGTEISTLPGGQN-LGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 392 (537) T ss_pred hhcccccCCCcccceeeccccCC-cChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHH Confidence 23667777654322 223456678889999999999998864432111 111111 1122234444444 Q ss_pred HHHHHHHhhh------------------hhhhhc--ch-------------------------------------hhhcc Q lcl|NC_011801. 311 IESELSQKLG------------------TDVKLD--IA-------------------------------------SAIDS 333 (386) Q Consensus 311 ie~~l~~~l~------------------~~~~fd--~~-------------------------------------~~l~~ 333 (386) |...|...|- ..++|+ .+ .+|+. T Consensus 393 Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~ 472 (537) T protein:vir:10 393 FSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQ 472 (537) T ss_pred HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhcc Confidence 4444443321 011111 11 11221 Q ss_pred CH---HHHHHHHHHHHhCCCc-CHHHHHHHhc-----cCCcCCCCCCCccc----cccccCCCCCC Q lcl|NC_011801. 334 DN---SELINNVQKLASAGVL-APIQAQKLLK-----NRGVFPELDLDEGT----NLLDNTKNIND 386 (386) Q Consensus 334 d~---~~~~~~~~~~~~~g~~-t~nE~R~~lg-----~~p~~p~~~~~~~~----~~~~~~~~~~~ 386 (386) .- ++..+.+++-...|.+ .|+|.-+ ++ ..|++|++..++-+ ..-..+++++= T Consensus 473 tDeeI~~~~k~I~~E~k~~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 473 TESEIKEIDKEIKQEIADGVIMDPQAMQA-MEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred CHHHHHHHHHHHHHHhhCCCCCCcccccc-cccCCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 11 1112222222233332 1211111 11 11222222211100 00011111111 No 240 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=25.85 E-value=2 Score=18.92 Aligned_cols=368 Identities=11% Similarity=0.046 Sum_probs=151.4 Q ss_pred Cchhhhhccc-------cccCCccch--------hhh----------hhcccccccCccc------cc-HHHHhccHHHH Q lcl|NC_011801. 1 MAFLSNLFKR-------QKMLSGSSP--------VWI----------LNQGQPVSIKPKA------IT-SAIALKNSDVY 48 (386) Q Consensus 1 Mg~~~~l~~~-------~~~~~~~~~--------~~~----------~~~~~~~~~~~~~------i~-~~~a~~~~~v~ 48 (386) |.+|.+.-.. .+..+...| +.. ..........+.. |. -+....+|.|. T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchh Confidence 6666432110 011111111 100 0000111111111 11 11235689999 Q ss_pred HHHHHHHHhhccC-----ceeec--ch------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC Q lcl|NC_011801. 49 AVISRVSSDIAGC-----RFVTN--AQ------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT 109 (386) Q Consensus 49 ~~v~~ia~~ia~~-----p~~~~--~~------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~ 109 (386) .||+.|.+.+.-. |+.+. +. ...++|+. -+-...+++ .+..|...|..|..++.+. T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fhkiid~ 162 (521) T protein:vir:65 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-IQFDRRGQD----MFRRWYVDSRIFFHKIIGK 162 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceeEEEEEEcC Confidence 9999998876532 33322 11 11122211 233333433 4556677899999888553 Q ss_pred C--CceEEEEEEcCcceEEeecCCC-----------ceeEEEEeccC---------cccceeEEEcccceeeeccccccC Q lcl|NC_011801. 110 N--GYPVRIEPVPNEKVTVALDDYG-----------KDLTYTVHFDD---------SKRSGDFLYDSSEVIHFRCTVSGE 167 (386) Q Consensus 110 ~--g~~~~l~~l~~~~v~~~~~~~~-----------~~~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~~~~~ 167 (386) + ....+|..|+|..++.++.-.. ...+|.|...+ ...+..+.++.+-|.+. +... . T Consensus 163 ~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~-hSGl-~ 240 (521) T protein:vir:65 163 NPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA-HSGL-M 240 (521) T ss_pred CccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeee-eccc-e Confidence 3 4578999999999876532211 11223332111 12223344555555433 2221 1 Q ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHh--------ccc--c Q lcl|NC_011801. 168 SDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQT--------TGE--N 236 (386) Q Consensus 168 ~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~--------~~~--~ 236 (386) ..++-.=+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+.. .|. + T Consensus 241 d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~d 320 (521) T protein:vir:65 241 DCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKN 320 (521) T ss_pred eCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccc Confidence 12222234555555555544444444333222222333344443323333333333 333333221 111 1 Q ss_pred cCc-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc----cHHHHHH--HH Q lcl|NC_011801. 237 AGR-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ----SNITMIR--AF 299 (386) Q Consensus 237 ~g~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~----~~~~~~~--~~ 299 (386) ..+ +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=+...+.+. ...+-.+ .= T Consensus 321 drk~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiK 399 (521) T protein:vir:65 321 QQANLSMTEDYWLQRRDGKAITDVTTLPGASG-MSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELE 399 (521) T ss_pred cccccchhhhhcccccCCCCccceeecccCCC-cChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHH Confidence 111 2222 23667777654322 22245667888999999999999875443321 1112111 11 Q ss_pred HHHHHHHHHHHHHHHHHHhhh------------------hhhhhc--chhhhcc-----CHHHHHHHHHHHHh--CCC-- Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLG------------------TDVKLD--IASAIDS-----DNSELINNVQKLAS--AGV-- 350 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~------------------~~~~fd--~~~~l~~-----d~~~~~~~~~~~~~--~g~-- 350 (386) +..-|.-+...|...|...|- ..++|+ .+..... =+..|+.+++.+-- +.. T Consensus 400 F~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S 479 (521) T protein:vir:65 400 FSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFS 479 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 223345555555555544331 112222 2211100 11223333332210 111 Q ss_pred ----------cCHHHHHHH------hccCCcCCCCCCCcccc Q lcl|NC_011801. 351 ----------LAPIQAQKL------LKNRGVFPELDLDEGTN 376 (386) Q Consensus 351 ----------~t~nE~R~~------lg~~p~~p~~~~~~~~~ 376 (386) ||=.|+-++ +.++|+.+.++.++.+= T Consensus 480 ~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 480 NQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 233333221 12344433333222111 No 241 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=24.89 E-value=2.1 Score=18.79 Aligned_cols=375 Identities=12% Similarity=0.038 Sum_probs=138.9 Q ss_pred Cchhhhhcccc-ccCCccchhhhhh--ccc--ccccCcccc--cHHHHhccHHHHHHHHHHHHhh-cc----------Cc Q lcl|NC_011801. 1 MAFLSNLFKRQ-KMLSGSSPVWILN--QGQ--PVSIKPKAI--TSAIALKNSDVYAVISRVSSDI-AG----------CR 62 (386) Q Consensus 1 Mg~~~~l~~~~-~~~~~~~~~~~~~--~~~--~~~~~~~~i--~~~~a~~~~~v~~~v~~ia~~i-a~----------~p 62 (386) |+++- +++ ..+...+...... .+. ..+..+..+ -.+.....|.+.+++++--... ++ .| T Consensus 53 ~~~~~---~~~~~~t~~~D~~~~g~~~~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p 129 (569) T protein:vir:10 53 SGFLG---GKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVP 129 (569) T ss_pred hhhhc---cCccccchhhhhHHHHHHHHhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEe Confidence 33331 111 1111111111110 000 111111111 1223445677778877754431 11 12 Q ss_pred eeec---chhHHHHHhccC-c---ccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEE---EEcCcceEEeecCCC Q lcl|NC_011801. 63 FVTN---AQPITDVLNAPL-G---NLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIE---PVPNEKVTVALDDYG 132 (386) Q Consensus 63 ~~~~---~~~~~~~l~~~P-N---~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~---~l~~~~v~~~~~~~~ 132 (386) .+-- +.+..+.+...- | +..+ .-...+..++..+|.+|+.|--+..-.++.|+ +.-|.-++...-.+. T Consensus 130 ~~~~~~a~~daakai~~el~~dl~~~iN--r~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqpFE~g~~ 207 (569) T protein:vir:10 130 VHNGNDSDYDAAQALCGELMNDIGRTIN--KEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEVSGN 207 (569) T ss_pred ecCCCCCcchHHHHHHHHHHHHHHHHHH--HHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccchhhhcCc Confidence 1110 112221111100 0 0000 13456788899999999988655443344443 233444444322222 Q ss_pred ceeEEEEeccCcccceeEEEcccc-------------------eeeeccccccCcc------cccccccHHHHHHHHHHH Q lcl|NC_011801. 133 KDLTYTVHFDDSKRSGDFLYDSSE-------------------VIHFRCTVSGESD------TQYMGIPPIDSLLNEIEV 187 (386) Q Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~-------------------vih~~~~~~~~~~------~~~~G~s~~~~~~~~i~~ 187 (386) ..-+......+.. .......+.. ..|..+....+.. -..+|-|-+..+.+...- T Consensus 208 tvGF~~~~~~~~~-~ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~ 286 (569) T protein:vir:10 208 LAGFSGDYLKDAS-GKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMN 286 (569) T ss_pred eEEeecccCCccc-cceeeechhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHH Confidence 2211110011100 0001111111 1122111111111 124788878777766655 Q ss_pred HHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHH-----------HHHHHHHHHHhccccc------CcceecCCCceee Q lcl|NC_011801. 188 QDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAK-----------ENTRQSFEEQTTGENA------GRAVVLDQSADVE 250 (386) Q Consensus 188 ~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~-----------~~~k~~~~~~~~~~~~------g~~~vl~~g~~~~ 250 (386) ...+.....+.=-+..+-..+|.+.-..+++++. |+-++.+++...|.++ +-.++.+++.-.. T Consensus 287 l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~ 366 (569) T protein:vir:10 287 LRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQM 366 (569) T ss_pred HHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecCccccc Confidence 4444332222111222223344443334455443 4455666655544332 1123445543222 Q ss_pred ec--cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcC-------CcCcccHH-----HHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_011801. 251 TT--NISPNVTEFLQNVSFSQDQIAKAFGIPADYLSG-------KQDAQSNI-----TMIRAFYQSSLSIYIKPIES-EL 315 (386) Q Consensus 251 ~~--~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~-------~~~~~~~~-----~~~~~~~~~~l~P~~~~ie~-~l 315 (386) ++ +.-+++.--+|-.-+..+.+|.++|+.+.+||- .+.+.... .++...++..+.-++..+-+ .+ T Consensus 367 tvDt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa~RS~~iRqa~~e~in~iidiH~ 446 (569) T protein:vir:10 367 TIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHL 446 (569) T ss_pred cccccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 22 223344445667778899999999999999963 22333221 12233444444433332221 12 Q ss_pred HHhhhh---------hhhhcch--hhhcc---CHHHHHHHHHHHH-------hCCCcCHHH--HHHHhccCCcCCCCCCC Q lcl|NC_011801. 316 SQKLGT---------DVKLDIA--SAIDS---DNSELINNVQKLA-------SAGVLAPIQ--AQKLLKNRGVFPELDLD 372 (386) Q Consensus 316 ~~~l~~---------~~~fd~~--~~l~~---d~~~~~~~~~~~~-------~~g~~t~nE--~R~~lg~~p~~p~~~~~ 372 (386) ..|.+. .++|.-. .+-+. ...++++.+..++ ++.++--|| .|.++- . ..++| T Consensus 447 ~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~--d---~~~~D 521 (569) T protein:vir:10 447 AFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFS--D---VLEID 521 (569) T ss_pred hhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHH--H---Hhhcc Confidence 222221 1223211 12111 2233333333222 222322232 222221 0 11122 Q ss_pred ccc--cccccC-CCCCC Q lcl|NC_011801. 373 EGT--NLLDNT-KNIND 386 (386) Q Consensus 373 ~~~--~~~~~~-~~~~~ 386 (386) |.. ..+.-- ..-.| T Consensus 522 e~~~e~l~ae~~akp~D 538 (569) T protein:vir:10 522 EKISEALVNELKAKSED 538 (569) T ss_pred hhHHHHHHhhcCCCcch Confidence 211 111111 11122 No 242 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=23.75 E-value=2.3 Score=18.64 Aligned_cols=366 Identities=11% Similarity=0.046 Sum_probs=151.0 Q ss_pred Cchhhhhcc-c------cccCCccchhhhh------------------hcccccccCccc------cc-HHHHhccHHHH Q lcl|NC_011801. 1 MAFLSNLFK-R------QKMLSGSSPVWIL------------------NQGQPVSIKPKA------IT-SAIALKNSDVY 48 (386) Q Consensus 1 Mg~~~~l~~-~------~~~~~~~~~~~~~------------------~~~~~~~~~~~~------i~-~~~a~~~~~v~ 48 (386) |.+|.+.-. + ....+...|.... ..+......+.. |. -+....+|.|. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:81 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchh Confidence 666643211 0 0111111111000 000111111111 11 11235689999 Q ss_pred HHHHHHHHhhccC-----ceeec--ch------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecC Q lcl|NC_011801. 49 AVISRVSSDIAGC-----RFVTN--AQ------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDT 109 (386) Q Consensus 49 ~~v~~ia~~ia~~-----p~~~~--~~------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~ 109 (386) .||+.|.+.+.-. |+.+. +. ...++|+. -+-...+++ .+..|...|..|..++.+. T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fhkiid~ 162 (521) T protein:vir:81 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-IQFDRRGQD----MFRRWYVDSRIFFHKIIGK 162 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEEEEEEEcC Confidence 9999998876532 33322 11 11122211 233333433 4556677899999888553 Q ss_pred C--CceEEEEEEcCcceEEeecCCC-----------ceeEEEEeccC---------cccceeEEEcccceeeeccccccC Q lcl|NC_011801. 110 N--GYPVRIEPVPNEKVTVALDDYG-----------KDLTYTVHFDD---------SKRSGDFLYDSSEVIHFRCTVSGE 167 (386) Q Consensus 110 ~--g~~~~l~~l~~~~v~~~~~~~~-----------~~~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~~~~~ 167 (386) + ....+|..|+|..++.++.-.. ...+|.|...+ ...+..+.++.+-|.+. +... . T Consensus 163 ~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~-hSGl-~ 240 (521) T protein:vir:81 163 NPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA-HSGL-M 240 (521) T ss_pred CccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeee-eccc-e Confidence 3 4578999999999876532211 11223332211 12223344555555432 2221 1 Q ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHh--------ccc--c Q lcl|NC_011801. 168 SDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQT--------TGE--N 236 (386) Q Consensus 168 ~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~--------~~~--~ 236 (386) ..++-.=+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+.. .|. + T Consensus 241 d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~d 320 (521) T protein:vir:81 241 DCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKN 320 (521) T ss_pred eCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccc Confidence 12222234445555555444444443333222222333344443323333333333 333333221 111 1 Q ss_pred cCc-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc----cHHHHHH--HH Q lcl|NC_011801. 237 AGR-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ----SNITMIR--AF 299 (386) Q Consensus 237 ~g~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~----~~~~~~~--~~ 299 (386) ..+ +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=|...+++. ...+-.+ .= T Consensus 321 drk~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiK 399 (521) T protein:vir:81 321 QQANLSMTEDYWLQRRDGKAITDVTTLPGASG-MSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELE 399 (521) T ss_pred cccccchhhhhcccccCCCcccceeecccCCC-CChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHH Confidence 111 2222 23667777654322 22245667888999999999999995433221 1112111 11 Q ss_pred HHHHHHHHHHHHHHHHHHhhh------------------hhhhhc--chhhhcc-----CHHHHHHHHHHHHh--CCC-- Q lcl|NC_011801. 300 YQSSLSIYIKPIESELSQKLG------------------TDVKLD--IASAIDS-----DNSELINNVQKLAS--AGV-- 350 (386) Q Consensus 300 ~~~~l~P~~~~ie~~l~~~l~------------------~~~~fd--~~~~l~~-----d~~~~~~~~~~~~~--~g~-- 350 (386) +..-|.-+...|...|...|- ..++|+ .+..... =+..|+.+++.+-- +.. T Consensus 400 F~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s 479 (521) T protein:vir:81 400 FSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFS 479 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 223344555555555544331 112222 2211100 01223333332210 111 Q ss_pred ----------cCHHHHHHH------hccCCcCCCCC--CCcc Q lcl|NC_011801. 351 ----------LAPIQAQKL------LKNRGVFPELD--LDEG 374 (386) Q Consensus 351 ----------~t~nE~R~~------lg~~p~~p~~~--~~~~ 374 (386) ||=.|+-++ +.++|+.+.++ ..++ T Consensus 480 ~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 480 NQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 233333221 12344433332 3333 No 243 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=23.69 E-value=2.3 Score=18.63 Aligned_cols=374 Identities=10% Similarity=0.087 Sum_probs=147.9 Q ss_pred Cchhhhhcccc--------------ccCCccchhh----------------hhhcccccccCccc------ccH-HHHhc Q lcl|NC_011801. 1 MAFLSNLFKRQ--------------KMLSGSSPVW----------------ILNQGQPVSIKPKA------ITS-AIALK 43 (386) Q Consensus 1 Mg~~~~l~~~~--------------~~~~~~~~~~----------------~~~~~~~~~~~~~~------i~~-~~a~~ 43 (386) |++++ +|+-+ +..+...|.. ....++.....+.. |+. +.... T Consensus 1 ~~~~~-lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~ 79 (516) T protein:vir:10 1 MKFLD-LFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTN 79 (516) T ss_pred CCchH-hcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhh Confidence 54433 22221 1111111110 01111111112211 111 23346 Q ss_pred cHHHHHHHHHHHHhhccC-----ceeecch--------------hHHHHHhccCcccCCHHHHHHHHHHHHHHhCCeEEE Q lcl|NC_011801. 44 NSDVYAVISRVSSDIAGC-----RFVTNAQ--------------PITDVLNAPLGNLMSGFSVWQAMIVQMMLTGNAFAI 104 (386) Q Consensus 44 ~~~v~~~v~~ia~~ia~~-----p~~~~~~--------------~~~~~l~~~PN~~~s~~~f~~~~~~~~~l~G~a~~~ 104 (386) +|.|..||+.|.+.+.-. |+.+.-. ...++|+. -+-...+++ .+..|...|..|.. T Consensus 80 ~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~fh 154 (516) T protein:vir:10 80 NPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRL-LDASRKLDT----LFRRWYIDSRIFFH 154 (516) T ss_pred ccchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHHhhhhcceEEEE Confidence 899999999998876532 3333211 11122221 233334444 44556677888887 Q ss_pred Eeec-CCCceEEEEEEcCcceEEeecC-----------CCceeEEEEeccC---------cccceeEEEcccceeeeccc Q lcl|NC_011801. 105 IDRD-TNGYPVRIEPVPNEKVTVALDD-----------YGKDLTYTVHFDD---------SKRSGDFLYDSSEVIHFRCT 163 (386) Q Consensus 105 ~~~~-~~g~~~~l~~l~~~~v~~~~~~-----------~~~~~~~~~~~~~---------~~~~~~~~~~~~~vih~~~~ 163 (386) ++.+ ......+|..|+|..++.++.- .+...+|.|...+ ...+..+.++.+-|. +-+. T Consensus 155 Kiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~-y~hS 233 (516) T protein:vir:10 155 KIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIV-YAHS 233 (516) T ss_pred EEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhhee-eeec Confidence 5544 3455789999999998765322 1111122222111 111223445554443 3322 Q ss_pred cccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHH-HHHHHHHHhc----ccccC Q lcl|NC_011801. 164 VSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKEN-TRQSFEEQTT----GENAG 238 (386) Q Consensus 164 ~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~g 238 (386) ...+..++. =+|-+..|.+.+.....++...-=+=-.-+.-+-++.+.-+.+.+..+++ +++...+... ..++| T Consensus 234 Gl~d~~~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TG 312 (516) T protein:vir:10 234 GLQDCSDRG-IVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred CcccCCCCc-eeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 211111111 13444444444444443333332222122223334433322333332222 3333322110 01122 Q ss_pred ------c-ceec----------CCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCcCcc----cHHHHHH Q lcl|NC_011801. 239 ------R-AVVL----------DQSADVETTNISPNVTEFLQNVSFSQDQIAKAFGIPADYLSGKQDAQ----SNITMIR 297 (386) Q Consensus 239 ------~-~~vl----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gvp~~~l~~~~~~~----~~~~~~~ 297 (386) + +..+ +.|.++++|.-..+ +.-++-.++.++.+..+++||.+=+...+..+ .+.+-.+ T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqn-lgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItR 391 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQT-MGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITR 391 (516) T ss_pred eeccchhhhhhHhhhcccccCCCcccceeeccccCC-cChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhH Confidence 1 1122 23667777654322 22345667888999999999999986543321 2222221 Q ss_pred H--HHHHHHHHHHHHHHHHH----HHhhh--------------hhhhhc--chhhhc----c-CHHHHHHHHHHHH--hC Q lcl|NC_011801. 298 A--FYQSSLSIYIKPIESEL----SQKLG--------------TDVKLD--IASAID----S-DNSELINNVQKLA--SA 348 (386) Q Consensus 298 ~--~~~~~l~P~~~~ie~~l----~~~l~--------------~~~~fd--~~~~l~----~-d~~~~~~~~~~~~--~~ 348 (386) . =+..-|.-+...|...| ...|. ..++|+ .+.... . =+..|+.+++.+- -+ T Consensus 392 DEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 471 (516) T protein:vir:10 392 DELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVG 471 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 1 12233444444444444 33331 112222 222110 0 1233444443332 23 Q ss_pred CCcCHHHHHHHhccCCcCCCCCCC--------cccc-ccccCCCCCC Q lcl|NC_011801. 349 GVLAPIQAQKLLKNRGVFPELDLD--------EGTN-LLDNTKNIND 386 (386) Q Consensus 349 g~~t~nE~R~~lg~~p~~p~~~~~--------~~~~-~~~~~~~~~~ 386 (386) ++++.+=+|+.+=.. .+++.. |..+ .++.+..-.| T Consensus 472 ky~s~~yi~k~ILr~---tDeei~~~~k~I~~E~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 472 KYVSHDYVMKNILQM---TDEQIAQEEKQIEKEANVKRFQNPENEDD 515 (516) T ss_pred cccchHHHHHHHhcC---CHhHHHHHHHHHHHhhhCCCCCCCCcccc Confidence 444444443322100 000000 0111 1111111112 No 244 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=21.17 E-value=2.6 Score=18.27 Aligned_cols=346 Identities=12% Similarity=0.051 Sum_probs=142.4 Q ss_pred CchhhhhccccccCCccchhhhhhcccccccCcccccHHHHhccHHHHHHHHHHHHhhcc------Cce---eecc---- Q lcl|NC_011801. 1 MAFLSNLFKRQKMLSGSSPVWILNQGQPVSIKPKAITSAIALKNSDVYAVISRVSSDIAG------CRF---VTNA---- 67 (386) Q Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~a~~~~~v~~~v~~ia~~ia~------~p~---~~~~---- 67 (386) +.-..+.+....... . .+...+.+. =+++--.|++.+|..+.+ -|| .+.+ T Consensus 38 lP~~~~~~~~~~~~~--~-------------~~~~~~~~~--~dstg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~ 100 (549) T protein:vir:10 38 MPRLDKFGQLPRPDS--E-------------KGRERSQKM--FDSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDALN 100 (549) T ss_pred ccccccccccCCCCC--C-------------ccccccccc--ccchHHHHHHHHHHHHHhhccCCCCccccccCCccchh Confidence 332222211110000 0 000000000 022222444444443321 233 1111 Q ss_pred -------------hhHHHHHh-ccCcccCCHHHHHHHHHHHHHHhCCeEEEEeecCCCceEEEEEEcCcceEEeecCCCc Q lcl|NC_011801. 68 -------------QPITDVLN-APLGNLMSGFSVWQAMIVQMMLTGNAFAIIDRDTNGYPVRIEPVPNEKVTVALDDYGK 133 (386) Q Consensus 68 -------------~~~~~~l~-~~PN~~~s~~~f~~~~~~~~~l~G~a~~~~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 133 (386) +.+...++ .+-| .+.-+..+..+++.+|++.+++..+. ++...+..+|...+-+..|..+. T Consensus 101 e~~~v~~~l~~ve~~~~~~~~~~~sn----f~~~~~~~~~~L~~~Gta~l~~~~~~-~~~~~f~~~pl~~~~v~~d~~G~ 175 (549) T protein:vir:10 101 EIASVKAYLQGVVRTLFAARYRWQGG----FVTQMGATYQSIGLFGPGALMIEHDV-GKGIVYRNVPMQRLWFAENNSGL 175 (549) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhcC----hHHHHHHHHHHHHhhcceeeEEeecC-CCeeEEEEEEcCeEEEeeCCCCC Confidence 01111121 1223 23345567788999999999887654 34556666666777776666665 Q ss_pred eeEEEEec-------------cCc----------cccee---------------------------E--------EE--- Q lcl|NC_011801. 134 DLTYTVHF-------------DDS----------KRSGD---------------------------F--------LY--- 152 (386) Q Consensus 134 ~~~~~~~~-------------~~~----------~~~~~---------------------------~--------~~--- 152 (386) ....+..+ +.- ..... + .+ T Consensus 176 vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~es 255 (549) T protein:vir:10 176 IDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQNS 255 (549) T ss_pred eEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEeeccC Confidence 43321000 000 00000 0 00 Q ss_pred --cccceeeeccccccCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEeeCCCCCCHHHHHHHHHHHHH Q lcl|NC_011801. 153 --DSSEVIHFRCTVSGESDTQYMGIPPIDSLLNEIEVQDLSSKLAISTLRHAIKPSIFIKVPNATLGKEAKENTRQSFEE 230 (386) Q Consensus 153 --~~~~vih~~~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~~l~~~~~~~~~~~~~~~k~~~~~ 230 (386) ..-+.+-.||. ...+..||.||...+...+...+...+.......-...|...+. ++...++... T Consensus 256 g~~e~P~~~~Rw~---~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~-~~g~~~~~~l--------- 322 (549) T protein:vir:10 256 GFRTFPFAIGRFY---VGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLAN-EDGVLDGFDL--------- 322 (549) T ss_pred CcccCCcceeeee---ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-ccccccccee--------- Confidence 00111112222 12344799999999999999999999999998888888877653 2333333211 Q ss_pred HhcccccCcceecCCCceeeeccCChhhHHH-HHHHHHHHHHHHHHhCCCHH-HhcCCcCcccHH-HHHHHHHHHHHHHH Q lcl|NC_011801. 231 QTTGENAGRAVVLDQSADVETTNISPNVTEF-LQNVSFSQDQIAKAFGIPAD-YLSGKQDAQSNI-TMIRAFYQSSLSIY 307 (386) Q Consensus 231 ~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gvp~~-~l~~~~~~~~~~-~~~~~~~~~~l~P~ 307 (386) ..|........-.+...++++... .+.+. .+..+..+..|-.+|-.... ++-..+.-+..| .++..-....|.|. T Consensus 323 -~pgg~~~~~~~~~~~~~~~pl~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv 400 (549) T protein:vir:10 323 -RSGALNWGGLNDKGEEMVKPLLTG-KQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPT 400 (549) T ss_pred -ccCCccccccCCCCccceeeeccc-cchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHH Confidence 112111111111233456776554 34443 45578888899999987763 332222222222 23344556778888 Q ss_pred HHHHHHHHHHhhhhhh-h-hcchhhhccCHHHHHHHH-HHHHh-CCCcCH----HHHHHHhccCCcCCCCCCCccccccc Q lcl|NC_011801. 308 IKPIESELSQKLGTDV-K-LDIASAIDSDNSELINNV-QKLAS-AGVLAP----IQAQKLLKNRGVFPELDLDEGTNLLD 379 (386) Q Consensus 308 ~~~ie~~l~~~l~~~~-~-fd~~~~l~~d~~~~~~~~-~~~~~-~g~~t~----nE~R~~lg~~p~~p~~~~~~~~~~~~ 379 (386) ...+.++|-.-|...+ . ....+.+..-+++..+-. ..-+. .+.+.. .++-..+...-.. +.....+-.+. T Consensus 401 ~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~--~~laq~~Pe~l 478 (549) T protein:vir:10 401 LGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQL--GIVSQFDPAAA 478 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHH--HHHhccChhHH Confidence 8888877644332211 0 011111111111111000 00000 000010 1111111110000 00000000000 Q ss_pred cCCCCCC Q lcl|NC_011801. 380 NTKNIND 386 (386) Q Consensus 380 ~~~~~~~ 386 (386) ..-+-| T Consensus 479 -d~id~d 484 (549) T protein:vir:10 479 -KVPNGA 484 (549) T ss_pred -hcCCHH Confidence 011112 Done!