Query lcl|NC_010576.1_cdsid_YP_001828657.1 [gene=LaP1706_gp09] [protein=putative portal protein] [protein_id=YP_001828657.1] [location=5627..6970] Match_columns 447 No_of_seqs 170 out of 826 Neff 9.4 Searched_HMMs 1612 Date Thu Nov 7 13:29:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105002 Length: 432 100.0 5E-88 3.1E-91 499.2 41.1 414 1-428 1-432 (432) 2 protein:vir:102855 Length: 432 100.0 5E-88 3.1E-91 499.2 41.1 414 1-428 1-432 (432) 3 protein:vir:107605 Length: 432 100.0 5E-88 3.1E-91 499.2 41.1 414 1-428 1-432 (432) 4 protein:vir:81072 Length: 432 100.0 8.2E-87 5.1E-90 492.5 40.1 410 1-435 7-432 (432) 5 protein:vir:97060 Length: 432 100.0 1.8E-86 1.1E-89 490.6 40.1 410 1-435 7-432 (432) 6 protein:vir:10362 Length: 432 100.0 2.6E-86 1.6E-89 489.8 40.1 410 1-435 7-432 (432) 7 protein:vir:102080 Length: 429 100.0 9.5E-86 5.9E-89 486.7 40.6 411 1-428 1-429 (429) 8 protein:vir:100249 Length: 431 100.0 1.8E-85 1.1E-88 485.2 38.8 399 1-425 1-431 (431) 9 protein:vir:6240 Length: 457 # 100.0 4.7E-85 2.9E-88 482.9 40.7 431 1-447 1-456 (457) 10 protein:vir:81152 Length: 411 100.0 7E-85 4.4E-88 481.9 38.7 391 1-416 1-411 (411) 11 protein:vir:1380 Length: 422 # 100.0 9E-85 5.6E-88 481.4 39.2 402 1-419 1-422 (422) 12 protein:vir:1884 Length: 424 # 100.0 1.7E-84 1.1E-87 479.8 40.6 393 1-417 14-424 (424) 13 protein:vir:4454 Length: 414 # 100.0 2.6E-84 1.6E-87 478.8 39.6 399 1-434 1-414 (414) 14 protein:vir:189 Length: 424 # 100.0 5.8E-84 3.6E-87 476.9 40.3 393 1-417 14-424 (424) 15 protein:vir:4509 Length: 424 # 100.0 8.5E-84 5.3E-87 476.0 40.2 396 1-431 16-424 (424) 16 protein:vir:100150 Length: 437 100.0 2.7E-83 1.7E-86 473.2 39.9 415 1-438 1-437 (437) 17 protein:vir:1326 Length: 457 # 100.0 2.8E-83 1.8E-86 473.1 39.3 424 1-440 1-457 (457) 18 protein:vir:5737 Length: 419 # 100.0 3E-83 1.9E-86 473.0 38.6 404 1-438 1-419 (419) 19 protein:vir:105064 Length: 421 100.0 2.4E-83 1.5E-86 473.5 38.1 403 1-440 1-421 (421) 20 protein:vir:1431 Length: 419 # 100.0 2.6E-83 1.6E-86 473.3 38.2 403 2-434 1-419 (419) 21 protein:vir:81218 Length: 423 100.0 4.8E-83 3E-86 471.9 39.3 401 1-433 1-423 (423) 22 protein:vir:4337 Length: 434 # 100.0 6.9E-83 4.3E-86 471.0 39.3 414 1-431 1-434 (434) 23 protein:vir:101648 Length: 518 100.0 4.8E-82 3E-85 466.4 40.8 427 1-447 1-454 (518) 24 protein:vir:7853 Length: 518 # 100.0 5.9E-82 3.6E-85 465.9 40.7 430 1-447 1-454 (518) 25 protein:vir:483 Length: 413 # 100.0 3.6E-82 2.3E-85 467.1 38.9 398 8-435 1-413 (413) 26 protein:vir:93610 Length: 454 100.0 5.6E-82 3.5E-85 466.0 39.9 423 10-447 1-452 (454) 27 protein:vir:9408 Length: 441 # 100.0 1.6E-81 1E-84 463.5 38.6 412 1-431 11-441 (441) 28 protein:vir:79984 Length: 441 100.0 1.6E-81 1E-84 463.5 38.6 412 1-431 11-441 (441) 29 protein:vir:94002 Length: 378 100.0 7.3E-82 4.5E-85 465.4 36.3 366 1-431 1-378 (378) 30 protein:vir:1266 Length: 416 # 100.0 3.3E-81 2E-84 461.8 38.9 402 2-432 1-416 (416) 31 protein:vir:102118 Length: 409 100.0 2E-81 1.2E-84 463.1 37.6 391 1-416 1-409 (409) 32 protein:vir:80333 Length: 419 100.0 1.1E-81 7.1E-85 464.3 36.2 404 1-438 1-419 (419) 33 protein:vir:8418 Length: 409 # 100.0 1.4E-80 8.6E-84 458.4 40.4 394 1-428 1-409 (409) 34 protein:vir:98396 Length: 441 100.0 6.4E-81 4E-84 460.2 37.9 412 1-431 11-441 (441) 35 protein:vir:858 Length: 378 # 100.0 3.5E-81 2.2E-84 461.7 36.4 366 1-431 1-378 (378) 36 protein:vir:93867 Length: 378 100.0 4.3E-81 2.7E-84 461.2 36.6 366 1-431 1-378 (378) 37 protein:vir:3868 Length: 417 # 100.0 1E-80 6.3E-84 459.1 38.3 397 1-439 1-417 (417) 38 protein:vir:94869 Length: 378 100.0 8.1E-81 5E-84 459.7 37.5 366 1-431 1-378 (378) 39 protein:vir:1661 Length: 378 # 100.0 1E-80 6.3E-84 459.1 36.6 366 1-431 1-378 (378) 40 protein:vir:96980 Length: 409 100.0 3.1E-80 2E-83 456.4 38.7 391 1-421 4-409 (409) 41 protein:vir:4598 Length: 416 # 100.0 2.5E-80 1.6E-83 457.0 38.0 402 1-431 1-416 (416) 42 protein:vir:81095 Length: 416 100.0 2.5E-80 1.6E-83 457.0 38.0 402 1-431 1-416 (416) 43 protein:vir:4089 Length: 395 # 100.0 2.3E-80 1.4E-83 457.2 37.1 388 1-426 1-395 (395) 44 protein:vir:2683 Length: 412 # 100.0 1.1E-79 7E-83 453.4 38.9 394 1-421 1-412 (412) 45 protein:vir:93943 Length: 409 100.0 1.9E-79 1.2E-82 452.2 38.1 391 1-421 4-409 (409) 46 protein:vir:94426 Length: 409 100.0 4.8E-78 3E-81 444.5 37.9 391 1-421 4-409 (409) 47 protein:vir:9702 Length: 406 # 100.0 7.9E-77 4.9E-80 437.8 38.3 394 1-435 1-406 (406) 48 protein:vir:9641 Length: 395 # 100.0 3.3E-77 2.1E-80 439.9 36.0 374 1-422 1-395 (395) 49 protein:vir:100650 Length: 395 100.0 6.9E-77 4.3E-80 438.1 35.8 383 1-429 1-395 (395) 50 protein:vir:101289 Length: 395 100.0 6.9E-77 4.3E-80 438.1 35.8 383 1-429 1-395 (395) 51 protein:vir:9507 Length: 395 # 100.0 6.9E-77 4.3E-80 438.1 35.8 383 1-429 1-395 (395) 52 protein:vir:98643 Length: 395 100.0 8.6E-77 5.3E-80 437.6 35.6 378 1-422 1-395 (395) 53 protein:vir:101647 Length: 460 100.0 1E-76 6.4E-80 437.2 35.8 399 1-419 2-460 (460) 54 protein:vir:78310 Length: 376 100.0 6.7E-77 4.2E-80 438.2 33.9 364 1-413 1-376 (376) 55 protein:vir:95965 Length: 385 100.0 2E-76 1.2E-79 435.6 35.7 372 1-419 1-385 (385) 56 protein:vir:80134 Length: 403 100.0 3.2E-76 2E-79 434.5 36.1 388 1-431 1-403 (403) 57 protein:vir:8317 Length: 409 # 100.0 2.3E-76 1.4E-79 435.3 35.4 368 1-400 1-409 (409) 58 protein:vir:94666 Length: 723 100.0 6.1E-76 3.8E-79 432.9 37.4 408 1-447 1-447 (723) 59 protein:vir:960 Length: 413 # 100.0 7.8E-76 4.8E-79 432.4 36.9 392 1-416 1-413 (413) 60 protein:vir:6210 Length: 394 # 100.0 1.3E-75 7.9E-79 431.2 35.1 380 1-427 1-394 (394) 61 protein:vir:8100 Length: 466 # 100.0 1.1E-74 6.9E-78 426.0 39.1 413 1-429 1-466 (466) 62 protein:vir:95378 Length: 406 100.0 6E-75 3.7E-78 427.5 37.5 390 1-433 1-406 (406) 63 protein:vir:104259 Length: 403 100.0 7.6E-75 4.7E-78 426.9 37.7 381 1-425 1-403 (403) 64 protein:vir:102727 Length: 945 100.0 4.6E-72 2.9E-75 411.7 38.9 436 1-447 52-548 (945) 65 protein:vir:3843 Length: 397 # 100.0 4E-72 2.5E-75 412.0 37.1 385 1-431 1-397 (397) 66 protein:vir:7407 Length: 392 # 100.0 8.4E-71 5.2E-74 404.8 34.9 375 1-421 3-392 (392) 67 protein:vir:9359 Length: 348 # 100.0 1.5E-70 9.2E-74 403.4 35.0 333 71-421 1-348 (348) 68 protein:vir:3989 Length: 392 # 100.0 1.8E-70 1.1E-73 402.9 35.0 375 1-421 3-392 (392) 69 protein:vir:1023 Length: 392 # 100.0 1.8E-70 1.1E-73 402.9 35.0 375 1-421 3-392 (392) 70 protein:vir:100187 Length: 385 100.0 2.7E-70 1.7E-73 402.0 34.7 367 1-418 1-385 (385) 71 protein:vir:100882 Length: 383 100.0 2.3E-68 1.4E-71 391.4 34.6 365 1-417 1-383 (383) 72 protein:vir:4995 Length: 384 # 100.0 9.6E-69 6E-72 393.5 31.1 365 1-395 1-384 (384) 73 protein:vir:4854 Length: 386 # 100.0 6.3E-68 3.9E-71 389.0 35.1 375 1-420 1-386 (386) 74 protein:vir:100691 Length: 535 100.0 1.7E-66 1.1E-69 381.1 37.6 441 1-447 1-535 (535) 75 protein:vir:4952 Length: 386 # 100.0 1.6E-65 9.9E-69 375.8 35.1 375 1-420 1-386 (386) 76 protein:vir:4828 Length: 382 # 100.0 3.9E-66 2.4E-69 379.2 30.7 368 1-420 1-382 (382) 77 protein:vir:1082 Length: 359 # 100.0 2.6E-64 1.6E-67 369.2 32.7 342 1-388 1-359 (359) 78 protein:vir:80644 Length: 551 100.0 1.6E-63 1E-66 364.8 36.5 431 1-447 5-539 (551) 79 protein:vir:80796 Length: 574 100.0 5.4E-63 3.3E-66 362.0 36.9 432 1-447 27-545 (574) 80 protein:vir:3153 Length: 467 # 100.0 2.1E-62 1.3E-65 358.8 35.7 387 51-444 1-467 (467) 81 protein:vir:63755 Length: 547 100.0 4.6E-62 2.8E-65 356.9 36.5 432 1-447 1-535 (547) 82 protein:vir:4156 Length: 542 # 100.0 6E-62 3.7E-65 356.2 35.1 419 3-447 1-467 (542) 83 protein:vir:4194 Length: 540 # 100.0 1.4E-61 8.7E-65 354.2 35.8 412 1-447 6-469 (540) 84 protein:vir:96579 Length: 576 100.0 6.7E-60 4.2E-63 345.0 38.4 433 1-447 6-544 (576) 85 protein:vir:95599 Length: 563 100.0 1.7E-59 1.1E-62 342.7 37.6 423 1-447 43-549 (563) 86 protein:vir:99312 Length: 563 100.0 1.7E-59 1.1E-62 342.7 37.6 423 1-447 43-549 (563) 87 protein:vir:79772 Length: 648 100.0 1.5E-57 9.1E-61 332.2 36.4 432 1-447 8-515 (648) 88 protein:vir:99452 Length: 651 100.0 2.9E-55 1.8E-58 319.6 31.1 426 1-447 78-589 (651) 89 protein:vir:78641 Length: 278 100.0 2.8E-52 1.8E-55 303.2 29.3 268 71-352 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 3.7E-41 2.3E-44 242.3 24.4 333 1-368 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 8.5E-39 5.3E-42 229.3 24.7 320 1-359 26-376 (376) 92 protein:vir:4698 Length: 251 # 100.0 2.3E-39 1.5E-42 232.4 21.5 243 1-268 1-251 (251) 93 protein:vir:100328 Length: 346 100.0 3.2E-38 2E-41 226.1 27.6 316 1-357 1-346 (346) 94 protein:vir:267 Length: 348 # 100.0 2E-38 1.2E-41 227.3 24.7 316 14-360 1-348 (348) 95 protein:vir:79207 Length: 351 100.0 3E-38 1.9E-41 226.3 25.1 322 1-359 1-351 (351) 96 protein:vir:78191 Length: 351 100.0 5.6E-38 3.5E-41 224.8 24.7 322 1-359 1-351 (351) 97 protein:vir:3780 Length: 345 # 100.0 5.2E-37 3.2E-40 219.5 28.0 318 8-354 1-345 (345) 98 protein:vir:3743 Length: 345 # 100.0 4.5E-37 2.8E-40 219.9 27.4 319 1-354 1-345 (345) 99 protein:vir:98567 Length: 340 100.0 1.1E-37 6.6E-41 223.3 23.6 312 1-356 1-340 (340) 100 protein:vir:78749 Length: 337 100.0 3.7E-37 2.3E-40 220.3 24.6 308 1-353 1-337 (337) 101 protein:vir:6058 Length: 344 # 100.0 6E-37 3.7E-40 219.2 23.8 318 1-357 1-344 (344) 102 protein:vir:5691 Length: 344 # 100.0 2E-36 1.2E-39 216.3 24.9 309 1-357 1-344 (344) 103 protein:vir:2013 Length: 344 # 100.0 5.7E-36 3.5E-39 213.8 24.5 316 1-357 1-344 (344) 104 protein:vir:1150 Length: 350 # 100.0 3E-36 1.9E-39 215.3 22.1 313 1-352 1-350 (350) 105 protein:vir:98853 Length: 219 100.0 4.2E-31 2.6E-34 187.1 20.2 195 147-356 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.8 5.5E-19 3.4E-22 120.7 29.5 385 1-437 1-437 (437) 107 protein:vir:107742 Length: 537 99.8 1.3E-18 7.8E-22 118.7 29.4 418 1-444 25-537 (537) 108 protein:vir:94049 Length: 532 99.7 3.1E-17 1.9E-20 111.1 29.5 427 1-447 17-525 (532) 109 protein:vir:99563 Length: 862 99.7 2.3E-16 1.4E-19 106.3 29.6 416 1-447 66-598 (862) 110 protein:vir:79647 Length: 435 99.6 1.7E-16 1.1E-19 107.0 23.8 379 1-431 1-435 (435) 111 protein:vir:96068 Length: 765 99.6 3.5E-15 2.2E-18 99.9 29.6 420 1-447 37-553 (765) 112 protein:vir:80040 Length: 461 99.5 4.3E-15 2.7E-18 99.3 22.4 393 1-434 1-461 (461) 113 protein:vir:104338 Length: 422 99.5 1.6E-14 1E-17 96.2 24.0 369 1-429 1-422 (422) 114 protein:vir:107662 Length: 427 99.5 5.5E-14 3.4E-17 93.3 22.1 367 1-438 1-427 (427) 115 protein:vir:103860 Length: 528 99.4 9.5E-13 5.9E-16 86.5 28.3 413 1-447 1-484 (528) 116 protein:vir:79538 Length: 502 99.4 1.2E-13 7.4E-17 91.4 22.1 422 1-432 1-502 (502) 117 protein:vir:108215 Length: 469 99.4 6.4E-12 4E-15 82.0 31.2 405 1-444 14-469 (469) 118 protein:vir:99232 Length: 526 99.3 5.3E-11 3.3E-14 76.9 31.2 420 1-447 1-482 (526) 119 protein:vir:96738 Length: 505 99.3 1.1E-12 6.6E-16 86.2 21.7 421 1-439 8-505 (505) 120 protein:vir:95542 Length: 548 99.3 6.4E-12 4E-15 81.9 22.9 442 1-447 1-540 (548) 121 protein:vir:4073 Length: 279 # 99.2 1.4E-13 8.7E-17 91.1 12.1 266 58-390 1-279 (279) 122 protein:vir:79233 Length: 526 99.2 4.8E-10 3E-13 71.7 31.0 418 1-447 1-482 (526) 123 protein:vir:389 Length: 530 # 99.2 5.4E-11 3.3E-14 76.9 25.0 422 1-446 1-530 (530) 124 protein:vir:1986 Length: 512 # 99.1 4.3E-10 2.7E-13 71.9 26.6 422 1-447 1-470 (512) 125 protein:vir:106716 Length: 698 99.1 4.9E-10 3E-13 71.6 26.7 431 1-447 46-578 (698) 126 protein:vir:99853 Length: 488 99.1 2.6E-10 1.6E-13 73.2 24.5 395 14-447 1-440 (488) 127 protein:vir:3420 Length: 533 # 99.1 5.4E-10 3.4E-13 71.4 25.0 429 1-440 3-533 (533) 128 protein:vir:78589 Length: 695 99.1 1.5E-09 9.3E-13 69.0 26.3 428 1-447 46-588 (695) 129 protein:vir:101541 Length: 694 99.0 4.2E-09 2.6E-12 66.5 27.6 427 1-447 48-587 (694) 130 protein:vir:95254 Length: 488 99.0 5.1E-09 3.2E-12 66.0 28.8 411 1-442 1-488 (488) 131 protein:vir:3648 Length: 695 # 99.0 4.1E-09 2.5E-12 66.6 27.1 431 1-447 49-588 (695) 132 protein:vir:6382 Length: 553 # 99.0 2.6E-09 1.6E-12 67.7 25.3 426 1-435 2-553 (553) 133 protein:vir:105782 Length: 449 98.9 3.8E-10 2.3E-13 72.3 18.6 381 1-435 7-449 (449) 134 protein:vir:10321 Length: 495 98.9 1.1E-09 7.1E-13 69.6 21.0 420 1-431 1-495 (495) 135 protein:vir:79063 Length: 491 98.9 6.1E-09 3.8E-12 65.6 24.5 416 1-447 1-453 (491) 136 protein:vir:106491 Length: 646 98.9 6.1E-09 3.8E-12 65.6 24.0 425 1-447 1-517 (646) 137 protein:vir:107880 Length: 491 98.9 1.9E-08 1.2E-11 62.9 26.3 406 1-447 1-453 (491) 138 protein:vir:98816 Length: 446 98.8 2.8E-08 1.7E-11 62.0 27.1 366 1-392 3-446 (446) 139 protein:vir:97376 Length: 320 98.8 3.7E-10 2.3E-13 72.3 14.0 310 1-415 1-320 (320) 140 protein:vir:102426 Length: 631 98.6 2.7E-07 1.7E-10 56.6 23.3 435 1-447 1-534 (631) 141 protein:vir:103219 Length: 201 98.5 5.9E-09 3.7E-12 65.7 13.3 177 225-430 1-201 (201) 142 protein:vir:107517 Length: 639 98.5 1.9E-07 1.2E-10 57.4 20.8 429 1-447 1-533 (639) 143 protein:vir:97900 Length: 639 98.5 1.9E-07 1.2E-10 57.4 20.8 429 1-447 1-533 (639) 144 protein:vir:79511 Length: 448 98.4 8.9E-07 5.5E-10 53.8 28.0 403 1-435 1-448 (448) 145 protein:vir:99088 Length: 629 98.4 9.4E-07 5.9E-10 53.6 22.7 424 1-447 1-531 (629) 146 protein:vir:8654 Length: 629 # 98.4 1E-06 6.2E-10 53.5 23.4 425 1-447 1-531 (629) 147 protein:vir:77981 Length: 448 98.3 1.8E-06 1.1E-09 52.1 27.9 402 1-442 1-448 (448) 148 protein:vir:78161 Length: 355 97.7 2.5E-05 1.5E-08 45.8 26.6 311 127-444 1-355 (355) 149 protein:vir:106027 Length: 629 97.6 3.7E-05 2.3E-08 44.9 23.5 425 1-447 1-521 (629) 150 protein:vir:4782 Length: 522 # 97.4 6.9E-05 4.3E-08 43.4 20.1 407 1-439 1-522 (522) 151 protein:vir:7987 Length: 456 # 96.8 0.00035 2.2E-07 39.5 20.7 384 1-427 7-456 (456) 152 protein:vir:2500 Length: 501 # 96.7 0.0004 2.5E-07 39.2 22.9 400 1-446 29-501 (501) 153 protein:vir:7768 Length: 484 # 96.7 0.00043 2.6E-07 39.1 21.4 401 1-444 14-484 (484) 154 protein:vir:99072 Length: 479 96.1 0.001 6.3E-07 37.0 24.3 406 1-445 15-479 (479) 155 protein:vir:104082 Length: 485 95.9 0.0013 7.8E-07 36.5 24.6 402 1-435 8-485 (485) 156 protein:vir:98444 Length: 434 95.9 0.0013 8.2E-07 36.4 24.6 367 14-444 1-434 (434) 157 protein:vir:2427 Length: 485 # 95.5 0.0019 1.2E-06 35.5 24.8 391 1-435 26-485 (485) 158 protein:vir:105819 Length: 456 95.4 0.0021 1.3E-06 35.3 21.7 389 1-433 7-456 (456) 159 protein:vir:102602 Length: 456 95.4 0.0021 1.3E-06 35.3 21.7 389 1-433 7-456 (456) 160 protein:vir:79703 Length: 505 95.4 0.0022 1.4E-06 35.1 22.8 393 1-426 1-505 (505) 161 protein:vir:99916 Length: 504 95.3 0.0024 1.5E-06 35.0 23.6 420 1-445 23-504 (504) 162 protein:vir:1587 Length: 508 # 95.2 0.0025 1.5E-06 34.9 21.6 396 1-431 1-508 (508) 163 protein:vir:99452 Length: 651 95.1 0.00058 3.6E-07 38.3 8.9 428 1-447 1-543 (651) 164 protein:vir:9815 Length: 500 # 94.8 0.0034 2.1E-06 34.1 21.3 394 1-437 1-500 (500) 165 protein:vir:3028 Length: 500 # 94.8 0.0034 2.1E-06 34.1 21.3 394 1-437 1-500 (500) 166 protein:vir:2341 Length: 488 # 94.8 0.0034 2.1E-06 34.1 22.9 406 1-435 10-488 (488) 167 protein:vir:78907 Length: 518 94.7 0.0037 2.3E-06 33.9 26.0 407 1-445 1-518 (518) 168 protein:vir:98883 Length: 517 94.5 0.0041 2.6E-06 33.7 21.7 374 1-431 1-517 (517) 169 protein:vir:78537 Length: 480 92.9 0.0094 5.9E-06 31.7 25.7 395 1-446 1-480 (480) 170 protein:vir:94742 Length: 409 92.5 0.011 7E-06 31.3 23.8 343 1-388 3-409 (409) 171 protein:vir:98265 Length: 524 92.1 0.013 8E-06 31.0 16.0 405 1-434 4-524 (524) 172 protein:vir:78227 Length: 480 91.2 0.017 1.1E-05 30.3 26.3 398 1-443 1-480 (480) 173 protein:vir:9751 Length: 422 # 90.3 0.021 1.3E-05 29.7 21.9 358 1-410 1-422 (422) 174 protein:vir:8184 Length: 474 # 90.2 0.022 1.4E-05 29.7 21.6 394 1-417 17-474 (474) 175 protein:vir:1634 Length: 409 # 87.6 0.038 2.4E-05 28.4 23.7 344 1-388 3-409 (409) 176 protein:vir:5839 Length: 533 # 86.8 0.043 2.7E-05 28.1 23.6 427 1-447 1-529 (533) 177 protein:vir:4223 Length: 486 # 85.8 0.05 3.1E-05 27.7 24.9 407 1-442 15-486 (486) 178 protein:vir:97171 Length: 512 85.6 0.052 3.2E-05 27.6 25.1 394 1-433 54-512 (512) 179 protein:vir:105889 Length: 474 83.6 0.067 4.1E-05 27.0 26.7 394 1-433 1-474 (474) 180 protein:vir:94101 Length: 474 83.6 0.067 4.1E-05 27.0 26.7 394 1-433 1-474 (474) 181 protein:vir:4898 Length: 502 # 81.5 0.085 5.3E-05 26.4 26.3 399 1-443 31-502 (502) 182 protein:vir:96240 Length: 511 79.3 0.11 6.6E-05 25.9 25.2 393 1-432 54-511 (511) 183 protein:vir:5961 Length: 503 # 77.6 0.12 7.6E-05 25.6 30.2 391 1-435 13-503 (503) 184 protein:vir:9306 Length: 511 # 77.2 0.13 7.9E-05 25.5 24.4 393 1-432 54-511 (511) 185 protein:vir:6596 Length: 521 # 75.3 0.15 9.2E-05 25.1 15.0 403 3-425 1-521 (521) 186 protein:vir:103951 Length: 511 73.4 0.17 0.00011 24.8 26.0 407 1-432 39-511 (511) 187 protein:vir:100598 Length: 516 68.5 0.24 0.00015 24.0 16.9 402 1-426 1-516 (516) 188 protein:vir:9568 Length: 410 # 66.9 0.26 0.00016 23.8 25.4 348 11-414 1-410 (410) 189 protein:vir:81017 Length: 521 63.3 0.32 0.0002 23.3 15.4 403 3-434 1-521 (521) 190 protein:vir:108049 Length: 524 62.7 0.33 0.0002 23.2 20.2 404 1-426 1-524 (524) 191 protein:vir:96366 Length: 511 60.2 0.38 0.00023 22.9 24.9 407 1-439 39-511 (511) 192 protein:vir:78805 Length: 511 60.2 0.38 0.00023 22.9 24.9 407 1-439 39-511 (511) 193 protein:vir:99522 Length: 470 59.5 0.39 0.00024 22.8 26.7 374 1-431 39-470 (470) 194 protein:vir:106639 Length: 481 56.9 0.45 0.00028 22.5 26.3 383 1-432 44-481 (481) 195 protein:vir:97336 Length: 492 55.6 0.47 0.00029 22.4 27.3 376 1-431 57-492 (492) 196 protein:vir:38 Length: 496 # N 55.4 0.48 0.0003 22.3 22.7 395 3-431 1-496 (496) 197 protein:vir:1236 Length: 483 # 51.3 0.59 0.00036 21.9 26.2 376 1-435 48-483 (483) 198 protein:vir:80959 Length: 499 51.0 0.59 0.00037 21.8 25.4 397 1-431 3-499 (499) 199 protein:vir:6896 Length: 523 # 48.7 0.66 0.00041 21.6 13.2 403 1-434 1-523 (523) 200 protein:vir:99781 Length: 511 46.0 0.75 0.00047 21.3 24.5 406 1-431 39-511 (511) 201 protein:vir:93747 Length: 472 43.9 0.83 0.00051 21.0 26.7 377 1-431 37-472 (472) 202 protein:vir:2732 Length: 501 # 41.9 0.91 0.00056 20.8 26.6 407 1-443 37-501 (501) 203 protein:vir:94805 Length: 492 37.3 1.1 0.0007 20.3 28.2 375 1-431 57-492 (492) 204 protein:vir:3964 Length: 453 # 36.8 1.2 0.00072 20.2 26.3 370 1-432 30-453 (453) 205 protein:vir:80680 Length: 441 33.4 1.4 0.00084 19.9 24.8 363 1-423 6-441 (441) 206 protein:vir:106282 Length: 521 29.1 1.7 0.001 19.3 15.6 403 1-434 1-521 (521) 207 protein:vir:106571 Length: 499 27.0 1.9 0.0012 19.1 29.4 398 1-439 1-499 (499) 208 protein:vir:95806 Length: 440 26.0 2 0.0012 18.9 23.9 379 3-431 1-440 (440) 209 protein:vir:105154 Length: 525 26.0 2 0.0012 18.9 15.3 410 1-446 47-525 (525) 210 protein:vir:103458 Length: 524 24.8 2.1 0.0013 18.8 14.0 402 1-434 1-524 (524) 211 protein:vir:95113 Length: 474 23.3 2.3 0.0014 18.6 26.9 375 1-439 40-474 (474) 212 protein:vir:96494 Length: 501 22.8 2.4 0.0015 18.5 26.4 407 1-437 38-501 (501) No 1 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=5e-88 Score=499.22 Aligned_cols=414 Identities=14% Similarity=0.107 Sum_probs=304.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||||||+++|+++++........+. ...++....+ .+..+.. ++...++++++||+||++||++||+|||++|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~---~~~~~~~~~g-~~~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNK---DDEKLLEWLG-ISPSTIS-VKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CChHHHHHHhcCccccCcccccccCC---chHHHHHHhC-CCcCccc-cchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 99999999999987765433222221 1112222222 2222223 45567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ- 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 159 (447) +++++. .+..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+....++++.+ ....+.....+. T Consensus 76 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~-~~v~v~~d~~~~~ 153 (432) T protein:vir:10 76 EDEYGI-QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA-SKVTVYIDDVGLL 153 (432) T ss_pred ecCCce-eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC-ceeEEEEcCcccc Confidence 876664 45789999999999999999999999999999999999999999988776554444444 333333221111 Q ss_pred -eEEEEeeecccccceeeeccccccccccc--ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCc Q lcl|NC_010576. 160 -VMVRVWNDNTGLEQDLLVSKENCIIIESP--FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~ 231 (447) .....++.....+....+++++|+|++.+ ..+.. +.+.+..+...+....+ ...+.||++|+|+|++++. T Consensus 154 ~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~--G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~ 231 (432) T protein:vir:10 154 NSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLV--GVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD 231 (432) T ss_pred cccceEEEEEecCCeEEEEccccEEEecCCCCCCCcc--cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC Confidence 11122222223344567889999999853 22221 12333333344333322 2345789999999999998 Q ss_pred CChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----CC Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----GT 304 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g~ 304 (447) +++++.+ ++++.|.+.+. .|+++++||++|++|+++++++++++ ++.+++++++||++|||||++|+ ++ T Consensus 232 l~~e~~~----~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~ 307 (432) T protein:vir:10 232 LNEDAKK----VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT 307 (432) T ss_pred CCHHHHH----HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 8876544 45555555554 47899999999999999999988865 68899999999999999999996 23 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.|++.++|+++||.||+++||++||+|||++.++..|++|+||++.|+++|.+++++++.+++++|++|+||+|+++ T Consensus 308 ~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 387 (432) T protein:vir:10 308 LNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE 387 (432) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 348999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 383 gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) ||||+||+ ...++++|+.+....++...++++.+++...+++.+. T Consensus 388 g~~pi~gg-D~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 388 DLPPEAGG-DRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCCCCC-CeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 99999884 2346788888877655443333332222222221111 No 2 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=5e-88 Score=499.22 Aligned_cols=414 Identities=14% Similarity=0.107 Sum_probs=304.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||||||+++|+++++........+. ...++....+ .+..+.. ++...++++++||+||++||++||+|||++|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~---~~~~~~~~~g-~~~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNK---DDEKLLEWLG-ISPSTIS-VKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CChHHHHHHhcCccccCcccccccCC---chHHHHHHhC-CCcCccc-cchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 99999999999987765433222221 1112222222 2222223 45567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ- 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 159 (447) +++++. .+..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+....++++.+ ....+.....+. T Consensus 76 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~-~~v~v~~d~~~~~ 153 (432) T protein:vir:10 76 EDEYGI-QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA-SKVTVYIDDVGLL 153 (432) T ss_pred ecCCce-eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC-ceeEEEEcCcccc Confidence 876664 45789999999999999999999999999999999999999999988776554444444 333333221111 Q ss_pred -eEEEEeeecccccceeeeccccccccccc--ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCc Q lcl|NC_010576. 160 -VMVRVWNDNTGLEQDLLVSKENCIIIESP--FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~ 231 (447) .....++.....+....+++++|+|++.+ ..+.. +.+.+..+...+....+ ...+.||++|+|+|++++. T Consensus 154 ~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~--G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~ 231 (432) T protein:vir:10 154 NSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLV--GVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD 231 (432) T ss_pred cccceEEEEEecCCeEEEEccccEEEecCCCCCCCcc--cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC Confidence 11122222223344567889999999853 22221 12333333344333322 2345789999999999998 Q ss_pred CChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----CC Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----GT 304 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g~ 304 (447) +++++.+ ++++.|.+.+. .|+++++||++|++|+++++++++++ ++.+++++++||++|||||++|+ ++ T Consensus 232 l~~e~~~----~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~ 307 (432) T protein:vir:10 232 LNEDAKK----VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT 307 (432) T ss_pred CCHHHHH----HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 8876544 45555555554 47899999999999999999988865 68899999999999999999996 23 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.|++.++|+++||.||+++||++||+|||++.++..|++|+||++.|+++|.+++++++.+++++|++|+||+|+++ T Consensus 308 ~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 387 (432) T protein:vir:10 308 LNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE 387 (432) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 348999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 383 gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) ||||+||+ ...++++|+.+....++...++++.+++...+++.+. T Consensus 388 g~~pi~gg-D~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 388 DLPPEAGG-DRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCCCCC-CeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 99999884 2346788888877655443333332222222221111 No 3 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=5e-88 Score=499.22 Aligned_cols=414 Identities=14% Similarity=0.107 Sum_probs=304.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||||||+++|+++++........+. ...++....+ .+..+.. ++...++++++||+||++||++||+|||++|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~---~~~~~~~~~g-~~~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNK---DDEKLLEWLG-ISPSTIS-VKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CChHHHHHHhcCccccCcccccccCC---chHHHHHHhC-CCcCccc-cchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 99999999999987765433222221 1112222222 2222223 45567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ- 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 159 (447) +++++. .+..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+....++++.+ ....+.....+. T Consensus 76 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~-~~v~v~~d~~~~~ 153 (432) T protein:vir:10 76 EDEYGI-QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA-SKVTVYIDDVGLL 153 (432) T ss_pred ecCCce-eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC-ceeEEEEcCcccc Confidence 876664 45789999999999999999999999999999999999999999988776554444444 333333221111 Q ss_pred -eEEEEeeecccccceeeeccccccccccc--ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCc Q lcl|NC_010576. 160 -VMVRVWNDNTGLEQDLLVSKENCIIIESP--FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~ 231 (447) .....++.....+....+++++|+|++.+ ..+.. +.+.+..+...+....+ ...+.||++|+|+|++++. T Consensus 154 ~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~--G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~ 231 (432) T protein:vir:10 154 NSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLV--GVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD 231 (432) T ss_pred cccceEEEEEecCCeEEEEccccEEEecCCCCCCCcc--cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC Confidence 11122222223344567889999999853 22221 12333333344333322 2345789999999999998 Q ss_pred CChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----CC Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----GT 304 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g~ 304 (447) +++++.+ ++++.|.+.+. .|+++++||++|++|+++++++++++ ++.+++++++||++|||||++|+ ++ T Consensus 232 l~~e~~~----~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~ 307 (432) T protein:vir:10 232 LNEDAKK----VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT 307 (432) T ss_pred CCHHHHH----HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 8876544 45555555554 47899999999999999999988865 68899999999999999999996 23 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.|++.++|+++||.||+++||++||+|||++.++..|++|+||++.|+++|.+++++++.+++++|++|+||+|+++ T Consensus 308 ~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 387 (432) T protein:vir:10 308 LNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE 387 (432) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 348999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 383 gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) ||||+||+ ...++++|+.+....++...++++.+++...+++.+. T Consensus 388 g~~pi~gg-D~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 388 DLPPEAGG-DRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCCCCC-CeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 99999884 2346788888877655443333332222222221111 No 4 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=8.2e-87 Score=492.54 Aligned_cols=410 Identities=12% Similarity=0.073 Sum_probs=296.9 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccc-ccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMT-SFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||+|+|++.+|....+. ........++..|.. .++...+.++.. ++..+++++++||+||++||++||+|||++| T Consensus 7 mg~f~r~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~-v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y 82 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPV---DIGGGQTFTPVNATARDLGIIISDTGAA-VNADAIMRLDAVAACVKLVSQAIAAMPLTMY 82 (432) T ss_pred cchhhhhhhhccccccc---ccccccccccCccchhhhcccccccCcc-cchHhhhccHHHHHHHHHHHHhhhhCceeeE Confidence 99999987766433222 111111122222221 122333333333 4566799999999999999999999999999 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) +++++|. .+..+|++++||+.+||++||+++||+.++.+++++||||+++.+++ +.+ ..+++.+.....+.....+. T Consensus 83 ~~~~~g~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~-g~~-~~L~~l~~~~v~v~~~~~g~ 159 (432) T protein:vir:81 83 MRTPDGR-KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRI-ESLQYLANDRLTITTDPKGN 159 (432) T ss_pred EecCCcc-eecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcE-EEEEEEcCCceEEEECCCCc Confidence 9888765 44679999999999999999999999999999999999999998875 333 34444444444444433444 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+++.. ..+....++.++|+|++.+..++... .+.+..+...+....+ ...+.||++++||+++++.+++ T Consensus 160 ~~y~~~~---~~g~~~~~~~~~iih~r~~~~dg~~G-~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~ 235 (432) T protein:vir:81 160 TAYRYRR---TDGQMIDIPKQQIWKIMGYSLDGENG-LSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD 235 (432) T ss_pred EEEEEEe---cCceEEEEccccEEEecCCCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCH Confidence 4443322 23445678899999999754433222 2333444444433322 2335789999999999998876 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC---------C Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG---------T 304 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g---------~ 304 (447) + ++++++++|.. ..|+|++++|++|++|+++++++++++ ++.+++++++||++|||||++||. + T Consensus 236 e----~~~~~~~~~~~--~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~s 309 (432) T protein:vir:81 236 D----QYDSFAKKVSG--SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGS 309 (432) T ss_pred H----HHHHHHHHHhh--hhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccc Confidence 5 44556666643 357899999999999999999998865 688999999999999999999962 3 Q ss_pred cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_010576. 305 ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGK 384 (447) Q Consensus 305 ~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl 384 (447) +.||+.++|+++||.||+++||++|++|||++.++ .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++|+ T Consensus 310 n~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~gl 388 (432) T protein:vir:81 310 GIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGL 388 (432) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 35899999999999999999999999999999875 47999999999999999999999999999999999999999999 Q ss_pred CCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 385 APHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 385 ~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) ||++|+....+++.++.++...+....+. +..++.+..+.+.++ T Consensus 389 pp~~g~~~~~~~~~~~~pl~~~~~~~~~~-------~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 389 PKLGGNAAVLTVQSAMVPLDSIGLQASPE-------PASGLGNQQQDKVSK 432 (432) T ss_pred CCCCCCcceEeecCcccchhhhccCCCCC-------CCCCCCCcccccccC Confidence 99987633334677877776554332211 111111111112222 No 5 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.8e-86 Score=490.65 Aligned_cols=410 Identities=13% Similarity=0.071 Sum_probs=298.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccc-cccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGM-TSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||+|+|++.+|..-.+......... ++..+. ..+++..+.++.. ++..+++++++||+||++||++||+|||++| T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~-v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y 82 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTF---TPVNATARDLGIIISDTGAA-VNADAIMRLDAVAACVKLVSQAVAAMPLMMY 82 (432) T ss_pred CchhhhhHhhcCCcccccccccccc---ccCchhhhhhcccccccCcc-cchHhhhcchHHHHHHHHHHHhhccCceEEE Confidence 9999998876644333221111111 222221 1123333344444 4556799999999999999999999999999 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) |++++|. .+..+||+++||+.+||++||+++||+.++.+++++||||+++.+++ +.+ ..+++++.....+.....+. T Consensus 83 ~~~~~g~-~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g~~-~~L~~l~p~~v~v~~~~~g~ 159 (432) T protein:vir:97 83 MRTPDGR-KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRI-ESLQYLANDRLTITTDTKGN 159 (432) T ss_pred EecCCCc-ccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcE-EEEEEEcCcceEEEEcCCCc Confidence 9887765 45689999999999999999999999999999999999999999875 333 34555554444444434445 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+++.. ..+..+.++.++|+|++.+..++... .+.+..+...+.... +...+.||++++|||++++.+++ T Consensus 160 ~~y~~~~---~~g~~~~~~~~~iih~r~~~~dg~~G-~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~ 235 (432) T protein:vir:97 160 TAYRYRR---TDGQMIDIPRQQIWKIMGYSLDGENG-LSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD 235 (432) T ss_pred EEEEEEe---cCceEEEEccccEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCH Confidence 4444332 22344678899999999754433222 233344444443322 23345789999999999998875 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC---------C Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG---------T 304 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g---------~ 304 (447) + ++++|++.|.. ..|+++++||++|++|+++++++.+++ ++.+++++++||++|||||++||. + T Consensus 236 e----~~~~~~~~~~~--~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s 309 (432) T protein:vir:97 236 D----QYDSFSKKVSG--SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGS 309 (432) T ss_pred H----HHHHHHHHHhh--hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccch Confidence 5 45566666643 357899999999999999999998865 688999999999999999999962 3 Q ss_pred cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_010576. 305 ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGK 384 (447) Q Consensus 305 ~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl 384 (447) +.|++.++|+++||.||+++||++|+++||++.++ .+++|+||++.|+++|.++|++++.+++++||||+||+|+++|+ T Consensus 310 ~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl 388 (432) T protein:vir:97 310 GIESQQLGFLTMTLSPWLRRIEQSIALNLLTPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGL 388 (432) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 35899999999999999999999999999999875 47899999999999999999999999999999999999999999 Q ss_pred CCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 385 APHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 385 ~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) ||++|+....+++.++.++...+.+..+. +..++.+..+.+.+. T Consensus 389 pp~~g~~~~~~~~~~~~pl~~~~~~~~~~-------~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 389 PKLGGNAAVLTVQSAMVPLDSIGLQASPE-------PASGLGNQQQDKVSK 432 (432) T ss_pred CCCCCCcceEeecccccchhhhcccCCCC-------CCCCCCCcccccccC Confidence 99987533234677777776654432211 112222222222222 No 6 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=2.6e-86 Score=489.79 Aligned_cols=410 Identities=12% Similarity=0.072 Sum_probs=298.3 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccc-ccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMT-SFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||+|+|++.+|....+......... ++..+.. .+++..+.++.. ++..+++++++||+||++||++||+|||++| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~s~~g~~-v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y 82 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTF---TPVNATARDLGIIISDTGAA-VNADAIMRLDAVAACVKLVSQAIAAMPLTMY 82 (432) T ss_pred cchhhhhHhhcCCcccccccccccc---ccCcchhhhhcccccccCcc-cchhhhhcchHHHHHHHHHHHhhhhCceeEE Confidence 9999998877654333222111111 2222211 123333334444 4556799999999999999999999999999 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) |++++|. .+..+||+++||+.+||++||+++||+.++.+++++||||+++.++. +... .+++++.....+.....+. T Consensus 83 ~~~~~g~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g~~~-~L~~l~~~~v~v~~~~~g~ 159 (432) T protein:vir:10 83 MRTPDGR-KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRIE-SLQYLANDRLTITTDTKGN 159 (432) T ss_pred EecCCCc-ccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE-EEEEEcCCceEEEEcCCCc Confidence 9887765 45679999999999999999999999999999999999999998874 3333 4444444444444434444 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..++++. ..+..+.++.++|+|++.+..++.... +.+..+...+.... +...+.||++++||+++++.+++ T Consensus 160 ~~y~~~~---~~g~~~~~~~~~iih~~~~~~dg~~G~-spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~ 235 (432) T protein:vir:10 160 TAYRYRR---TDGQMIDIPKQQIWKIMGYSLDGENGL-SAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD 235 (432) T ss_pred EEEEEEe---cCceEEEEcCccEEEecCCCCCCcccc-cHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCH Confidence 4444332 234456788999999997544333222 33333444443322 23346789999999999998876 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC---------C Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG---------T 304 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g---------~ 304 (447) ++ +++++++|.. ..|+++++||++|++|+++++++++++ ++.+++++++||++|||||++||. + T Consensus 236 e~----~~~~~~~~~~--~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~s 309 (432) T protein:vir:10 236 DQ----YDSFAKKVSG--SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGS 309 (432) T ss_pred HH----HHHHHHHHhh--hhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccc Confidence 54 5566676643 457899999999999999999998865 688999999999999999999962 3 Q ss_pred cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_010576. 305 ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGK 384 (447) Q Consensus 305 ~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl 384 (447) +.|++.++|+++||.||+++||++|+++||++.++ .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++|| T Consensus 310 n~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl 388 (432) T protein:vir:10 310 GIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGL 388 (432) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 35899999999999999999999999999998875 46899999999999999999999999999999999999999999 Q ss_pred CCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 385 APHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 385 ~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) ||++|+....+++.++.++...+.+..+. +..++.+..+.+.+. T Consensus 389 ppi~g~~~~~~~~~~~~pl~~~~~~~~~~-------~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 389 PKLGGNAAVLTVQSAMVPLDSIGLQASPE-------PASGLGNQQQDKVSK 432 (432) T ss_pred CCCCCCcceEeecCcccchhhhcccCCCC-------CCCCCCCcccccccC Confidence 99997633334677777776554332211 111111111111112 No 7 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=9.5e-86 Score=486.71 Aligned_cols=411 Identities=14% Similarity=0.110 Sum_probs=298.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||+|.| +|++++++.......+ ....++..++|. +.++.. ++...++++++|++||++||++||+|||++|| T Consensus 1 M~~~~~---~f~~~~r~~~~~~~~~---~~~~~~~~~~g~-~~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~ 72 (429) T protein:vir:10 1 MDSVKK---FFNFEKRQTSQVIELN---KDDEKLLEWLGI-SPSTIS-VKGKNALKVATVFACIKILSESVSKLPLKIYQ 72 (429) T ss_pred Cchhhh---hhcccccCcccccccC---CChHHHHHHhcC-CCCcce-echhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 999877 4555555433222221 122223333332 223333 44567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +++++. ++..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+.....+.. T Consensus 73 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i-~~~~v~v~~~~~~~~ 150 (429) T protein:vir:10 73 EDEYGI-QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPI-DASKVTVYIDDVGLL 150 (429) T ss_pred ecCCce-eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE-cCceeEEEEcCcccc Confidence 876664 467899999999999999999999999999999999999999999887765444444 444333332211111 Q ss_pred --EEEEeeecccccceeeeccccccccccccc-ccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcC Q lcl|NC_010576. 161 --MVRVWNDNTGLEQDLLVSKENCIIIESPFY-AILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYST 232 (447) Q Consensus 161 --~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~ 232 (447) ....++.....+....+++++|+|++.+.. +.. .+.+.+..+...+....+ ...+.||++++|+|++++.+ T Consensus 151 ~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~-~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l 229 (429) T protein:vir:10 151 NSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGL-VGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL 229 (429) T ss_pred cccceEEEEEccCCeEEEEccccEEEecCCCCCCCc-ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC Confidence 112222222334456788999999985421 111 112333333333333322 23357899999999999988 Q ss_pred ChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C- Q lcl|NC_010576. 233 KSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T- 304 (447) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~- 304 (447) ++++.+ ++++.|++.+. .|+++++||++|++|+++++++.+++ ++.+++.+++||++|||||++|++ + T Consensus 230 ~~e~~~----~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~ 305 (429) T protein:vir:10 230 NEDAKK----VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATL 305 (429) T ss_pred CHHHHH----HHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 876544 44555555544 47899999999999999999998865 688999999999999999999962 2 Q ss_pred -cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 305 -ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 305 -~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) +.|++.++|+++||.||+++||++||+|||++.++..|++|+||++.|+++|++++++++.+++++|+||+||+|+++| T Consensus 306 sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 385 (429) T protein:vir:10 306 NNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKED 385 (429) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 3489999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 384 KAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 384 l~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) +||+|| +|. ++++|+.+....++...++++.+++...++++.. T Consensus 386 l~p~~g--gD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 386 LPPEAG--GDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred CCCCCC--cCeeeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 999987 454 6788888876654433333333222222221111 No 8 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=1.8e-85 Score=485.21 Aligned_cols=399 Identities=12% Similarity=0.059 Sum_probs=287.9 Q ss_pred CchhHhhhhhcccccCCccccc---ccccc----cccccc------ccccccccccCCcccccchhhhhhHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQ---NTNDF----LTPSNG------MTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRI 67 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~---~~~~~----~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~i 67 (447) ||+|||+++.....+...+..+ ....+ .+..++ +..+.+....++.. ++...++++++|++||++| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~-v~~~~al~~~~V~~ci~~I 79 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGT-GRETRALRNMAVLRCVTLI 79 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcce-echhhhhccHHHHHHHHHH Confidence 9999998875444332222111 01111 111111 11112222333334 4556788999999999999 Q ss_pred HHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccC Q lcl|NC_010576. 68 ALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTA 147 (447) Q Consensus 68 a~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~ 147 (447) |++||+|||++||.+ ++.+...+|++++||+.+||++||+++||+.++.+++++||||+++.++. +.+.. +++.+. T Consensus 80 a~~iA~lp~~v~~~~--~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~-L~pl~~ 155 (431) T protein:vir:10 80 SGTIGMLPMNLISSD--DSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIR-LIPMDR 155 (431) T ss_pred HHhhccCceEEEEec--CceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEE-EEEEcC Confidence 999999999999864 33345689999999999999999999999999999999999999999985 33333 444444 Q ss_pred CCcceeeecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcc Q lcl|NC_010576. 148 RVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKL 222 (447) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~ 222 (447) ....+.....+.+.+.+. ...+..+.++.++|+|++++..++... .+.+..+...+.+..+. ..+.||+++ T Consensus 156 ~~v~~~~~~~~~~~y~~~---~~~g~~~~~~~~dViHir~~~~dg~~G-~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p 231 (431) T protein:vir:10 156 GSAKGRLTSTWQIVYDYT---TPTGDKIELPAREVFHLRDLSIDGVSG-VSRVKLSGNALELAEQAERAASRTFRTGVMA 231 (431) T ss_pred ceeEEEEcCCCeEEEEEE---eCCceEEEEchhhEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 433333333333333322 223445678899999999765443322 23344444444443332 335789999 Q ss_pred cceeeeCCcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHH Q lcl|NC_010576. 223 NGFIQFPYSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEA 299 (447) Q Consensus 223 ~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~ 299 (447) +|||++++.+++++.+ ++++.|.+.++ +|+|+++||++|++|+++++++++++ ++.+++++++||++|||||+ T Consensus 232 ~gil~~~~~ls~e~~~----~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~ 307 (431) T protein:vir:10 232 GGAIEVPKELSDNAYG----RMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRP 307 (431) T ss_pred cEEEecCCCCCHHHHH----HHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 9999999988876544 45555555554 48899999999999999999998865 68899999999999999999 Q ss_pred HhcC------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC-- Q lcl|NC_010576. 300 ILNG------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA-- 371 (447) Q Consensus 300 ~l~g------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G-- 371 (447) +|++ ++.||+.++|+++||.||+++||++||++||++.++ .+++|+||++.|+|+|.++|+++|.+++..| T Consensus 308 ~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~ 386 (431) T protein:vir:10 308 LLMMDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKML-GQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQ 386 (431) T ss_pred HhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhc-CCceEEEechhhhccCHHHHHHHHHHHHhcccc Confidence 9974 345899999999999999999999999999998765 5789999999999999999999999998765 Q ss_pred --CcCHHHHHHHhCCCCCCCcccccccc-ccccchhhcccccCCCCCCCCCCCcCCC Q lcl|NC_010576. 372 --IYTPNEIRELTGKAPHPNPLANELFN-RNIADGNQVGGINTPGQITSDQPATAST 425 (447) Q Consensus 372 --~~t~NE~R~~~gl~p~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (447) |||+||+|+++||||++|+++|.+.. .+... ....+++| .++ T Consensus 387 ~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~~-----------~~~~~~~p-~~~ 431 (431) T protein:vir:10 387 SPWMKQNEVREMLDLPRADDPVADQLRNPMTQKQ-----------KGSGDEPP-ATT 431 (431) T ss_pred cCccCHHHHHHHhCCCCCCCccccceeccccccc-----------CCCCCCCC-CCC Confidence 59999999999999999999998653 33211 11111121 111 No 9 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=4.7e-85 Score=482.91 Aligned_cols=431 Identities=10% Similarity=0.070 Sum_probs=288.3 Q ss_pred CchhHhhhhhcccccCCcccc--ccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQN--QNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) ||||++|++ ++.... ......+.+..+....+++.+.++.. ++...++++++||+||++||++||+|||++ T Consensus 1 Mg~~~~l~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-v~~~~al~~~~v~~~i~~ia~~iA~lp~~~ 73 (457) T protein:vir:62 1 MGFWSALFG------RGHSPALDAAEGRAWEPYDPSIYNLGATASSGER-VTPHDALQVSAVFASVRLLSETIATLPLST 73 (457) T ss_pred Cchhhhhhc------cccccccccccccccccchhhhhhccccccCCce-echHHhhccHHHHHHHHHHHHhHhhCceEE Confidence 999998653 221111 11111112211111123333344444 445678999999999999999999999999 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCC Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (447) ||.++ ++.+.+.+|.+..|| .+||++||+++||+.++.+++++||||+++.++. +.+.. +++++.....+.....+ T Consensus 74 ~~~~~-~~~~~~~~~~~~~ll-~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~-g~~~~-l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:62 74 YSKRG-GTRKEIDTPEWLDFP-NAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAG-PNIAG-LDVLDPTKIHVHMVMVD 149 (457) T ss_pred EEecC-CccccccchHHHHhc-cccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-CcEEE-EEEEcCcceEEEEeccC Confidence 99764 444444444455555 5899999999999999999999999999987664 33333 34433333333322222 Q ss_pred ceE---EEEeeecccccc--eeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeee Q lcl|NC_010576. 159 QVM---VRVWNDNTGLEQ--DLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQF 228 (447) Q Consensus 159 ~~~---~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~ 228 (447) ... +..|........ ...+++++|||++.+.......+.+.+..+...+....++ ..+.||++|+|||++ T Consensus 150 ~~~~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 229 (457) T protein:vir:62 150 GLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEV 229 (457) T ss_pred CccceeEEEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEc Confidence 211 112222211111 1246789999999765433222223344444444443332 335789999999999 Q ss_pred CCcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-- Q lcl|NC_010576. 229 PYSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG-- 303 (447) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-- 303 (447) ++.+++++. +++++.|.+.++ .|+++++||++|++|+++++++.+++ ++.+++++++||++|||||++||. T Consensus 230 ~~~ls~e~~----~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 305 (457) T protein:vir:62 230 PGTMSEEGL----ARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDAT 305 (457) T ss_pred CCCCCHHHH----HHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Confidence 999887654 445555655555 47899999999999999999998865 688999999999999999999962 Q ss_pred ------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_010576. 304 ------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNE 377 (447) Q Consensus 304 ------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE 377 (447) ++.|++.++|+++||.||+++||++||++||++.+. .+++|+||++.|+++|.++|++++.+++++|+||+|| T Consensus 306 ~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE 384 (457) T protein:vir:62 306 NSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDE 384 (457) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 345899999999999999999999999999998775 4789999999999999999999999999999999999 Q ss_pred HHHHhCCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCC-CCcccccccCCccCcCCCCC Q lcl|NC_010576. 378 IRELTGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDP-LNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 378 ~R~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 447 (447) +|+++||||++||++|. +++.|+.+..........++.++.+++.+...+ .+.+.+...+++..++.|.= T Consensus 385 ~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 456 (457) T protein:vir:62 385 VRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDD 456 (457) T ss_pred HHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCcccccccccc Confidence 99999999999998887 457777765543221111111111111111111 11111122222333333222 No 10 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=7e-85 Score=481.94 Aligned_cols=391 Identities=13% Similarity=0.079 Sum_probs=292.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.+ +|+++.... ....+ .+ ...+| ...++...++++++|++||++||++||++||++|| T Consensus 1 MG~~~~~~~---~~~~~~~~~-~~~~~----~~-~~~~g------~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~ 65 (411) T protein:vir:81 1 MGWWSRLTR---FFRPRNETV-DMTNP----LL-LQWLG------VDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQ 65 (411) T ss_pred CchHHHHHh---hccCccccc-ccchH----HH-HHHhc------CcccChhhhhccHHHHHHHHHHHHhHhhCceeEEE Confidence 999999765 444443221 11111 11 11111 12234567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ- 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 159 (447) ++++|.. ++.+|++++||+.+||++||+++||+.++.+++++||||+++.++. +. ...+++.+.....+.....+. T Consensus 66 ~~~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~-~~~l~~l~~~~v~~~~~~~~~~ 142 (411) T protein:vir:81 66 KTERGIV-KSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQ-LQALWILPSQYVTIVVDDRGLL 142 (411) T ss_pred ecCCcee-eecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-Cc-eEEEEEECCceEEEEEcCcccc Confidence 8877654 5678999999999999999999999999999999999999999884 33 334555555554444321111 Q ss_pred ---eEEEEeeecccccceeeeccccccccccc-ccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCC Q lcl|NC_010576. 160 ---VMVRVWNDNTGLEQDLLVSKENCIIIESP-FYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPY 230 (447) Q Consensus 160 ---~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~ 230 (447) ..+.+.......+..+.++.++|+|++.+ ..+.. .+.+.+..+...+....++ ..+.||+.|+|+|++++ T Consensus 143 ~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 221 (411) T protein:vir:81 143 GEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSVTFDGI-TGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTG 221 (411) T ss_pred cccceEEEEEEecCCceEEEEccccEEEEcCCCCCCCc-ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC Confidence 11111111222344557889999999843 22222 1223334444444443332 23478999999999999 Q ss_pred cCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC---- Q lcl|NC_010576. 231 STKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG---- 303 (447) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g---- 303 (447) .+++++.+ ++++.|.+.+. +|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||. T Consensus 222 ~l~~e~~~----~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 297 (411) T protein:vir:81 222 DLNQEARD----RLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKS 297 (411) T ss_pred CCCHHHHH----HHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 88876554 44555555544 47899999999999999999998765 688999999999999999999963 Q ss_pred C--cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 304 T--ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 304 ~--~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) + +.|++.++|+++||.||+++||++|+++||++.++..|++|+||++.++++|.+++++++.+++++|+||+||+|++ T Consensus 298 t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 377 (411) T protein:vir:81 298 SYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDY 377 (411) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 2 34889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCCCCCCccccc-cccccccchhhcccccCCCCCC Q lcl|NC_010576. 382 TGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQIT 416 (447) Q Consensus 382 ~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 416 (447) +|+||+|| ||. +++.+++++...+++..+|+++ T Consensus 378 ~gl~p~~g--gD~~~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 378 LDMPADDY--GNNLMANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred hCCCCCCC--CCeeeeccCccchhhhhhhhccCCCC Confidence 99999987 455 5788888886655443333333 No 11 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=9e-85 Score=481.36 Aligned_cols=402 Identities=13% Similarity=0.057 Sum_probs=287.7 Q ss_pred CchhHhhhhhcccc-cCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAF-QSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||||+||.+..+.- +.+...........+. .++...+|. .....++...++++++|++||++||++||++||++| T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~---~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~ 76 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISD-SNFWEKFGI---KLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIY 76 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCc-chhhhhccc---cCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEE Confidence 99999874321110 0000000111111111 111122221 122234566788999999999999999999999999 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) |.+. +..+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+.....+. T Consensus 77 ~~~~-----~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v-~~~~~~~~~ 150 (422) T protein:vir:13 77 KDKE-----EYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNV-TKIIDDDNF 150 (422) T ss_pred ecCc-----ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcce-EEEEcCCcc Confidence 8542 3468999999999999999999999999999999999999999988776555555544443 333222221 Q ss_pred e---EEEEeeecccccceeeeccccccccccccc-ccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCC Q lcl|NC_010576. 160 V---MVRVWNDNTGLEQDLLVSKENCIIIESPFY-AILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPY 230 (447) Q Consensus 160 ~---~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~ 230 (447) . ....|......+....+++++|+|++.+.. +.. .+.+.+..+...+....+. ..+.||++|+|+|++++ T Consensus 151 ~~~~~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 229 (422) T protein:vir:13 151 LSSLSKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGL-IGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG 229 (422) T ss_pred eeccceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCc-ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC Confidence 1 111222222334456788999999986422 211 1223334444444333322 33578899999999999 Q ss_pred cCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC---- Q lcl|NC_010576. 231 STKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG---- 303 (447) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g---- 303 (447) .+++++.++. ++.|.+.+. .|+++++||++|++|+++++++.+++ ++.+++++++||++|||||++|++ T Consensus 230 ~l~~e~~~~~----~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~ 305 (422) T protein:vir:13 230 DLDEKAKKIF----KKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERA 305 (422) T ss_pred CCCHHHHHHH----HHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 8887655544 444544444 47899999999999999999998865 688999999999999999999974 Q ss_pred --CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 304 --TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 304 --~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) ++.|++..+|+++||.||+++||++|+++||++.++..|++|+||++.|+|+|.+++++++.+++++|+||+||+|++ T Consensus 306 ~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~ 385 (422) T protein:vir:13 306 TFNNLTEQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRR 385 (422) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 245899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred hCCCCCCCccccc-cccccccchhhcccccCCCCCCCCC Q lcl|NC_010576. 382 TGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQ 419 (447) Q Consensus 382 ~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (447) +|+||+|| ||. +++.|++++...+....++++.++. T Consensus 386 ~gl~p~~g--gD~~~~~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 386 ENLPPVEG--GDRLLVNGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred hCCCCCCC--cCeeeeccCccchhhcccccccCCCcCCC Confidence 99999988 455 6788888776554332222222222 No 12 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=1.7e-84 Score=479.81 Aligned_cols=393 Identities=11% Similarity=0.069 Sum_probs=291.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) =||++||..+|.-.++..+...+. ..+..+. .... ...++...++++++||+||++||++||+|||++|| T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~------~~~~-~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~ 83 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQGSQ---TGPVSAH------GHLG-DSSINDERILQISTVWRCVSLISTLTACLPLDVFE 83 (424) T ss_pred CchHHHHHhhhccccccccccccc---ccccccc------cccc-cccccHHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 577777776553322221111111 1111111 1111 22345678899999999999999999999999999 Q ss_pred EcCCCceecc-ccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 81 IDPISGNQTP-MPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) .+++|+..++ .+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+.. ..+. T Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V-~v~~-~~~~ 161 (424) T protein:vir:18 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM-DVKL-VGKK 161 (424) T ss_pred eecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcce-EEEE-cCCe Confidence 9888766543 68999999999999999999999999999999999999999988776555555544443 3322 2333 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCc-CC Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYS-TK 233 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~-~~ 233 (447) ..+.+. ..+....+++++|+|++.+..++... .+.+..+...+....+. ..+.||++|+|||+++.. ++ T Consensus 162 ~~y~~~----~~g~~~~~~~~eIih~r~~~~dg~~G-~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~ 236 (424) T protein:vir:18 162 VVYRYQ----RDSEYADFSQKEIFHLKGFGFTGLVG-LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) T ss_pred EEEEEE----eCCeEEEeccccEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCC Confidence 333322 22345578899999999765443322 23334444444443322 235789999999999865 44 Q ss_pred hHHHHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-------- Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG-------- 303 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-------- 303 (447) + ++++++++.|++..++ |+++++||++|++|+++++++++++ ++.+++++++||++|||||++||. T Consensus 237 ~----e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 312 (424) T protein:vir:18 237 E----QQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) T ss_pred H----HHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccc Confidence 3 3445556666655544 7899999999999999999998865 788999999999999999999962 Q ss_pred CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 304 TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 304 ~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) ++.||+.++|+++||.||+++||++|+++||++.++ .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++| T Consensus 313 sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~g 391 (424) T protein:vir:18 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDN 391 (424) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 345899999999999999999999999999999876 4799999999999999999999999999999999999999999 Q ss_pred CCCCCCccccc-cccccccchhhcccccCCCCCCC Q lcl|NC_010576. 384 KAPHPNPLANE-LFNRNIADGNQVGGINTPGQITS 417 (447) Q Consensus 384 l~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 417 (447) +||+||+ |. +++.++.++...+....+.++.. T Consensus 392 l~pi~gG--D~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 392 LPPLPGG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred CCCCCCc--CeeeeccCccchHhhhccCCCccCCC Confidence 9999884 55 67888888766543221111111 No 13 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=2.6e-84 Score=478.84 Aligned_cols=399 Identities=13% Similarity=0.089 Sum_probs=284.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+||. ++++..... .+..+...++.......+..++...++++++|++||++||++||++||++|| T Consensus 1 Mg~f~~lf------~r~~~~~~~-----~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~ 69 (414) T protein:vir:44 1 MVFFSGLF------QRKSDAPVT-----TPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYH 69 (414) T ss_pred Cchhhhhh------ccCccCccc-----chhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEE Confidence 99999853 444322111 1111222222222222334445567899999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .++++. ....+|++++||+.+||++||+++||+.++.+++++||||+++.++. +.+.. +++++.....+.....+.. T Consensus 70 ~~~~~~-~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~-g~~~~-L~~l~~~~v~~~~~~~~~~ 146 (414) T protein:vir:44 70 LNGSLK-QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAF-GEVAE-LLPVDPGCVVPKLNSSWEP 146 (414) T ss_pred ecCCce-eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-CcEEE-EEEEcCceEEEEECCCCcE Confidence 876664 45689999999999999999999999999999999999999998764 43333 4444443333333223333 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+.+ ....+....++.++|+|++.+..+... +.+.+..+...+....+ ...+.||++++|+|++++.++++ T Consensus 147 ~y~~---~~~~g~~~~~~~~evih~~~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 222 (414) T protein:vir:44 147 VYQV---TFPDGSTDVLSQEDIWHVRTLTLDGLV-GLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQ 222 (414) T ss_pred EEEE---EecCceEEEEccccEEEecCCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHH Confidence 3322 222334456889999999975433322 22333444444433322 23457899999999999998876 Q ss_pred HHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------CcH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------TAN 306 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------~~~ 306 (447) +.++ ++++|.+.++ +|++++++|++|++|+++++++.+++ ++.+++++++||++|||||++|++ ++. T Consensus 223 ~~~~----~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~ 298 (414) T protein:vir:44 223 AYER----LKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNI 298 (414) T ss_pred HHHH----HHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH Confidence 5544 4445555544 47899999999999999999998865 688999999999999999999973 234 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) |++.+.|+++||.||+++||++||++||++.++ .+++|+||++.|+++|.+++++++++++++|+||+||+|+++|+|| T Consensus 299 e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 377 (414) T protein:vir:44 299 EELGLGFINYSLVPYLTRIEQRINTGLVRKSKQ-GVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNP 377 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc-CceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 899999999999999999999999999999876 4889999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) +|| ||. +++.+++.....+.. ...++++...+++++ T Consensus 378 ~~g--gD~~~~~~n~~~~~~~~~~----------~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 378 RPG--GDVYLTPMNMTTKPSDGSK----------AGKQKDNANADETTS 414 (414) T ss_pred CCC--cceecccccccccCCcccc----------CCCCCCCCCCCCCCC Confidence 988 455 455555432211110 001111111111111 No 14 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=5.8e-84 Score=476.90 Aligned_cols=393 Identities=12% Similarity=0.082 Sum_probs=289.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) =||++|++++| +.+............+..+ . .......++...++++++|++||++||++||+|||+||| T Consensus 14 ~g~~~~~~~~f---~~~~~~~~~~~~~~~~~~~------~-~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~ 83 (424) T protein:vir:18 14 NGWWARLKSWF---VGGRLVTPNQGSQTGPVSA------H-GYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFE 83 (424) T ss_pred CchHHHHHhhc---cccccccccchhhcccccc------c-cccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 46666655543 3322111111111111111 1 111122345667999999999999999999999999999 Q ss_pred EcCCCceecc-ccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 81 IDPISGNQTP-MPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) ..++|+..++ .+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+.. ..+. T Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v-~v~~-~~~~ 161 (424) T protein:vir:18 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM-DVKL-VGKK 161 (424) T ss_pred eccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcce-EEEE-cCCe Confidence 9888776554 68999999999999999999999999999999999999999988776555555444433 3332 2334 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCc-CC Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYS-TK 233 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~-~~ 233 (447) ..+.+. ..+....+++++|+|++++..++.... +.+..+...+....+. ..+.||++++|+|+++.. ++ T Consensus 162 ~~y~~~----~~g~~~~~~~~eVihir~~~~dg~~G~-spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~ 236 (424) T protein:vir:18 162 VVYRYQ----RDSEYADFSQKEIFHLKGFGFTGLVGL-SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) T ss_pred EEEEEE----eCCeEEEeccccEEEecCcCCCCcccc-cHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCC Confidence 433322 223455788999999997654433222 3334444444443222 335789999999999875 44 Q ss_pred hHHHHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-------- Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG-------- 303 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-------- 303 (447) ++ +++++++.|.+...+ |+++++||++|++|+++++++++++ ++.+++++++||++|||||++||. T Consensus 237 ~e----~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 312 (424) T protein:vir:18 237 EQ----QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) T ss_pred HH----HHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCccccc Confidence 43 445556666554443 7889999999999999999998865 688999999999999999999962 Q ss_pred CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 304 TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 304 ~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) ++.||+.++|+++||.||+++||++||++||++.++. +++|+||++.|+++|.++|++++.+++++|+||+||+|+++| T Consensus 313 sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~g 391 (424) T protein:vir:18 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPSKDVG-RLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDN 391 (424) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccC-CeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 3458999999999999999999999999999998874 799999999999999999999999999999999999999999 Q ss_pred CCCCCCccccc-cccccccchhhcccccCCCCCCC Q lcl|NC_010576. 384 KAPHPNPLANE-LFNRNIADGNQVGGINTPGQITS 417 (447) Q Consensus 384 l~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 417 (447) +||+|| +|. ++++|+.++...++...+..... T Consensus 392 l~pi~g--gD~~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 392 MPPLPG--GDVAMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred CCCCCC--cCeeeeccCccchhhhhccCCccccCC Confidence 999988 455 67888888766543221111100 No 15 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=8.5e-84 Score=476.01 Aligned_cols=396 Identities=11% Similarity=0.038 Sum_probs=287.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.+|++ +|++++. ++++.+.+..++. .++ ...++. +++...++++++|++||++||++||+|||++|| T Consensus 16 ~~~~~~------lf~~~~~--~~~~~~~~~~~~~--~~~-~~~~~~-~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~ 83 (424) T protein:vir:45 16 RVLLDA------LFRSKSL--ENPSTPITGDAVD--TDG-LFRADV-YVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMR 83 (424) T ss_pred hHHHHh------hccccCC--CCCccccchhhhh--hhc-cccCCc-eechHHhhccHHHHHHHHHHHHHHhhCceEEEE Confidence 666665 3444432 2233333332221 111 222223 355677999999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ++ +|+.+++.+|++++||+.+|||+||+++||+.++.+++++||+|+++.++..+.+..++++.+..+ .+.. ..+.. T Consensus 84 ~~-~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v-~i~~-~~~~~ 160 (424) T protein:vir:45 84 RH-KGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWET-TLMN-TGGRY 160 (424) T ss_pred ec-CCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceE-EEEE-cCCeE Confidence 75 455566789999999999999999999999999999999999999999988777665555555443 2222 23333 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+.+.. ......+++++|+|++.+..+.... .+.+..+...+....+ ...+.||++++|||++++.++++ T Consensus 161 ~y~~~~----~~~~~~~~~~eVih~r~~~~d~~~G-~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 235 (424) T protein:vir:45 161 TYGLYN----EYGAFAISPDDMIHIRALGNNQKMG-LSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKE 235 (424) T ss_pred EEEEEe----cCceEEECcccEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHH Confidence 333322 1223467889999999765443222 2333444444443332 23357899999999999988876 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C--cHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T--ANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~--~~e~ 308 (447) +.++.+++|.+.+. ...+|+|+++||++|++|+++++++.+++ ++.+++++++||++|||||++|+. + +.|| T Consensus 236 ~~~~~~~~~~~~~~-g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq 314 (424) T protein:vir:45 236 SWGWLKDQWQKASQ-ALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISA 314 (424) T ss_pred HHHHHHHHHHHHhc-cccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 55555444444332 12247899999999999999999998754 788999999999999999999973 2 3489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.++|+++||.||+++||++||++||++.++..|++|+||++.|+|+|.++|++++.+++++|+||+||+|+++|+||++ T Consensus 315 ~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ 394 (424) T protein:vir:45 315 QAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVE 394 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) | +|. +++.|+.+.... ..+...+...+++ T Consensus 395 g--gD~~~~~~n~~~~~~~------------~~~~~~~~~~~~~ 424 (424) T protein:vir:45 395 G--LDEMLVSVNAANPAGD------------FKPPKNDEGKTNE 424 (424) T ss_pred C--cceeeecccccccccc------------cCCCCCCCCCCCC Confidence 8 465 456555431110 0000111111111 No 16 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2.7e-83 Score=473.25 Aligned_cols=415 Identities=12% Similarity=0.059 Sum_probs=292.1 Q ss_pred Cc-----hhHhhhh-hcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccC Q lcl|NC_010576. 1 MA-----SSDRLLH-SWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMV 74 (447) Q Consensus 1 Mg-----~~~~l~~-~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~l 74 (447) |. +|+|+.. +.+.|... .+.. .+.+| ..+++..+..+ ..++..+++++++||+||++||++||+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~----~s~~---~~~~~-~~~~~~~~~~g-~~v~~~~al~~~~v~~ci~~Ia~~ia~l 71 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVP----ISLT---DGSFW-SAWGGMGSSSG-ETVTADSALQLSAVWSCVRLIAETIATL 71 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCc----ccCC---chhHH-HhhcccccCCC-ceechHhhhccHHHHHHHHHHHHHHhhC Confidence 65 3333322 11122110 0110 11111 12223333333 3344567889999999999999999999 Q ss_pred ceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceee Q lcl|NC_010576. 75 DFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQ 154 (447) Q Consensus 75 p~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (447) ||++||++++|......+|++++||+.+||++||+++||+.++.+++++||||+++.++. +... .+++.+.....+.. T Consensus 72 p~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~-~L~~l~p~~v~i~~ 149 (437) T protein:vir:10 72 PLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLI-GLELMLPQRTTVKR 149 (437) T ss_pred ceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE-EEEEEcCcceEEEE Confidence 999999999998888899999999999999999999999999999999999999999885 4433 34444444444444 Q ss_pred ecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeC Q lcl|NC_010576. 155 FFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFP 229 (447) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~ 229 (447) ...+...+.+. .. .+....++.++|+|++++..++... .+.+..+...+....+ ...+.||++++|||+++ T Consensus 150 ~~~g~~~y~~~-~~--~g~~~~~~~~dIih~r~~~~d~~~G-~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 225 (437) T protein:vir:10 150 LTSGALQYTYR-NV--DGTVSTLAEDDVFHVRGFSLDGLMG-LTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD 225 (437) T ss_pred CCCCeEEEEEE-ec--CceEEEEccccEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC Confidence 33344333322 22 2334578899999999765443222 2334444444443322 23357899999999999 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--- Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG--- 303 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g--- 303 (447) +.+++++.++.+ +.|.+.+. .|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||. T Consensus 226 ~~l~~e~~~~~~----~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 301 (437) T protein:vir:10 226 QILQKEKRAEIR----TDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEK 301 (437) T ss_pred CCCCHHHHHHHH----HHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 988876555444 44554444 47899999999999999999998865 788999999999999999999962 Q ss_pred -----CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHH Q lcl|NC_010576. 304 -----TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEI 378 (447) Q Consensus 304 -----~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~ 378 (447) ++.|++.++|+++||.||+.+||++|++|||++.++. +++|+||++.|+++|.++|++++.+++++|+||+||+ T Consensus 302 ~t~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~-~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~ 380 (437) T protein:vir:10 302 STSWGTGIEQQTLGFLTFTLRPWLTRIEQAARRSLLRPGERD-QFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDEC 380 (437) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccC-ceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 3358999999999999999999999999999998875 5789999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCC Q lcl|NC_010576. 379 RELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIEN 438 (447) Q Consensus 379 R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (447) |+++|+||++|+....+++.++.+....+....+.+. ++...+.+..+++..+..+. T Consensus 381 R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 381 RAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAA---QDALKAWLYQEEKTRATQER 437 (437) T ss_pred HHHhCCCCCCCCcceEeecCcccchhhccCcCCCcch---hccccccCCCCCCCCccccC Confidence 9999999999863323457777776554333222211 11111111111111111111 No 17 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=2.8e-83 Score=473.13 Aligned_cols=424 Identities=11% Similarity=0.081 Sum_probs=290.5 Q ss_pred CchhHhhhhhcccccCCc--cccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ--NQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |||++||.+.. +... ..+.....+..+.++ .+++...++..++ ...++++++||+||++||++||+|||++ T Consensus 1 Mg~~~~l~~r~---~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~g~~V~-~~~al~~~~V~~~v~~Ia~~iA~lp~~~ 73 (457) T protein:vir:13 1 MGFWSALFGRG---HSPALDGIEARAWEPYDPSIY---NLGAVAASGETVT-PHDALQVSAVFASVRLLSETIATLPLST 73 (457) T ss_pred Cchhhhhhccc---ccccccccccccccccchHHH---hhcccccCCceec-hHHhhccHHHHHHHHHHHHhhccCceEE Confidence 99999875422 2211 111111111122111 1233344444444 4678999999999999999999999999 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCC Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (447) ||.+.+ +.+++.+|++.++|+..|| +||+++||+.++.+++++||||+++.++. +.+. .+++++.....+.....+ T Consensus 74 ~~~~~~-~~~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~~-g~~~-~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:13 74 YSKRGG-SRKEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQG-PNIV-GLDVLDPTKIHVHMVMVD 149 (457) T ss_pred EEecCC-cccccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE-EEEEEccCceEEEEecCC Confidence 997654 4556789999999986555 79999999999999999999999997764 3333 344444433333332222 Q ss_pred ceE---EEEeeecccccc--eeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeee Q lcl|NC_010576. 159 QVM---VRVWNDNTGLEQ--DLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQF 228 (447) Q Consensus 159 ~~~---~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~ 228 (447) ... +..|........ ...++.++|||++.+.......+.+.+..+...+....+. ..+.||++|+|||++ T Consensus 150 ~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 229 (457) T protein:vir:13 150 GLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEV 229 (457) T ss_pred CccceeEEEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEc Confidence 221 222222222111 2246789999999765433222223344444444443332 334789999999999 Q ss_pred CCcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-- Q lcl|NC_010576. 229 PYSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG-- 303 (447) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-- 303 (447) ++.+++++. +++++.|.+.++ .|+++++||++|++|+++++++.+++ ++.+++++++||++|||||++||. T Consensus 230 ~~~ls~e~~----~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 305 (457) T protein:vir:13 230 PGTMSEEGL----ARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDAT 305 (457) T ss_pred CCCCCHHHH----HHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Confidence 998887654 445555555554 47899999999999999999998865 688999999999999999999962 Q ss_pred ------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_010576. 304 ------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNE 377 (447) Q Consensus 304 ------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE 377 (447) ++.||+.++|+++||.||+++||++||++||++.++ .+++|+||++.|+++|.++|++++.+++++|+||+|| T Consensus 306 ~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE 384 (457) T protein:vir:13 306 NSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDE 384 (457) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccc-CceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 335899999999999999999999999999998775 5789999999999999999999999999999999999 Q ss_pred HHHHhCCCCCCCccccc-cccccccchhhcccccC--------CCCCCCCCCCc-CCCCCCCcccccccCCcc Q lcl|NC_010576. 378 IRELTGKAPHPNPLANE-LFNRNIADGNQVGGINT--------PGQITSDQPAT-ASTDPLNNVSTSAIENGS 440 (447) Q Consensus 378 ~R~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 440 (447) +|+++||||++||.+|. +++.|+.+..+...... +..+..+.++. ++.++.++.-.+..++|+ T Consensus 385 ~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 385 VRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred HHHHhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 99999999999998887 46777766543222111 11111111111 111112222222222222 No 18 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=3e-83 Score=473.01 Aligned_cols=404 Identities=12% Similarity=0.087 Sum_probs=297.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |+|+++ |+++.. +....+.....+ ..+....++.. ++..+++++++|++||++||++||+|||++|| T Consensus 1 m~~~~~-------~~~~~~--~~~~~~~~~~~~---~~~~~~~~g~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 67 (419) T protein:vir:57 1 MFIPQF-------WKGRPS--ENRVNWQVVPGG---MRSSSSQAGVI-ITPETALALSAVRACVTLLAESVAQLPCVLYR 67 (419) T ss_pred Ccchhh-------hccCCc--cccccccccccc---cccccccCCce-echHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 888765 333322 222222111111 11222333333 45567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+++|+.+...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+.....+.. T Consensus 68 ~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v-~v~~~~~g~~ 146 (419) T protein:vir:57 68 RTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKV-IVLKGPDGMP 146 (419) T ss_pred EcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcce-EEEECCCceE Confidence 999998888889999999999999999999999999999999999999999988776655555544433 3433222222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+ ...+.+ ..++.++|+|++.+..+.... .+.+..+...+.... +...+.||++|+|+|+++...... T Consensus 147 ~y----~~~~~~--~~~~~~~vih~r~~~~d~~~G-~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~ 219 (419) T protein:vir:57 147 YY----DIPSIG--EILPMRMVHHIKSFSLDGYIG-TSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAI 219 (419) T ss_pred EE----EEcCCc--eEEchhhEEEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcc Confidence 22 222222 246789999999754332211 222333333333322 223357899999999998877766 Q ss_pred HHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------CcH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------TAN 306 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------~~~ 306 (447) ..++++++++++|.+.+.+ |+++++||++|++|+++++++++++ ++.+++++++||++|||||++|++ ++. T Consensus 220 ~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~ 299 (419) T protein:vir:57 220 ASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNI 299 (419) T ss_pred cCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccH Confidence 6778888888888887765 7899999999999999999998865 688999999999999999999963 234 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) |++.+.|+++||.||+++||++|+++||++.++ .|++|+||++.|+++|.++|++++++++++|+||+||+|+++|+|| T Consensus 300 e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~-~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p 378 (419) T protein:vir:57 300 EHQGLQYVIYTMLAILKRHESAMMRDLLLPSER-RDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTP 378 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 899999999999999999999999999998776 4899999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCC Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIEN 438 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (447) +|| ||. +++.|+++..+..+.+.+. ++.. .+.++.-++++ T Consensus 379 ~~g--gD~~~~~~n~~~~~~~~~~~~~~--~~~~--------~~~~~~~~~~~ 419 (419) T protein:vir:57 379 IPG--GDKYLTPLNMVDSKALTGIGKAT--PQQL--------KDIEAILCTRN 419 (419) T ss_pred CCC--cCeeeeccccccccccccccCCC--cccC--------cchhhhhhccC Confidence 987 466 4677776654433322111 1111 11112222222 No 19 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=2.4e-83 Score=473.49 Aligned_cols=403 Identities=10% Similarity=0.041 Sum_probs=295.6 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccc---ccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMT---SFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) |+++.. |+++.. +.+ ...+|.. ...+..+.++.. ++...++++++||+||++||++||+|||+ T Consensus 1 m~~~~~-------~~~~~~-~~s-----~~~~w~~~~~~~~~~~~~~g~~-vt~~~al~~~~v~~~i~~Ia~~iA~lp~~ 66 (421) T protein:vir:10 1 MFIPQM-------FEGKKR-SVS-----GGGFWEAMLGGVRSSHSKAGVM-ITPETALALSAVRACVTLLAESVAQLPVE 66 (421) T ss_pred CCCcch-------hccccc-ccC-----cchhhHHHhhhhccCcccCCce-echHHhhccHHHHHHHHHHHHhhccCceE Confidence 776543 333322 111 1222321 122333334434 45567899999999999999999999999 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP 157 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (447) |||++++|+.++..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+..... T Consensus 67 ~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l-~~~~v~v~~~~~ 145 (421) T protein:vir:10 67 LYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPI-NPKKVIVLKGPD 145 (421) T ss_pred EEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEe-cCceEEEEECCC Confidence 9999999988888999999999999999999999999999999999999999999887765544444 444434443333 Q ss_pred CceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcC Q lcl|NC_010576. 158 RQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYST 232 (447) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~ 232 (447) +...+.+ ...+. .++.++|+|++.+..+... +.+.+..+...+....+ ...+.||++++|+|++++.+ T Consensus 146 g~~~y~~----~~~g~--~~~~~eiih~~~~~~d~~~-G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~ 218 (421) T protein:vir:10 146 GMPYYEI----PEIGE--TLPMRMMHHVKVFSLDGYI-GSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEA 218 (421) T ss_pred ceEEEEE----cCCCc--EEchhhEEEecCcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCcc Confidence 3332222 11222 4678999999976544322 22333444444443322 23357899999999999877 Q ss_pred ChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------ Q lcl|NC_010576. 233 KSTARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------ 303 (447) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------ 303 (447) .....+++++++++.|.+.+.+ |+++++||++|++|+++++++.+++ ++.+++++++||++|||||++|+. T Consensus 219 ~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 298 (421) T protein:vir:10 219 PAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATN 298 (421) T ss_pred CccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCcc Confidence 6655677778888888887765 7899999999999999999998865 688999999999999999999972 Q ss_pred CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 304 TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 304 ~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) ++.|++.++|+++||.||+++||++||+|||++.++ .+++|+||++.|+++|.+++++++.+++++|+||+||+|+++| T Consensus 299 sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~-~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 377 (421) T protein:vir:10 299 NNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSER-RDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMEN 377 (421) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 235899999999999999999999999999999886 4889999999999999999999999999999999999999999 Q ss_pred CCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCCcc Q lcl|NC_010576. 384 KAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIENGS 440 (447) Q Consensus 384 l~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (447) +||++| ||. +++.++++..+...... +. ..+.++ +.-.+.+++ T Consensus 378 l~p~~g--gD~~~~~~n~~~~~~~~~~~~--~~-~~~~~~---------e~d~~~~~~ 421 (421) T protein:vir:10 378 LPPIAG--GDKYLTPLNMVDSAQIIPGDK--KP-TAQQMA---------EIDTILSRT 421 (421) T ss_pred CCCCCC--cceeeeccccccccccccCCC--Cc-ccccCc---------ccccccccC Confidence 999987 455 56777765443321111 00 000000 011111222 No 20 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=2.6e-83 Score=473.31 Aligned_cols=403 Identities=13% Similarity=0.050 Sum_probs=288.8 Q ss_pred chhHhhhhhcccccCCcccccccccccccccccccccc-ccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 2 ASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGG-YYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 2 g~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) =||+| +........ ..+...|...++| ..+.++.. ++...++++++|++||++||++||+|||++|| T Consensus 1 ~~~~r----------~~~~~~~~~-~~~~~~~~~~~~g~~~s~~~~~-vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~ 68 (419) T protein:vir:14 1 MFFSR----------QLLSNLGQT-QMSAGGWVSALLGSSRSDSGQV-VTPASALALTVLQNCVTLLAESIAQLPIELYE 68 (419) T ss_pred Ccccc----------ccccccccc-ccCcchhhHHhhcCCCccCCcc-cchHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 12232 211111111 1122334444443 33444444 44567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +++++. .++.+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+...+++.+ ....+.....+.. T Consensus 69 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~-~~v~v~~~~~~~~ 146 (419) T protein:vir:14 69 RSGEDR-KPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDN-EAVTVMRGSDLKP 146 (419) T ss_pred ecCCcc-ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecC-ceEEEEECCCceE Confidence 877664 56789999999999999999999999999999999999999999987776554444444 3333333222222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+.+. +. ..++.++|+|++.+..++... .+.+..+...+....+ ...+.||++++|+|++++.+..+ T Consensus 147 ~y~~~----~~---~~~~~~~i~h~~~~~~dg~~G-~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~ 218 (419) T protein:vir:14 147 VYRVR----GS---DPMPQRLVHHVRWMSINGYTG-LSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPAL 218 (419) T ss_pred EEEEc----cC---cccchhheeEecCcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcc Confidence 22221 11 125678999999754333221 2333333444333222 23357899999999999888766 Q ss_pred HHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C--cH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T--AN 306 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~--~~ 306 (447) ..+++++++++.|++.+++ |++++++|++|++|+++++++.+++ ++.+++++++||++|||||++|+. + +. T Consensus 219 ~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~ 298 (419) T protein:vir:14 219 KDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNI 298 (419) T ss_pred cCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccH Confidence 6677788888888887765 7799999999999999999998865 688999999999999999999963 2 34 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) |++.++|+++||.||+++||++|++|||++.++ .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++|+|| T Consensus 299 E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~-~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p 377 (419) T protein:vir:14 299 EHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPP 377 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 899999999999999999999999999999886 5899999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) +||+ |. +++.|+.+....+.. +. ...++..+....+..-.+ T Consensus 378 ~~gG--D~~~~~~n~~~~~~~~~~--~~---~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 378 VKGG--DIYLSPMNMVDASKPQQL--PV---GKSEPTKAAIDEIGRILS 419 (419) T ss_pred CCCc--Ceeeeccccccccccccc--cC---CCCCCccccccchhcccC Confidence 9884 55 566666654332111 00 111111111111111111 No 21 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=4.8e-83 Score=471.87 Aligned_cols=401 Identities=11% Similarity=0.046 Sum_probs=285.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||++||.. +....+.. ...++....+..........+....++++++|++||++||++||+|||++|| T Consensus 1 Mg~~~~~~~------~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~ 69 (423) T protein:vir:81 1 MGFLQKLGL------APSVVATP-----EPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFE 69 (423) T ss_pred CchhHhhcc------ccccccCc-----cccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEE Confidence 999999632 11111111 1111222223333333334455667788999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-ceeeeccCCCcceeeecC-- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-GSFDINTARVGKIMQFFP-- 157 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-- 157 (447) ++++|+.+++.+|++++||+ +||++||+++||+.++.+++++||||+++.++..+... ..+.+.+.....+..... T Consensus 70 ~~~dg~~~~~~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~ 148 (423) T protein:vir:81 70 RVEDGGRERVREGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGW 148 (423) T ss_pred EecCCceeeeccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCC Confidence 99999888889999999996 89999999999999999999999999999887543221 223333333333333222 Q ss_pred CceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcC Q lcl|NC_010576. 158 RQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYST 232 (447) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~ 232 (447) +...+.+.......+..+.+++++|||++.+.......+.+.+..+...+....+. ..+.||+.++|||+++... T Consensus 149 ~~~~Y~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~ 228 (423) T protein:vir:81 149 GSLDYIIIESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPES 228 (423) T ss_pred cceEEEEEEecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcc Confidence 23344443333334555678899999999764433222334444444444433222 3357899999999887543 Q ss_pred Ch-HHHHHHHHHHHHHHHHHh---ccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----C Q lcl|NC_010576. 233 KS-TARAAQAARRKQEIENEM---ANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----G 303 (447) Q Consensus 233 ~~-~~~~~~~~~~~~~~~~~~---~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g 303 (447) .+ ...++++++++++|.+.+ .+|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++|| + T Consensus 229 ~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~ 308 (423) T protein:vir:81 229 KAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNA 308 (423) T ss_pred cCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCC Confidence 22 123445556666666554 347799999999999999999998865 68899999999999999999996 2 Q ss_pred C--cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHh-cCCceEEEecchhhhcCHHHHHHHHHHHHh-CCCcCHHHHH Q lcl|NC_010576. 304 T--ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAV-SQGQVLVYYRNPFKLVPVEQLATVADVLTR-NAIYTPNEIR 379 (447) Q Consensus 304 ~--~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~-~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~-~G~~t~NE~R 379 (447) + +.|++.++|+++||.||+++||++|+++|+++.+. ..|++|+||.+.|+|+|+++|++++.+++. .||||+||+| T Consensus 309 t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R 388 (423) T protein:vir:81 309 NYSNVREFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVR 388 (423) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHH Confidence 2 34899999999999999999999999999998765 468999999999999999999999999885 6999999999 Q ss_pred HHhCCCCCCCcccccc-ccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccc Q lcl|NC_010576. 380 ELTGKAPHPNPLANEL-FNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVST 433 (447) Q Consensus 380 ~~~gl~p~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (447) +++|+||+||+ |.+ ++.|+.+..+... +++. +++ T Consensus 389 ~~~gl~p~~gG--D~~~~p~n~~~~~~~~~----~~~~--------------~~t 423 (423) T protein:vir:81 389 AMDNLPSIDGG--DDLARPLNTEFGDSEDA----PGEE--------------VET 423 (423) T ss_pred HHhCCCCCCCc--ceeecccccccCccCCC----CCCC--------------CCC Confidence 99999999984 554 4555443221111 0100 000 No 22 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=6.9e-83 Score=471.03 Aligned_cols=414 Identities=12% Similarity=0.015 Sum_probs=294.1 Q ss_pred CchhHhhhhhcccccCCcccc----ccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCce Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQN----QNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDF 76 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~ 76 (447) |. ..|.+.....+...+-. ...+...+..+++..++|..+.++..+ +...++++++|++||++||++||+||| T Consensus 1 ~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v-~~~~al~~~~V~~~i~~ia~~ia~lp~ 77 (434) T protein:vir:43 1 MS--KSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKV-TVDKAMKLSAVWACVRLISTSVAGLPL 77 (434) T ss_pred Cc--cchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCccCCcee-chhhhhccHHHHHHHHHHHHhhhhCce Confidence 32 11112111111111000 001111112222233445544454554 456789999999999999999999999 Q ss_pred EEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 77 KHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 77 ~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) ++||++.+|+..+..+|++++||+.+||++||+++||+.++.+++++||+|+++.++. +.+..+++ +++....+.... T Consensus 78 ~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~-G~~~~L~~-l~p~~v~~~~~~ 155 (434) T protein:vir:43 78 GVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAA-GRPAALDF-LLPSRVDLECDE 155 (434) T ss_pred EEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-CcEEEEEE-EcCcceEEEEcC Confidence 9999999998888899999999999999999999999999999999999999988764 44444444 444443444333 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCc Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~ 231 (447) .+...+.++ ...+....++.++|+|++++..++... .+.+..+...+.... +...+.||++++|+|++++. T Consensus 156 ~g~~~y~~~---~~~g~~~~~~~~eVih~~~~~~dg~~G-~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~ 231 (434) T protein:vir:43 156 NGRLKYFYT---TKKGARREIERTNMLHIPAFTLDGRIG-LSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRI 231 (434) T ss_pred CCeEEEEEE---ecCceEEEEccccEEEecCcCCCCccc-cCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCC Confidence 333333322 233445678899999999765443322 233344444443322 23335789999999999998 Q ss_pred CChHHHHHHHHHHHHHHHHHh-ccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------ Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEM-ANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------ 303 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~-~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------ 303 (447) +++++.+ ++++.|++.. ..|+|+++||++|++|+++++++.+++ ++.+++++++||++|||||++||. T Consensus 232 l~~e~~~----~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 307 (434) T protein:vir:43 232 LQPAQRE----EFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSN 307 (434) T ss_pred CCHHHHH----HHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCcc Confidence 8866543 4455554433 347899999999999999999998865 688999999999999999999962 Q ss_pred --CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 304 --TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 304 --~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) ++.|++.++|+++||.||+.+||++||++||++.++. +++|+||++.|+|+|.++|++++.+++++|+||+||+|++ T Consensus 308 ~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~ 386 (434) T protein:vir:43 308 WGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERI-RYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRK 386 (434) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhc-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 3458999999999999999999999999999998864 7899999999999999999999999999999999999999 Q ss_pred hCCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 382 TGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 382 ~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) +|+||+|| ||. +++.|+++....++...+..... ...+.++.|...| T Consensus 387 ~gl~p~~g--gD~~~~~~n~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 434 (434) T protein:vir:43 387 ENLPELPG--GDILTVQSNLVPIDQLGQSNKSQAVRA-ALMNWFSQPEPQE 434 (434) T ss_pred hCCCCCCC--CCeEeeccCccchhhhhccCCCcchhh-hhhccCCCCCCCC Confidence 99999988 455 67888888776554333222211 1111122222222 No 23 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=4.8e-82 Score=466.41 Aligned_cols=427 Identities=12% Similarity=0.064 Sum_probs=290.8 Q ss_pred CchhHhhhhhcccccCCccccccc---cccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT---NDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) |=+ -......+.+. ..++...+.+.+..+.. .+...-.....|+++++|++||++||++||+|||+ T Consensus 1 ~~~----------~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~ 69 (518) T protein:vir:10 1 MLL----------ANGQTLSAPAMAELSPQMQDSYYYAPAVGMQ-LERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) T ss_pred Ccc----------cCceeecCchhhhhhhhhhccccccccccee-cccccchhhHHHhhhHHHHHHHHHHHHhhccCceE Confidence 211 11111111111 11111111111111111 11111223356889999999999999999999999 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-c Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-F 156 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 156 (447) +||++.++.. +..+|+++. |+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+... . T Consensus 70 l~~~~~~~~~-~~~~~~~~~-Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v-~v~~~~~ 146 (518) T protein:vir:10 70 CMFTSGDTET-EESDTGYAK-LLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRV-AIKRNSR 146 (518) T ss_pred EEEEcCCCce-eccchHHHH-HHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCce-EEEEcCC Confidence 9999887754 445666655 556999999999999999999999999999999988776555455444433 33222 2 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCc Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~ 231 (447) .+...+.+..........+.++.++|||++.+.......+.+.+..+...+....+ ...+.||+.++|||++++. T Consensus 147 ~~~~~y~~~~~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ 226 (518) T protein:vir:10 147 TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR 226 (518) T ss_pred CCEEEEEEEecCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC Confidence 23333333322222334467889999999976543322222333444444333322 2335789999999999998 Q ss_pred CChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----CC Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----GT 304 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g~ 304 (447) +++++ ++++++.|++.++ .|+|+++||++|++|+++++++.+++ ++.+++++++||++|||||++|| ++ T Consensus 227 ls~e~----~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t 302 (518) T protein:vir:10 227 LSEAA----QQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT 302 (518) T ss_pred CCHHH----HHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC Confidence 87654 4456666666665 47899999999999999999988864 78899999999999999999996 33 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.|++.++||++||.||+.+||++||++|++..+ .+++|+||++.|+++|.+++++++.+++++|+||+||+|+++ T Consensus 303 ~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~--~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~ 380 (518) T protein:vir:10 303 FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM 380 (518) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--CCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 3489999999999999999999999999998754 478999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccc-cccccccchhhcccccCCCCCCCCC-CCcCC-----CC--CCCcccccccCCccCcCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQ-PATAS-----TD--PLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 383 gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~--~~~~~~~~~~~~~~~~~~~~~ 447 (447) |+||++++++|. |++.|+.+.....+....+++.+.. ++... +. +.......++.++...++|.- T Consensus 381 Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:10 381 GLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred CCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCccccccccccccc Confidence 999999888887 6788888776544433333322211 11111 11 111112223344444555555 No 24 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=5.9e-82 Score=465.93 Aligned_cols=430 Identities=12% Similarity=0.055 Sum_probs=287.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |=+ -+-+..-.+.......++...+++.+..+. ..+...-.....|+++++||+||++||++||+|||++|| T Consensus 1 ~~~-------~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~ 72 (518) T protein:vir:78 1 MLL-------ANGQTLSAPAMAELSPQMQDSYYYAPAVGM-QLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred Ccc-------cCceeeccchhhhhhhhhhhcccccceece-ecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEE Confidence 211 111111111111111222211111111111 111111223356889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 159 (447) +++++.. +..+|++ .+|+.+||++||+++||+.++.+++++||+|+++.++..+.+..++++.+..+ .+... ..+. T Consensus 73 ~~~~~~~-~~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~V-tv~~~~~~~~ 149 (518) T protein:vir:78 73 TSGDTET-EEHDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRV-AIKRNSRTGR 149 (518) T ss_pred EcCCccc-cccchHH-HHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCce-EEEEcCCCCE Confidence 8776654 3345554 55567999999999999999999999999999999988776555455444433 33322 2223 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+.+..........+.++.++|||++.+.......+.+.+..+...+....+ ...+.||++|+|||++++.+++ T Consensus 150 ~~y~~~~~~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~ 229 (518) T protein:vir:78 150 YEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSP 229 (518) T ss_pred EEEEEEecCCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCH Confidence 33333222222334566889999999976433322222333444444333322 2335789999999999998876 Q ss_pred HHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----CC--c Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILN----GT--A 305 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~----g~--~ 305 (447) ++. +++++.|++.+++ |+|+++||++|++|+++++++.+++ ++.+++++++||++|||||++|| ++ + T Consensus 230 e~~----~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn 305 (518) T protein:vir:78 230 EAQ----QRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN 305 (518) T ss_pred HHH----HHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh Confidence 544 4566666666554 7899999999999999999988765 68899999999999999999997 23 3 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Q lcl|NC_010576. 306 NEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKA 385 (447) Q Consensus 306 ~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~ 385 (447) .|++.++||++||.||+.+||++||++|+++.+ .+++++||++.|+++|.++|++++.+++++|+||+||+|+++||| T Consensus 306 ~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~~--~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~ 383 (518) T protein:vir:78 306 ISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLP 383 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 489999999999999999999999999998644 478999999999999999999999999999999999999999999 Q ss_pred CCCCccccc-cccccccchhhcccccCCCCCCCCC-CCcCCCC----CCCccccc---ccCCccCcCCCCC Q lcl|NC_010576. 386 PHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQ-PATASTD----PLNNVSTS---AIENGSLTDGGSY 447 (447) Q Consensus 386 p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~~~~~~~~---~~~~~~~~~~~~~ 447 (447) |++++++|. |++.++.+.....+....+++.+.. ++..... ...++... ++.++...++|.- T Consensus 384 pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:78 384 RSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred CCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCccccccccccccc Confidence 999888887 5788888876554433333322211 1111111 01111111 2222222333332 No 25 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=3.6e-82 Score=467.06 Aligned_cols=398 Identities=13% Similarity=0.092 Sum_probs=286.1 Q ss_pred hhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Q lcl|NC_010576. 8 LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGN 87 (447) Q Consensus 8 ~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~ 87 (447) |-+.++|+++...... ++..+...++++.....+..++...++++++|++||++||++||++||++||.++++ . T Consensus 1 ~~f~~~f~r~~~~~~~-----~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~-~ 74 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVT-----TPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTL-K 74 (413) T ss_pred CccchhhccCccCCcc-----chHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc-c Confidence 3344567765432211 122222222222222222334556788999999999999999999999999987655 4 Q ss_pred eccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceEEEEeee Q lcl|NC_010576. 88 QTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVMVRVWND 167 (447) Q Consensus 88 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (447) .++.+|++++||+.+||++||+++||+.++.+++++||||+++.++. +.+. .+++.+.....+.....+...+. . T Consensus 75 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~-g~~~-~L~~l~~~~v~~~~~~~~~~~y~---~ 149 (413) T protein:vir:48 75 TRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL-GEVV-ELLPIDPGCVEPKLNSQWQPVYQ---V 149 (413) T ss_pred eeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC-CcEE-EEEEEcCceEEEEEcCCceEEEE---E Confidence 56789999999999999999999999999999999999999998874 3333 34444444433333222222222 2 Q ss_pred cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChHHHHHHHH Q lcl|NC_010576. 168 NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKSTARAAQAA 242 (447) Q Consensus 168 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~ 242 (447) ....+....++.++|+|++.+..+... +.+.+..+...+....+ ...+.||+.|+|+|++++.+++++.+ T Consensus 150 ~~~~g~~~~~~~~evih~~~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~---- 224 (413) T protein:vir:48 150 TFPDGSVDVLTQDEIWHVRTLTLDGLV-GLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYE---- 224 (413) T ss_pred EecCceEEEEccccEEEecCcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHH---- Confidence 222334456889999999975433322 12333444444443322 23356899999999999988876554 Q ss_pred HHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C--cHHHHHHHH Q lcl|NC_010576. 243 RRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T--ANEQQTLGY 313 (447) Q Consensus 243 ~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~--~~e~~~~~f 313 (447) ++++.|.+.+. +|+|+++++++|++|+++++++.+++ ++.+++++++||++|||||++|++ + +.|++.+.| T Consensus 225 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f 304 (413) T protein:vir:48 225 RLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGF 304 (413) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHH Confidence 44555555544 47899999999999999999998865 688999999999999999999973 2 348999999 Q ss_pred HHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccc Q lcl|NC_010576. 314 YNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLAN 393 (447) Q Consensus 314 ~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~ 393 (447) +++||.||+++||++||++||++.++ .+++|+||++.|+++|.+++++++++++++|+||+||+|+++|+||+||+ | T Consensus 305 ~~~~i~P~~~~ie~~l~~~L~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~gg--D 381 (413) T protein:vir:48 305 INYSLVPYLTRIEQRINTGLVRESKQ-GKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGG--D 381 (413) T ss_pred HHHHHHHHHHHHHHHHHhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--c Confidence 99999999999999999999998875 48999999999999999999999999999999999999999999999884 5 Q ss_pred c-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 394 E-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 394 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) . +++.++.+..+.++...+..+..+.+ ++++ T Consensus 382 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~-----------~~~~ 413 (413) T protein:vir:48 382 VYLTPMNMTTSPSAGDDNGKKKESGDAD-----------KTAS 413 (413) T ss_pred eeeccccccccccccccCCCCCCCCCcc-----------ccCC Confidence 5 56777665544333222111111111 1111 No 26 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=5.6e-82 Score=466.04 Aligned_cols=423 Identities=13% Similarity=0.118 Sum_probs=290.0 Q ss_pred hcccccCCccccccccccccccccc------cccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEEEcC Q lcl|NC_010576. 10 SWNAFQSNQNQNQNTNDFLTPSNGM------TSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLKIDP 83 (447) Q Consensus 10 ~~~~f~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~ 83 (447) .|++|++... ++.+........|. ...+++.+.++.. ++...++++++|++||++||++||+|||++||+++ T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~-v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~ 78 (454) T protein:vir:93 1 MWNLLRRTRK-NQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVK-ADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDA 78 (454) T ss_pred CCCccccCcc-cccccccccchhhhhhhhhhhhhhcchhhcCcc-cChHHhhccHHHHHHHHHHHHhhccCceEEEEecc Confidence 4555544221 12222122222221 1123333444444 44567899999999999999999999999999988 Q ss_pred CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceEEE Q lcl|NC_010576. 84 ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVMVR 163 (447) Q Consensus 84 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (447) +|...+..+|+++ +|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+.....+.+.++ T Consensus 79 ~g~~~~~~~~~~~-~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i-~~~~v~v~~~~~g~~~y~ 156 (454) T protein:vir:93 79 QGIRRETRRGDIA-RLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRIL-DWNRVEPLVADDGEVFYR 156 (454) T ss_pred CCccchhhhHHHH-HHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEE-cCcceEEEEcCCCcEEEE Confidence 8877766666555 55579999999999999999999999999999999887765554444 444444544444555555 Q ss_pred Eeeec-ccccceeeeccccccccccc-ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 164 VWNDN-TGLEQDLLVSKENCIIIESP-FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 164 ~~~~~-~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) +.... .+......++.++|+|++.+ ..+... +.+.+..+...+....+ ...+.||++++|||++++.+++++ T Consensus 157 ~~~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~-G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~ 235 (454) T protein:vir:93 157 ITPDRNCGITEAVTVPAREVIHDRFNCFFHPLI-GLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEEN 235 (454) T ss_pred EEeccccccceeEEecCcceEEeccCCCCCCce-eccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHH Confidence 44332 23344567889999999843 222211 12233333333333322 233578999999999999887765 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------CcHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------TANEQ 308 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e~ 308 (447) .+ ++++.|++.+.+ |+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||. ++.|+ T Consensus 236 ~~----~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~ 311 (454) T protein:vir:93 236 AK----KLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEA 311 (454) T ss_pred HH----HHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHH Confidence 44 455556555554 7899999999999999999998865 788999999999999999999973 23589 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.++|+++||.||++.||++||++|++. .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++|+||++ T Consensus 312 ~~~~f~~~~l~P~~~~ie~~ln~~L~~~----~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 387 (454) T protein:vir:93 312 LEQQYYSQCLQTLIESIELLLDEALETG----ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLA 387 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 9999999999999999999999999974 467999999999999999999999999999999999999999999999 Q ss_pred Cccccc-cccccccchhhcccccCCCCC--CCCCCCcCCCC---CCCcccccccCCcc--CcCCCCC Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGINTPGQI--TSDQPATASTD---PLNNVSTSAIENGS--LTDGGSY 447 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~ 447 (447) |+ |. |++.++.+....++.....+. ..+++.++..+ ...+++.++.+.++ +.=.|-+ T Consensus 388 gg--D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~~~~ 452 (454) T protein:vir:93 388 GG--DALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFRGIL 452 (454) T ss_pred CC--CeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhhhhh Confidence 85 44 677777766554332221111 11111111111 11111111111111 1111222 No 27 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=1.6e-81 Score=463.49 Aligned_cols=412 Identities=12% Similarity=0.068 Sum_probs=287.6 Q ss_pred CchhHhh-----hhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MASSDRL-----LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~~~~l-----~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp 75 (447) |+|++|- |.++++|.+.+.+... .+......|...+.+........ ++...++++++||+||++||++||+|| T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~al~~~~V~~cv~~Ia~~iA~lp 88 (441) T protein:vir:94 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQ-YNEDDLQMMVQTLPGFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMP 88 (441) T ss_pred ccccccccchhhhhcccccccccccccc-CCCcchHHHHHHhcccCcccccc-cchhhhhccHHHHHHHHHHHHhhccCc Confidence 8888873 4566677554443211 11112222222222322233333 444567899999999999999999999 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF 155 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (447) |++|+. + +...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+... T Consensus 89 ~~~~~~---~--~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i-~~~~v~v~~d 162 (441) T protein:vir:94 89 IRVTVN---G--QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFR-KTSEIELKSD 162 (441) T ss_pred eeeecC---c--cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE-cCceeEEEEC Confidence 999863 2 234689999999999999999999999999999999999999999877665544444 4444344433 Q ss_pred cCCceEEEEeee-cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeC Q lcl|NC_010576. 156 FPRQVMVRVWND-NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFP 229 (447) Q Consensus 156 ~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~ 229 (447) ..+.+.+.++.. ..+......++.++|+|++.+..++.. +.+.+..+..++....+. ..+.||++|+|||+++ T Consensus 163 ~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~-G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 241 (441) T protein:vir:94 163 ARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred CCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC Confidence 333443333221 122334456889999999975433221 223444445444443332 2357899999999999 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC-- Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT-- 304 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~-- 304 (447) +.+.++ ++++++++.|.+.+.+ |+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||.+ T Consensus 242 ~~~~~~---e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:94 242 GVLDNK---KARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred CCCCCH---HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 887653 3445566677776654 7899999999999999999998865 6889999999999999999999742 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.+++.. +|.+||.||+++||++||++|+++. .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++ T Consensus 319 ~~s~~q~~~-~~~~tl~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~ 394 (441) T protein:vir:94 319 NMSITDANL-DYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD 394 (441) T ss_pred CccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 2355544 4567999999999999999998753 478999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 383 GKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 383 gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) |+||++||+.+. +++.++++....+....+ +......+..+. .++| T Consensus 395 gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~-~~~~~~~~~kgG--e~~e 441 (441) T protein:vir:94 395 GLAPIPGGNGSIHRVDLNHVNIELVDEYQMN-KSRATDKKLKGG--EENE 441 (441) T ss_pred CCCCCCCCCcceEeecccccccccccccccc-cccccccccCCC--CCCC Confidence 999999987655 467777776544221100 000000111111 1111 No 28 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=1.6e-81 Score=463.49 Aligned_cols=412 Identities=12% Similarity=0.068 Sum_probs=287.6 Q ss_pred CchhHhh-----hhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MASSDRL-----LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~~~~l-----~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp 75 (447) |+|++|- |.++++|.+.+.+... .+......|...+.+........ ++...++++++||+||++||++||+|| T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~al~~~~V~~cv~~Ia~~iA~lp 88 (441) T protein:vir:79 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQ-YNEDDLQMMVQTLPGFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMP 88 (441) T ss_pred ccccccccchhhhhcccccccccccccc-CCCcchHHHHHHhcccCcccccc-cchhhhhccHHHHHHHHHHHHhhccCc Confidence 8888873 4566677554443211 11112222222222322233333 444567899999999999999999999 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF 155 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (447) |++|+. + +...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+... T Consensus 89 ~~~~~~---~--~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i-~~~~v~v~~d 162 (441) T protein:vir:79 89 IRVTVN---G--QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFR-KTSEIELKSD 162 (441) T ss_pred eeeecC---c--cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE-cCceeEEEEC Confidence 999863 2 234689999999999999999999999999999999999999999877665544444 4444344433 Q ss_pred cCCceEEEEeee-cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeC Q lcl|NC_010576. 156 FPRQVMVRVWND-NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFP 229 (447) Q Consensus 156 ~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~ 229 (447) ..+.+.+.++.. ..+......++.++|+|++.+..++.. +.+.+..+..++....+. ..+.||++|+|||+++ T Consensus 163 ~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~-G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 241 (441) T protein:vir:79 163 ARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred CCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC Confidence 333443333221 122334456889999999975433221 223444445444443332 2357899999999999 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC-- Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT-- 304 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~-- 304 (447) +.+.++ ++++++++.|.+.+.+ |+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||.+ T Consensus 242 ~~~~~~---e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:79 242 GVLDNK---KARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred CCCCCH---HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 887653 3445566677776654 7899999999999999999998865 6889999999999999999999742 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.+++.. +|.+||.||+++||++||++|+++. .+++|+||++.|+++|.++|++++.+++++|+||+||+|+++ T Consensus 319 ~~s~~q~~~-~~~~tl~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~ 394 (441) T protein:vir:79 319 NMSITDANL-DYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD 394 (441) T ss_pred CccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 2355544 4567999999999999999998753 478999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 383 GKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 383 gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) |+||++||+.+. +++.++++....+....+ +......+..+. .++| T Consensus 395 gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~-~~~~~~~~~kgG--e~~e 441 (441) T protein:vir:79 395 GLAPIPGGNGSIHRVDLNHVNIELVDEYQMN-KSRATDKKLKGG--EENE 441 (441) T ss_pred CCCCCCCCCcceEeecccccccccccccccc-cccccccccCCC--CCCC Confidence 999999987655 467777776544221100 000000111111 1111 No 29 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=7.3e-82 Score=465.40 Aligned_cols=366 Identities=16% Similarity=0.215 Sum_probs=281.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+++..+.+.+.... ... ++. +.....++++++|++||++||++||+|||+||| T Consensus 1 Mg~f~~~~~~~~~~~~~~-----~~~-------~~~-----------~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~ 57 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNND-----TQR-------VTA-----------WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVK 57 (378) T ss_pred CCccccchhcccccccCC-----cce-------eee-----------eccchhHHHHHHHHHHHHHHHhhhhhCceeeEE Confidence 999998765432222111 000 000 011234568889999999999999999999999 Q ss_pred EcCCCce----eccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 81 IDPISGN----QTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 81 ~~~~~~~----~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) .+++|+. ....+|++++||+.+||++||+++||+.++.+++++||||+++.+++... +++..+ T Consensus 58 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g-------------~~~~l~ 124 (378) T protein:vir:94 58 YKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTG-------------ELLDLL 124 (378) T ss_pred EcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCc-------------eEEEEE Confidence 8777643 24568999999999999999999999999999999999999977654321 122222 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) +... ...++++||||+++|++... +.+++..+..++. .+..++.++|+|++++.+++++ T Consensus 125 p~~~-------------~~~~~~~diiH~~~~~~~~~--g~s~l~~~~~~i~------~~~~~~~~~gil~~~~~l~~~~ 183 (378) T protein:vir:94 125 FADD-------------KKEYKPEELVRLTSPFYINE--DTSILDNALASIQ------TKLEQGKLRGLLKINAFLDIDN 183 (378) T ss_pred ecCC-------------eeEeeeeeeEEecCcCCccc--hhHHHHHHHHHHH------HHHhcccccceeeeCCcCCHHH Confidence 1111 12456789999998865432 2334443333332 2345678999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGYYN 315 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f~~ 315 (447) .++++++|++.|++...+ ++++++||++|++|+++++++.+++++++++++++||++|||||++|+|+++|++.++||+ T Consensus 184 ~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~~~se~~~~~f~~ 263 (378) T protein:vir:94 184 TQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTASQEQQIYFYN 263 (378) T ss_pred HHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCChHHHHHHHHHH Confidence 999999999999876554 7789999999999999999999988889999999999999999999999999999999999 Q ss_pred HHHhHHHHHHHHHHHhhcCChhHhcCCc------eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 316 RCVDVLLQYVTDAISRIALTKTAVSQGQ------VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 316 ~ti~P~~~~ie~~l~~kLl~~~e~~~g~------~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) +||.||+++||++|+++||++.|+..|+ .++||++.|+++|+++|++++.+++++||||+||+|+++||||+|| T Consensus 264 ~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g 343 (378) T protein:vir:94 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999998887765 3789999999999999999999999999999999999999999998 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) ||. +++.|+++....+.... ..++ ...++..+|| T Consensus 344 --GD~~~~~~n~~~~~~~~~~~~--~~~~----~~~~~e~~n~ 378 (378) T protein:vir:94 344 --GDVYIANLNAVAVKNLSDLQG--SRKD----VTSTDETNNQ 378 (378) T ss_pred --CCeeeecccccccccchhhcC--CcCC----CCCCCCCCCC Confidence 455 67888887654432211 1111 1111222333 No 30 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=3.3e-81 Score=461.84 Aligned_cols=402 Identities=11% Similarity=0.067 Sum_probs=289.1 Q ss_pred chhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEEE Q lcl|NC_010576. 2 ASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLKI 81 (447) Q Consensus 2 g~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~ 81 (447) =||+ ++|+++.......... ...+...+++..+.++ ..+....++++++||+||++||++||+|||++||. T Consensus 1 m~~~------~~f~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~ 71 (416) T protein:vir:12 1 MLLE------RMFEKRSGSSDHEDGF--NNILLNMFGGRKTASG-ERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKR 71 (416) T ss_pred Cccc------hhcccccCccccCccc--hhHHHHhhcCcccccC-ceechhhhhccHHHHHHHHHHHHhhhhCceEEEEe Confidence 1333 4566665443222211 1111222333333333 44456678899999999999999999999999997 Q ss_pred cCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC-Cce Q lcl|NC_010576. 82 DPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP-RQV 160 (447) Q Consensus 82 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 160 (447) +++|. .++.+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+ ....++.... +.. T Consensus 72 ~~~~~-~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~-~~v~v~~~~~~~~~ 149 (416) T protein:vir:12 72 TDGGI-ERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRP-DYTNAYVHPTTGML 149 (416) T ss_pred cCCcc-ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC-cceEEEEeCCCcEE Confidence 66554 45678999999999999999999999999999999999999999988776555554444 4434433222 222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+ .....+..+.++.++|+|++++..+... +.+.+..+...+....++ ..+.|++.|+|||+++..++++ T Consensus 150 ~~----~~~~~g~~~~~~~~eiih~~~~~~~~~~-G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e 224 (416) T protein:vir:12 150 WY----QTVLNGKAIELYDYEVLHFKGLSTDGIH-GKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEK 224 (416) T ss_pred EE----EEecCCeEEEecCccEEEecCcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHH Confidence 22 2222344567889999999975444322 223344444444443322 3357899999999999888765 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C--cHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T--ANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~--~~e~ 308 (447) + ++++++.|++. .++++++||++|++|+++++++++++ ++.+++++++||++|||||++|++ + +.|+ T Consensus 225 ~----~~~~~~~~~~~--~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~ 298 (416) T protein:vir:12 225 P----KENVRKEWKRV--NKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEH 298 (416) T ss_pred H----HHHHHHHHHHH--hcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHH Confidence 4 45556666543 35688999999999999999998865 788999999999999999999963 2 3489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.++|+++||.||+++||++||++||++.++..|++|+||++.|+++|.+++++++.+++++|+||+||+|+++|+||+| T Consensus 299 ~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 378 (416) T protein:vir:12 299 QSIEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIE 378 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCccc Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVS 432 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (447) | +|. +++.|+.+....+....... +.. ..+.++ ++++ T Consensus 379 g--gd~~~~~~n~~~~~~~~~~~~~~~---~~~-~~gge~-~~~g 416 (416) T protein:vir:12 379 N--GDKYISSLNYVFLDFLEEYQRLKA---GGA-MKGGDN-KNEG 416 (416) T ss_pred C--cceeeeccccccccccchhhcccc---ccc-cCCCCC-cCCC Confidence 8 455 56777776554322211100 000 001110 1111 No 31 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=2e-81 Score=463.06 Aligned_cols=391 Identities=15% Similarity=0.096 Sum_probs=287.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |-| .+.|+++.. ..+. ....+...+|. +.++ .+++...++++++|++||++||++||+|||+||| T Consensus 1 m~f-------~~~~~~~~~---~~~~---~~~~~~~~~g~-~~~~-~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 65 (409) T protein:vir:10 1 MLF-------RKGFKNQSQ---EISI---DDKKILEWLGI-NPSE-TYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQ 65 (409) T ss_pred Ccc-------cccccCcCC---CCCC---ChHHHHHHhcC-CcCc-ceechhhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 543 234444322 1111 11111112222 2222 3445567899999999999999999999999998 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+ +|+ .++.+|++++||+.+||++||+++||+.++.+++++||||+++.++..+....++++.+ ....+.....+.. T Consensus 66 ~~-~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~-~~V~v~~~~~~~~ 142 (409) T protein:vir:10 66 KK-DGI-KRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKS-DGMKIFVDDTGLL 142 (409) T ss_pred ec-CCe-eeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcC-CceEEEEcCCccc Confidence 64 333 45679999999999999999999999999999999999999999988877655444444 4333332211111 Q ss_pred ---EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcC Q lcl|NC_010576. 161 ---MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYST 232 (447) Q Consensus 161 ---~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~ 232 (447) ....|......+....+++++|+|++.+..+... +.+.+..+...+....+ ...+.||++++|||++++.+ T Consensus 143 ~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l 221 (409) T protein:vir:10 143 NSENNVWYLYTDDLGQRHKFMSDEILHFKGLTADGLA-GLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDL 221 (409) T ss_pred cccceEEEEEEeCCceeEEeccccEEEecCcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC Confidence 0111222222344456889999999976443321 22333444444433322 23357899999999999988 Q ss_pred ChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------ Q lcl|NC_010576. 233 KSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------ 303 (447) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------ 303 (447) ++++.+ ++++.|.+.+. .|+++++|+++|++|++++.++.+++ ++.+++++++||++|||||++|+. T Consensus 222 ~~e~~~----~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 297 (409) T protein:vir:10 222 NPEAEE----VFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATH 297 (409) T ss_pred CHHHHH----HHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc Confidence 776544 45555555544 37899999999999999999988865 688999999999999999999962 Q ss_pred CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 304 TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 304 ~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) ++.|++.++|+++||.||+++||++||++||++.++..|++|+||++.|+++|.+++++++.+++++|+||+||+|+++| T Consensus 298 ~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lg 377 (409) T protein:vir:10 298 SNITEQNREFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEE 377 (409) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 23489999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCccccc-cccccccchhhcccccCCCCCC Q lcl|NC_010576. 384 KAPHPNPLANE-LFNRNIADGNQVGGINTPGQIT 416 (447) Q Consensus 384 l~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 416 (447) +||+||+ |. +++.|+.+....++...+|++. T Consensus 378 l~p~~gg--D~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 378 DEPLEGG--DVLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred CCCCCCc--CeeeeccCccchhhccccccccCCC Confidence 9999884 55 6788888776655444444333 No 32 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=1.1e-81 Score=464.33 Aligned_cols=404 Identities=15% Similarity=0.078 Sum_probs=288.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |-|..+ +++ .....++.+.. |...+++...+.....++...++++++|++||++||++||+|||++|| T Consensus 1 m~~~~~-------~~~----~~~~~~~~~~~-~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~ 68 (419) T protein:vir:80 1 MFFSRQ-------LLS----NLGQTQPGSGG-WVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYE 68 (419) T ss_pred CCcccc-------ccc----ccCcCCCCcch-hhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 544221 111 12222222233 333333333333334445677889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ++++|. +++.+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+ ....+.....+.. T Consensus 69 ~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~-~~v~i~~~~~~~~ 146 (419) T protein:vir:80 69 RSGDDR-KPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDN-EAVTVMKGPDLKP 146 (419) T ss_pred ecCCCc-ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecC-ceEEEEECCCceE Confidence 888774 55689999999999999999999999999999999999999999988776544444444 4333333222222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+.+ . +. ..++.++|+|++.+..++.. +.+.+..+...+.... +...+.||++++|+|++++..... T Consensus 147 ~y~~---~-~~---~~~~~~~i~h~~~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~ 218 (419) T protein:vir:80 147 MYRV---A-GA---DPLPQRLVHHVRWMSINGYT-GLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPAL 218 (419) T ss_pred EEEE---c-Cc---cccchhheEEecCCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcc Confidence 2221 1 11 13677899999964333222 1223333333333322 223357899999999998877655 Q ss_pred HHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----C--cH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----T--AN 306 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~--~~ 306 (447) ..+++.+++++.|++.+++ |+|++++|++|++|+++++++.+++ ++.+++++++||++|||||++|+. + +. T Consensus 219 ~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~ 298 (419) T protein:vir:80 219 KDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNI 298 (419) T ss_pred cCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccH Confidence 5567778888888887765 6799999999999999999998865 688999999999999999999963 2 34 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) |++.+.|+++||.||+++||++|+++||++.++ .+++|+||++.|+++|.++|++++++++++|+||+||+|+++|+|| T Consensus 299 e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~-~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p 377 (419) T protein:vir:80 299 EHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPP 377 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 899999999999999999999999999998876 4799999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCC Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIEN 438 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (447) +|| ||. +++.++++..+.... +.+ +.++. ..+.... ...-+ T Consensus 378 ~~g--GD~~~~~~n~~~~~~~~~~--~~~--~~~~~---~~~~~~~--~~~l~ 419 (419) T protein:vir:80 378 VKG--GDIYLSPMNMVDASKPQPI--PMG--KTEPT---KAALDEI--GRILS 419 (419) T ss_pred CCC--cceeeeccccccccccccc--cCC--CCCch---hhhHHHH--HhhcC Confidence 988 455 567776654433211 111 11111 1111000 11111 No 33 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1.4e-80 Score=458.40 Aligned_cols=394 Identities=12% Similarity=0.032 Sum_probs=275.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.+.+. +........ .+.. .+......+. .+....++++++|++||++||++||++||++|| T Consensus 1 Mgl~~~~f~~~~-----~~~~~~~~~-----~~~~-~~~~~~~~g~-~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~ 68 (409) T protein:vir:84 1 MSLFTRIFSGPS-----EERTLTKIS-----GIPS-PAEDWAMHGD-RPGANSAMTLGAFYACVTLLADTVASLSIDAYR 68 (409) T ss_pred CchhhhhhcCCC-----ccccccccc-----cccc-ccchhhccCc-ccchhhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 999998644221 111111111 1111 1111112222 344567889999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .++++ ++..|++++||+.+||++||+++||+.++.+++++||+|+++.+++.+.....+++++.....+........ T Consensus 69 ~~~~~---~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~ 145 (409) T protein:vir:84 69 KKDNV---RIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDG 145 (409) T ss_pred ecCCc---ccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcc Confidence 76543 356899999999999999999999999999999999999988754444344445554444434433222221 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .. ++......+ ..++.++|+|++.+.......+.+.+..+...+....+ ...+.||++++|+|++++.++++ T Consensus 146 ~~-~~~~~~~~g--~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 222 (409) T protein:vir:84 146 DW-IEPVYRIDG--KVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPD 222 (409) T ss_pred eE-EEEEecCCc--eEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHH Confidence 11 111112222 24678999999864333221222333444444433322 23357899999999999988876 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--------CcH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG--------TAN 306 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g--------~~~ 306 (447) +.+ +++++|.+.. .|+++++||++|++|+++++++.+++ ++.+++++++||++|||||++||. ++. T Consensus 223 ~~~----~~~~~~~~~~-~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~ 297 (409) T protein:vir:84 223 QVK----QTQKQWIQSH-HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGI 297 (409) T ss_pred HHH----HHHHHHHHHh-ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchH Confidence 544 4555555443 46789999999999999999998865 688999999999999999999962 335 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) |++.++|+++||.||+++||++|+++|. .|++|+||++.|+++|.+++++++.+++++|+||+||+|+++|+|| T Consensus 298 e~~~~~f~~~~l~P~~~~ie~~l~~~L~------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p 371 (409) T protein:vir:84 298 EEQGINFVRHTLLPWLRCIEQALDTFLP------RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPP 371 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcc------CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 8999999999999999999999999983 4789999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) +|| ||. +++.++.+..+... .+++++++ +.++.+.+. T Consensus 372 ~~g--gD~~~~~~n~~~~~~~~~-~~~~~~~~--~~~~~~gn~ 409 (409) T protein:vir:84 372 IPE--GDIHLQPMNFVPLGYVPP-EEPAQEPQ--PNSATEGNK 409 (409) T ss_pred CCC--cceeeecccccccccCCc-cccCcCCC--CCCccCCCC Confidence 988 455 56777776554321 11111111 111111111 No 34 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=6.4e-81 Score=460.23 Aligned_cols=412 Identities=12% Similarity=0.078 Sum_probs=282.2 Q ss_pred CchhHh-----hhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MASSDR-----LLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~~~~-----l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp 75 (447) .-|++| -|.++++|++.+.+... ........|...+.+....+... ++...++++++|++||++||++||+|| T Consensus 11 ~~~~~~~~~~~~~~~~~~f~~~e~r~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~al~~~~V~acv~~Ia~~iA~lp 88 (441) T protein:vir:98 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQ-YNEDDLQMMVQTLPGFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMP 88 (441) T ss_pred eccccccchhhhhhcccccccccccccc-CCCcchHHHHHHhhcccccCccc-cchhhhhccHHHHHHHHHHHHhhccCc Confidence 111221 12233444443332211 11111222222222332333333 445568899999999999999999999 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF 155 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (447) |++|+. +. ...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++ +.+.....+... T Consensus 89 l~~~~~---~~--~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~-~i~~~~v~v~~~ 162 (441) T protein:vir:98 89 IRVTVN---GQ--INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLT-FRKTSEIELKLD 162 (441) T ss_pred eEEecC---Cc--ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEE-EEcCceeEEEEC Confidence 999863 22 346899999999999999999999999999999999999999998776654444 444444444433 Q ss_pred cCCceEEEEeee-cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeC Q lcl|NC_010576. 156 FPRQVMVRVWND-NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFP 229 (447) Q Consensus 156 ~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~ 229 (447) ..+.+.+.++.. ..+......+++++|+|++.+..++.. +.+.+..+...+....+. ..+.||++++|||+++ T Consensus 163 ~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~~dg~~-G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~ 241 (441) T protein:vir:98 163 ARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred CCCcEEEEEEEeccCcceeeEEEccccEEEeccCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC Confidence 344444433322 122334456889999999975433322 223344444444443332 3357899999999999 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC-- Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT-- 304 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~-- 304 (447) +.+.++ ++++++++.|.+.+.+ |+|+++||++|++|+++++++++++ ++.+++++++||++|||||++|+.+ T Consensus 242 ~~~~~~---e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:98 242 GVLDNK---KARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred CCCCCH---HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 887643 3445566667666654 7899999999999999999998865 6889999999999999999999743 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.+|+...|. +||+||+++||++||++|+++. .+++|+||++.|+++|.+++++++.+++++|+||+||+|+++ T Consensus 319 ~~s~~q~~~~y~-~tl~P~~~~ie~~ln~~L~~~~---~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 394 (441) T protein:vir:98 319 NMSITDANLDYL-STLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD 394 (441) T ss_pred CccHHHHHHHHH-HHHHHHHHHHHHHHHhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 2356655555 6999999999999999999753 478999999999999999999999999999999999999999 Q ss_pred CCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 383 GKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 383 gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) ||||++||+.+. +++.++++....+... ..+...+.....+.+ +|| T Consensus 395 gl~pi~gGd~~~~~~~~n~~~~~~~~~~q-~~~~~~~~~~~kgGe--~ne 441 (441) T protein:vir:98 395 GLAPIPGGNGSIHRVDLNHVNIELVDEYQ-MNKSRATDKKLKGGE--ENE 441 (441) T ss_pred CCCCCCCCCcceEeecccccccccccccc-cccccccccccCCCC--CCC Confidence 999999987655 4677777655432211 111000011111111 111 No 35 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=3.5e-81 Score=461.66 Aligned_cols=366 Identities=17% Similarity=0.228 Sum_probs=279.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||+|+|+..++..+... .. ++ .++. .....++++++|++||++||++||+|||++|| T Consensus 1 M~~f~k~~~~~~~~~~~-----~~----~~------~~~~--------~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~ 57 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNN-----DT----QR------VTAW--------QNEAVEYTSAFVTNIHNKIANEITKVEFNHVK 57 (378) T ss_pred Cchhhhhhhhhhccccc-----CC----cc------eeee--------eccchhhhhHHHHHHHHHHHHhHhhCceeEEE Confidence 99999987766432211 00 01 0110 11235678889999999999999999999999 Q ss_pred EcCCCce----eccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 81 IDPISGN----QTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 81 ~~~~~~~----~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) +++++.. .+..+|++++||+.+||++||+++||+.++.+++++||||+++..++. .+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-------------~g~~---- 120 (378) T protein:vir:85 58 YKKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSE-------------TGEL---- 120 (378) T ss_pred EeccccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCC-------------CceE---- Confidence 8876553 245799999999999999999999999999999999999998654321 1111 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) .+.++.. ....+..++++|++++++.... ... +..++.. ...+.+++.++|+|+.++.++++. T Consensus 121 ----~~~~~~~-----~~~~~~~~dvih~~~~~~~~~~--~~~---~~~a~~~---~~~~~~~~~~~g~l~~~~~l~~~~ 183 (378) T protein:vir:85 121 ----LDLLFAN-----DKKEYKPEELVRLVSPFYINED--TSI---LDNALAS---IQTKLEQGKLRGLLKINAFLDIDN 183 (378) T ss_pred ----EEEEecC-----CCEEEcccceEEEecCcCccch--hhH---HHHHHHH---HHHHHhcCCcceEEEeCCcCCHHH Confidence 1111111 1134567899999987653221 112 2222222 223356678999999999999998 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGYYN 315 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f~~ 315 (447) .++++++|++.|.+...+ ++++++||++|++|+++++++.+.+++.+++++++||++|||||++|+|+++|++.++|++ T Consensus 184 ~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~s~~e~~~~~f~~ 263 (378) T protein:vir:85 184 TQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKSELLTGYFMNENILLGTATQEQQIYFYN 263 (378) T ss_pred HHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHH Confidence 888999999888776554 7889999999999999999999888888999999999999999999999999999999999 Q ss_pred HHHhHHHHHHHHHHHhhcCChhHhcCCc------eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 316 RCVDVLLQYVTDAISRIALTKTAVSQGQ------VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 316 ~ti~P~~~~ie~~l~~kLl~~~e~~~g~------~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) +||.||+++||++|++|||++.|+..++ +++|+++.|+++|.++++++|.+++++|+||+||+|+++|+||++| T Consensus 264 ~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~g 343 (378) T protein:vir:85 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999998887765 3789999999999999999999999999999999999999999998 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) | |. +++.|+++....+...... .....++..+|+ T Consensus 344 G--D~~~~~~N~~~~~~~~~~~~~~------~~~~~~~e~~n~ 378 (378) T protein:vir:85 344 G--DIYIANLNAVAVKNLSDLQGSR------KDVASTDETNNQ 378 (378) T ss_pred C--CeEeecccccccccchhhcCcc------CCCCCCCCCCCC Confidence 4 55 6788888765443321111 111112222333 No 36 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=4.3e-81 Score=461.20 Aligned_cols=366 Identities=16% Similarity=0.195 Sum_probs=280.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+++..+...+.. ..... ...| .....+++.++|++||++||++||+|||+||| T Consensus 1 Mg~f~~~~~f~~~~~~-----~~~~~---~~~~---------------~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~ 57 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLN-----NDTQR---VTAW---------------QNEAVEYTSAFVTNIHNKIANEITKVEFNHVK 57 (378) T ss_pred CccchhhhhhhccccC-----CCcce---eeec---------------ccchhHHHHHHHHHHHHHHHhhhhhCceeeEE Confidence 9999987653322111 01100 0001 11224568889999999999999999999999 Q ss_pred EcCCCcee----ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 81 IDPISGNQ----TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 81 ~~~~~~~~----~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) .+++++.. ...+|++++||+.+||++||+++||+.++.+++++||||+++.+++... +++..+ T Consensus 58 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g-------------~~~~l~ 124 (378) T protein:vir:93 58 YKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTG-------------ELLDLL 124 (378) T ss_pred EcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCc-------------eEEEEE Confidence 87766432 3468999999999999999999999999999999999999987654321 111111 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) +.. ....++.++|+|+++|++... ..+++..+... ...+..++.++|+|++++.+++++ T Consensus 125 ~~~-------------~~~~~~~~diih~r~~~~~~~--~~s~l~~~~~~------i~~~~~~~~~~g~l~~~~~l~~~~ 183 (378) T protein:vir:93 125 FAD-------------DKKEYKTEELVRLTSPFYINE--DTSILDNALAS------IQTKLEQGKLRGLLKINAFLDIDN 183 (378) T ss_pred ecC-------------CeeEeccceeEEecCccccch--hhHHHHHHHHH------HHHHHhcCcccceeeeCCcCCHHH Confidence 111 123567899999998865432 22233322222 223456678999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGYYN 315 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f~~ 315 (447) .++++++|++.|++...+ +++++++|++|++|+++++++.+++++++++++++||++|||||++|+|+++|++..+|++ T Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~~~e~~~~~f~~ 263 (378) T protein:vir:93 184 TQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTATQEQQIYFYN 263 (378) T ss_pred HHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Confidence 999999999999876554 6789999999999999999999988899999999999999999999999999999999999 Q ss_pred HHHhHHHHHHHHHHHhhcCChhHhcCCc------eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 316 RCVDVLLQYVTDAISRIALTKTAVSQGQ------VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 316 ~ti~P~~~~ie~~l~~kLl~~~e~~~g~------~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) +||.||+++||++|+++||++.|+..|+ +++||++.|+++|+++|++++.+++++|+||+||+|+++|+||++| T Consensus 264 ~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 343 (378) T protein:vir:93 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999998887665 4889999999999999999999999999999999999999999998 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) | |. +++.|+.+....+.... ...++. .++..+|+ T Consensus 344 g--D~~~~~~n~~~~~~~~~~~~--~~~~~~----~~~e~~n~ 378 (378) T protein:vir:93 344 G--DVYIANLNAVAVKNLSDLQG--SRKDVT----STDETNNQ 378 (378) T ss_pred C--CeeeeccccccccchhhhcC--ccCCCC----CCCCCCCC Confidence 4 55 66778877654432211 001111 11112222 No 37 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=1e-80 Score=459.13 Aligned_cols=397 Identities=12% Similarity=0.084 Sum_probs=281.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccc--cccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTP--SNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |++|.++.. .....|... ..+..+ ...+.++ ...++++++||+||++||++||+|||++ T Consensus 1 m~~~~~~~~------------~~~~~~~~~~~~~~~~~------~~~g~~~-~~~Al~~~~V~~cv~~ia~~iA~lp~~~ 61 (417) T protein:vir:38 1 MKLFRGLAT------------EVDPHWADHLLDSGVIP------SFRGGYL-GISALRNSDVLTAVSIVSGDVSRFPLVI 61 (417) T ss_pred Ccccccccc------------CCCccchhhhccccccc------ccCCcee-chhhcccHHHHHHHHHHHHhhccCeeEE Confidence 888743111 011111110 111111 1112223 2457899999999999999999999999 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCC Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (447) ||.+.++. ...|++++||+.+|||+||+++||+.++.+++++||||+++.++..+..+..+++.+.....+.....+ T Consensus 62 ~~~~~~~~---~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~ 138 (417) T protein:vir:38 62 TDSSTDEV---IDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPD 138 (417) T ss_pred EEcCCcce---eccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCC Confidence 98766543 357899999999999999999999999999999999999999988776666777776665555444444 Q ss_pred ceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~ 233 (447) ...+++.. ........++.++|+|++.+..++.. +.+.+..+...+....+ ...+.||+.++||++.++.++ T Consensus 139 ~~~y~~~~--~~~~~~~~~~~~dviH~r~~~~d~~~-G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~ 215 (417) T protein:vir:38 139 NIIYRFTP--YNSSMQKVCGFEDVIHWKFFSYDTIM-GRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLS 215 (417) T ss_pred eEEEEEEE--cCCcEEEEecCcceEEecCCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCC Confidence 44443322 22233446788999999975444322 22333444444433322 233578999999999999988 Q ss_pred hHHHHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANE 307 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e 307 (447) +++.++ +++.|.+.+++ |+|+++||++|++|+++++++++++ ++.+++++++||++|||||++|+++ +.| T Consensus 216 ~e~~~~----~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e 291 (417) T protein:vir:38 216 AEARQK----IREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSPNQSVK 291 (417) T ss_pred HHHHHH----HHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHH Confidence 765544 44555544443 7899999999999999999998865 6889999999999999999999753 248 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_010576. 308 QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH 387 (447) Q Consensus 308 ~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 387 (447) ++.++|+++||.||+++||++|+++||++.++. +++|+||++.+++.+. +.+++++++|+||+||+|+++|+||+ T Consensus 292 ~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~-~~~~~fd~~~l~~~~~----~~~~~~~~~G~~T~NE~R~~~gl~pi 366 (417) T protein:vir:38 292 QLADDYIRNDLPFYFEPITSEFELKLLDDAQRH-QYCIGFDTKSVNGLPI----ADVNTAVNGGLWTGNEGRAELGKKPL 366 (417) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcc-cceEEechhhhhHHHH----HHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999998864 6899999998876543 34678899999999999999999999 Q ss_pred CCccccc-cccccccchhhcccccC------CCCCCCCCCCcCCCCCCCcccccccCCc Q lcl|NC_010576. 388 PNPLANE-LFNRNIADGNQVGGINT------PGQITSDQPATASTDPLNNVSTSAIENG 439 (447) Q Consensus 388 ~g~~~~~-~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (447) +||++|. +++.|+++....+.... +|++++.+..+.+++ ..++. T Consensus 367 ~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~--------~~~~~ 417 (417) T protein:vir:38 367 KDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSG--------TNANS 417 (417) T ss_pred CCCCCCeeeecccccccccccccccccccccCCCCCCCCCCCcCCC--------CcCCC Confidence 9998876 56778777665433221 111111111111111 11111 No 38 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=8.1e-81 Score=459.68 Aligned_cols=366 Identities=17% Similarity=0.236 Sum_probs=280.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||+|+|++.++.++..... ++ ....+ ....+++.++|++||++||++||+|||++|| T Consensus 1 M~if~~~~~~~~~~~~~~~---------~~---~~~~~-----------~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~ 57 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDT---------QR---VTAWQ-----------NEAVEYTSAFVTNIHNKIANEITKVEFNHVK 57 (378) T ss_pred CchhHHhHhhhhcccccCc---------ce---eeeee-----------cchhhhhhHHHHHHHHHHHHhHhhCceeeee Confidence 9999998887755433210 01 10011 1234567789999999999999999999999 Q ss_pred EcCCCce----eccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 81 IDPISGN----QTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 81 ~~~~~~~----~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) .++.++. ....+|++++||+.+||++||+++||+.++.+++++||||+++..++.. +.++ T Consensus 58 ~~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~-------------g~~~--- 121 (378) T protein:vir:94 58 YKKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSET-------------GELL--- 121 (378) T ss_pred ecccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCC-------------CcEE--- Confidence 8776543 3457899999999999999999999999999999999999986543221 1111 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) ..++... ...++.++|+|+++|.+.. .....+..+... ...+..++.++|+|+.+..+++++ T Consensus 122 -----~~~~~~~-----~~~~~~~dvih~~~~~~~~--~~~~~~~~~~~~------~~~~~~~~~~~g~l~~~~~l~~~~ 183 (378) T protein:vir:94 122 -----DLLFAND-----KKEYKPEELVRLTSPFYIN--EDTSILDNALAS------IQTKLEQGKLRGLLKINAFLDIDN 183 (378) T ss_pred -----EEEEecC-----cEEechhceeeecCcCCcc--cchhHHHHHHHH------HHHHHhhCCcccceeeCCcCCHHH Confidence 1111111 1346789999999876432 122222322222 223345678899999999999999 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGYYN 315 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f~~ 315 (447) .++++++|++.|++...+ ++++++||++|++|+++++++.+.+++++++++++||++|||||++|+|+++|++.++|++ T Consensus 184 ~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~~~e~~~~~f~~ 263 (378) T protein:vir:94 184 TQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTATQEQQIYFYN 263 (378) T ss_pred HHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHH Confidence 899999999999876654 7788999999999999999999988999999999999999999999999999999999999 Q ss_pred HHHhHHHHHHHHHHHhhcCChhHhcCCc------eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 316 RCVDVLLQYVTDAISRIALTKTAVSQGQ------VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 316 ~ti~P~~~~ie~~l~~kLl~~~e~~~g~------~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) +||.||+++||++|+++||++.++..|+ +++|+++.|+++|.+++++++.+++++|+||+||+|+++|+||++| T Consensus 264 ~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~g 343 (378) T protein:vir:94 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999998877664 4789999999999999999999999999999999999999999998 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) + |. +++.|++++...+..... .++ ...++..+|| T Consensus 344 g--d~~~~~~n~~~~~~~~~~~~~---~~~---~~~~~e~~n~ 378 (378) T protein:vir:94 344 G--DVYIANLNAVAVKNLSDLQGN---RKD---VTSTDETNNQ 378 (378) T ss_pred C--Ceeeecccccchhcchhcccc---cCC---CCCCCCCCCC Confidence 4 55 678888776544332111 111 1112222333 No 39 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=1e-80 Score=459.14 Aligned_cols=366 Identities=16% Similarity=0.209 Sum_probs=280.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+++..+...+... .... ...| .....++++++|++||++||++||+|||++|| T Consensus 1 Mg~f~~~~~~~~~~~~~-----~~~~---~~~~---------------~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~ 57 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNN-----DTQR---VTAW---------------QNEAVEYTSAFVTNIHNKIANEITKVEFNHVK 57 (378) T ss_pred CccchhhhhhhcccccC-----Ccce---eeec---------------ccchhhHHHHHHHHHHHHHHhhhhhCceeEEE Confidence 99999876543332111 0000 0001 11224568889999999999999999999999 Q ss_pred EcCCCcee----ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec Q lcl|NC_010576. 81 IDPISGNQ----TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF 156 (447) Q Consensus 81 ~~~~~~~~----~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (447) .+++|+.. ...+|++++||+.+||++||+++||+.++.+++++||||+++.+++... +++..+ T Consensus 58 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g-------------~~~~l~ 124 (378) T protein:vir:16 58 YKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTG-------------ELLDLL 124 (378) T ss_pred EcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCc-------------eEEEEE Confidence 87765432 3468999999999999999999999999999999999999988764321 111111 Q ss_pred CCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHH Q lcl|NC_010576. 157 PRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTA 236 (447) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~ 236 (447) +... ...++.+||||+|+|++... ..+.+..+...+ ..+..++.++|+|+.+..+++++ T Consensus 125 ~~~~-------------~~~~~~~diih~r~~~~~~~--~~s~l~~~~~~i------~~~~~~~~~~g~l~~~~~l~~~~ 183 (378) T protein:vir:16 125 FADD-------------KKEYKPEELVRLTSPFYINE--DTSILDNALASI------QTKLEQGKLRGLLKINAFLDIDN 183 (378) T ss_pred ecCC-------------eeEecccceEEecCccCccc--hhHHHHHHHHHH------HHHHhcCccceeeEeCCcCCHHH Confidence 1111 12456799999998865432 222333222222 23455778999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHHHH Q lcl|NC_010576. 237 RAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGYYN 315 (447) Q Consensus 237 ~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f~~ 315 (447) .++++++|++.|++...+ ++|+++||++|++|+++++++++++++++++++++||++|||||++|+|+++|++.++|++ T Consensus 184 ~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~~~e~~~~~f~~ 263 (378) T protein:vir:16 184 TQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTASQEQQIYFYN 263 (378) T ss_pred HHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHHHHHH Confidence 999999999999876554 7789999999999999999999988888999999999999999999999999999999999 Q ss_pred HHHhHHHHHHHHHHHhhcCChhHhcCCc------eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 316 RCVDVLLQYVTDAISRIALTKTAVSQGQ------VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 316 ~ti~P~~~~ie~~l~~kLl~~~e~~~g~------~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) +||.||+++||++|++|||++.++..++ .++|+++.++++|++++++++.+++++|+||+||+|+++|+||+|| T Consensus 264 ~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g 343 (378) T protein:vir:16 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999998876664 4889999999999999999999999999999999999999999988 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) ||. +++.|++++...++.. +...++ ..++..+|| T Consensus 344 --gD~~~~~~n~~~~~~~~~~~--~~~~~~----~~~~e~~ne 378 (378) T protein:vir:16 344 --GDVYIANLNAVAVKNLSDLQ--GSRKDV----TSTDETNNQ 378 (378) T ss_pred --CCeEeeccccccccchhhhc--CccCCC----CCCCCCCCC Confidence 455 6788888765543321 111111 111222333 No 40 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=3.1e-80 Score=456.44 Aligned_cols=391 Identities=13% Similarity=0.110 Sum_probs=279.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |.+++|+++.+ +.+-.. .+.+.......|.. .+... ++...++++++|++||++||++||+|||++|| T Consensus 4 ~~~~~~~k~~~--~~~~~~--~~~~~~~~~~~~~~-------~~~~~-v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~ 71 (409) T protein:vir:96 4 ENIVTRIKKKL--IDNWID--QSASKLYDFSPWKN-------KSFWG-VINNTLETNETIFSAITKLSNSMASLPLKMYE 71 (409) T ss_pred ccchhhhhhHH--hhhhhc--cccccccccccccC-------ccccc-cchhhHhhhHHHHHHHHHHHHhhhhCceEEee Confidence 88999877632 111111 11111111111211 11111 23456889999999999999999999999998 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec-CCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF-PRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 159 (447) +++ ..+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+ ....+.... ... T Consensus 72 ~~~------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~-~~v~v~~~~~~~~ 144 (409) T protein:vir:96 72 DYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNP-DVVEMLIENQSRE 144 (409) T ss_pred ccc------ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcC-ceeEEEEeCCCcE Confidence 543 357999999999999999999999999999999999999999988776555454444 433333222 222 Q ss_pred eEEEEeeecccccceeeecccccccccccc-cccccchhHHHHHHHHHHHHHHHH--HHHhhcCcc-cceeeeCCcCChH Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPF-YAILNDTNQTLRMLEQKIKLMNSQ--DNRASSGKL-NGFIQFPYSTKST 235 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~n~~~~-~gvl~~~~~~~~~ 235 (447) +.+. .....+..+.++.++|+|++.+. .+... +.+.+..+...+....+. ..+.+++.+ +++++.+..++++ T Consensus 145 ~~y~---~~~~~g~~~~~~~~evih~r~~~~~~~~~-G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e 220 (409) T protein:vir:96 145 LYYS---IHAATGNKLIVHNMDMLHFKHIVASNMVQ-GISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVSTE 220 (409) T ss_pred EEEE---EEcCCceEEEEccccEEEeCCCCCCCccc-cccHHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCCCCCHH Confidence 2222 22333455678899999998642 22211 123344444444433222 223444444 4577888888876 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC------cHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT------ANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~ 308 (447) +.++ ++++|.+.+. |++++++|++|++|+++++++.+++ ++.+++++++||++|||||++|++. +.|+ T Consensus 221 ~~~~----~~~~~~~~~~-n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~ 295 (409) T protein:vir:96 221 KRQQ----VLEDFKQYYE-ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEE 295 (409) T ss_pred HHHH----HHHHHHHHhh-cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 5544 4555544443 5678999999999999999998865 6889999999999999999999742 3489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.+.|+++||.||+++||++|+++||++.++..|++|+||++.|+++|.++|++++++++++|+||+||+|+++|+||+| T Consensus 296 ~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ 375 (409) T protein:vir:96 296 LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 375 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cccccc-cccccccchhhccc--ccCCCCCCCCCCC Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGG--INTPGQITSDQPA 421 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 421 (447) | ||. +++.|+++...... ...+|++.++++. T Consensus 376 g--gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 376 G--GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred C--cceeeecccccccccchhhcccccCCCCCcCCC Confidence 7 566 66788877643221 1122222221111 No 41 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=2.5e-80 Score=456.98 Aligned_cols=402 Identities=11% Similarity=0.048 Sum_probs=279.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+| +++++. .........+.....+........ ++...++++++||+||++||++||++||++|+ T Consensus 1 Mg~f~~-------~~~r~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~ 68 (416) T protein:vir:45 1 MGIFYK-------NEKRDL----QYNEDDLQMMVQTLPGFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMPIRVTV 68 (416) T ss_pred CCcccc-------cccccc----cCCCcchhHHHHHhccccccCccc-cchhhhhcchHHHHHHHHHHHhhccCceEEec Confidence 999865 111111 111111111222222322233333 34467889999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) . +. ...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++ +.+.....+.....+.+ T Consensus 69 ~---~~--~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~-~i~~~~v~v~~~~~g~~ 142 (416) T protein:vir:45 69 N---GQ--INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLT-FRKTSEIELKSDARGRL 142 (416) T ss_pred C---cc--ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE-EEcCceeEEEECCCccE Confidence 3 22 346899999999999999999999999999999999999999998776654444 44444444444333444 Q ss_pred EEEEeee-cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 161 MVRVWND-NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 161 ~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~~ 234 (447) .+.+... ..+......++.++|||++....+... +.+.+..+...+....+. ..+.||++++|||++++.+.+ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~evihir~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~ 221 (416) T protein:vir:45 143 YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN 221 (416) T ss_pred EEEEEEecCCCceeEEEEccccEEEeccCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCC Confidence 4333221 223334456889999999964433221 223344444444443322 235789999999999988765 Q ss_pred HHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANE 307 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e 307 (447) + ++++++++.|.+.+.+ |+|+++||++|++|+++++++++++ ++.+++.+++||++|||||++|+.. +.+ T Consensus 222 ~---~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 298 (416) T protein:vir:45 222 K---KARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSIT 298 (416) T ss_pred H---HHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH Confidence 3 3445556666666554 7899999999999999999998865 6889999999999999999999732 234 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_010576. 308 QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH 387 (447) Q Consensus 308 ~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 387 (447) ++ ..+|.+||.||+++||++||++|+++. .+++|+||++.|+++|.+++++++.+++++|+||+||+|+++|+||+ T Consensus 299 ~~-~~~~~~~l~P~~~~ie~~ln~~l~~~~---~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~ 374 (416) T protein:vir:45 299 DA-NLDYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI 374 (416) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 54 445567999999999999999998753 47899999999999999999999999999999999999999999999 Q ss_pred CCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 388 PNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 388 ~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) +||+.+. +++.+++++...... ...+......+..+. .+|| T Consensus 375 ~~gd~~~~~~~~n~~~~~~~~~~-~~~~~~~~~~~~kgG--e~n~ 416 (416) T protein:vir:45 375 PGGNGSIHRVDLNHVNIELVDEY-QMNKSRATDKKLKGG--EENE 416 (416) T ss_pred CCCCcceEeeccccccccccccc-CcccccccccccCCC--CCCC Confidence 9997765 467777665543211 111111111111111 1222 No 42 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=2.5e-80 Score=456.98 Aligned_cols=402 Identities=11% Similarity=0.048 Sum_probs=279.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+| +++++. .........+.....+........ ++...++++++||+||++||++||++||++|+ T Consensus 1 Mg~f~~-------~~~r~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~ 68 (416) T protein:vir:81 1 MGIFYK-------NEKRDL----QYNEDDLQMMVQTLPGFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMPIRVTV 68 (416) T ss_pred CCcccc-------cccccc----cCCCcchhHHHHHhccccccCccc-cchhhhhcchHHHHHHHHHHHhhccCceEEec Confidence 999865 111111 111111111222222322233333 34467889999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) . +. ...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++ +.+.....+.....+.+ T Consensus 69 ~---~~--~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~-~i~~~~v~v~~~~~g~~ 142 (416) T protein:vir:81 69 N---GQ--INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLT-FRKTSEIELKSDARGRL 142 (416) T ss_pred C---cc--ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE-EEcCceeEEEECCCccE Confidence 3 22 346899999999999999999999999999999999999999998776654444 44444444444333444 Q ss_pred EEEEeee-cccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 161 MVRVWND-NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 161 ~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~~ 234 (447) .+.+... ..+......++.++|||++....+... +.+.+..+...+....+. ..+.||++++|||++++.+.+ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~evihir~~~~d~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~ 221 (416) T protein:vir:81 143 YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN 221 (416) T ss_pred EEEEEEecCCCceeEEEEccccEEEeccCCCCCcc-ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCC Confidence 4333221 223334456889999999964433221 223344444444443322 235789999999999988765 Q ss_pred HHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANE 307 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e 307 (447) + ++++++++.|.+.+.+ |+|+++||++|++|+++++++++++ ++.+++.+++||++|||||++|+.. +.+ T Consensus 222 ~---~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 298 (416) T protein:vir:81 222 K---KARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSIT 298 (416) T ss_pred H---HHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH Confidence 3 3445556666666554 7899999999999999999998865 6889999999999999999999732 234 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_010576. 308 QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH 387 (447) Q Consensus 308 ~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 387 (447) ++ ..+|.+||.||+++||++||++|+++. .+++|+||++.|+++|.+++++++.+++++|+||+||+|+++|+||+ T Consensus 299 ~~-~~~~~~~l~P~~~~ie~~ln~~l~~~~---~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~ 374 (416) T protein:vir:81 299 DA-NLDYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPI 374 (416) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 54 445567999999999999999998753 47899999999999999999999999999999999999999999999 Q ss_pred CCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 388 PNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 388 ~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) +||+.+. +++.+++++...... ...+......+..+. .+|| T Consensus 375 ~~gd~~~~~~~~n~~~~~~~~~~-~~~~~~~~~~~~kgG--e~n~ 416 (416) T protein:vir:81 375 PGGNGSIHRVDLNHVNIELVDEY-QMNKSRATDKKLKGG--EENE 416 (416) T ss_pred CCCCcceEeeccccccccccccc-CcccccccccccCCC--CCCC Confidence 9997765 467777665543211 111111111111111 1222 No 43 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=2.3e-80 Score=457.17 Aligned_cols=388 Identities=11% Similarity=0.070 Sum_probs=272.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||+||+..+++- .... .. .....+|.. + .......++++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~~~~~~---~~~~---~~-~~~~~~~~~--~--------~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~ 63 (395) T protein:vir:40 1 MGFKSWVSGFFNE---EQRT---LN-LTDTVWCSI--P--------SEKLKELSIKKWAIDSCANKIANTLSCAEVLTYE 63 (395) T ss_pred CchHHHHHhhhcc---cccc---cc-cccchhhcc--c--------cccchhhhhhhHHHHHHHHHHHHHHhhCceeecc Confidence 9999998876532 1111 11 111111211 0 1123456889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ++ +...|+++++|+.+||++||+++||+.++.+++|+||||+++.++.......+ .... ...++... T Consensus 64 ~~------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~~~~~~~-~~~~------~~~~~~~~ 130 (395) T protein:vir:40 64 KG------EEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYIYVADSF-TKND------KSLYENTY 130 (395) T ss_pred CC------ccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCceeecCCc-cccc------ccccccee Confidence 42 13579999999999999999999999999999999999999887654332211 1110 01111111 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) ..... + +......+++++|+|++.+...+...+..........+....+...+.++.++.++++.+..+++++.++. T Consensus 131 ~~v~~-~--~~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 207 (395) T protein:vir:40 131 TEVTL-K--DLTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLTAAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKL 207 (395) T ss_pred eeeee-c--CceeeeeeccccEEEeecCCCCccccchhHHHHHHHHHHHHHHHHHhcCCCCceEEEecccCCCHHHHHHH Confidence 10000 1 11112357889999998654444444433333333333333333333344444444555666776666666 Q ss_pred HHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHH---HHHHHHHhCCCHHHhcCCcH--HHHHHHHH Q lcl|NC_010576. 241 AARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQL---QQDFYNQMGITEAILNGTAN--EQQTLGYY 314 (447) Q Consensus 241 ~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~---~~~Ia~~fgVP~~~l~g~~~--e~~~~~f~ 314 (447) ++.|.+.+.. ..+++++++|+++|++|+++++++.+++ ++.+++. .++||++|||||++|+|+++ |++.++|+ T Consensus 208 ~~~~~~~~~~-~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~sn~e~~~~~f~ 286 (395) T protein:vir:40 208 RLMLSERMKK-FLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTVGLSEQVNSFL 286 (395) T ss_pred HHHHHHHHHH-hhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHH Confidence 6555555432 2357889999999999999999998865 5667776 47999999999999998765 89999999 Q ss_pred HHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc Q lcl|NC_010576. 315 NRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE 394 (447) Q Consensus 315 ~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~ 394 (447) ++||.||+++||++|++|||++.++..|++|+||++.++++|.+++++++.+++++|+||+||+|+++|+||++||++|. T Consensus 287 ~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~ 366 (395) T protein:vir:40 287 MFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQE 366 (395) T ss_pred HHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred -cccccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 395 -LFNRNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) +++.|+.+....... .++++.++++ +|+ T Consensus 367 ~~~~~n~~~~~~~~~~-~kgge~~~~~---~~~ 395 (395) T protein:vir:40 367 RFVTKNYAPLGENEED-LKGGDINENK---GDS 395 (395) T ss_pred eeeccccccccccccc-cCCCCCCCCc---CCC Confidence 467777766433221 1222221111 111 No 44 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=1.1e-79 Score=453.41 Aligned_cols=394 Identities=13% Similarity=0.112 Sum_probs=277.7 Q ss_pred CchhHh--hhhh-cccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDR--LLHS-WNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~--l~~~-~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) |.||.| +... .+.+..+-. .........+..| ...+... ++...++++++|++||++||++||+|||+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-------~~~~~~~-v~~~~a~~~~~v~~~i~~ia~~iA~lp~~ 71 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWI-DQSTSKLYDFSPW-------KNRSFWG-VINNTLETNETIFSAITKLSNSMASLPLK 71 (412) T ss_pred CccchhhhhhhhhhhhHhhhhh-ccccccccccccc-------CCccccc-cchhhhhccHHHHHHHHHHHHhHhhCcee Confidence 988865 2211 111100000 0111111111111 1112222 23567899999999999999999999999 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP 157 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (447) +||+++ ..+|++++||+.+||++||+++||+.++.+|+++||||+++.++..+.... +++.+.....+..... T Consensus 72 ~~~~~~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~-L~~l~~~~v~v~~~~~ 144 (412) T protein:vir:26 72 MYEDYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSK-LFLLNPDVVEMLIENQ 144 (412) T ss_pred Eeeccc------cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEE-EEEEcCceeEEEEeCC Confidence 998543 357999999999999999999999999999999999999999988776544 4444444434433222 Q ss_pred -CceEEEEeeecccccceeeecccccccccccc-cccccchhHHHHHHHHHHHHHHHHH--HHhh-cCcccceeeeCCcC Q lcl|NC_010576. 158 -RQVMVRVWNDNTGLEQDLLVSKENCIIIESPF-YAILNDTNQTLRMLEQKIKLMNSQD--NRAS-SGKLNGFIQFPYST 232 (447) Q Consensus 158 -~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~n-~~~~~gvl~~~~~~ 232 (447) ..+. |......+..+.+++++|+|++.+. .+... +.+.+..+...+....+.. .+.+ +..++++++.+..+ T Consensus 145 ~~~~~---y~~~~~~g~~~~~~~~evih~~~~~~~~~~~-G~s~i~~~~~~i~~~~a~~~~~~~~~~~~~~~i~~~~~~l 220 (412) T protein:vir:26 145 SRELY---YSIHAATGNKLIVHNMDMLHFKHIVASNMVQ-GISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNV 220 (412) T ss_pred CcEEE---EEEEcCCceEEEEccccEEEeCCCCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCC Confidence 2222 2233333455678999999999742 22221 2233444444444433332 2333 34445677888888 Q ss_pred ChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC------c Q lcl|NC_010576. 233 KSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT------A 305 (447) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~------~ 305 (447) ++++.++.++ .|.+.. +++++++||++|++|+++++++.+++ ++.+++++++||++|||||++|++. + T Consensus 221 ~~e~~~~~~~----~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn 295 (412) T protein:vir:26 221 GKEKRQQVLE----DFKQYY-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAK 295 (412) T ss_pred CHHHHHHHHH----HHHHHh-hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc Confidence 7765555544 444433 35778999999999999999998865 6889999999999999999999752 3 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Q lcl|NC_010576. 306 NEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKA 385 (447) Q Consensus 306 ~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~ 385 (447) .|++.++|+++||.||+++||++||++||++.++..|++|+||++.|+++|.+++++++++++++|++|+||+|+++|+| T Consensus 296 ~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~ 375 (412) T protein:vir:26 296 NEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLP 375 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 48999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccccc-cccccccchhhccc--ccCCCCCCCCCCC Q lcl|NC_010576. 386 PHPNPLANE-LFNRNIADGNQVGG--INTPGQITSDQPA 421 (447) Q Consensus 386 p~~g~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 421 (447) |+|| ||. +++.|+.+...... ...+|++.++++. T Consensus 376 p~~g--gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 376 PVEG--GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred CCCC--cCeeeecccccccccchhhcccccCCCCCcCCC Confidence 9988 465 66788777643221 1122222222221 No 45 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=1.9e-79 Score=452.17 Aligned_cols=391 Identities=13% Similarity=0.113 Sum_probs=278.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) =++++|+++.+ ..+-. .+.......+..|.. .+... ++..+++++++|++||++||++||+|||++|| T Consensus 4 ~~~~~~~~~~~---~~~~~-~~~~~~~~~~~~~~~-------~~~~~-v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~ 71 (409) T protein:vir:93 4 ENIVTRIKKKL---IDNWI-DQSTSKLYDFSPWKN-------RSFWG-VINNTLETNETIFSAITKLSNSMASLPLKMYE 71 (409) T ss_pred cchhhhhhhhh---hhhhh-ccccccccccccccC-------ccccc-cchhhhhccHHHHHHHHHHHHhhhhCceeEee Confidence 35666654422 11110 011111112222221 11111 23457899999999999999999999999998 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC-Cc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP-RQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 159 (447) +++ ..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.... +++++.....+..... .. T Consensus 72 ~~~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~-L~~l~~~~v~~~~~~~~~~ 144 (409) T protein:vir:93 72 DYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSK-LFLLNPDVVEMLIENQSRE 144 (409) T ss_pred ccc------cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEE-EEEEcCceeEEEEeCCCcE Confidence 543 357999999999999999999999999999999999999999987766544 4444444444433222 22 Q ss_pred eEEEEeeecccccceeeecccccccccccc-cccccchhHHHHHHHHHHHHHHHHH--HHhhc-CcccceeeeCCcCChH Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPF-YAILNDTNQTLRMLEQKIKLMNSQD--NRASS-GKLNGFIQFPYSTKST 235 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~n~-~~~~gvl~~~~~~~~~ 235 (447) +. |......+..+.++.++|+|++.+. .+... +.+.+..+...+....+.. .+.++ ..++++++.+..++++ T Consensus 145 ~~---y~~~~~~g~~~~~~~~eVih~r~~~~~~~~~-G~s~i~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e 220 (409) T protein:vir:93 145 LY---YSIHAATGNKLIVHNMDMLHFKHIVASNMVQ-GISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKE 220 (409) T ss_pred EE---EEEEcCCceEEEEccccEEEeCCCCCCCccc-cccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHH Confidence 22 2233334455678999999999642 22221 2233444455544443322 23343 3445677888888776 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC------cHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT------ANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~ 308 (447) +.++.+ +.|.+.+ +++++++|+++|++|+++++++.+++ ++.+++++++||++|||||++|++. +.|+ T Consensus 221 ~~~~~~----~~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 295 (409) T protein:vir:93 221 KRQQVL----EDFKQYY-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEE 295 (409) T ss_pred HHHHHH----HHHHHHh-hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 555544 4444433 35678999999999999999998865 6889999999999999999999742 3489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.++|+++||.||+++||++|+++||++.++..|++|+||++.|+++|.+++++++++++++|++|+||+|+++|+||+| T Consensus 296 ~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ 375 (409) T protein:vir:93 296 LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 375 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cccccc-cccccccchhhcccc--cCCCCCCCCCCC Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGI--NTPGQITSDQPA 421 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 421 (447) | ||. +++.|+++....... ..+|++.+.++. T Consensus 376 g--gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 376 G--GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred C--cCeeeecccccccccchhhcccccCCCCCcCCC Confidence 7 465 668888776543221 122222222221 No 46 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=4.8e-78 Score=444.45 Aligned_cols=391 Identities=12% Similarity=0.115 Sum_probs=276.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) =.+++|+++.+ +.+-.. .+.+.......|. ..+... ++...|+++++|++||++||++||+|||++|| T Consensus 4 ~~~~~~~k~~~--~~~~~~--~~~~~~~~~~~~~-------~~~~~~-v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~ 71 (409) T protein:vir:94 4 ENIVTRIKKKL--IDNWID--QSASKLYDFSPWK-------NKSFWG-VINNTLETNETIFSAITKLSNSMASLPLKMYE 71 (409) T ss_pred cccchhhhhHH--hhhhhc--CCccccccccccc-------Cccccc-cchhhhhccHHHHHHHHHHHHhhhhCceeEee Confidence 24566655422 111111 1111111111111 111111 23457899999999999999999999999998 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC-Cc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP-RQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 159 (447) +++ ..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ ++....+..... +. T Consensus 72 ~~~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l-~~~~v~v~~~~~~~~ 144 (409) T protein:vir:94 72 DYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLL-NPDVVEMLIENQSRE 144 (409) T ss_pred ccc------ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE-cCceeEEEEeCCCcE Confidence 543 3579999999999999999999999999999999999999999877765554444 444333332222 22 Q ss_pred eEEEEeeecccccceeeecccccccccccc-cccccchhHHHHHHHHHHHHHHHHH--HHhh-cCcccceeeeCCcCChH Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPF-YAILNDTNQTLRMLEQKIKLMNSQD--NRAS-SGKLNGFIQFPYSTKST 235 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~n-~~~~~gvl~~~~~~~~~ 235 (447) +. |......+..+.++.+||+|++.+. .+... +.+.+..+...+....+.. .+.+ +..++++++.+..++++ T Consensus 145 ~~---y~~~~~~g~~~~~~~~dvih~r~~~~~~~~~-G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e 220 (409) T protein:vir:94 145 LY---YSIHAATGNKLIVHNMDMLHFKHIVASNMVQ-GISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKE 220 (409) T ss_pred EE---EEEEcCCceEEEEccccEEEecCCCCCCccc-cccHHHHHHHHHHHHHHHHHHHHHhcCCCCeeEEecCCCCCHH Confidence 22 2233334456678899999999642 22221 2233444444544433332 2233 34445678888888776 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC------cHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT------ANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~ 308 (447) +.++ +++.|.+.+ ++++++++|++|++|+++++++++++ ++.+++++++||++|||||++|++. +.|+ T Consensus 221 ~~~~----~~~~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 295 (409) T protein:vir:94 221 KRQQ----VLEDFKQYY-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEE 295 (409) T ss_pred HHHH----HHHHHHHHh-hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 5544 445554444 36778999999999999999998865 6889999999999999999999742 3489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.+.|+++||.||+++||++|+++||++.++..|++|+||++.|+++|.+++++++.+++++|+||+||+|+++|+||+| T Consensus 296 ~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ 375 (409) T protein:vir:94 296 LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 375 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cccccc-cccccccchhhcccc--cCCCCCCCCCCC Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGI--NTPGQITSDQPA 421 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 421 (447) | ||. +++.|+.+....... ..+|++.+..+. T Consensus 376 g--gD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 376 G--GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred C--cCeEeecccccccccchhhcccccCCCCCcCCC Confidence 7 465 667888776443211 112222111111 No 47 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=7.9e-77 Score=437.82 Aligned_cols=394 Identities=11% Similarity=0.023 Sum_probs=271.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||.. +.....+..++ | ..++++. ....++.. .++++++||+||++||++||+|||++++ T Consensus 1 m~~f~~----------~~~~~~~~~~~-----~-~~~~~~~--~~~~~~~~-~Al~~~~V~~~i~~Ia~~iA~lp~~~~~ 61 (406) T protein:vir:97 1 MSFFQP----------LGTSKVSYDDY-----I-SSVLAGD--VSQKYLGV-SALKNSDILTATSIIAGDIARFPLVKKD 61 (406) T ss_pred Cccccc----------cCCCCCCcchH-----H-HHHhcCC--CCcccccc-hhhccHHHHHHHHHHHHhhhhCeeEEEe Confidence 777642 11111121111 1 1112211 11233433 4789999999999999999999998765 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+ |. .+.+|++++||+.+||++||+++||+.++.+|+++||||+++.++........+++.+.....+.....+.+ T Consensus 62 ~~--g~--~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~ 137 (406) T protein:vir:97 62 VN--GD--IIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEI 137 (406) T ss_pred cC--cc--ccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceE Confidence 43 32 356899999999999999999999999999999999999999998654444455555444444443333344 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+.+. ....+..+.++.++|||++....++.. +.+.+..+...+....+ ...+.||+.+++++..+..++++ T Consensus 138 ~y~~~--~~~~~~~~~~~~~evih~r~~~~dg~~-G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e 214 (406) T protein:vir:97 138 VYTFT--DMLTAKQVKCFAHDVIHWKFFSHDTIL-GRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGD 214 (406) T ss_pred EEEEE--ecCCceEEEEccccEEEecCCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHH Confidence 44332 222344567889999999975433221 12233333344333222 22346888888888888878766 Q ss_pred HHHHHHHHHHHHHHHHhc-cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMA-NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQQ 309 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~~ 309 (447) +.++.+ +.|++... .|+|+++||++|++|++|++++++++ ++.+++++++||++|||||++||++ +.|++ T Consensus 215 ~~~~~~----~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~ 290 (406) T protein:vir:97 215 ARQRAR----QEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQL 290 (406) T ss_pred HHHHHH----HHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHH Confidence 555444 44444444 47899999999999999999998864 7889999999999999999999743 45899 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||+++||++|++|||++.++. +++|+||++.+ .+.+++.+.+++++|+||+||+|+++|+||+++ T Consensus 291 ~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~-~~~i~fd~~~~----~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~ 365 (406) T protein:vir:97 291 MEDYVTNDLPFYFDAITSELGLKTLNDKDRR-LYHIEFDTRSV----TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTD 365 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcChhhcc-ceeEEEecCcc----chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 9999999999999999999999999987754 68999997654 556677888999999999999999999999999 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) +++|. +++.|+++........++... .... .+.+++++.+ T Consensus 366 ~~gD~~~~~~n~~~~~~~~~~~~~~~~--~~~g----g~~~~~~~~~ 406 (406) T protein:vir:97 366 PNMDRYQSSLNYVFLDKKEEYQDKVGI--KGKG----GEVNAEEDKS 406 (406) T ss_pred CCCCeEeeccCccchhccccccccccc--ccCC----CCCCCCCCCC Confidence 98887 467788776543211110110 0011 1111111111 No 48 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=3.3e-77 Score=439.87 Aligned_cols=374 Identities=13% Similarity=0.120 Sum_probs=260.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||||++.. ++.... +...+... ...+....++++++|++||++||++||+|||++|+ T Consensus 1 Mgl~d~~~~------~~~~~~-------~~~~~~~~---------~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~ 58 (395) T protein:vir:96 1 MGILDFFSF------KKSGTL-------SDDDSGST---------TSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKA 58 (395) T ss_pred CcchhhhcC------CCCccc-------cccccccc---------hhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEe Confidence 999998543 211111 11111111 11233467889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +++ ....+|++++||+.+||++||+++||+.++.+++++||||+++.++....+...+..... .++... T Consensus 59 ~~~----~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~~~-------~~~~~~ 127 (395) T protein:vir:96 59 PEK----LTENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQDKK-------LSGNKF 127 (395) T ss_pred CCc----cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccccc-------ccccee Confidence 532 234689999999999999999999999999999999999999988765433332222111 111111 Q ss_pred EEEEeeecccccceeeeccccccccccccc-------ccccchhHHHHHHHHHHHH----HHHHHHHhhcCcccceeeeC Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFY-------AILNDTNQTLRMLEQKIKL----MNSQDNRASSGKLNGFIQFP 229 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~~~----~~~~~~~~n~~~~~gvl~~~ 229 (447) ...... +......++.++|+|++.+.. ++.......+......... ......+.+++.+.++++.+ T Consensus 128 ~~v~~~---~~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (395) T protein:vir:96 128 KVSRVQ---GQTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSD 204 (395) T ss_pred eeeeec---cceeeeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccC Confidence 111111 111123578899999985432 2222222222221111100 11122245677777787766 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHH------HHHHHHHhCCCHHHhc Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQL------QQDFYNQMGITEAILN 302 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~------~~~Ia~~fgVP~~~l~ 302 (447) ..... +..+++.+++.+....++++++++++|++|+++++++.+++ ++.+++. .++||++|||||++|+ T Consensus 205 ~~~~~----~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~ 280 (395) T protein:vir:96 205 GGRQP----KSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH 280 (395) T ss_pred chhhH----HHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc Confidence 54433 34445555555555567788999999999999999987755 5666554 5799999999999998 Q ss_pred CCc--HHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHH Q lcl|NC_010576. 303 GTA--NEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRE 380 (447) Q Consensus 303 g~~--~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~ 380 (447) |++ .|++.++||++||.||+++||++|+++||++.++..|.+ |+++.++++|.+++++++.+++++||||+||+|+ T Consensus 281 ~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~--f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 358 (395) T protein:vir:96 281 GDIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKSETLEGSF--IKVTGLKNYDLFSISSQADKLISSGFVFIDEVRE 358 (395) T ss_pred CCCccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCcee--EeecchhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 864 589999999999999999999999999999988877654 6778999999999999999999999999999999 Q ss_pred HhCCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCc Q lcl|NC_010576. 381 LTGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPAT 422 (447) Q Consensus 381 ~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (447) ++|+||++|+++|. |+++|+++..+.++ +.++...+ T Consensus 359 ~~gl~pi~~~~gD~~~~~~N~~~~~~~gg------e~~~~~~~ 395 (395) T protein:vir:96 359 EIGLPELPDGLGKVLYMTKNYESVLERGG------EVDEEVET 395 (395) T ss_pred HhCCCCCCCCCCceeeecccceechhccC------CCCCCCCC Confidence 99999999999987 56888887655322 21111111 No 49 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=6.9e-77 Score=438.11 Aligned_cols=383 Identities=15% Similarity=0.161 Sum_probs=269.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+||. +++.. +. +|.. . .. ...+..+.++++++|++||++||++||++||++|+ T Consensus 1 Mg~f~~lf------~~~~~----~~------~~~~-~-----~~-~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~ 57 (395) T protein:vir:10 1 MSILEKIF------KTRKD----IT------YMLD-L-----DM-IEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLE 57 (395) T ss_pred Cchhhhhh------ccCcc----cc------cccc-c-----hh-ccccchhhhhhhHHHHHHHHHHHHhhccceeEecc Confidence 99999854 33321 11 1111 0 00 11233467889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+ ++.+|+++++|+.+||++||+++||+.++.+|++.|++|+++.++... ++..+. .......++... T Consensus 58 ~~------~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-----~~~~~~-~~~~~~~~~~~~ 125 (395) T protein:vir:10 58 GN------RIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-----LIADSF-YREEYALYDDIF 125 (395) T ss_pred CC------ccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-----EecCCc-cceeEeecCcce Confidence 42 356899999999999999999999999999999999999987655332 222111 112222223222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) ...... .......+++++|+|++.+.......+.+++..+..+++.+.+ .+++++.++|+|+.+....++ ++ T Consensus 126 ~~~~~~---~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~--~~~~~~~~~gii~~~~~~~~~---e~ 197 (395) T protein:vir:10 126 KDVTVK---DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG--AQLKNYQIRGILKSASSAYDE---KN 197 (395) T ss_pred eEEEEc---CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH--HHHhcCCCceEEEeCCCCCCH---HH Confidence 211111 1122245789999999865433333344556666665555433 467899999999988765443 33 Q ss_pred HHHHHHHHHHHhcc-C--CcceeecCCCceeeecCCChhh------hhHHHHHHHHHHHHHHhCCCHHHhcCCcH--HHH Q lcl|NC_010576. 241 AARRKQEIENEMAN-N--KYGVATLDTQEKFVSAGMGLQN------NLLSDVRQLQQDFYNQMGITEAILNGTAN--EQQ 309 (447) Q Consensus 241 ~~~~~~~~~~~~~~-n--~~~~~vl~~g~~~~~l~~~~~~------~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~--e~~ 309 (447) ++++++.|.+..++ + +.+++++++|++|+++++++.+ ++++.+++++++||++|||||++|+|+++ |++ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~ 277 (395) T protein:vir:10 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKN 277 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHH Confidence 33444555444433 3 3446778999999999988754 34678899999999999999999998765 999 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||+++||++||+|||++.++..+ ++|+++.++++|.+++++++.+++++||||+||+|+++|+||++| T Consensus 278 ~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~ 355 (395) T protein:vir:10 278 TLVFEKFCLTPLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDN 355 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 999999999999999999999999998877664 578999999999999999999999999999999999999999999 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) |++|. +++.++.+.........+++. ...+.++.+.+.+ T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~~~~~~-~~~kgg~~~~~g~ 395 (395) T protein:vir:10 356 PELDEYLITKNYEKANSGENDEKEKDE-NTLKGGDEDESGD 395 (395) T ss_pred CCCceeeeccccccccccccccCcccc-cccCCCCCCCCCC Confidence 98887 567777765443222111111 1111111111111 No 50 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=6.9e-77 Score=438.11 Aligned_cols=383 Identities=15% Similarity=0.161 Sum_probs=269.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+||. +++.. +. +|.. . .. ...+..+.++++++|++||++||++||++||++|+ T Consensus 1 Mg~f~~lf------~~~~~----~~------~~~~-~-----~~-~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~ 57 (395) T protein:vir:10 1 MSILEKIF------KTRKD----IT------YMLD-L-----DM-IEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLE 57 (395) T ss_pred Cchhhhhh------ccCcc----cc------cccc-c-----hh-ccccchhhhhhhHHHHHHHHHHHHhhccceeEecc Confidence 99999854 33321 11 1111 0 00 11233467889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+ ++.+|+++++|+.+||++||+++||+.++.+|++.|++|+++.++... ++..+. .......++... T Consensus 58 ~~------~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-----~~~~~~-~~~~~~~~~~~~ 125 (395) T protein:vir:10 58 GN------RIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-----LIADSF-YREEYALYDDIF 125 (395) T ss_pred CC------ccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-----EecCCc-cceeEeecCcce Confidence 42 356899999999999999999999999999999999999987655332 222111 112222223222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) ...... .......+++++|+|++.+.......+.+++..+..+++.+.+ .+++++.++|+|+.+....++ ++ T Consensus 126 ~~~~~~---~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~--~~~~~~~~~gii~~~~~~~~~---e~ 197 (395) T protein:vir:10 126 KDVTVK---DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG--AQLKNYQIRGILKSASSAYDE---KN 197 (395) T ss_pred eEEEEc---CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH--HHHhcCCCceEEEeCCCCCCH---HH Confidence 211111 1122245789999999865433333344556666665555433 467899999999988765443 33 Q ss_pred HHHHHHHHHHHhcc-C--CcceeecCCCceeeecCCChhh------hhHHHHHHHHHHHHHHhCCCHHHhcCCcH--HHH Q lcl|NC_010576. 241 AARRKQEIENEMAN-N--KYGVATLDTQEKFVSAGMGLQN------NLLSDVRQLQQDFYNQMGITEAILNGTAN--EQQ 309 (447) Q Consensus 241 ~~~~~~~~~~~~~~-n--~~~~~vl~~g~~~~~l~~~~~~------~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~--e~~ 309 (447) ++++++.|.+..++ + +.+++++++|++|+++++++.+ ++++.+++++++||++|||||++|+|+++ |++ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~ 277 (395) T protein:vir:10 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKN 277 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHH Confidence 33444555444433 3 3446778999999999988754 34678899999999999999999998765 999 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||+++||++||+|||++.++..+ ++|+++.++++|.+++++++.+++++||||+||+|+++|+||++| T Consensus 278 ~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~ 355 (395) T protein:vir:10 278 TLVFEKFCLTPLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDN 355 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 999999999999999999999999998877664 578999999999999999999999999999999999999999999 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) |++|. +++.++.+.........+++. ...+.++.+.+.+ T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~~~~~~-~~~kgg~~~~~g~ 395 (395) T protein:vir:10 356 PELDEYLITKNYEKANSGENDEKEKDE-NTLKGGDEDESGD 395 (395) T ss_pred CCCceeeeccccccccccccccCcccc-cccCCCCCCCCCC Confidence 98887 567777765443222111111 1111111111111 No 51 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=6.9e-77 Score=438.11 Aligned_cols=383 Identities=15% Similarity=0.161 Sum_probs=269.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+||. +++.. +. +|.. . .. ...+..+.++++++|++||++||++||++||++|+ T Consensus 1 Mg~f~~lf------~~~~~----~~------~~~~-~-----~~-~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~ 57 (395) T protein:vir:95 1 MSILEKIF------KTRKD----IT------YMLD-L-----DM-IEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLE 57 (395) T ss_pred Cchhhhhh------ccCcc----cc------cccc-c-----hh-ccccchhhhhhhHHHHHHHHHHHHhhccceeEecc Confidence 99999854 33321 11 1111 0 00 11233467889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+ ++.+|+++++|+.+||++||+++||+.++.+|++.|++|+++.++... ++..+. .......++... T Consensus 58 ~~------~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-----~~~~~~-~~~~~~~~~~~~ 125 (395) T protein:vir:95 58 GN------RIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-----LIADSF-YREEYALYDDIF 125 (395) T ss_pred CC------ccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-----EecCCc-cceeEeecCcce Confidence 42 356899999999999999999999999999999999999987655332 222111 112222223222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) ...... .......+++++|+|++.+.......+.+++..+..+++.+.+ .+++++.++|+|+.+....++ ++ T Consensus 126 ~~~~~~---~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~--~~~~~~~~~gii~~~~~~~~~---e~ 197 (395) T protein:vir:95 126 KDVTVK---DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG--AQLKNYQIRGILKSASSAYDE---KN 197 (395) T ss_pred eEEEEc---CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH--HHHhcCCCceEEEeCCCCCCH---HH Confidence 211111 1122245789999999865433333344556666665555433 467899999999988765443 33 Q ss_pred HHHHHHHHHHHhcc-C--CcceeecCCCceeeecCCChhh------hhHHHHHHHHHHHHHHhCCCHHHhcCCcH--HHH Q lcl|NC_010576. 241 AARRKQEIENEMAN-N--KYGVATLDTQEKFVSAGMGLQN------NLLSDVRQLQQDFYNQMGITEAILNGTAN--EQQ 309 (447) Q Consensus 241 ~~~~~~~~~~~~~~-n--~~~~~vl~~g~~~~~l~~~~~~------~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~--e~~ 309 (447) ++++++.|.+..++ + +.+++++++|++|+++++++.+ ++++.+++++++||++|||||++|+|+++ |++ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~ 277 (395) T protein:vir:95 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKN 277 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHH Confidence 33444555444433 3 3446778999999999988754 34678899999999999999999998765 999 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||+++||++||+|||++.++..+ ++|+++.++++|.+++++++.+++++||||+||+|+++|+||++| T Consensus 278 ~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~ 355 (395) T protein:vir:95 278 TLVFEKFCLTPLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDN 355 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 999999999999999999999999998877664 578999999999999999999999999999999999999999999 Q ss_pred ccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) |++|. +++.++.+.........+++. ...+.++.+.+.+ T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~~~~~~-~~~kgg~~~~~g~ 395 (395) T protein:vir:95 356 PELDEYLITKNYEKANSGENDEKEKDE-NTLKGGDEDESGD 395 (395) T ss_pred CCCceeeeccccccccccccccCcccc-cccCCCCCCCCCC Confidence 98887 567777765443222111111 1111111111111 No 52 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=8.6e-77 Score=437.61 Aligned_cols=378 Identities=13% Similarity=0.105 Sum_probs=260.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||||+.+.+. . .. ....+.. .........++++++|++||++||++||++||++|+ T Consensus 1 MGlf~~~~~~~~------~-~~------~~~~~~~---------~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~ 58 (395) T protein:vir:98 1 MGILDFFSFKKS------G-TL------SDDDSGS---------TTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKT 58 (395) T ss_pred CcchhhhcCCCc------c-cc------cccccch---------hhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEe Confidence 999998643211 1 00 1111110 011123456789999999999999999999999998 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) .+++ ...+|++++||+.+|||+||+++||+.++.+++++||||+++.++....++..+.... .+.......+ T Consensus 59 ~~~~----~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 130 (395) T protein:vir:98 59 PEKL----TENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIYVADSFTQDK----KISGSQFKVS 130 (395) T ss_pred cCCc----ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCceecCCcccccc----cccCccccee Confidence 5332 2457999999999999999999999999999999999999998876544333222211 1111111111 Q ss_pred EEEEeeecccccceeeecccccccccccccccccch----hHHHHHHHHHHHH---HHHHHHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDT----NQTLRMLEQKIKL---MNSQDNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~----~~~~~~~~~~~~~---~~~~~~~~n~~~~~gvl~~~~~~~ 233 (447) .+.. ......++.++|+|++.....+.... ......+...+.. ..+...+.++....+++....... T Consensus 131 ~~~~------~~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (395) T protein:vir:98 131 RVQG------QTYEKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSD 204 (395) T ss_pred eecC------ceeeeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCC Confidence 1111 11124577899999986432222111 1111112222221 122223345666666766655554 Q ss_pred hHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCC------hhhhh-HHHHHHHHHHHHHHhCCCHHHhcCCcH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMG------LQNNL-LSDVRQLQQDFYNQMGITEAILNGTAN 306 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~------~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~~~ 306 (447) .+...+..+++.+.+.+....++++++++++|++|++++++ +++.+ ++.+++++++||++|||||++|+++++ T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~s 284 (395) T protein:vir:98 205 GGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIA 284 (395) T ss_pred cHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcc Confidence 44433444444444444444567788999999999999864 34444 567889999999999999999998764 Q ss_pred --HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_010576. 307 --EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGK 384 (447) Q Consensus 307 --e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl 384 (447) |++.++|+++||.||+++||++|++|||++.++..|.+ |+++.|+++|.+++++++.+++++||+|+||+|+++|+ T Consensus 285 n~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~~~~~g~~--f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~ 362 (395) T protein:vir:98 285 DNQKNYELLLEGPIESLITNIVDGLEYAIFDKSETLQGSF--IKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGL 362 (395) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCcce--eeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 89999999999999999999999999999988877665 66679999999999999999999999999999999999 Q ss_pred CCCCCccccc-cccccccchhhcccccCCCCCCCCCCCc Q lcl|NC_010576. 385 APHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPAT 422 (447) Q Consensus 385 ~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (447) ||++|+++|. |+++|+++.... |+++++++.+ T Consensus 363 ~Pi~~~~gD~~~~~~n~~~~~~~------gge~~~~~~~ 395 (395) T protein:vir:98 363 PELPDGLGKVLYMTKNYESVLER------GGEVDEEVET 395 (395) T ss_pred CCCCCCCCceeeecccceecccc------cCCCCCCCCC Confidence 9999998887 678888876532 2222211111 No 53 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=1e-76 Score=437.15 Aligned_cols=399 Identities=12% Similarity=0.042 Sum_probs=277.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccc--cccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSN--GMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) -.+++|+.+ +. ....+.....| |+.+.++....++..+ ....++++++||+||++||++||+|||++ T Consensus 2 ~~~~~~~~~------~~----~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-~~~~a~~~~~v~~~v~~ia~~iA~lp~~v 70 (460) T protein:vir:10 2 ANRIIRALR------EL----TGLDNKFNDAFIKYIGQTFTKYDNNGKTY-LEQGYNINPDVYSCISQMAAKTVAVPYTI 70 (460) T ss_pred chhHHHHHh------hh----hccCCCchHHHHHhhccccCCCccchhhh-hHHHHhcchHHHHHHHHHHHhhhhCceEE Confidence 233444332 11 11111111111 2223333333334333 34568999999999999999999999999 Q ss_pred EEEcCCCcee--------------------------ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEee Q lcl|NC_010576. 79 LKIDPISGNQ--------------------------TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPID 132 (447) Q Consensus 79 ~r~~~~~~~~--------------------------~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~ 132 (447) |+.+.+|... ....+++..+|+.+||++||+++||+.++.+++++||||+++.+ T Consensus 71 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 150 (460) T protein:vir:10 71 KVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMS 150 (460) T ss_pred EeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 9988877532 22345556677789999999999999999999999999999998 Q ss_pred ccCCc---ccceeeeccCCCcceeeecCCceEE-----EEeeecccccceeeeccccccccccccccc-----ccchhHH Q lcl|NC_010576. 133 TTVDP---DSGSFDINTARVGKIMQFFPRQVMV-----RVWNDNTGLEQDLLVSKENCIIIESPFYAI-----LNDTNQT 199 (447) Q Consensus 133 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~-----~~~~~~~ 199 (447) +..+. ....+++++.....+.....+.... ..|.. ...+....++.++|||+|.+.... .-.+.+. T Consensus 151 ~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp 229 (460) T protein:vir:10 151 PDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYML-IQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSP 229 (460) T ss_pred cCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEE-ecCceeEEecccceEEEecCCCCcccccCccccccH Confidence 75432 2234555555554444333222111 11111 223445678999999998543221 0011123 Q ss_pred HHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecC Q lcl|NC_010576. 200 LRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAG 272 (447) Q Consensus 200 ~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~ 272 (447) +..+...+.... +...+.||+.++++++.+..+++++.+ ++++.|++.+. +|+|++++|++|++|++++ T Consensus 230 ~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~----~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~ 305 (460) T protein:vir:10 230 IRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQAD----SLKQRLTEMDKSPDRLSQIAGASGEIAFTKIS 305 (460) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHH----HHHHHHHHHhcCccccCCceecCCCceEEEcc Confidence 333333333322 233356889999999999888876544 45555555554 4789999999999999999 Q ss_pred CChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCc Q lcl|NC_010576. 273 MGLQNNL-LSDVRQLQQDFYNQMGITEAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQ 343 (447) Q Consensus 273 ~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~ 343 (447) +++.+++ ++.+++++++||++|||||++||. ++.|++.++|+++||.||+++||++||++|+++.++..++ T Consensus 306 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~ 385 (460) T protein:vir:10 306 LNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENA 385 (460) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCc Confidence 9988865 788999999999999999999962 2358999999999999999999999999999998888899 Q ss_pred eEEEecchh--hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc-cccccccchhhcccccCCCCCCCCC Q lcl|NC_010576. 344 VLVYYRNPF--KLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQ 419 (447) Q Consensus 344 ~i~f~~~~l--~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (447) +|+||++.+ ++.|.+++++ ++++|+||+||+|+++|+||++++.+|. ++++|+.+....++...++.+++++ T Consensus 386 ~i~~d~~~l~~l~~d~~~~~~----~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 386 VIEWDISELPEMQTDMVAMAS----WLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred eEEeecchhhhHHHHHHHHHH----HHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCcccCCC Confidence 999999998 5666666654 5688999999999999999998777777 5688888776554433333333333 No 54 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=6.7e-77 Score=438.20 Aligned_cols=364 Identities=16% Similarity=0.152 Sum_probs=264.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+||.+ ++.... ... . + ..+ ..+....++++++|++||++||++||++||++|+ T Consensus 1 Mg~f~~l~~------~~~~~~-~~~-----~-~--~~~--------~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~ 57 (376) T protein:vir:78 1 MGFFSELFK------RNKEIE-WMW-----D-L--DFL--------EDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKN 57 (376) T ss_pred Cchhhhhhc------cCCccc-ccc-----c-h--hhc--------cccchhhhhhhHHHHHHHHHHHHhhcccceeecc Confidence 999998543 322111 000 0 0 001 1133457889999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +. ...+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+...++..+..+.... ...+ T Consensus 58 ~~------~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~---~~~~ 128 (376) T protein:vir:78 58 GE------TSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDV---FEGV 128 (376) T ss_pred cc------ccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeee---eeee Confidence 32 34689999999999999999999999999999999999999998887776666655544332111 1111 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) .+..+ .....++.++|+|++.....+......+.......+..+.....+.++.++.++++.+..+++++. T Consensus 129 ~~~~~------~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~--- 199 (376) T protein:vir:78 129 TVKDY------RYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGELFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQ--- 199 (376) T ss_pred eeecc------eeeeeeccccEEEeccCCCCchhhhhHHHHHHHHHHHHHHHHHHhcCCCceeEEEccCCCCCHHHH--- Confidence 11111 112357789999999655554444333333333333333222222333333334555566665544 Q ss_pred HHHHHHHHHHHhcc---CCcceeecCCCceeeecCCChhh------hhHHHHHHHHHHHHHHhCCCHHHhcCCcH--HHH Q lcl|NC_010576. 241 AARRKQEIENEMAN---NKYGVATLDTQEKFVSAGMGLQN------NLLSDVRQLQQDFYNQMGITEAILNGTAN--EQQ 309 (447) Q Consensus 241 ~~~~~~~~~~~~~~---n~~~~~vl~~g~~~~~l~~~~~~------~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~--e~~ 309 (447) +++++.|.+.+++ ++++++++++|++|+++++++.+ ++++.+++++++||++|||||++|+|+++ |++ T Consensus 200 -~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~ 278 (376) T protein:vir:78 200 -TKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNN 278 (376) T ss_pred -HHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHH Confidence 4455555555443 55678899999999999987743 34688899999999999999999998765 899 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||+++||++||+|||++.+ ++++|+++.+++.|.+++++++.+++++||+|+||+|+++|+||++| T Consensus 279 ~~~f~~~~l~P~~~~ie~~l~~kll~~~~----~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~ 354 (376) T protein:vir:78 279 MKAYMEYCIDPLTKKLEDELNAKLFTFSE----FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDN 354 (376) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhCCccc----ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999754 56788889999999999999999999999999999999999999999 Q ss_pred ccccc-cccccccchhhcccccCCC Q lcl|NC_010576. 390 PLANE-LFNRNIADGNQVGGINTPG 413 (447) Q Consensus 390 ~~~~~-~~~~~~~~~~~~~~~~~~~ 413 (447) |++|. ++++|+.+.... +++| T Consensus 355 g~~d~~~~~~n~~~~~~~---~e~g 376 (376) T protein:vir:78 355 PELDKYLITKNYQSADEG---GEDG 376 (376) T ss_pred CCCceeeeccCceehhcc---ccCC Confidence 99887 568888776433 2223 No 55 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=2e-76 Score=435.62 Aligned_cols=372 Identities=13% Similarity=0.186 Sum_probs=269.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||++| |+++.. ...... ...+ ..+....|+++++|++||++||++||++||++|| T Consensus 1 Mg~f~~~------f~~~~~----~~~~~~-----~~~~--------~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~ 57 (385) T protein:vir:95 1 MGLFDSV------FKRHSE----LSWMYD-----LEFL--------QDKSKKAYLKQIALNTVVEMVARTISQSEFRVMK 57 (385) T ss_pred Cchhhhh------hccCcc----cccccc-----hhhh--------hccchhhhhhhHHHHHHHHHHHHHHcccceeeee Confidence 9999984 343321 111100 0011 1123467889999999999999999999999997 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ++. ..+|++++||+.+||++||+++||+.++.+++++||||+++.++........+ ..+... ...+... T Consensus 58 ~~~------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~~~~~~-~~~~~~----~~~~~~~ 126 (385) T protein:vir:95 58 NNT------KEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFFVADDF-EKEDEL----GLYSHRF 126 (385) T ss_pred cCc------cccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCCCeeecccc-cccccc----ccccccc Confidence 432 25799999999999999999999999999999999999998766543332221 111111 1111111 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQ 240 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~ 240 (447) ..... . .......++.++|+|++.+...+...+.+++..+...+....+.. +.++.++|+++++..... .+++ T Consensus 127 ~~~~~-~--~~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~--~~~~~~~g~l~~~~~~~~--~~e~ 199 (385) T protein:vir:95 127 TNVLV-N--DFEFKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQ--MLNNQIRGILKVDATKFY--NKEK 199 (385) T ss_pred eeeee-c--ccceeeeeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHH--HhcCCCceEEEeCCccCC--CHHH Confidence 11111 1 111224578899999998765544445566666666666555443 345668899988753221 1334 Q ss_pred HHHHHHHHHHHhc---cCCcceeecCCCceeeecCC------Chhhhh-HHHHHHHHHHHHHHhCCCHHHhcCCc--HHH Q lcl|NC_010576. 241 AARRKQEIENEMA---NNKYGVATLDTQEKFVSAGM------GLQNNL-LSDVRQLQQDFYNQMGITEAILNGTA--NEQ 308 (447) Q Consensus 241 ~~~~~~~~~~~~~---~n~~~~~vl~~g~~~~~l~~------~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~~--~e~ 308 (447) .+++++.|.+.++ +++++++++++|++|+++++ ++++++ ++.+++++++||++|||||++|+|++ .|+ T Consensus 200 ~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~ 279 (385) T protein:vir:95 200 QKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEMADLEK 279 (385) T ss_pred HHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHH Confidence 4455555655544 35677999999999999985 445654 68899999999999999999998865 489 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +.++|+++||.||+++||++||++||++.++. +.+|+||++.|+++|.+++++++.+++++|+||+||+|+++|+||++ T Consensus 280 ~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~-~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~ 358 (385) T protein:vir:95 280 TIESYLQFCINPLLRKIEAELNSKFFYQDEYL-NDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPAD 358 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCChhhcc-cceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999998875 45899999999999999999999999999999999999999999999 Q ss_pred Cccccc-cccccccchhhcccccCCCCCCCCC Q lcl|NC_010576. 389 NPLANE-LFNRNIADGNQVGGINTPGQITSDQ 419 (447) Q Consensus 389 g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (447) +++||. |++.|+.+.... ++++.+++ T Consensus 359 ~~~gd~~~~~~n~~~~~~~-----kgge~~~e 385 (385) T protein:vir:95 359 DPELDKFIITKNLQSADAF-----KGGESNEE 385 (385) T ss_pred CCCCceeeecccceecccc-----cCCCCCCC Confidence 888887 567787765432 22222222 No 56 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=3.2e-76 Score=434.51 Aligned_cols=388 Identities=11% Similarity=0.109 Sum_probs=269.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||+|+ +|+++.... +..+..|+....+. ....+.....+.++++||+||++||++||++|+++|| T Consensus 1 Mg~~~-------~f~~k~~~~-----~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~ 65 (403) T protein:vir:80 1 MGLFN-------FFRRKTRSE-----PTNAISWFLTQEAY---DTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQ 65 (403) T ss_pred Ccccc-------ccccccccc-----ccchhhhhcccccc---cccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEE Confidence 88865 455543221 11122222111111 1111112234567899999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHh--cCCeeEEEeeccCCcccceeeeccCCCcceeeecCC Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLD--EGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll--~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (447) ..++|.. ..+|+++++|+.+||++||+++||+.+++++++ +||||+++.++..+.+..++++.+ ....+.....+ T Consensus 66 ~~~~g~~--~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p-~~v~~~~~~~g 142 (403) T protein:vir:80 66 NTDNGDI--RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAP-SKVSFVDTDTG 142 (403) T ss_pred ecCCcee--ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcC-CeeEEEEcCCc Confidence 8776643 368999999999999999999999999999998 488999999887766544444444 33333222111 Q ss_pred ceEEEEeeecccccceeeecccccccccc-c-ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCc Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIES-P-FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYS 231 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~ 231 (447) +.+++. ...++.++|+|++. + ..+... +.+.+..+...+....+ ...+.||++++|||+++.. T Consensus 143 ---~~~~y~------~~~~~~~eiih~~~~~~~~~~~~-G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~ 212 (403) T protein:vir:80 143 ---YQIWYQ------GKAYNYDEVLHFIVNPDPEKPYM-GRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAA 212 (403) T ss_pred ---eEEEEe------ecccchhhEEEEeccCCCcCccc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC Confidence 122211 12466789999983 2 112111 12233334434333322 2345788999999999999 Q ss_pred CChHHHHHHHHHHHHHHHHHhccCCcceeecCCCc-eeeecC-CChhhhh-HHHHHHHHHHHHHHhCCCHHHhc-CCcHH Q lcl|NC_010576. 232 TKSTARAAQAARRKQEIENEMANNKYGVATLDTQE-KFVSAG-MGLQNNL-LSDVRQLQQDFYNQMGITEAILN-GTANE 307 (447) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~-~~~~l~-~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~-g~~~e 307 (447) +++++.++.++++.+.+.. ..++|++++++.+. +++++. +++++++ ++.+++++++||++|||||++|| ++.++ T Consensus 213 ~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 290 (403) T protein:vir:80 213 TAELSSEEGRNAVFKKYLE--ASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYDK 290 (403) T ss_pred CChHHHHHHHHHHHHHHhh--hhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCccH Confidence 8887776666666555432 23678888886654 555554 5677654 68899999999999999999997 66778 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_010576. 308 QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH 387 (447) Q Consensus 308 ~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 387 (447) ++..+|+++||.||+++||++|++|||++. +++++||++.|+++|.++|++++.+++++||||+||+|+++||||+ T Consensus 291 ~~~~~f~~~~l~P~~~~ie~~l~~kll~~~----~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~ 366 (403) T protein:vir:80 291 DEYNNFINSTILPIAKGIEQELTRKLLISP----DLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPK 366 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCC----CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 888899999999999999999999999864 4789999999999999999999999999999999999999999999 Q ss_pred CCccccc-cccccccchhhcccccC-CCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 388 PNPLANE-LFNRNIADGNQVGGINT-PGQITSDQPATASTDPLNNV 431 (447) Q Consensus 388 ~g~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 431 (447) +|+ |. +++.+++|+...+.... ++++.++.+ ++. + T Consensus 367 ~gg--d~~~~~~n~~pl~~~~~~~~~k~ge~~~~~-~~~------~ 403 (403) T protein:vir:80 367 EGL--SELVILENYIPLDKIGDQNKLKGGEKGGAD-GQT------D 403 (403) T ss_pred CCC--CeEeecccccchhhccchhhccCCCCCCCC-CCC------C Confidence 884 44 66888888765443321 111111111 111 1 No 57 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=2.3e-76 Score=435.28 Aligned_cols=368 Identities=11% Similarity=0.052 Sum_probs=263.3 Q ss_pred CchhHhhhhh------------cc---------cccCCc-cccccccccccccccccccccccccCCcccccchhhhhhH Q lcl|NC_010576. 1 MASSDRLLHS------------WN---------AFQSNQ-NQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKAD 58 (447) Q Consensus 1 Mg~~~~l~~~------------~~---------~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 58 (447) |||-|+|..+ ++ -|++.. ..+.....+..+..|.. ............++...+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~~~~t~~~~~~~~ 79 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSG-YPESWATPSWGSAQDKLRTLID 79 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhccccccccccc-ccccccccCccccchhhHhhhH Confidence 9999998764 11 122222 22222233333333321 1111222223345567789999 Q ss_pred HHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEe-eccCCc Q lcl|NC_010576. 59 LIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPI-DTTVDP 137 (447) Q Consensus 59 ~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~-~~~~~~ 137 (447) +||+||++||++||+|||++||... ..+.+.++|+.+||++||+++||+.++.+|++ ||+|+++. ++..+. T Consensus 80 ~v~acV~~Ia~~iA~lpl~~~~~~~-------~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~ 151 (409) T protein:vir:83 80 VAWACIDLNASVLSSMPIYRMRNGR-------IIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGY 151 (409) T ss_pred HHHHHHHHHHHhhccCceEEeeCCc-------cccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCc Confidence 9999999999999999999997421 23456678999999999999999999999988 99999865 555554 Q ss_pred ccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccc--cccccchhHHHHHHHHHHHHHH---- Q lcl|NC_010576. 138 DSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAILNDTNQTLRMLEQKIKLMN---- 211 (447) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~---- 211 (447) +..++++.+ ....+.....+...+++.. ....++|+|++.+. .++. +.+.+..++..+.... T Consensus 152 ~~~L~pl~p-~~v~v~~~~~g~~~y~~~~---------~~~~~eiiHir~~~~~~~~~--G~spi~~~~~~i~~~~a~~~ 219 (409) T protein:vir:83 152 PIRFRVVPP-WLVNVELKKGARREYRIGG---------LNVTDEILHIRYQGNTADAH--GHGPLESAAPRQVVIGLLQK 219 (409) T ss_pred EEEEEEECC-cceEEEEcCCceEEEEEcc---------ccCccceEEeCCCCCCCCcc--cccHHHHHHHHHHHHHHHHH Confidence 444444444 3333433333332222211 12357899998542 2221 2233333344433322 Q ss_pred -HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCcee-eecCCChhhhh-HHHHHHHHH Q lcl|NC_010576. 212 -SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKF-VSAGMGLQNNL-LSDVRQLQQ 288 (447) Q Consensus 212 -~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~-~~l~~~~~~~~-l~~~~~~~~ 288 (447) +...+.||++|+|+|++++.+++++ +++++++|++.+.+|+|++++|++|+++ +++++++++++ ++.++++++ T Consensus 220 ~~~~~f~nga~p~gil~~~~~ls~e~----~~~~~~~~~~~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~ 295 (409) T protein:vir:83 220 YVQNLAETGGVPLYWLGVERRLSETE----AVDLMDRWIESRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEA 295 (409) T ss_pred HHHHHHhcCCCcceEeecCCCCCHHH----HHHHHHHHHHhhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhHH Confidence 2333568999999999999887654 4556666666777899999999999997 56899998865 688999999 Q ss_pred HHHHHhCCCHHHhcC---------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHH Q lcl|NC_010576. 289 DFYNQMGITEAILNG---------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQ 359 (447) Q Consensus 289 ~Ia~~fgVP~~~l~g---------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~ 359 (447) +||++|||||++||. ++.||+.++|+++||.||+++||++|+++||++ +.+|+||++.|+|+|+++ T Consensus 296 eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~-----~~~~~f~~~~llr~d~~~ 370 (409) T protein:vir:83 296 RIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS-----PQHLELNRDDYTRPSLVE 370 (409) T ss_pred HHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-----CcEEEeehhhhhccCHHH Confidence 999999999999962 234999999999999999999999999999974 568999999999999999 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccc Q lcl|NC_010576. 360 LATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNI 400 (447) Q Consensus 360 ~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~ 400 (447) |+++|++++++|+||+||+|+++||||++|+ |.+....+ T Consensus 371 r~~~~~~~~~~G~lT~NE~R~~~glpp~~gg--d~l~~~gv 409 (409) T protein:vir:83 371 RATAYKIMIEAGVMEPNEARAMERLHSEAAA--VRLSGGGV 409 (409) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--cccCCCCC Confidence 9999999999999999999999999999985 44432222 No 58 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=6.1e-76 Score=432.95 Aligned_cols=408 Identities=11% Similarity=0.038 Sum_probs=266.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |.. .+.....| +.|+.. .... .....|+++++||+||++||++||+|||++|+ T Consensus 1 ~~~----------------------~~~~~g~~--~~~~~~--~~~~-~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~ 53 (723) T protein:vir:94 1 MTT----------------------FPSGAGGW--NAWSAD--SVFG-NGAKGWSNSAVAYRCISMLANNAASVDLVVRG 53 (723) T ss_pred Ccc----------------------cccCCCcc--cccccc--cccc-ccHHHHhhhHHHHHHHHHHHHhhccceeEEEc Confidence 111 11111111 111111 1111 22357899999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCC--cccceeeeccCCCcceeeecCC Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVD--PDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 158 (447) .+ +. ....|++++||+.+||++||+++||+.++.+|+++||+|+++.++... ..+..+++++.++..+...... T Consensus 54 ~~--~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~ 129 (723) T protein:vir:94 54 PD--GE--LDELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAA 129 (723) T ss_pred CC--Cc--cchhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCC Confidence 32 22 235799999999999999999999999999999999999999876532 2334556655554443322222 Q ss_pred ceE----EEEeeecccccceeeeccccccccccc--ccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceee Q lcl|NC_010576. 159 QVM----VRVWNDNTGLEQDLLVSKENCIIIESP--FYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQ 227 (447) Q Consensus 159 ~~~----~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~ 227 (447) ... ...|......+..+.++.++|||++.+ ..+.. +.+.+..+...+....+ ...|.||++|+|||+ T Consensus 130 ~~~~~~~~~~y~~~~~~G~~~~~~~~dIiHir~~~~~dg~~--G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~ 207 (723) T protein:vir:94 130 DAVPQAQIIGYVIERTDGVRVPVLADEMLWLRFSDPYDPLA--VMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVN 207 (723) T ss_pred ccceeeeeeEEEEEecCceeEEecccceEEecCCCCCCCcc--cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEE Confidence 111 111212222345567889999999854 23322 22333444444433322 233578999999999 Q ss_pred eCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC----------CCceeeecCCChhhhh-HHHHHHHHHHHHHHhCC Q lcl|NC_010576. 228 FPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD----------TQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGI 296 (447) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~----------~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgV 296 (447) .+. +++++.++.+++|++.+. +..|+|+++||+ .|++|+++++++++++ ++.+++++++||++||| T Consensus 208 ~~~-l~~e~~~~~~~~~~~~~~--G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgV 284 (723) T protein:vir:94 208 LGD-MDEQTFTKTVAAFRSQVE--GVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGI 284 (723) T ss_pred cCC-CCHHHHHHHHHHHHHHhh--chhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCC Confidence 874 666555544444444332 234788888885 6999999999998865 68899999999999999 Q ss_pred CHHHhcCC----cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCC Q lcl|NC_010576. 297 TEAILNGT----ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAI 372 (447) Q Consensus 297 P~~~l~g~----~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~ 372 (447) ||++|++. +.+++.++||++||.||+++||++||++||+..+ ...+++||...++++|.+++++++.+++++|+ T Consensus 285 Pp~~i~~~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~g--~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~ 362 (723) T protein:vir:94 285 RKDALLGGSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPDIG--WTVEWDFNSVPALQEDLEAQAGRNQGYLVNDV 362 (723) T ss_pred ChhHcCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhccccc--CceEEeecchhhhhcCHHHHHHHHHHHHhCCC Confidence 99999753 3488999999999999999999999999997532 34567788888999999999999999999999 Q ss_pred cCHHHHHHHhCCCCCCCccccccccc---cccchhhcccccCCCCCC--CCCCCcCCCCCCCcccccccCCccCcCCC-- Q lcl|NC_010576. 373 YTPNEIRELTGKAPHPNPLANELFNR---NIADGNQVGGINTPGQIT--SDQPATASTDPLNNVSTSAIENGSLTDGG-- 445 (447) Q Consensus 373 ~t~NE~R~~~gl~p~~g~~~~~~~~~---~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 445 (447) ||+||+|+++||||+|||+++.++.+ +.++..........+... .....-.++.|... ..+....-..++.| T Consensus 363 ~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~-~~~~~~~~~~~~~~~~ 441 (723) T protein:vir:94 363 LMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPE-LPVRATTVLHHDPGPD 441 (723) T ss_pred cCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCC-CCCCCCCCCCCCcccC Confidence 99999999999999999887765432 232222211111111100 11111122222111 11111111122333 Q ss_pred ----CC Q lcl|NC_010576. 446 ----SY 447 (447) Q Consensus 446 ----~~ 447 (447) +| T Consensus 442 ~~~~~~ 447 (723) T protein:vir:94 442 PQQTLY 447 (723) T ss_pred CchhHH Confidence 22 No 59 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=7.8e-76 Score=432.36 Aligned_cols=392 Identities=11% Similarity=0.076 Sum_probs=276.9 Q ss_pred CchhHhhhhh--cccccCCcccccccc----ccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccC Q lcl|NC_010576. 1 MASSDRLLHS--WNAFQSNQNQNQNTN----DFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMV 74 (447) Q Consensus 1 Mg~~~~l~~~--~~~f~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~l 74 (447) |..++=+++. +++|+++....+... .+..+...... ..... .........++++++|++||++||++||++ T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~ 77 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTL-PNFFK--ELISDGYTKLSDSPEVRMAVDCIADLVSNM 77 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhccccccccccccc-hhhHh--hhccchhHHHhhchHHHHHHHHHHHhhccC Confidence 7777666654 346655533221111 11111111100 00000 001111234778999999999999999999 Q ss_pred ceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceee Q lcl|NC_010576. 75 DFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQ 154 (447) Q Consensus 75 p~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (447) ||++||.+.++.. ..+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.....+++.+.....+.. T Consensus 78 ~~~~~~~~~~~~~--~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~ 155 (413) T protein:vir:96 78 TIQLMQNGETGDK--RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNV 155 (413) T ss_pred ceEEEEecCCCcc--ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEE Confidence 9999998776653 46899999999999999999999999999999999999999998877655556665554444433 Q ss_pred ecCCceEEEEeeecccccceeeecccccccccc-ccc-ccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceee Q lcl|NC_010576. 155 FFPRQVMVRVWNDNTGLEQDLLVSKENCIIIES-PFY-AILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQ 227 (447) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~ 227 (447) . .+.+.+.+... . ..+++++|+|++. +.. +.. .+.+.+..+...+.... +...+.||+.|+|+|+ T Consensus 156 ~-~~~~~y~~~~~----~--~~~~~~evih~k~~~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~ 227 (413) T protein:vir:96 156 S-DDDLDYSITFD----N--KEYDPSTLLHFVLNPSIERPF-IGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVS 227 (413) T ss_pred c-CCeEEEEEeec----C--cEEchhhEEEEeccCCCCCcc-ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEE Confidence 2 33333333221 1 2467899999984 221 111 12223333333333322 2334578999999999 Q ss_pred eCCcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCc-eeeecC-CChhhhh-HHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_010576. 228 FPYSTKSTARAAQAARRKQEIENEMAN--NKYGVATLDTQE-KFVSAG-MGLQNNL-LSDVRQLQQDFYNQMGITEAILN 302 (447) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~-~~~~l~-~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~ 302 (447) +++.+++++. ++++++|++.+.+ |+|++++++.|. +++++. +++++++ ++.+++++++||++|||||++|| T Consensus 228 ~~~~l~~e~~----~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg 303 (413) T protein:vir:96 228 VDSDSDELSD----EEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLG 303 (413) T ss_pred eCCCCCHHHH----HHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC Confidence 9998876654 4555566555544 688999987665 556664 6777754 68899999999999999999997 Q ss_pred -CCcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 303 -GTANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 303 -g~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) +++++++...|+++||.||+++||++||++||++ +++|+||++.++++|.+++++++++++++|+||+||+|++ T Consensus 304 ~~~~~~~~~~~~~~~~l~P~~~~ie~~ln~~ll~~-----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 378 (413) T protein:vir:96 304 VGTYNKDEFNNFINTKIMSIAQVIQQTYNKLIVEE-----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNW 378 (413) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 6778999999999999999999999999999874 7899999999999999999999999999999999999999 Q ss_pred hCCCCCCCccccc-cccccccchhhcccccC-CCCCC Q lcl|NC_010576. 382 TGKAPHPNPLANE-LFNRNIADGNQVGGINT-PGQIT 416 (447) Q Consensus 382 ~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~-~~~~~ 416 (447) +|+||+|| +|. +++.|+.++...++... +++++ T Consensus 379 ~g~~p~~~--gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 379 VGMPPDAE--MDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred hCCCCCCC--cceeeecccccchhhcccccCCCCCCC Confidence 99999988 455 56888877655433211 11111 No 60 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=1.3e-75 Score=431.18 Aligned_cols=380 Identities=11% Similarity=0.116 Sum_probs=262.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||||+++.+ ++..+. ..++.. .+++....++.. ++...++++++|++||++||++||+|||++|+ T Consensus 1 MGl~~~~~~~~--~~~~~~-----~~~~~~-----~~~~~~~~~~~~-vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~ 67 (394) T protein:vir:62 1 MGLRDRFSNYL--FKKAEK-----RGYLDN-----VLGKSIRYSGVY-VTDSNILQSSDVYELLQDISNQMVLADIVVED 67 (394) T ss_pred Cchhhhhhhhc--cCCCCc-----hhhhhh-----hhhcccccCccc-cChhhhhccHHHHHHHHHHHHhhcccceEEEc Confidence 99999976532 222111 111111 112222223333 44567889999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ++ |. ++.+|+++.|| .+||++||+++||+.++.+++++||+|+++.++..+.... +....+... T Consensus 68 ~~--g~--~~~~~~~~~Ll-~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~~~-----------~~~~~~~~~ 131 (394) T protein:vir:62 68 EF--GN--EIKDDIALQIL-RNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLASN-----------VFTELDDNL 131 (394) T ss_pred CC--Cc--ccchhhHHHHh-ccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecccc-----------ceEEECCce Confidence 43 32 35688888777 5899999999999999999999999999987665443221 111122222 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~~ 235 (447) .+ .+.. ....++.++|+|+|.+..+... +.+.+..+...+....+ ...+.||++++|+|+++..+.++ T Consensus 132 ~~-~~~~-----~~~~~~~~eiih~r~~~~d~~~-G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~ 204 (394) T protein:vir:62 132 VE-HFNI-----GGHEIPPCMIRHVKNIGADHLR-GKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQ 204 (394) T ss_pred EE-EEee-----CCEEechhheEEecCcCCCCcc-ccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcC Confidence 11 1111 1135778999999965433321 12333444444433222 23357899999999999887654 Q ss_pred HHHHHHHHHHHHHHHHhc--cCCcceeecCCCc--eeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHHhcC---CcHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMA--NNKYGVATLDTQE--KFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAILNG---TANE 307 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~--~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~l~g---~~~e 307 (447) + ++++++++.|.+.++ .|+|+++|++.|. ++++++.++.++ +++.+++++++||++|||||++|++ ++.| T Consensus 205 ~--~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e 282 (394) T protein:vir:62 205 N--GAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKEDIE 282 (394) T ss_pred H--HHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHH Confidence 2 334556666666655 4788999988777 555788888775 5688999999999999999999976 4468 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_010576. 308 QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH 387 (447) Q Consensus 308 ~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~ 387 (447) ++.++|+++||.||+++||++|+++||++.++ .+++|+||...+++. .++++++.+++++|+||+||+|+++|+||+ T Consensus 283 ~~~~~~~~~~l~P~~~~ie~~l~~kll~~~~~-~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~ 359 (394) T protein:vir:62 283 KAMMYIHNKAVRPIMKNFEDHLSLLFYAQNSG-KRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQ 359 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCcccc-CceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 99999999999999999999999999998764 356777777777655 578899999999999999999999999999 Q ss_pred CCccccc-cccccccchhhcccccCCCCCCCCCCCcCCCCC Q lcl|NC_010576. 388 PNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATASTDP 427 (447) Q Consensus 388 ~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (447) ++++++. |++.++.+........ +..+.++.+.+ T Consensus 360 ~~~~gd~~~~~~n~~~~~~~~~~~------~~~kgge~~en 394 (394) T protein:vir:62 360 NTKESQAIYISNDVTEIGKKEATD------GSLGGGEENEN 394 (394) T ss_pred CCCCCCeeeccccccccccccccc------ccCCCCCCCCC Confidence 9888876 5677776554321111 11111111111 No 61 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=1.1e-74 Score=426.01 Aligned_cols=413 Identities=11% Similarity=0.015 Sum_probs=266.2 Q ss_pred CchhHhhhhhcccccCCccc-ccccccccccccc----------cccc-ccc-c--ccCCcccccchhhhhhHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNTNDFLTPSNG----------MTSF-GGY-Y--GRGQSNYSRSYSYNKADLIKSVIT 65 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~~~~~~~~~~----------~~~~-~~~-~--~~~~~~~~~~~~~~~~~~v~~cv~ 65 (447) ||+||||++....-.+.... .....+.....++ ...+ +|. . +......++..+|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 99999998755431111111 1111111000000 0000 111 1 111122244667999999999999 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc------- Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD------- 138 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~------- 138 (447) +||++||+|||++||.+ +++..+..+|+++.|| .+||++||+++||+.++.+++++||||+++.++..+.. T Consensus 81 ~Ia~~ia~lp~~~~~~~-~~~~~~~~~~~~~~L~-~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~ 158 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLR-DGKPSDTFGSRDLQIL-ETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDV 158 (466) T ss_pred HHHHhhccCceEEEEec-CCceeeccccHHHHHh-hCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCcc Confidence 99999999999999875 4555667788887766 59999999999999999999999999999999775432 Q ss_pred cceeeeccCCCcceeeecCCceEEEEeeecc---cccceeeeccccccccccc---ccccccchhHHHHHHHHHHHHHHH Q lcl|NC_010576. 139 SGSFDINTARVGKIMQFFPRQVMVRVWNDNT---GLEQDLLVSKENCIIIESP---FYAILNDTNQTLRMLEQKIKLMNS 212 (447) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~v~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 212 (447) ...+++++.....+.....+...+.+.+... .......++.+||+|++.. ..+..+ .+.+..+...+....+ T Consensus 159 ~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G--~s~i~~~~~~i~~~~a 236 (466) T protein:vir:81 159 VVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRG--MSWLTPILREIRADQA 236 (466) T ss_pred eeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCccccccc--ccHHHHHHHHHHHHHH Confidence 2334444444434433332222222211111 1223456889999999853 222211 1233333333333322 Q ss_pred -----HHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCCChhhhh-HHHHH Q lcl|NC_010576. 213 -----QDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVR 284 (447) Q Consensus 213 -----~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~ 284 (447) ...+.||+.++|||++++.+++++. +++++.|.+.+. +|+|+++||++|++|+++++++++++ ++.++ T Consensus 237 ~~~~~~~~f~ng~~p~gil~~~~~l~~e~~----~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~ 312 (466) T protein:vir:81 237 MSKHQAKFFDNGATVNLVIKHNPMADPAAV----KKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRG 312 (466) T ss_pred HHHHHHHHHhcCCCcceEEecCCCCCHHHH----HHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHH Confidence 2335789999999999998886654 455555555554 47899999999999999999998865 78899 Q ss_pred HHHHHHHHHhCCCHHHhcC---------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhc Q lcl|NC_010576. 285 QLQQDFYNQMGITEAILNG---------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLV 355 (447) Q Consensus 285 ~~~~~Ia~~fgVP~~~l~g---------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~ 355 (447) +++++||++|||||++||. ++.||+.++||++||.||+++||++|+++|++..++ .+++++||.++|+|+ T Consensus 313 ~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~-~~~~~~f~~~~llr~ 391 (466) T protein:vir:81 313 GGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPD-VRLWYDADDVPFLRE 391 (466) T ss_pred HHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccC-cceEEEecchhhhcc Confidence 9999999999999999962 134899999999999999999999999999987664 357899999999999 Q ss_pred CHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc-ccccchhhcccccCCCCCCCCCCCcCCCCC Q lcl|NC_010576. 356 PVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN-RNIADGNQVGGINTPGQITSDQPATASTDP 427 (447) Q Consensus 356 d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (447) |.++|+++ +..++++|+ |+||+|+.+ +++ ++.+.+ .++.+..........+...+......++++ T Consensus 392 d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~-----~~g-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~n 464 (466) T protein:vir:81 392 DEKDAADIQKVRAETINTLITAGY-EPESVVAAV-----NSG-DLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDN 464 (466) T ss_pred CHHHHHHHHHHHHHHHHHHHHcCC-Chhhccccc-----cCC-ccccccCCCcchhhhcccccccccCCCCcccCCCCcC Confidence 99998876 677889995 999999643 332 122222 222222111100000011000000111111 Q ss_pred CC Q lcl|NC_010576. 428 LN 429 (447) Q Consensus 428 ~~ 429 (447) .| T Consensus 465 gn 466 (466) T protein:vir:81 465 GN 466 (466) T ss_pred CC Confidence 11 No 62 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=6e-75 Score=427.50 Aligned_cols=390 Identities=11% Similarity=0.068 Sum_probs=271.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+++++ ++++.. ........ ..+++.... ....+....++++++|++||++||++||++||++|| T Consensus 1 Mg~f~~~~~----~~~~~~---~~~~~~~~----~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 68 (406) T protein:vir:95 1 MGLFDRWRR----TKRKSK---IRADTGYV----GLFMSGEDV-SFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQ 68 (406) T ss_pred Ccchhhhcc----cccccc---ccccchhh----hhhccCccc-CccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEE Confidence 999998543 222211 11111111 111111111 122233456789999999999999999999999999 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCC--eeEEEeeccCCcccceeeeccCCCcceeeecCC Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQ--IAMVPIDTTVDPDSGSFDINTARVGKIMQFFPR 158 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn--a~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (447) .++++.. ..+|+++++|+.+||++||+++||+.++.+++++|+ +|+++.++..+.+...+++ ++....+..... T Consensus 69 ~~~~~~~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i-~~~~v~~~~~~~- 144 (406) T protein:vir:95 69 NTEDGDI--RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPL-TPSKVNFLDTPD- 144 (406) T ss_pred ecCCcce--eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEE-cCceeEEEEcCC- Confidence 8876643 468999999999999999999999999999999865 5666777766655444444 443333332211 Q ss_pred ceEEEEeeecccccceeeeccccccccccc---ccccccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCC Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIESP---FYAILNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPY 230 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~ 230 (447) .+.+.. .. ..++.++|+|++.. ..+.. +.+.+..+...+.... +...+.||+.++|+|+++. T Consensus 145 --~~~~~~----~~--~~~~~~evih~~~~~~~~~~~~--G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~ 214 (406) T protein:vir:95 145 --GYQVLY----GG--QTFNYDEVLHFIYNPDPERPYI--GRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA 214 (406) T ss_pred --eEEEEe----cc--EEEchhHEEEeeccCCCCCCcc--ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC Confidence 122211 11 24778999999842 22111 1233333333333332 2334578999999999999 Q ss_pred cCChHHHHHHHHHHHHHHHHHhccCCcceeecCC-CceeeecC-CChhhhh-HHHHHHHHHHHHHHhCCCHHHhc-CCcH Q lcl|NC_010576. 231 STKSTARAAQAARRKQEIENEMANNKYGVATLDT-QEKFVSAG-MGLQNNL-LSDVRQLQQDFYNQMGITEAILN-GTAN 306 (447) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l~-~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~-g~~~ 306 (447) .+++++.++.+++|.+.+. ...|+++++|++. |.+++++. +++++++ ++.+++++++||++|||||++|| ++++ T Consensus 215 ~l~~e~~~~~~~~~~~~~~--g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~ 292 (406) T protein:vir:95 215 ATAELSSEEGRNAVFKKYL--QATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGEFN 292 (406) T ss_pred CCCHHHHHHHHHHHHHHhc--cccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCch Confidence 9888766655555544432 1247788877764 55777764 6887755 68899999999999999999997 6778 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 307 EQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 307 e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) +++..+||++||.||+++||++|+++||++ .+++|+||++.|+++|.+++++.+.+++++|+||+||+|+++|+|| T Consensus 293 ~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~----~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p 368 (406) T protein:vir:95 293 RDEYNNFINSTILPIAKGIEQELTRKLLIS----PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSP 368 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 999999999999999999999999999985 3579999999999999999999999999999999999999999999 Q ss_pred CCCccccc-cccccccchhhcccccC-CCCCCCCCCCcCCCCCCCcccc Q lcl|NC_010576. 387 HPNPLANE-LFNRNIADGNQVGGINT-PGQITSDQPATASTDPLNNVST 433 (447) Q Consensus 387 ~~g~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 433 (447) +|| +|. +++.++.+....++... ++++.+++ ++++. T Consensus 369 ~~~--gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~---------~~~~~ 406 (406) T protein:vir:95 369 KEG--LSELVILENYIPLDKIGDQSKLKGGDNSGA---------DGQTD 406 (406) T ss_pred CCC--cceeeeccCccchhhcccccccCCCCCCCC---------CCCCC Confidence 988 455 56888887765543221 11111111 11111 No 63 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=7.6e-75 Score=426.95 Aligned_cols=381 Identities=14% Similarity=0.117 Sum_probs=265.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |||+|+|...++.-++.. ....+.+... .....++...++++++|++||++||++||+|||++++ T Consensus 1 mg~~~~~~~~~~~~~~~~-------~~~~~~~~~~--------~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~ 65 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRII-------RDMEPVSHRT--------NRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGD 65 (403) T ss_pred Ccchhhhhhccchhhhhh-------hccccccccc--------CCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEee Confidence 999999876555432211 1111111111 1111233467889999999999999999999999987 Q ss_pred EcCCCc-eeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCc Q lcl|NC_010576. 81 IDPISG-NQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQ 159 (447) Q Consensus 81 ~~~~~~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (447) +...+. ...+..|++++||+.+||++||+++||+.++.+++++||||+++... .+++.+.....+...... T Consensus 66 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~~-------~l~~l~~~~~~v~~~~~~- 137 (403) T protein:vir:10 66 KYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDGT-------SLYHVPAALMQVEADANK- 137 (403) T ss_pred cccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeCc-------eeEeecCcceEEEEcCCc- Confidence 654332 22346899999999999999999999999999999999999987543 234444444333332222 Q ss_pred eEEEEeeecccccceeeeccccccccccccc-----ccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeC Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFY-----AILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFP 229 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~ 229 (447) ..+.++.. ....+..++|+|++.... +... +.+.+..+...+....+ ...+.||+.++|||+.+ T Consensus 138 ~~~~~~~~-----~~~~~~~~eiih~~~~~~~~~~~~~~~-G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~ 211 (403) T protein:vir:10 138 FIKKFIFN-----NQINYRVDEIIFIKDNSYVCGTNSQIS-GQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETD 211 (403) T ss_pred eEEEEEec-----CceeecccceEEecccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 21211111 123456788999984321 1111 11233333333333222 23357899999999999 Q ss_pred CcCChHHHHHHHHHHHHHHHHHhc--cCCcceeecCCCceeeecCC--Chhhh-hHHHHHHHHHHHHHHhCCCHHHhcC- Q lcl|NC_010576. 230 YSTKSTARAAQAARRKQEIENEMA--NNKYGVATLDTQEKFVSAGM--GLQNN-LLSDVRQLQQDFYNQMGITEAILNG- 303 (447) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~--~n~~~~~vl~~g~~~~~l~~--~~~~~-~l~~~~~~~~~Ia~~fgVP~~~l~g- 303 (447) +.+++++.+ ++++.|.+.++ .|+|++++|++|++|+++++ ++.++ +++.+++++++||++|||||++|++ T Consensus 212 ~~l~~e~~~----~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~ 287 (403) T protein:vir:10 212 EILNKKLRE----RKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG 287 (403) T ss_pred CCCCHHHHH----HHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC Confidence 998876554 45555555554 47899999999999999986 45564 5788999999999999999999973 Q ss_pred --CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchh--hhcCHHHHHHHHHHHHhCCCcCHHHHH Q lcl|NC_010576. 304 --TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPF--KLVPVEQLATVADVLTRNAIYTPNEIR 379 (447) Q Consensus 304 --~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l--~~~d~~~~~~~~~~~~~~G~~t~NE~R 379 (447) ++.|++.+.|+++||.||+++||++|+++| +++++||++.+ ++.|.+++++++.+++++|+||+||+| T Consensus 288 ~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L--------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R 359 (403) T protein:vir:10 288 NNANIRPNIELFYYMTIIPMLNKLTSSLTFFF--------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEAR 359 (403) T ss_pred CCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc--------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 345899999999999999999999999988 46788998866 899999999999999999999999999 Q ss_pred HHhCCCCCCCccccc-cccccccchhhcccccCCCCCCCCCCCcCCC Q lcl|NC_010576. 380 ELTGKAPHPNPLANE-LFNRNIADGNQVGGINTPGQITSDQPATAST 425 (447) Q Consensus 380 ~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (447) +++|+||++++++|. |++.|+.......+.++.++ .+.+.+++ T Consensus 360 ~~~gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~---~~~~~~g~ 403 (403) T protein:vir:10 360 SELNLEPLDDEQMNKIRIPANVAGSATGVSGQEGGR---PKGSTEGD 403 (403) T ss_pred HHhCCCCCCcccccccccccccccccccCCCCcCCC---CCCCcCCC Confidence 999999999888887 45666653322111111111 11111111 No 64 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=4.6e-72 Score=411.66 Aligned_cols=436 Identities=8% Similarity=-0.030 Sum_probs=268.2 Q ss_pred CchhHh-------------hh--hhcccccCCcccccccccccccccc-ccccccccc-cCCcccccchhhhhhHHHHHH Q lcl|NC_010576. 1 MASSDR-------------LL--HSWNAFQSNQNQNQNTNDFLTPSNG-MTSFGGYYG-RGQSNYSRSYSYNKADLIKSV 63 (447) Q Consensus 1 Mg~~~~-------------l~--~~~~~f~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~c 63 (447) ..|.+- |+ +....|++++++.....-..++... +.+...... ..... +....++++++|++| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vn-Vs~~~AlknsaV~sc 130 (945) T protein:vir:10 52 LAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLIN-LFRKYRFNNDSKLIK 130 (945) T ss_pred hhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeeh-hhhhhhhccHHHHHH Confidence 111111 11 2333444443321111000000000 000000000 00011 234567889999999 Q ss_pred HHHHHHhhccCceEEEEEcCCCc-----eeccccchHHHHHhhhcCcccCHHHHH----HHHHHHHHhcCCeeEEEeecc Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKIDPISG-----NQTPMPSGLINVLTRSANIDQTGRSFV----FDLLYSLLDEGQIAMVPIDTT 134 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~~~~~-----~~~~~~~~l~~lL~~~PN~~~t~~~f~----~~~~~~lll~Gna~i~~~~~~ 134 (447) |++||++||++||++||+.++|. .+...+|+++.||+ +||++||+++|| +.++.+++++||+|+++.++. T Consensus 131 I~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~ 209 (945) T protein:vir:10 131 VSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDE 209 (945) T ss_pred HHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECC Confidence 99999999999999999877764 23446899999997 999999999854 567799999999999999987 Q ss_pred CCcccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccc-ccccccccccc--chhHHHHHHHHHHHHHH Q lcl|NC_010576. 135 VDPDSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCI-IIESPFYAILN--DTNQTLRMLEQKIKLMN 211 (447) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 211 (447) .+.+..++++.+ ....+.....+...+. |....+......+++.|++ |++.+...+.. .+.+.+..+..++.... T Consensus 210 ~G~ii~L~pLdP-s~Vti~~ddDG~~~y~-Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~al 287 (945) T protein:vir:10 210 QGNLVAITPVDG-TTIKPILSEDTGIVVG-YVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDI 287 (945) T ss_pred CCcEEEEEEECC-cceEEEEcCCCcEEEE-EEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHH Confidence 776554444444 4334433333333222 2222222333455666755 56665433221 12233444444433332 Q ss_pred HH----HH-H-hhcCcccceeeeCCcCC------hHHHHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 212 SQ----DN-R-ASSGKLNGFIQFPYSTK------STARAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 212 ~~----~~-~-~n~~~~~gvl~~~~~~~------~~~~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) ++ .. + .||++|+|+|+++.... ....+++++++++.|.+.+++ ++++++++++|++|+++++++.++ T Consensus 288 Aaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~Da 367 (945) T protein:vir:10 288 FIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDM 367 (945) T ss_pred HHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHH Confidence 22 22 3 36789999998764422 112355667788888877665 667788999999999999999886 Q ss_pred h-HHHHHHHHHHHHHHhCCCHHHhc------CCcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecch Q lcl|NC_010576. 279 L-LSDVRQLQQDFYNQMGITEAILN------GTANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNP 351 (447) Q Consensus 279 ~-l~~~~~~~~~Ia~~fgVP~~~l~------g~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~ 351 (447) + ++.+++++++||++|||||++|| +++.|++...|+++||.||+++||++||++|++.. .+.+++|+++. T Consensus 368 QfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~---eg~~i~fdFd~ 444 (945) T protein:vir:10 368 QFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFR---NEKDIKLWFKE 444 (945) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cCceeEEEecc Confidence 5 68899999999999999999996 24468999999999999999999999999997543 35667777778 Q ss_pred hhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccc-ccc-cccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 352 FKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANEL-FNR-NIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 352 l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) +.+.|.+++++++.+++++|+||+||+|+++|+||++|| |.+ ++. ++.+..+..+.... ..++......++.|.. T Consensus 445 ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGG--D~lli~~nn~~P~d~~~ka~~g-a~p~q~aq~~~dqp~~ 521 (945) T protein:vir:10 445 DDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWG--DVPFSGLRNWKPEDEQAKAQQG-AMPPQLAQAMADQPSQ 521 (945) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--ceeeeccccccccccccccccC-CCCcccccCCCCCCCC Confidence 888999999999999999999999999999999999984 553 333 34444333222111 1111111011111111 Q ss_pred cccccccCCccC----cCCC------CC Q lcl|NC_010576. 430 NVSTSAIENGSL----TDGG------SY 447 (447) Q Consensus 430 ~~~~~~~~~~~~----~~~~------~~ 447 (447) +..+..+++.. .+.+ .+ T Consensus 522 -kGGe~dEns~~psE~kda~~e~~~~l~ 548 (945) T protein:vir:10 522 -QGGGVDENSSVPSEQKNAGLEVLRNLF 548 (945) T ss_pred -CCCCCCCCCCCCCcccchHHHHHHHHH Confidence 11111111111 1111 11 No 65 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=4e-72 Score=412.02 Aligned_cols=385 Identities=13% Similarity=0.062 Sum_probs=260.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.+ .+ ...+. ....|...++++ .. ...++..+++++++|++||++||++||++||++ T Consensus 1 M~~f~~~~~------~~--~~~~~----~~~~~~~~~~~~--~~-~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~-- 63 (397) T protein:vir:38 1 MPLLKLNKS------HS--QGFSL----NDPDWVNFLTGG--EA-QKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS-- 63 (397) T ss_pred Ccchhhhhc------cc--CcccC----CchhhhhhhcCC--cC-CceechHHhhccHHHHHHHHHHHHHHhhCcccc-- Confidence 999987432 11 11111 111232222222 12 233556678999999999999999999999853 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 159 (447) .|+..++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+ ....+... ..+. T Consensus 64 -----------~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~-~~v~i~~~~~~~~ 131 (397) T protein:vir:38 64 -----------ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRP-SQVQPMLLQDGSG 131 (397) T ss_pred -----------cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcC-ceeEEEEcCCCce Confidence 3566777888999999999999999999999999999999988776554444444 33333332 2234 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+.+.......+....++.++|+|++.+.......+.+.+..+...+....+ ...+.||++++|+|+++..+.+ T Consensus 132 ~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~ 211 (397) T protein:vir:38 132 LIYNINFDEPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLL 211 (397) T ss_pred EEEEEEeccccccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCH Confidence 44444444444445567889999999976443322222334444444443322 2335789999999999998887 Q ss_pred HHHHHHHHHHHHHHHHHhc-cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCCc----HHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMA-NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGTA----NEQ 308 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~~----~e~ 308 (447) ++.++.++++ ..... .|+++++||+.|++|+++++++.+++ ++.+++++++||++|||||++|+++. +.+ T Consensus 212 e~~~~~~~~~----~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e 287 (397) T protein:vir:38 212 DAETRIARSK----EISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT 287 (397) T ss_pred HHHHHHHHHH----HHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH Confidence 7655444443 33333 47899999999999999999988865 68899999999999999999998632 335 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHP 388 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 388 (447) +...||.+||.||+.+||++||++|+++. +|++..++++|.+++++++++++++|++|+||+|+++|++|++ T Consensus 288 ~~~~~~~~~l~P~~~~ie~~ln~~l~~~~--------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~ 359 (397) T protein:vir:38 288 QISGQYAKSLNRYVQAIVGELNDKLHANI--------SANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYL 359 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccChh--------cccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 56789999999999999999999999753 3455566788999999999999999999999999999999998 Q ss_pred CccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 389 NPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 389 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) ++ +..... .............+++.++.+..+.+ .+.| T Consensus 360 ~~--d~~~~~-~~~~~~~~~~~~~~g~~~~~~~~e~~--~~~~ 397 (397) T protein:vir:38 360 AK--DLPDPE-KEPQQAIQLIQQEGGENDGNNSDERG--SDPE 397 (397) T ss_pred CC--cccccc-ccccccccccccccCCCCCCCCCCCC--CCCC Confidence 75 322111 11111111111111111111111111 1111 No 66 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=8.4e-71 Score=404.75 Aligned_cols=375 Identities=14% Similarity=0.132 Sum_probs=257.4 Q ss_pred CchhHhhhhhcccccCCccc-cccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||||+++++ .+... ......+..... ...+++.........++..+++++++|++||++||++||+|||+++ T Consensus 3 m~~~~~~~~------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~ 75 (392) T protein:vir:74 3 LPILNFINQ------TNDPPEAGSVQSYFPDGN-DAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred chhhhhhhc------ccCcccccccccccccCc-hhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeec Confidence 999876443 11111 112222211111 1112222222233445667889999999999999999999999998 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCC Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPR 158 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 158 (447) ++.. ..|+.+||++||+++||+.++.+++++||||+++.++..+....++++.+..+ .+... ..+ T Consensus 76 ~~~~-------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v-~v~~~~~~~ 141 (392) T protein:vir:74 76 KKKN-------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQV-NTYYFEYEN 141 (392) T ss_pred cchh-------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCcee-EEEEcCCCc Confidence 6432 23556999999999999999999999999999999988776555555544444 33332 234 Q ss_pred ceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~ 233 (447) ...+++............++.++|+|++.+.......+.+.+..+...+....++ ..+.||++|+|+|++++... T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~ 221 (392) T protein:vir:74 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL 221 (392) T ss_pred eEEEEEEecCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Confidence 4445544443334445678899999999765433222233444444444443332 23578999999999987654 Q ss_pred hHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQ 308 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~ 308 (447) .+ +++++++++.|. ...|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||+. ++++ T Consensus 222 ~~--~~~~~~~~~~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e 297 (392) T protein:vir:74 222 LS--DKDKASRSRSFM--KRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ 297 (392) T ss_pred ch--HHHHHHHHHHHh--ccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH Confidence 43 344556666654 3457899999999999999999988765 7889999999999999999999753 3467 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT---GKA 385 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---gl~ 385 (447) +.++|+++||.||++.||++|+++|++ +++||...+++.|.+++++.+.+++++|++|+||+|+++ |+. T Consensus 298 ~~~~~~~~~l~p~~~~ie~~l~~~l~~--------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~ 369 (392) T protein:vir:74 298 QISGMYASALNRYLRPAISELEYKLSD--------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI 369 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccc--------hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCC Confidence 889999999999999999999999975 368899999999999999999999999999999999987 333 Q ss_pred CCCCccccccccccccchhhcccccCCCCCCCCCCC Q lcl|NC_010576. 386 PHPNPLANELFNRNIADGNQVGGINTPGQITSDQPA 421 (447) Q Consensus 386 p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (447) |.+-.. .-++.+ ...|++++ .-| T Consensus 370 pne~r~-----~enl~~-------~~~Gd~~~-p~p 392 (392) T protein:vir:74 370 PKDLPA-----PENTNK-------KTTGQSNE-PVP 392 (392) T ss_pred ccccch-----hcCCCC-------CCCCCCCC-CCC Confidence 311100 001110 00111111 111 No 67 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=1.5e-70 Score=403.41 Aligned_cols=333 Identities=12% Similarity=0.113 Sum_probs=247.7 Q ss_pred hccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCc Q lcl|NC_010576. 71 ASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVG 150 (447) Q Consensus 71 ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~ 150 (447) ||+|||++||.++ ..+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+.. +++++.... T Consensus 1 ia~lp~~~~~~~~------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~-L~~l~~~~v 73 (348) T protein:vir:93 1 MASLPLKMYEDYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSK-LFLLNPDVV 73 (348) T ss_pred CcccceEeEecCc------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE-EEEEcCCce Confidence 9999999998543 458999999999999999999999999999999999999999988776554 444444443 Q ss_pred ceeeecC-CceEEEEeeecccccceeeeccccccccccccc-ccccchhHHHHHHHHHHHHHHHHH--HHhhcC-cccce Q lcl|NC_010576. 151 KIMQFFP-RQVMVRVWNDNTGLEQDLLVSKENCIIIESPFY-AILNDTNQTLRMLEQKIKLMNSQD--NRASSG-KLNGF 225 (447) Q Consensus 151 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~n~~-~~~gv 225 (447) .+..... ..+.+. .....+..+.++.++|+|++.+.. +... +.+.+..+...+....++. .+.+++ .+.++ T Consensus 74 ~~~~~~~~~~~~y~---~~~~~g~~~~~~~~eiih~r~~~~~~~~~-G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i 149 (348) T protein:vir:93 74 EMLIENQSRELYYS---IHAATGNKLIVHNMDMLHFKHIVASNMVQ-GISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFM 149 (348) T ss_pred EEEEeCCCcEEEEE---EEcCCCeEEEEccccEEEecCCCCCCcee-eccHHHHHHHHHHHHHHHHHHHHHhcCCCceeE Confidence 4333222 222222 233334556788999999986422 2211 1122333344433332222 233333 34567 Q ss_pred eeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC Q lcl|NC_010576. 226 IQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT 304 (447) Q Consensus 226 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~ 304 (447) ++.+..+++++.+ +++++|.+.+ +|+++++++++|++|+++++++++++ ++.+++++++||++|||||++|++. T Consensus 150 ~~~~~~l~~e~~~----~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 224 (348) T protein:vir:93 150 LKYGSNVSTEKRQ----QVLEDFKQYY-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNAR 224 (348) T ss_pred EecCCCCCHHHHH----HHHHHHHHHh-hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 7888888776554 4455555444 46789999999999999999998865 6889999999999999999999742 Q ss_pred ------cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHH Q lcl|NC_010576. 305 ------ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEI 378 (447) Q Consensus 305 ------~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~ 378 (447) +.|++.++|+++||.||++.||++|+++||++.++..|++|+||.+.|+++|.+++++++.+++++|++|+||+ T Consensus 225 ~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~ 304 (348) T protein:vir:93 225 SNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDI 304 (348) T ss_pred CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHH Confidence 34899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCccccc-cccccccchhhccccc--CCCCCCCCCCC Q lcl|NC_010576. 379 RELTGKAPHPNPLANE-LFNRNIADGNQVGGIN--TPGQITSDQPA 421 (447) Q Consensus 379 R~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 421 (447) |+++|+||+|| ||. |+++|+.+........ .+|++.++++. T Consensus 305 R~~~g~~p~~g--gD~~~~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 305 REWEDLPPVEG--GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred HHHhCCCCCCC--cCeEeecccccccccchhhcccccCCCCCcCCC Confidence 99999999998 566 6788887754432221 11111111111 No 68 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=1.8e-70 Score=402.95 Aligned_cols=375 Identities=13% Similarity=0.125 Sum_probs=259.1 Q ss_pred CchhHhhhhhcccccCCccc-cccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||||+++.+ .+... ......+..... ...+.+.........++...++++++|++||++||++||++|++++ T Consensus 3 m~~f~~~~~------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:39 3 LPILNFINQ------TNDPPEVGSVQSYFPDGN-DAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred chhhhhhhc------ccccccccccccccccCc-hhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeec Confidence 999987543 22222 222222222211 1222232232333345667889999999999999999999999988 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCC Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPR 158 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 158 (447) +... +.|+.+||++||+++||+.++.+++++||||+++.++..+....++++.+..+ .+... ..+ T Consensus 76 ~~~~-------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v-~~~~~~~~~ 141 (392) T protein:vir:39 76 KKKN-------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQV-NTYYFEYEN 141 (392) T ss_pred cchh-------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCcee-EEEEcCCCc Confidence 6432 23556999999999999999999999999999999988776555555544433 33332 234 Q ss_pred ceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~ 233 (447) ...+++............++.+||+|++.+.......+.+.+..+...+....+. ..+.|++.++|+|++++... T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 221 (392) T protein:vir:39 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL 221 (392) T ss_pred eEEEEEEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Confidence 4445544444444445678899999999764433222233344444444333322 23578999999999987654 Q ss_pred hHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQ 308 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~ 308 (447) .+ +++++++++.+.. ..|+++++||++|++|+++++++++.+ ++.+++++++||++|||||++||++ ++++ T Consensus 222 ~~--~~~~~~~~~~~~~--~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~ 297 (392) T protein:vir:39 222 LS--DKDKASRSRSFMK--RSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ 297 (392) T ss_pred ch--HHHHHHHHHHHhc--cccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH Confidence 44 3445566665543 457899999999999999999988865 7889999999999999999999753 3467 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT---GKA 385 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---gl~ 385 (447) +.++|+++||.||++.||++|+++|++ +++||...+++.|.+++++.+.+++++|++|+||+|+++ |+. T Consensus 298 ~~~~f~~~~l~P~~~~ie~~l~~~L~~--------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~ 369 (392) T protein:vir:39 298 QISGMYASALNRYLRPAISELEYKLSD--------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI 369 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccc--------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCC Confidence 889999999999999999999999975 367889999999999999999999999999999999987 554 Q ss_pred CCCCccccccccccccchhhcccccCCCCCCCCCCC Q lcl|NC_010576. 386 PHPNPLANELFNRNIADGNQVGGINTPGQITSDQPA 421 (447) Q Consensus 386 p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (447) |.+-.... ++. ..+++..++..| T Consensus 370 p~e~r~~e-----~l~--------~~~~Gd~~~p~p 392 (392) T protein:vir:39 370 PKDLPAPE-----NTN--------KKTTGQSNEPVP 392 (392) T ss_pred ccccchhc-----CCC--------CCCCCCCCCCCC Confidence 42211000 010 001111111111 No 69 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=1.8e-70 Score=402.95 Aligned_cols=375 Identities=13% Similarity=0.125 Sum_probs=259.1 Q ss_pred CchhHhhhhhcccccCCccc-cccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||||+++.+ .+... ......+..... ...+.+.........++...++++++|++||++||++||++|++++ T Consensus 3 m~~f~~~~~------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:10 3 LPILNFINQ------TNDPPEVGSVQSYFPDGN-DAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred chhhhhhhc------ccccccccccccccccCc-hhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeec Confidence 999987543 22222 222222222211 1222232232333345667889999999999999999999999988 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCC Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPR 158 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 158 (447) +... +.|+.+||++||+++||+.++.+++++||||+++.++..+....++++.+..+ .+... ..+ T Consensus 76 ~~~~-------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v-~~~~~~~~~ 141 (392) T protein:vir:10 76 KKKN-------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQV-NTYYFEYEN 141 (392) T ss_pred cchh-------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCcee-EEEEcCCCc Confidence 6432 23556999999999999999999999999999999988776555555544433 33332 234 Q ss_pred ceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 159 QVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~ 233 (447) ...+++............++.+||+|++.+.......+.+.+..+...+....+. ..+.|++.++|+|++++... T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 221 (392) T protein:vir:10 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL 221 (392) T ss_pred eEEEEEEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Confidence 4445544444444445678899999999764433222233344444444333322 23578999999999987654 Q ss_pred hHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQ 308 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~ 308 (447) .+ +++++++++.+.. ..|+++++||++|++|+++++++++.+ ++.+++++++||++|||||++||++ ++++ T Consensus 222 ~~--~~~~~~~~~~~~~--~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~ 297 (392) T protein:vir:10 222 LS--DKDKASRSRSFMK--RSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ 297 (392) T ss_pred ch--HHHHHHHHHHHhc--cccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH Confidence 44 3445566665543 457899999999999999999988865 7889999999999999999999753 3467 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT---GKA 385 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---gl~ 385 (447) +.++|+++||.||++.||++|+++|++ +++||...+++.|.+++++.+.+++++|++|+||+|+++ |+. T Consensus 298 ~~~~f~~~~l~P~~~~ie~~l~~~L~~--------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~ 369 (392) T protein:vir:10 298 QISGMYASALNRYLRPAISELEYKLSD--------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI 369 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccc--------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCC Confidence 889999999999999999999999975 367889999999999999999999999999999999987 554 Q ss_pred CCCCccccccccccccchhhcccccCCCCCCCCCCC Q lcl|NC_010576. 386 PHPNPLANELFNRNIADGNQVGGINTPGQITSDQPA 421 (447) Q Consensus 386 p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (447) |.+-.... ++. ..+++..++..| T Consensus 370 p~e~r~~e-----~l~--------~~~~Gd~~~p~p 392 (392) T protein:vir:10 370 PKDLPAPE-----NTN--------KKTTGQSNEPVP 392 (392) T ss_pred ccccchhc-----CCC--------CCCCCCCCCCCC Confidence 42211000 010 001111111111 No 70 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=2.7e-70 Score=401.98 Aligned_cols=367 Identities=14% Similarity=0.096 Sum_probs=252.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+. |.+++.+..... ... .++..+++.. ....++...++++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~-----~~~~~~~~~~~~--~~~-~~~~~~~~~~---~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~ 69 (385) T protein:vir:10 1 MGLLTPRN-----FNKRKAKNMVYP--SNP-AFFTTTVGGM---QLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN 69 (385) T ss_pred Cccccchh-----cccccccccccc--cch-hhhhhhcccc---CccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec Confidence 99998732 222222211111 111 1222223222 123456678899999999999999999999998763 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) |+...||+ +||++||+++||+.++.+++++||||+++.++.. ..+++.+..+ .+... .... T Consensus 70 ------------~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~~----~~~p~~~~~v-~~~~~-~~~~ 130 (385) T protein:vir:10 70 ------------TATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQNL----EHIPNSDVQI-NYLPG-NMGI 130 (385) T ss_pred ------------cchhhhhh-cCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCce----eEeecCCceE-EEEEc-CCce Confidence 56677786 8999999999999999999999999999987632 2333333222 22221 1222 Q ss_pred EEEEeeecccccceeeeccccccccccccccccc--chhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILN--DTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~ 233 (447) .+.+. .........+++++|+|++.+...+.. .+.+.+..+...+....+ ...+.||++++|+|++++.+. T Consensus 131 ~~~~~--~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~ 208 (385) T protein:vir:10 131 VYTVL--ESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLS 208 (385) T ss_pred EEEEE--EcCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC Confidence 22222 122334567889999999964321111 111233333334333222 233578999999999997765 Q ss_pred hHHHHHHHHHHHHHHHHHhcc-CCcceeecCCCceeeecCCChhhhh-H-HHHHHHHHHHHHHhCCCHHHhcCC------ Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMAN-NKYGVATLDTQEKFVSAGMGLQNNL-L-SDVRQLQQDFYNQMGITEAILNGT------ 304 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~-n~~~~~vl~~g~~~~~l~~~~~~~~-l-~~~~~~~~~Ia~~fgVP~~~l~g~------ 304 (447) ++ ++++++++.|++.+++ |++++++|++|++|+++++++.+.+ + +.+++++++||++|||||++|++. T Consensus 209 ~~---e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~ 285 (385) T protein:vir:10 209 DG---KDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQ 285 (385) T ss_pred CH---HHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcc Confidence 43 3445556666655554 7899999999999999999988755 5 778999999999999999999752 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.| |...||..||.||++.||++|+++|++ .+|+|+++.++++|.+++++++.+++++|++|+||+|+++ T Consensus 286 ~sn~e-q~~~~~~~~l~P~~~~ie~~l~~~l~~-------~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 357 (385) T protein:vir:10 286 HSNID-QIKATYLANLNSYVNPIVDELRLKMNA-------PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFIL 357 (385) T ss_pred cccHH-HHHHHHHHHHHHHHHHHHHHHHHhhCC-------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 124 446667789999999999999999985 3699999999999999999999999999999999999999 Q ss_pred CCCCCCCccccccccccccchhhcccccCCCCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSD 418 (447) Q Consensus 383 gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (447) |++|+|++.++.+.... .+ ...|++.++ T Consensus 358 g~~p~p~~~~~~~~~~~----~~----~~~g~~~dn 385 (385) T protein:vir:10 358 TRSGFLPDNLPEFKPLT----TQ----VKGGDEGDN 385 (385) T ss_pred CCCccCCCCCccccCcc----cc----cCCCCCCCC Confidence 99999987666653211 01 111111111 No 71 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=2.3e-68 Score=391.44 Aligned_cols=365 Identities=14% Similarity=0.088 Sum_probs=249.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+ .|++++.+... ..... .|....+++ ....+++...++++++|++||++||++||++||+++ T Consensus 1 Mg~~~~~-----~~~k~~~~~~~--~~~~~-~~~~~~~~~---~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~- 68 (383) T protein:vir:10 1 MGLLTPK-----NFSKRNAKNMV--YPSNP-AFFTTTVGG---MQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTE- 68 (383) T ss_pred CCccccc-----ccccccccccc--cccch-hhhhhhccC---ccccccchhHhhcchHHHHHHHHHHHhhccCceeec- Confidence 9999872 22333332211 11111 222222222 223445667789999999999999999999999875 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +|+..+||+ +||++||+++||+.++.+++++||||+++.++... .+++.+..+ .+... .+.. T Consensus 69 -----------~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~~~----~~p~~~~~v-~~~~~-~~~~ 130 (383) T protein:vir:10 69 -----------NTATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQNLE----HIPNSDVQI-NYLPG-NMGI 130 (383) T ss_pred -----------ccchhhhhh-CCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCcee----EeecCcceE-EEEEc-CCce Confidence 356677886 89999999999999999999999999998876432 233333222 11211 1222 Q ss_pred EEEEeeecccccceeeeccccccccccccccccc--chhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCC Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILN--DTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTK 233 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~ 233 (447) .+.+.. ........+++++|+|+|.+...+.. .+.+.+..+...+.... +...+.||++++|+|++++.+. T Consensus 131 ~~~~~~--~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~ 208 (383) T protein:vir:10 131 VYTVLE--SNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLS 208 (383) T ss_pred EEEEEE--cCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC Confidence 222222 22234466889999999854322111 11122333333333322 2233578999999999998775 Q ss_pred hHHHHHHHHHHHHHHHHHhc-cCCcceeecCCCceeeecCCChhhhh-H-HHHHHHHHHHHHHhCCCHHHhcCC------ Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMA-NNKYGVATLDTQEKFVSAGMGLQNNL-L-SDVRQLQQDFYNQMGITEAILNGT------ 304 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~~~~-l-~~~~~~~~~Ia~~fgVP~~~l~g~------ 304 (447) ++ ++++++++.|++... +|++++++|++|++|+++++++.+.+ + +.+++++++||++|||||++|++. T Consensus 209 ~~---e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~ 285 (383) T protein:vir:10 209 DG---KDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQ 285 (383) T ss_pred CH---HHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCc Confidence 43 344555666665554 47899999999999999999988754 5 678999999999999999999742 Q ss_pred --cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 305 --ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 305 --~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) +.||+ ..|+..||.||++.||++|+++|++ .+++||++.+++.|.+++++++.+++++|+||+||+|+++ T Consensus 286 ~sn~eq~-~~~~~~~l~P~~~~ie~~l~~~l~~-------~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~l 357 (383) T protein:vir:10 286 HSNIDQI-KATYLANLNSYVNPIVDELRLKMNA-------PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFIL 357 (383) T ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHHhhCC-------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 12444 4566689999999999999999974 4799999999999999999999999999999999999999 Q ss_pred CCCCCCCccccccccccccchhhcccccCCCCCCC Q lcl|NC_010576. 383 GKAPHPNPLANELFNRNIADGNQVGGINTPGQITS 417 (447) Q Consensus 383 gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 417 (447) |++|+++++...+.. +. ...+|++.+ T Consensus 358 g~~p~~~~d~~~~~~-~~--------~~~~gGd~e 383 (383) T protein:vir:10 358 TRSGFLPDNLPEFKP-LT--------NETKGGDDK 383 (383) T ss_pred CCCcccCCcccccCC-Cc--------ccCCCCCCC Confidence 999999865433211 00 011111111 No 72 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=9.6e-69 Score=393.48 Aligned_cols=365 Identities=12% Similarity=0.076 Sum_probs=261.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.. ++..+.. ....+.... ...+++. . ....+++.+.++++++|++||++||++||++||++++ T Consensus 1 Mglf~~~~~-----~~~~~~~-~~~~~~~~~--~~~~~~~-~-~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~ 70 (384) T protein:vir:49 1 MPIFNITNL-----ATESPPS-NQDSFFDIT--DPEFLDA-L-NGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSR 70 (384) T ss_pred Ccccccccc-----Ccccccc-cchhhcccc--chhhccc-c-cCCceechhhhhccHHHHHHHHHHHHHHhhCceeeec Confidence 999997421 1111111 111111100 0011221 1 1233456678899999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 159 (447) +.. ..|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+..+ .+... ..+. T Consensus 71 ~~~-------------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v-~v~~~~~~~~ 136 (384) T protein:vir:49 71 KQL-------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQV-SFNRLDNQNG 136 (384) T ss_pred chh-------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCcee-EEEEcCCCce Confidence 432 23567999999999999999999999999999999988776555555544433 33322 2234 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) +.+.+.......+....++.++|||++.+.....-.+.+.+..+...+....+ ...+.||++++|+|++++...+ T Consensus 137 ~~y~~~~~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~ 216 (384) T protein:vir:49 137 LYYNITFDDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLL 216 (384) T ss_pred EEEEEEecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCh Confidence 44444443333445567889999999975433211122333333333333222 2335789999999999988876 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC--------c Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT--------A 305 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~--------~ 305 (447) ++.++ ++++.. ....|++++++|+.|++|+++++++++++ ++.+++++++||++|||||++|+++ . T Consensus 217 ~~~~~---~~~~~~--~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~ 291 (384) T protein:vir:49 217 DFKTK---QSRSRQ--AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEM 291 (384) T ss_pred HHHHH---HHHHHH--hcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHH Confidence 54332 233222 33568899999999999999999998866 6889999999999999999999753 2 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHhhcC---ChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_010576. 306 NEQQTLGYYNRCVDVLLQYVTDAISRIAL---TKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT 382 (447) Q Consensus 306 ~e~~~~~f~~~ti~P~~~~ie~~l~~kLl---~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 382 (447) .++....|++.+|.||+..|+++|+++|+ ....+..+++++|+++.|+++|++++.+++.++.+.|+++ ||+|+.+ T Consensus 292 ~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~ 370 (384) T protein:vir:49 292 IYNIYFKAVSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGE 370 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHc Confidence 36777889999999999999999999984 3333445678999999999999999999999999999986 9999999 Q ss_pred CCCCCCCccccc-c Q lcl|NC_010576. 383 GKAPHPNPLANE-L 395 (447) Q Consensus 383 gl~p~~g~~~~~-~ 395 (447) |++|++||+.++ | T Consensus 371 ~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 371 TDSTLKGGETNEQY 384 (384) T ss_pred CCCCCCCCCCCCCC Confidence 999999987665 5 No 73 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=6.3e-68 Score=389.01 Aligned_cols=375 Identities=12% Similarity=0.057 Sum_probs=260.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+++.+ .+.....+...+..... +.+.... ....+++.+.++++++|++||++||++||+||+++|+ T Consensus 1 M~~f~~~~~------~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~ 70 (386) T protein:vir:48 1 MPIFNITNL------ATESPPISQGGFFDITD---PDFLSTL-NGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASR 70 (386) T ss_pred Ccccccccc------ccccccccccccccccc---chhcccc-cCCceechhhhhcchHHHHHHHHHHHhhccCceeecc Confidence 999987443 12222222222211111 1111111 2233456677899999999999999999999999875 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC-Cc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP-RQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 159 (447) . .. +.|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+..... +. T Consensus 71 ~------------~~-~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l-~~~~v~v~~~~~~~~ 136 (386) T protein:vir:48 71 K------------QL-QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYL-RPSQVSFNRLDNKDG 136 (386) T ss_pred c------------hh-HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEe-cCceeEEEEcCCCce Confidence 2 23 445679999999999999999999999999999999887765544444 444333333222 23 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+.+............++.++|+|++.+.......+.+.+..+...+....+ ...+.||++++|+|+.++.+++ T Consensus 137 ~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~ 216 (386) T protein:vir:48 137 IYYNITFDDPRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLL 216 (386) T ss_pred EEEEEEecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCH Confidence 33333333333344567889999999965433211222334444444433322 2335789999999999998887 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQQ 309 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~~ 309 (447) ++.++ +++.|.. ...|+++++||++|++|+++++++++++ ++.+++++++||++|||||++||++ +.|++ T Consensus 217 e~~~~----~~~~~~~-~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~ 291 (386) T protein:vir:48 217 DFKTK----LSRSRQA-MKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM 291 (386) T ss_pred HHHHH----HHHHHHH-hhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHH Confidence 65544 4444543 4567899999999999999999998865 7889999999999999999999753 45899 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) .++|+++||.||++.||++|+++|+++ +++++..+++.|...++..+.+++++|++|+||+|+.+|++|+++ T Consensus 292 ~~~~~~~~l~P~~~~ie~~l~~~l~~~--------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~ 363 (386) T protein:vir:48 292 SLDLYNKAVSRYLRPFLSELSQKLSCD--------VDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILP 363 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcch--------hhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCC Confidence 999999999999999999999999863 456677778889999999999999999999999999999999876 Q ss_pred ccccccccccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 390 PLANELFNRNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) ++...+-..+.. ..+|++.++++ T Consensus 364 ~~~~~~~~~~~~--------~~~gGd~~~~~ 386 (386) T protein:vir:48 364 KELPEGENPNKT--------TLKGGEINGED 386 (386) T ss_pred ccchhhcCCCCC--------ccCCCCCCCCC Confidence 432222111111 11122222222 No 74 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.7e-66 Score=381.14 Aligned_cols=441 Identities=10% Similarity=0.031 Sum_probs=269.6 Q ss_pred CchhHhhhhhcccccCCcccccccccccccc--------------c--cc---c-ccccccccC---------------- Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPS--------------N--GM---T-SFGGYYGRG---------------- 44 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~--------------~--~~---~-~~~~~~~~~---------------- 44 (447) |.++.-|++.|..|-+|..--...+++.... . +. . .-.|+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~ 80 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYA 80 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhc Confidence 9999989998887766542211111111000 0 00 0 000011100 Q ss_pred Cccccc--chhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce-eccccchHHHHHhhhcCcccCHHHHHH----HHH Q lcl|NC_010576. 45 QSNYSR--SYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGN-QTPMPSGLINVLTRSANIDQTGRSFVF----DLL 117 (447) Q Consensus 45 ~~~~~~--~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~----~~~ 117 (447) ...++. -.++....++|+||.+|+++++++|+++++.+..+.. .....|+++++|+.+||++|++++||+ .++ T Consensus 81 ~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv 160 (535) T protein:vir:10 81 DNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKII 160 (535) T ss_pred cChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHH Confidence 000111 1233456678899999999999999999887665543 345679999999999999999887554 455 Q ss_pred HHHH-hcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccc-cccc-- Q lcl|NC_010576. 118 YSLL-DEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF-YAIL-- 193 (447) Q Consensus 118 ~~ll-l~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~-- 193 (447) .+++ +.|++|+++.++..+.+..++++.+ ....+............+...........++.++|+|++... .+.. T Consensus 161 ~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p-~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~ 239 (535) T protein:vir:10 161 NDMYVQDQINIERIFKNDSNELDHFNAVDA-SKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRR 239 (535) T ss_pred HHHHhhCCceEEEEEECCCCcEEEEEEeCC-ceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEeccCCCCcccc Confidence 5544 4568999999887776555454444 333333221111111122222233344568889999998522 1111 Q ss_pred cchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecC-CC Q lcl|NC_010576. 194 NDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMAN--NKYGVATLD-TQ 265 (447) Q Consensus 194 ~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~-~g 265 (447) ..+.+.+..+...+.... +...+.||++|+|||++++...+...+++++++++.|.+.++| |+++++||. .| T Consensus 240 ~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g 319 (535) T protein:vir:10 240 GYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKD 319 (535) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCC Confidence 011223333333333322 2333578999999999988766666677888899999887765 778876665 79 Q ss_pred ceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------------------CcHHHHHHHHHHHHHhHHHHHHH Q lcl|NC_010576. 266 EKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG------------------TANEQQTLGYYNRCVDVLLQYVT 326 (447) Q Consensus 266 ~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g------------------~~~e~~~~~f~~~ti~P~~~~ie 326 (447) ++|+++++++++++ ++.+++++++||++|||||++||- +..|++...|++.||.||+++|| T Consensus 320 ~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie 399 (535) T protein:vir:10 320 AKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIE 399 (535) T ss_pred ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999998865 688999999999999999999962 23478888999999999999999 Q ss_pred HHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc-cccc-cccchh Q lcl|NC_010576. 327 DAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE-LFNR-NIADGN 404 (447) Q Consensus 327 ~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~-~~~~-~~~~~~ 404 (447) ++||++||+.. +.+++|+++.+++.|.++++++++.++ .|+||+||+|+++||||++||+-.. .++. ++.... T Consensus 400 ~~ln~~Ll~~~----~~~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~ 474 (535) T protein:vir:10 400 QVINDKIMRYV----DTDYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINAT 474 (535) T ss_pred HHHhhhccccc----CCeEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCccccccccchhhccccc Confidence 99999999753 346788888999999999999987665 5779999999999999999864210 1111 111110 Q ss_pred hccccc---------CCCCCCCCCCC--------cCCCCCCCcccccccCCccC-cCCCCC Q lcl|NC_010576. 405 QVGGIN---------TPGQITSDQPA--------TASTDPLNNVSTSAIENGSL-TDGGSY 447 (447) Q Consensus 405 ~~~~~~---------~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~-~~~~~~ 447 (447) +.+... .+..+.+.+++ ...++|......++...+.+ .+.+-- T Consensus 475 ~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 475 GFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred ccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCccccccccCC Confidence 000000 00000000010 11112111111111111111 111111 No 75 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=1.6e-65 Score=375.81 Aligned_cols=375 Identities=12% Similarity=0.062 Sum_probs=257.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.+ .++....+...+..... +.+...... +..++...++++++|++||++||++||++|+++++ T Consensus 1 M~~f~~~~~------~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~ 70 (386) T protein:vir:49 1 MPIFNITNL------ATESPPINQESFFDIAD---SDFLASLNS-SEWVSAENALKNSDLFSIISQLSNDLATAKITTSR 70 (386) T ss_pred Cchhhhhcc------CCCCcccchhhhhhhhh---ccccccccC-CceechhhhhccHHHHHHHHHHHHHhhhCceeecc Confidence 999988532 22222222211111110 111111122 23455667899999999999999999999999986 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC-Cc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP-RQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 159 (447) ... +.|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+..... +. T Consensus 71 ~~~-------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i-~~~~v~v~~~~~~~~ 136 (386) T protein:vir:49 71 KQL-------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYL-RPSQVSFNRLDNQNG 136 (386) T ss_pred chh-------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEe-cCceeEEEEcCCCce Confidence 432 235679999999999999999999999999999999887765544444 444333433322 23 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+.+............++.++|+|++.+.......+.+.+..+...+....++ ..+.||+.++|+|++++.+.+ T Consensus 137 ~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~ 216 (386) T protein:vir:49 137 LYYNITFDDPHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLL 216 (386) T ss_pred EEEEEEEcCccccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCCh Confidence 333333333334455678899999999764333222233444444444443332 335789999999999998887 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC----cHHHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT----ANEQQ 309 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~----~~e~~ 309 (447) ++.++. ++.|. ....|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++|+++ .+.++ T Consensus 217 ~~~~~~----~~~~~-~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~ 291 (386) T protein:vir:49 217 DFKTKV----SRSRQ-AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEM 291 (386) T ss_pred HHHHHH----HHHHH-HhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHH Confidence 655444 33443 34568899999999999999999998865 6889999999999999999999853 23466 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPN 389 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g 389 (447) ..+|+..+|.|+++.|+++|+++|++ +++|+.+.+++.|.++++..+.+++++|++|+||+|++++..++.. T Consensus 292 ~~~~~~~~i~~~l~~i~~~~~~~l~~--------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~ 363 (386) T protein:vir:49 292 IYNIYFKSVSRYLRPFVSEMSKKLSC--------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILP 363 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc--------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCC Confidence 78899999999999999999999863 5789999999999999999999999999999999999987665532 Q ss_pred ccccccccccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 390 PLANELFNRNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) . ++...+ .......+|++.++++ T Consensus 364 ~---~~~~~~-----~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 364 K---ELPDGK-----NPNRTSLKGGEINEQD 386 (386) T ss_pred C---cCcchh-----ccCCCCCCCCCCCCCC Confidence 1 111110 0001111222222222 No 76 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=3.9e-66 Score=379.19 Aligned_cols=368 Identities=11% Similarity=0.075 Sum_probs=243.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+|+.+ ++. ++...+..... +.+...... ...++...++++++|++||++||++||++||++|+ T Consensus 1 Mg~f~~~~~------~~~---~~~~~~~~~~~---~~~~~~~~~-~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~ 67 (382) T protein:vir:48 1 MPIFNLATE------SPP---DNQGGFFDVVD---SDFLASLKG-NEWVSAETALRNSDLFSIINQLSNDLATVKLITSR 67 (382) T ss_pred Ccccccccc------CCc---ccccccccchh---hhccccccC-CcccchHhhhccHHHHHHHHHHHHhhccCceeeec Confidence 999998543 211 11111111100 111111112 23345567899999999999999999999999987 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 159 (447) .. .+.|+.+||++||+++||+.++.+|+++||||+++.++..+....++++.+ ....+... ..+. T Consensus 68 ~~-------------~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~-~~v~v~~~~~~~~ 133 (382) T protein:vir:48 68 KK-------------LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRP-SQVSFNRLDNKDG 133 (382) T ss_pred ch-------------hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcC-ceeEEEEcCCCCe Confidence 43 234678999999999999999999999999999999988776555444444 33333332 2233 Q ss_pred eEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 160 VMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~~~~ 234 (447) ..+.+.......+....++.++|+|++.+.......+.+.+..+...+....+ ...+.||+.|+|+|++++.+.+ T Consensus 134 ~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~ 213 (382) T protein:vir:48 134 IYYNITFDDPRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLL 213 (382) T ss_pred EEEEEEecCccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCh Confidence 44444433333345567889999999976543322223344444444443332 2335789999999999998887 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----CcHHHH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----TANEQQ 309 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----~~~e~~ 309 (447) ++.++. ++.|.+ ...|+|+++||++|++|+++++++.+++ ++.+++++++||++|||||.+||+ ++++++ T Consensus 214 e~~~~~----~~~~~~-~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~ 288 (382) T protein:vir:48 214 DFKTKL----SRSRQA-MKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEM 288 (382) T ss_pred HHHHHH----HHHHHh-hccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHH Confidence 654443 444443 3457899999999999999999998865 688999999999999999999974 345788 Q ss_pred HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-- Q lcl|NC_010576. 310 TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPH-- 387 (447) Q Consensus 310 ~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~-- 387 (447) .++|++.||.|++++||++|+++|+++.+++. ...+ +.|........++++++|++|+||+|+.++..++ T Consensus 289 ~~~~~~~~l~p~~~~i~~~l~~~l~~~~~~~~--~~~~------~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~ 360 (382) T protein:vir:48 289 SSDLYSKAVSRYLRPFLSELSQKLSCDVDADI--FPAV------DPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILP 360 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcChhhhhh--hhhh------ccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCC Confidence 89999999999999999999999998755432 1222 2233344455667888889999999988743332 Q ss_pred -CCccccccccccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 388 -PNPLANELFNRNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 388 -~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) +-+.+++.. ...+|++.++++ T Consensus 361 ~~~~~~~~~~------------~~~~GGd~~~~~ 382 (382) T protein:vir:48 361 KELPNGENPN------------STLKGGEEDGQD 382 (382) T ss_pred cchhhhhcCC------------CCCCCCCCCCCC Confidence 111111110 011122111111 No 77 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=2.6e-64 Score=369.19 Aligned_cols=342 Identities=13% Similarity=0.096 Sum_probs=226.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||+ .|++++... . ..+|.....++... ...+++...++++++|++||++||++||++|+. T Consensus 1 M~~~~-------~f~~r~~~~--~-----~~~~~~~~~~~~~~-~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~--- 62 (359) T protein:vir:10 1 MSILN-------PFERRSSIT--P-----NNYYPFMVQNGSIV-PNSLVDATEALKNSDLYAVTSLISSDIAGTRFI--- 62 (359) T ss_pred Ccccc-------hhhccccCC--C-----Ccchhhhhcccccc-CCcccCHHHhhcchHHHHHHHHHHHhhhcCccc--- Confidence 77765 455443211 1 11221111122222 233455667899999999999999999999983 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+... ++.+.....+.. ..+.. T Consensus 63 -----------~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l-~~l~~~~v~i~~-~~~~~ 129 (359) T protein:vir:10 63 -----------GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKEL-RLIPSNAITIDL-TDDTL 129 (359) T ss_pred -----------cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEE-EEeCCceEEEEE-cCCeE Confidence 46677777889999999999999999999999999999998887765544 444444433322 23333 Q ss_pred EEEEeeecccccceeeecccccccccccccc-----cccchhHHHHHHHHHHHHHHH-----HHHHhhcCcccceeeeCC Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYA-----ILNDTNQTLRMLEQKIKLMNS-----QDNRASSGKLNGFIQFPY 230 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~-----~~~~~n~~~~~gvl~~~~ 230 (447) .+.+.. ........++.+||+|++.+..+ +.. +.+.+..+...+....+ ...++||++++|+|++++ T Consensus 130 ~y~~~~--~~~~~~~~~~~~evih~~~~~~~~~~~dg~~-G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~ 206 (359) T protein:vir:10 130 TYEVNQ--FDDYPSAKYNASEMIHVKIMAYGVDTLHNLV-GHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQ 206 (359) T ss_pred EEEEEe--cCCceEEEEcccceEEeccCCCCCCccCccc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC Confidence 333322 22234557889999999965321 111 12333444444444332 233578999999999976 Q ss_pred -cCChHHHHHHHHHHHHHHHHHhc-cCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCCc-- Q lcl|NC_010576. 231 -STKSTARAAQAARRKQEIENEMA-NNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGTA-- 305 (447) Q Consensus 231 -~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~~-- 305 (447) .+++++ ++++++.|++..+ .|+|+++||++|++|+++++++++++ ++.+++++++||++|||||++||+.. T Consensus 207 ~~l~~e~----~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~ 282 (359) T protein:vir:10 207 GTLSSEA----KDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQ 282 (359) T ss_pred CCCCHHH----HHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcc Confidence 455444 4455556654444 47899999999999999999998865 68899999999999999999997632 Q ss_pred --HHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_010576. 306 --NEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTG 383 (447) Q Consensus 306 --~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 383 (447) +.++..+++..+|.|++..|+++|+.+|....+.+.+..++| |...+...+.+++++|++|+||+|+++| T Consensus 283 ~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~~~~~~~~~~~--------d~~~~~~~~~~~~~~G~~t~NE~R~~l~ 354 (359) T protein:vir:10 283 QSSLDQIKDLYVNALNRFIEPLISELRIKCDSSIGVDMSPITDY--------SNSVFKADILNWVKEGIIEPTEAKTLLE 354 (359) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhc--------CHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 223333444444444444445555554443222333334444 4444455567899999999999999999 Q ss_pred CCCCC Q lcl|NC_010576. 384 KAPHP 388 (447) Q Consensus 384 l~p~~ 388 (447) ++|+= T Consensus 355 ~~pv~ 359 (359) T protein:vir:10 355 SKGII 359 (359) T ss_pred CCCCC Confidence 99996 No 78 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=1.6e-63 Score=364.81 Aligned_cols=431 Identities=10% Similarity=0.058 Sum_probs=255.4 Q ss_pred CchhHhhhh-------hcc---------cccCCccc--------cccccccccccccccccccccccCCcccc------- Q lcl|NC_010576. 1 MASSDRLLH-------SWN---------AFQSNQNQ--------NQNTNDFLTPSNGMTSFGGYYGRGQSNYS------- 49 (447) Q Consensus 1 Mg~~~~l~~-------~~~---------~f~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 49 (447) ||+|+|++. +++ ++.++-.. .++.. ..++.-+...+..++.. .+.+. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a-~~~~~~~~~~~~~~~~~-r~~~~~~~~l~~ 82 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVA-YSQPVIGSMSANPGFKT-KPSIRNNQDLHG 82 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcce-eecccccceecCccccc-CccccChhHHHH Confidence 999999882 222 11111000 00111 01111111112222221 11111 Q ss_pred cchhhhhhHHHHHHHHHHHHhhccCc-----------eEEEEEcCCCcee----ccccchHHHHHhhhcCccc-----CH Q lcl|NC_010576. 50 RSYSYNKADLIKSVITRIALDASMVD-----------FKHLKIDPISGNQ----TPMPSGLINVLTRSANIDQ-----TG 109 (447) Q Consensus 50 ~~~~~~~~~~v~~cv~~ia~~ia~lp-----------~~~~r~~~~~~~~----~~~~~~l~~lL~~~PN~~~-----t~ 109 (447) ..+.|..+++|++||+.||+.||+++ |.+ |.++.+... ...-+.+..+|+ +||+.+ |+ T Consensus 83 ~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i-~~kd~~~~~~~~~~~~~~~i~~~l~-~pn~~~~p~~~s~ 160 (551) T protein:vir:80 83 VLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEV-RLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDSF 160 (551) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceE-EecccCcccChhHHHHHHHHHHHHH-hcCCCCCCccchH Confidence 12346678999999999999999854 442 222212111 112234566665 899874 88 Q ss_pred HHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceE--EEEeeecccccceeeecccccccccc Q lcl|NC_010576. 110 RSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVM--VRVWNDNTGLEQDLLVSKENCIIIES 187 (447) Q Consensus 110 ~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~~~ 187 (447) .+|++.++.+++++||||+++.++..+.+..++++ ++....+.....+... ...|...........++.++|+|++. T Consensus 161 ~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l-~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~ 239 (551) T protein:vir:80 161 SSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAK-DPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVR 239 (551) T ss_pred HHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEe-CCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecc Confidence 89999999999999999999999887765544444 4444344332222110 11111122223334678899999974 Q ss_pred -ccccc--ccchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCc--CChHHHHHHHHHHHHHHHHHhcc--C Q lcl|NC_010576. 188 -PFYAI--LNDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYS--TKSTARAAQAARRKQEIENEMAN--N 255 (447) Q Consensus 188 -~~~~~--~~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--n 255 (447) ++... ...+.+.+..+...+.... +...+.||++|+|+|+++.. ++ +++++++++.|.+.++| | T Consensus 240 n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt----~e~~~~lk~~~~~~~~G~~n 315 (551) T protein:vir:80 240 NPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQS----QHALEIFKREWKNSLSGING 315 (551) T ss_pred cCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCC----HHHHHHHHHHHHHHhcCccc Confidence 22111 0111223333333333322 23335789999999988654 43 34566777777776654 7 Q ss_pred Ccceeec-CCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----------------CcHHHHHHHHHHHH Q lcl|NC_010576. 256 KYGVATL-DTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----------------TANEQQTLGYYNRC 317 (447) Q Consensus 256 ~~~~~vl-~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----------------~~~e~~~~~f~~~t 317 (447) +|++++| ++|++|+++++++.+++ ++.+++++++||++|||||++||. ++.|++..+|+++| T Consensus 316 ag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~t 395 (551) T protein:vir:80 316 SWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKG 395 (551) T ss_pred cCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHH Confidence 8887665 68999999999998865 688999999999999999999962 23478888999999 Q ss_pred HhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCccccccc Q lcl|NC_010576. 318 VDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP-HPNPLANELF 396 (447) Q Consensus 318 i~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p-~~g~~~~~~~ 396 (447) |+||+++||++||++|++.. +.+++|+++.+++.+.+++++++. ++.+|+||+||+|+++|||| ++| ||.++ T Consensus 396 L~P~~~~ie~~ln~~L~~~~----~~~~~f~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~~eg--GD~~~ 468 (551) T protein:vir:80 396 LQPLLGFIEDFINKHIVAEF----GDKYTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGDVIG--GDIPL 468 (551) T ss_pred HHHHHHHHHHHHHhhhcccc----CCceEEEeeccChhhHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCCCCC--Cceee Confidence 99999999999999999752 345667777888889888888765 66789999999999999998 566 45543 Q ss_pred -cccccchhhcccccCCC-------------------CCCCCCCCcCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 397 -NRNIADGNQVGGINTPG-------------------QITSDQPATASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 397 -~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) +.++.+..+......+. +...++++..+++..++++++..++...+..|.- T Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (551) T protein:vir:80 469 NGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQ 539 (551) T ss_pred cccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCccccchhhh Confidence 44444333221111100 0000111111111222223333333322222211 No 79 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=5.4e-63 Score=361.96 Aligned_cols=432 Identities=10% Similarity=0.017 Sum_probs=252.2 Q ss_pred CchhHhhhhhccc--ccCC------cccccccccccccccccccccccccc-----------------CCcccccchhhh Q lcl|NC_010576. 1 MASSDRLLHSWNA--FQSN------QNQNQNTNDFLTPSNGMTSFGGYYGR-----------------GQSNYSRSYSYN 55 (447) Q Consensus 1 Mg~~~~l~~~~~~--f~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~ 55 (447) |++-+=-.+..++ |.+. ........+.. ....++.+++.. ....++..-..+ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~ 103 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPI---IGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINT 103 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchh---hhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHH Confidence 4433221111111 1100 00000110000 000011110000 011222222344 Q ss_pred hhHHHHHHHHHHHHhhccCceEEEEEcCCC---ceeccccchHHHHHhh---hcCccc-CHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 56 KADLIKSVITRIALDASMVDFKHLKIDPIS---GNQTPMPSGLINVLTR---SANIDQ-TGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 56 ~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~---~~~~~~~~~l~~lL~~---~PN~~~-t~~~f~~~~~~~lll~Gna~i 128 (447) ....|++|+.+|+.++|+|||+|++++.++ .+.....|++..+|.. .|||++ |+.+||+.++.+++++||+|+ T Consensus 104 ~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi 183 (574) T protein:vir:80 104 RSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNF 183 (574) T ss_pred HHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEE Confidence 566789999999999999999999876553 2334578999998864 366765 788999999999999999999 Q ss_pred EEeeccCCcccceeeeccCCCcceeeecCCce--EEEEeeecccccceeeecccccccccc-cccccc--cchhHHHHHH Q lcl|NC_010576. 129 VPIDTTVDPDSGSFDINTARVGKIMQFFPRQV--MVRVWNDNTGLEQDLLVSKENCIIIES-PFYAIL--NDTNQTLRML 203 (447) Q Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~~~-~~~~~~--~~~~~~~~~~ 203 (447) ++.++..+.+..++++.+ ....+.....+.+ ....|...........++.++|+|++. +..... ..+.+.+..+ T Consensus 184 ~i~r~~~G~~~~L~pl~p-~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a 262 (574) T protein:vir:80 184 EKVFDKDGNFIKFDTVDP-TTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIA 262 (574) T ss_pred EEEECCCCcEEEEEEEcC-ceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHH Confidence 999987776554444444 3333332211111 011122222333445678899999974 221110 0111233333 Q ss_pred HHHHHHHHH-----HHHHhhcCcccceeeeCCc--CChHHHHHHHHHHHHHHHHHhcc--CCcce-eecCCCceeeecCC Q lcl|NC_010576. 204 EQKIKLMNS-----QDNRASSGKLNGFIQFPYS--TKSTARAAQAARRKQEIENEMAN--NKYGV-ATLDTQEKFVSAGM 273 (447) Q Consensus 204 ~~~~~~~~~-----~~~~~n~~~~~gvl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--n~~~~-~vl~~g~~~~~l~~ 273 (447) ...+..+.+ ...+.||+.++|||+++.. +++ ++++++++.|.+.+.+ |+|++ +++++|++|+++++ T Consensus 263 ~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~----e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~ 338 (574) T protein:vir:80 263 LKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQ----QALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTP 338 (574) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCH----HHHHHHHHHHHHHhccccccccceeecCCCceEEEccC Confidence 333333322 3335789999999988643 444 4555667777666654 77886 45578999999999 Q ss_pred Chhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----------------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCCh Q lcl|NC_010576. 274 GLQNNL-LSDVRQLQQDFYNQMGITEAILNG----------------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTK 336 (447) Q Consensus 274 ~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----------------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~ 336 (447) ++.+++ ++.+++++++||++|||||++||. ++.|++.++|+++||.||+++||++||++||+. T Consensus 339 s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~ 418 (574) T protein:vir:80 339 SANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAE 418 (574) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Confidence 998865 688999999999999999999962 235889999999999999999999999999975 Q ss_pred hHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc-cccccccchhhcccccCCC-- Q lcl|NC_010576. 337 TAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE-LFNRNIADGNQVGGINTPG-- 413 (447) Q Consensus 337 ~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~-~~~~~~~~~~~~~~~~~~~-- 413 (447) . ..+++++|+..++++.+.+. . ...++.+||||+||+|+++||||++|+ |. +++.++.+..+........ T Consensus 419 ~--~~~~~~~f~~~d~~~~~~~~--~-~~~~~~~G~lT~NE~R~~lgl~Pi~gG--D~~~~~~n~~~~~~~~~~~~~~~~ 491 (574) T protein:vir:80 419 F--GEKYQFQFRGGDLSAQLDKL--K-IIEQEGKVFRTVNEIRHDKGLEPIKGG--DVILNGVHIQAIGQALQEEQLEYQ 491 (574) T ss_pred c--CCceEEEecccchhhHHHHH--H-HHHHHhCCccCHHHHHHHhCCCCCCCC--CEeeeccceeecccccccccCCcc Confidence 3 45678888866665433222 2 235788999999999999999999874 55 4566666554332111100 Q ss_pred CCCCCCCCcCCCCCCCcccccccC-CccCc-------------------CCCCC Q lcl|NC_010576. 414 QITSDQPATASTDPLNNVSTSAIE-NGSLT-------------------DGGSY 447 (447) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-------------------~~~~~ 447 (447) ......+........+.++++.++ .++.. ..|+. T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 545 (574) T protein:vir:80 492 RSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKV 545 (574) T ss_pred chhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCc Confidence 000000000000001111111111 11111 12222 No 80 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=2.1e-62 Score=358.75 Aligned_cols=387 Identities=10% Similarity=0.024 Sum_probs=246.4 Q ss_pred chhhh-hhHHHHHHHHHHHHhhccCceEEEEEcCCC---ceeccccchHHHHHhhhcCccc--------CHHHHHHHHHH Q lcl|NC_010576. 51 SYSYN-KADLIKSVITRIALDASMVDFKHLKIDPIS---GNQTPMPSGLINVLTRSANIDQ--------TGRSFVFDLLY 118 (447) Q Consensus 51 ~~~~~-~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~---~~~~~~~~~l~~lL~~~PN~~~--------t~~~f~~~~~~ 118 (447) .+.+. .+++|++||++||++||++||+++.+.... ......++....+++.+||+.| |..+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 33333 479999999999999999999988654321 1222234445567778899876 55689999999 Q ss_pred HHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-------cCCc-e-----------------EEEEee-eccccc Q lcl|NC_010576. 119 SLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-------FPRQ-V-----------------MVRVWN-DNTGLE 172 (447) Q Consensus 119 ~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~-~-----------------~~~~~~-~~~~~~ 172 (447) +++++||||+++.++..+.+..+.++.+..+ .+..+ .... . ...++. ...... T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTI-RKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTG 159 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCcee-EeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeecccccc Confidence 9999999999999988776554444433322 22111 0100 0 111111 111223 Q ss_pred ceeeecccccccccccc--cc--cccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeeeC-CcCChHHHHHHHHHHHH Q lcl|NC_010576. 173 QDLLVSKENCIIIESPF--YA--ILNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQFP-YSTKSTARAAQAARRKQ 246 (447) Q Consensus 173 ~~~~~~~~~v~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~~-~~~~~~~~~~~~~~~~~ 246 (447) ..+.++.++|+|++.+. .. +.+.....+..+....... .+...+.||+.++|+|+++ ..+++++.++.++.|.. T Consensus 160 ~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~ 239 (467) T protein:vir:31 160 TSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIED 239 (467) T ss_pred ceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHh Confidence 44568889999998653 11 1122222222222222222 2233457899999999875 45666655555554444 Q ss_pred HHHHH---------hccCCcceeecCCCceeeecCC--------Chhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----- Q lcl|NC_010576. 247 EIENE---------MANNKYGVATLDTQEKFVSAGM--------GLQNNL-LSDVRQLQQDFYNQMGITEAILNG----- 303 (447) Q Consensus 247 ~~~~~---------~~~n~~~~~vl~~g~~~~~l~~--------~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----- 303 (447) .+.+. ...|++++++++.|++++++++ ++++++ ++.+++.+++||++|||||++||. T Consensus 240 ~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~ 319 (467) T protein:vir:31 240 NNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGA 319 (467) T ss_pred hhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCC Confidence 33221 1236788889988876655543 455655 688899999999999999999962 Q ss_pred --CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 304 --TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 304 --~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) ++.|++...|+++||.|+++.||++||++|++..+...+++|+|+++.+++.|.+++++++.+++++|++|+||+|++ T Consensus 320 ~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 399 (467) T protein:vir:31 320 FSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDE 399 (467) T ss_pred cccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 346899999999999999999999999999998777788999999999999999999999999999999999999999 Q ss_pred hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCC-----------CCCcccccccCCccCcCC Q lcl|NC_010576. 382 TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTD-----------PLNNVSTSAIENGSLTDG 444 (447) Q Consensus 382 ~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~ 444 (447) +||||++++. +.+..... ........+++..+++..+...+ ..+.++.- +-|.--|. T Consensus 400 ~Gl~pi~d~~---~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 467 (467) T protein:vir:31 400 FGFEPFPEEH---VYGGETLV-AEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLI--EIGANADS 467 (467) T ss_pred hCCCCCCccc---ccCCcccc-cccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhh--hhccccCC Confidence 9999996532 22211111 11111111111111111111111 11112211 11111111 No 81 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=4.6e-62 Score=356.87 Aligned_cols=432 Identities=10% Similarity=0.055 Sum_probs=255.2 Q ss_pred CchhHhhhhhcc---cccCCccc---------------c-----ccccccccccccccccccccccCCcccc------cc Q lcl|NC_010576. 1 MASSDRLLHSWN---AFQSNQNQ---------------N-----QNTNDFLTPSNGMTSFGGYYGRGQSNYS------RS 51 (447) Q Consensus 1 Mg~~~~l~~~~~---~f~~~~~~---------------~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 51 (447) ||||+|+++.+. .|..+-.. . .......++..+..++..++..-..... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 999999987543 11111000 0 0000111222222233222221111111 12 Q ss_pred hhhhhhHHHHHHHHHHHHhhccCc-----------eEEEEEcCCC----ceeccccchHHHHHhhhcCccc-----CHHH Q lcl|NC_010576. 52 YSYNKADLIKSVITRIALDASMVD-----------FKHLKIDPIS----GNQTPMPSGLINVLTRSANIDQ-----TGRS 111 (447) Q Consensus 52 ~~~~~~~~v~~cv~~ia~~ia~lp-----------~~~~r~~~~~----~~~~~~~~~l~~lL~~~PN~~~-----t~~~ 111 (447) +.|..+++|++||+.||+.||.+. |. +|.+... ......-+.+..+|+ +||+++ |+++ T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~-ir~k~~~~~~~~~~~~~~~~l~~~l~-~pn~~~~p~~~s~~~ 158 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFE-VRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDSFSS 158 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCce-eEecccccccChhhHHHHHHHHHHHH-hhCCCCCCccchHHH Confidence 346678999999999999999752 22 2221111 111122345667775 799874 8899 Q ss_pred HHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce---EEEEeeecccccceeeecccccccccc- Q lcl|NC_010576. 112 FVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV---MVRVWNDNTGLEQDLLVSKENCIIIES- 187 (447) Q Consensus 112 f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~v~~~~~- 187 (447) ||+.++.+++++||+|+++.++..+.+.. ++++++....+.....+.. ...++ ..........++.++|+|++. T Consensus 159 f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~-L~~l~p~~V~~~~~~~g~~~~~~~~y~-~~~~~~~~~~~~~~eiih~r~n 236 (547) T protein:vir:63 159 FVKKIVRDTYMYDQVNFEKVFNRNQSMVR-FVAKDPTTIFFATTADGKIPDNGNRFV-QVIDQKIVATFNAREMAFAVRN 236 (547) T ss_pred HHHHHHHHHHhhCCEEEEEEECCCCcEEE-EEEecCceeEEEECCccccccCceEEE-EEcCCcEEEEeccccEEEeccc Confidence 99999999999999999999988776544 4444444434332222211 01111 112222334678899999984 Q ss_pred cccccc--cchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCc--CChHHHHHHHHHHHHHHHHHhcc--CC Q lcl|NC_010576. 188 PFYAIL--NDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYS--TKSTARAAQAARRKQEIENEMAN--NK 256 (447) Q Consensus 188 ~~~~~~--~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--n~ 256 (447) +..+.. ..+.+.+..+...+.... +...+.||+.|+|+|+++.. ++ +++++++++.|.+.+.| |+ T Consensus 237 ~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls----~e~~~~lk~~~~~~~~G~~na 312 (547) T protein:vir:63 237 PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQS----QHALEIFKREWKNSLSGINGS 312 (547) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCC----HHHHHHHHHHHHHHhcCcccc Confidence 222111 112233333333333322 23345789999999988654 44 34556677777776654 78 Q ss_pred cceeec-CCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC----------------CcHHHHHHHHHHHHH Q lcl|NC_010576. 257 YGVATL-DTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG----------------TANEQQTLGYYNRCV 318 (447) Q Consensus 257 ~~~~vl-~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g----------------~~~e~~~~~f~~~ti 318 (447) |++++| ++|++|+++++++++++ ++.+++++++||++|||||++||. ++.|++..+|+++|| T Consensus 313 gk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL 392 (547) T protein:vir:63 313 WQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGL 392 (547) T ss_pred cccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHH Confidence 887655 68999999999998865 688999999999999999999962 235888899999999 Q ss_pred hHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCccccccc- Q lcl|NC_010576. 319 DVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP-HPNPLANELF- 396 (447) Q Consensus 319 ~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p-~~g~~~~~~~- 396 (447) .||++.||++||++|++.. +.+++|+++.+++.+..++++++. ++.+|+||+||+|+++|||| ++| ||.++ T Consensus 393 ~P~~~~ie~~ln~~L~~~~----~~~~~~~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~~eg--GD~~~~ 465 (547) T protein:vir:63 393 QPLLGFIEDFINKHIVAEF----GDKYTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGDVIG--GDIPLN 465 (547) T ss_pred HHHHHHHHHHHHhhccccc----CCceEEEeeccccccHHHHHHHHH-HHhCCCcCHHHHHHHhCCCCCCCC--Cceeec Confidence 9999999999999999742 334566667888888888888764 67789999999999999998 566 45543 Q ss_pred cccccchhhcccccCCC-------------------CCCCCCCCcCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 397 NRNIADGNQVGGINTPG-------------------QITSDQPATASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 397 ~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) +.++.+..+......+. ...+++.+..++...+.+.++..++...++.|.- T Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 535 (547) T protein:vir:63 466 GVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQ 535 (547) T ss_pred ccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCccccchhhh Confidence 33443332211110000 0001111111111122222222222222222211 No 82 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=6e-62 Score=356.22 Aligned_cols=419 Identities=11% Similarity=-0.023 Sum_probs=253.9 Q ss_pred hhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEEEc Q lcl|NC_010576. 3 SSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLKID 82 (447) Q Consensus 3 ~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~ 82 (447) +|+..++...+.+++....+.... ...+...+.+.....-......+.+..+++|++||++||++||++||++++.+ T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s---~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~ 77 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVES---QALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDD 77 (542) T ss_pred Cccccccccccccchhhhhccccc---cccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeeccc Confidence 677666655555544333222221 11111111111111101111123456689999999999999999999976421 Q ss_pred CCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee------- Q lcl|NC_010576. 83 PISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF------- 155 (447) Q Consensus 83 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 155 (447) . + .+++..||++||+++||+.++.+++++||||+++.++..+.+..++++ +.....+... T Consensus 78 ~---------~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l-~~~~v~v~~d~~~~~~~ 144 (542) T protein:vir:41 78 E---------G---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYI-PSHTIRVHKDGSRYRQT 144 (542) T ss_pred c---------h---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEE-cCcceEEEEcCCeeEee Confidence 1 1 234456999999999999999999999999999999887755544444 4433333221 Q ss_pred cCC--ceEEEEeeecc-----cccceeeeccccccccccccc-ccccchhHHHHHHHHHHHHH-----HHHHHHhhcCcc Q lcl|NC_010576. 156 FPR--QVMVRVWNDNT-----GLEQDLLVSKENCIIIESPFY-AILNDTNQTLRMLEQKIKLM-----NSQDNRASSGKL 222 (447) Q Consensus 156 ~~~--~~~~~~~~~~~-----~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~~n~~~~ 222 (447) ... ......|.... .......++..+|+|++.+.. +... +.+.+..+..++... .+...|.||+.| T Consensus 145 ~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~-Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p 223 (542) T protein:vir:41 145 WDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYY-GVPRYVSAAPAILAMQKIDEYNYAFFDNYTIP 223 (542) T ss_pred ecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 001 01111111110 001122467789999996531 1111 112222233333222 233346789999 Q ss_pred cceeeeCCcCChHH------HHHHHHHHHHHHHHHhc---cCCcceeecC------CCceeeecCCChhhhh-HHHHHHH Q lcl|NC_010576. 223 NGFIQFPYSTKSTA------RAAQAARRKQEIENEMA---NNKYGVATLD------TQEKFVSAGMGLQNNL-LSDVRQL 286 (447) Q Consensus 223 ~gvl~~~~~~~~~~------~~~~~~~~~~~~~~~~~---~n~~~~~vl~------~g~~~~~l~~~~~~~~-l~~~~~~ 286 (447) +|||++++.+.++. .+++.+++++.|.+.+. +|+|+++||+ +|++|+++++++++++ ++.++++ T Consensus 224 ~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~ 303 (542) T protein:vir:41 224 SYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEK 303 (542) T ss_pred cEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHH Confidence 99999987654322 24556666777766543 4778899984 7999999999988865 6889999 Q ss_pred HHHHHHHhCCCHHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHH Q lcl|NC_010576. 287 QQDFYNQMGITEAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVE 358 (447) Q Consensus 287 ~~~Ia~~fgVP~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~ 358 (447) +++||++|||||++||. ++.|++...|+++||.|++++||++||++|+++.+ .+++++|+.+.+++.|.+ T Consensus 304 ~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~--~~~~~~f~~~~ll~~d~~ 381 (542) T protein:vir:41 304 KYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFN--PKTRFKFNDETLLESDSV 381 (542) T ss_pred HHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC--CceEEEecchhhcchHHH Confidence 99999999999999962 34589999999999999999999999999988654 468999999999998754 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCC---CCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 359 QLATVADVLTRNAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQ---ITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 359 ~~~~~~~~~~~~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 434 (447) + .+.+++++|+||+||+|+. .|++|.+++ .+.+.++.. .+......+.. ..+.+.......|.-++... T Consensus 382 ~---~~~~~v~~GilT~NE~Re~L~g~~pgdd~---~l~p~~~~~-~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~ 454 (542) T protein:vir:41 382 R---NCALLVQSGVLTPAEARERLFGLDGGPDI---FMVPSKGAA-KSVKRQERNYEKNQIREIRKIYAKYRPRFNEIIS 454 (542) T ss_pred H---HHHHHHhCCCCCHHHHHHhhCCCCCCCcc---ccccccccc-cccccCCcCCCCCchhhhhhcccccCcccccccc Confidence 4 4667899999999999985 366654332 122333321 11111111111 11111111111122222222 Q ss_pred ccCCccCcCCCCC Q lcl|NC_010576. 435 AIENGSLTDGGSY 447 (447) Q Consensus 435 ~~~~~~~~~~~~~ 447 (447) ...+..+.+++.+ T Consensus 455 ~~~~~~~~~~~~~ 467 (542) T protein:vir:41 455 SKLSAEEKKKKID 467 (542) T ss_pred ccccchhhccccc Confidence 3333344444444 No 83 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=1.4e-61 Score=354.21 Aligned_cols=412 Identities=11% Similarity=0.029 Sum_probs=249.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |.+.+. -+ +.+.++... +... +...++......-....-...+..+++|++||++||++||++|++++. T Consensus 6 ~~~~~~-~~-~~~~~~~~~---~~~~------~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~ 74 (540) T protein:vir:41 6 LSIKSL-EK-YRAIKGDTD---SQAL------KEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDG 74 (540) T ss_pred cChhhc-cc-hhhhhcccc---cccc------ccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEec Confidence 666552 11 222222111 1111 111111111111001111345667899999999999999999998753 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee----- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF----- 155 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 155 (447) + ++.+..+ .||++||+++||+.++.+++++||||+++.++..+.+..++++ ++...++... T Consensus 75 ~----------~~~~~~~---lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i-~~~~V~v~~~~~~~~ 140 (540) T protein:vir:41 75 D----------DGGVEEL---LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYI-PAHTVRVHRDGSRYM 140 (540) T ss_pred C----------ccchhhh---ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEe-CCcceEEeEcCceeE Confidence 2 2334443 4999999999999999999999999999999887655554444 4433333221 Q ss_pred --cCCceE--EEEeee-----cccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCc Q lcl|NC_010576. 156 --FPRQVM--VRVWND-----NTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGK 221 (447) Q Consensus 156 --~~~~~~--~~~~~~-----~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~ 221 (447) .++... +..|.. .........++.++|||++.+. .+. .+.+......+....... .+...+.||+. T Consensus 141 ~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~ 220 (540) T protein:vir:41 141 QTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTI 220 (540) T ss_pred eeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 111111 011110 0111223457889999998653 222 222222222222222222 22334578999 Q ss_pred ccceeeeCCcCChHH------HHHHHHHHHHHHHHHhc---cCCcceeecC------CCceeeecCCChhhhh-HHHHHH Q lcl|NC_010576. 222 LNGFIQFPYSTKSTA------RAAQAARRKQEIENEMA---NNKYGVATLD------TQEKFVSAGMGLQNNL-LSDVRQ 285 (447) Q Consensus 222 ~~gvl~~~~~~~~~~------~~~~~~~~~~~~~~~~~---~n~~~~~vl~------~g~~~~~l~~~~~~~~-l~~~~~ 285 (447) |+|||++++.+.++. .++.++++++.|.+.++ +|+|+++||+ .|++|++|++++++++ ++.+++ T Consensus 221 p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~ 300 (540) T protein:vir:41 221 PSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAE 300 (540) T ss_pred CceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHH Confidence 999999987765543 34556777777777654 3789999984 7999999999988765 688999 Q ss_pred HHHHHHHHhCCCHHHhc----C----CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCH Q lcl|NC_010576. 286 LQQDFYNQMGITEAILN----G----TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPV 357 (447) Q Consensus 286 ~~~~Ia~~fgVP~~~l~----g----~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~ 357 (447) ++++||++|||||++|| + ++.|++.+.|+++||.||+++||++||++|+++ +..+++|+||.+.++++|. T Consensus 301 ~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~--~~~~~~i~f~~~~ll~~D~ 378 (540) T protein:vir:41 301 KKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLK--LDPGARFVFNEEILMESEF 378 (540) T ss_pred HHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc--cCCceEEEecchhhcchHH Confidence 99999999999999996 2 245899999999999999999999999999875 4568899999999999876 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCccccccc-cccccchhhcccccCCCCCCCC---CCCcCCCCCCCc-- Q lcl|NC_010576. 358 EQLATVADVLTRNAIYTPNEIRELT-GKAPHPNPLANELF-NRNIADGNQVGGINTPGQITSD---QPATASTDPLNN-- 430 (447) Q Consensus 358 ~~~~~~~~~~~~~G~~t~NE~R~~~-gl~p~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~-- 430 (447) +++ +.+++++|++|+||+|+.+ |++|.+ +.+. +.++.. ....++..++..++. ......+++..+ T Consensus 379 ~~~---~~~lv~~G~lT~NE~Re~L~g~e~gd----d~~l~p~n~~~-~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 450 (540) T protein:vir:41 379 VHN---YALLVQCGVLTPSEVREKLFGLDGGP----DMFMVPSSIGK-SAMKRQKRNYEKNQINEIKRTYAKYKPRIQEI 450 (540) T ss_pred HHH---HHHHHhCCCCCHHHHHHHhCcCcCCC----ccccccccccc-ccccccccccCCCCccccccccchhcccccCc Confidence 655 5678999999999999853 555533 3333 334332 222222111111111 111111122111 Q ss_pred --ccccccCCccCcCCCCC Q lcl|NC_010576. 431 --VSTSAIENGSLTDGGSY 447 (447) Q Consensus 431 --~~~~~~~~~~~~~~~~~ 447 (447) ++.++....+.-+-+-+ T Consensus 451 ~~~~~~~~~~~~~~~~~~~ 469 (540) T protein:vir:41 451 ISSESPLEDKKKKIDEVLS 469 (540) T ss_pred ccccccccccccccccccc Confidence 22222222222222111 No 84 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=6.7e-60 Score=344.99 Aligned_cols=433 Identities=12% Similarity=0.087 Sum_probs=240.0 Q ss_pred CchhHhhh-------------------hhcccccCCccccccccc-----cccccccccccccccccCCccc-------- Q lcl|NC_010576. 1 MASSDRLL-------------------HSWNAFQSNQNQNQNTND-----FLTPSNGMTSFGGYYGRGQSNY-------- 48 (447) Q Consensus 1 Mg~~~~l~-------------------~~~~~f~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~-------- 48 (447) -.+|+|++ ..|...+.++.+...... +..|.....+..-++.. .+.+ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~-~p~~~~~~~~~~ 84 (576) T protein:vir:96 6 ADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRT-KRSYMKNSDNLH 84 (576) T ss_pred HHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccc-cCcchhhhhhhH Confidence 23444433 001001011111111110 00110000000000110 0111 Q ss_pred ccchhhhhhHHHHHHHHHHHHhhccC-----------ceEEEEEcCCCc---eeccccch----HHHHHhhhcCcc-cCH Q lcl|NC_010576. 49 SRSYSYNKADLIKSVITRIALDASMV-----------DFKHLKIDPISG---NQTPMPSG----LINVLTRSANID-QTG 109 (447) Q Consensus 49 ~~~~~~~~~~~v~~cv~~ia~~ia~l-----------p~~~~r~~~~~~---~~~~~~~~----l~~lL~~~PN~~-~t~ 109 (447) ...+.+..+++|++||++||++||++ +|.+..+..++. +.....|+ +..++ ..|||+ ||+ T Consensus 85 ~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~-~~~~p~~~t~ 163 (576) T protein:vir:96 85 DVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTG-RDKDIDRDSF 163 (576) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhcc-CCCCCccccH Confidence 01234567889999999999999973 233222222221 11112233 33333 346665 589 Q ss_pred HHHHHHHHHHHHhcCCeeEEEeecc--CCcccceeeeccCCCcceeeecCCceEE--EEeeecccccceeeecccccccc Q lcl|NC_010576. 110 RSFVFDLLYSLLDEGQIAMVPIDTT--VDPDSGSFDINTARVGKIMQFFPRQVMV--RVWNDNTGLEQDLLVSKENCIII 185 (447) Q Consensus 110 ~~f~~~~~~~lll~Gna~i~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~ 185 (447) ++||+.++.+++++||||+++.+.. .+.+.. ++++++....+.....+.... ..|...........++..+++|+ T Consensus 164 ~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~-L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~ 242 (576) T protein:vir:96 164 QSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDK-FIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMG 242 (576) T ss_pred HHHHHHHHHHHHhcCCeEEEEEEecCCCCceEE-EEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEE Confidence 9999999999999999999987543 333334 444444443443333332221 11222223334456777888765 Q ss_pred -cccccccc--cchhHHHHHHHHHHHHHH-----HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--C Q lcl|NC_010576. 186 -ESPFYAIL--NDTNQTLRMLEQKIKLMN-----SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMAN--N 255 (447) Q Consensus 186 -~~~~~~~~--~~~~~~~~~~~~~~~~~~-----~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--n 255 (447) +++..+.. ..+.+.+..+...+.... +...+.||+.++|||++++.... .+++++++++.|.+.++| | T Consensus 243 ~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~l--s~e~~~~lr~~~~~~~~G~~n 320 (576) T protein:vir:96 243 IRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQ--SQRALENFKREWKSSFSGING 320 (576) T ss_pred eecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC--CHHHHHHHHHHHHHHhccccc Confidence 44443211 111233333344433332 23345789999999998754321 134566777777777665 7 Q ss_pred Ccc-eeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-----------------CcHHHHHHHHHHH Q lcl|NC_010576. 256 KYG-VATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNG-----------------TANEQQTLGYYNR 316 (447) Q Consensus 256 ~~~-~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-----------------~~~e~~~~~f~~~ 316 (447) +++ ++||++|++|+++++++.+++ ++.+++++++||++|||||++||. ++.|++.++||++ T Consensus 321 ag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~ 400 (576) T protein:vir:96 321 SWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNK 400 (576) T ss_pred cccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHH Confidence 788 489999999999999998865 788999999999999999999962 2458999999999 Q ss_pred HHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCccccc Q lcl|NC_010576. 317 CVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVL--TRNAIYTPNEIRELTGKAPHPNPLANE 394 (447) Q Consensus 317 ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~gl~p~~g~~~~~ 394 (447) ||.||+++||++||++|++.. +.++.|+ +++.|.+++++.+..+ +.+|+||+||+|+++||||++|| |. T Consensus 401 tL~P~~~~ie~~ln~~Ll~~~----~~~~~~~---f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegG--D~ 471 (576) T protein:vir:96 401 GLQPLLRFIEDLINTHIISEY----SDKYVFQ---FVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGG--DV 471 (576) T ss_pred HHHHHHHHHHHHHHhhhchhc----cCceEEE---eccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCc--ce Confidence 999999999999999999752 2334443 4678999999988755 55799999999999999999984 55 Q ss_pred c-ccccccchhhcccccC----CCCCC-----------CCCCC-cCCCCCCCcccccccCC---ccCcCCCCC Q lcl|NC_010576. 395 L-FNRNIADGNQVGGINT----PGQIT-----------SDQPA-TASTDPLNNVSTSAIEN---GSLTDGGSY 447 (447) Q Consensus 395 ~-~~~~~~~~~~~~~~~~----~~~~~-----------~~~~~-~~~~~~~~~~~~~~~~~---~~~~~~~~~ 447 (447) + .+.++.+..+...... ..+.. ....+ .+++...++...++... +.-+++|.- T Consensus 472 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~ 544 (576) T protein:vir:96 472 LLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQL 544 (576) T ss_pred eccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCcccccccc Confidence 3 3444433332211100 00000 00000 00001111111111111 111233433 No 85 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=1.7e-59 Score=342.72 Aligned_cols=423 Identities=11% Similarity=0.100 Sum_probs=244.1 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccccccccccc----CCccc---ccchhhhhhHHHHHHHHHHHHhhcc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGR----GQSNY---SRSYSYNKADLIKSVITRIALDASM 73 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~---~~~~~~~~~~~v~~cv~~ia~~ia~ 73 (447) +..+++-+ .++++.+.. ++.. ....+.++.. ..+.+ ...+.+..+++|++||+.+++.||. T Consensus 43 ~~~~~~~~------~~~~~a~~~---~~~~---~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~ 110 (563) T protein:vir:95 43 YQDLTKSL------YGQQQAYAE---PFIE---MMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAM 110 (563) T ss_pred HHHHHhhh------ccCCCcchh---hhHh---hhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHH Confidence 33333211 111111111 1110 0011111110 11111 1133455678888999988888885 Q ss_pred -------------CceEEEEEcCCCcee-ccccchHHHHHh-hh--cCcc-cCHHHHHHHHHHHHHhcCCeeEEEe--ec Q lcl|NC_010576. 74 -------------VDFKHLKIDPISGNQ-TPMPSGLINVLT-RS--ANID-QTGRSFVFDLLYSLLDEGQIAMVPI--DT 133 (447) Q Consensus 74 -------------lp~~~~r~~~~~~~~-~~~~~~l~~lL~-~~--PN~~-~t~~~f~~~~~~~lll~Gna~i~~~--~~ 133 (447) +|+++++.+..+... ....|++..+|. .. |||+ +|+++||+.++.+++++||+|+++. ++ T Consensus 111 ~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd 190 (563) T protein:vir:95 111 YCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKN 190 (563) T ss_pred HhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEec Confidence 688877766555433 234566655442 22 3343 5889999999999999999998765 45 Q ss_pred cCCcccceeeeccCCCcceeeecCCceE--EEEeeecccccceeeecccccc-cccccccccc--cchhHHHHHHHHHHH Q lcl|NC_010576. 134 TVDPDSGSFDINTARVGKIMQFFPRQVM--VRVWNDNTGLEQDLLVSKENCI-IIESPFYAIL--NDTNQTLRMLEQKIK 208 (447) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~-~~~~~~~~~~--~~~~~~~~~~~~~~~ 208 (447) ..+.+..++++.+. ...+.....+... ...|...........++.++++ |++++..... ..+.+.+..+...+. T Consensus 191 ~~G~~~~L~pl~p~-~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~ 269 (563) T protein:vir:95 191 NKTKLEKFIAVDPS-TIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFI 269 (563) T ss_pred CCCceEEEEEeCCc-eeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHH Confidence 44544444444443 3333332222211 0111111122223456677766 5556543211 112233333444433 Q ss_pred HHH-----HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--CCcce-eecCCCceeeecCCChhhhh- Q lcl|NC_010576. 209 LMN-----SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMAN--NKYGV-ATLDTQEKFVSAGMGLQNNL- 279 (447) Q Consensus 209 ~~~-----~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~-~vl~~g~~~~~l~~~~~~~~- 279 (447) ... +...|.||+.|+|||++++.... .+++++++++.|.+.+++ |+|++ +|+++|++|+++++++++++ T Consensus 270 ~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l--s~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qf 347 (563) T protein:vir:95 270 AYNNTESFNDRFFSHGGTTRGILQIRSDQQQ--SQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQF 347 (563) T ss_pred HHHHHHHHHHHHHHccCCCceEEEeCCCCCC--CHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHH Confidence 332 23345789999999998754211 134566677777776664 67775 78999999999999998865 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcC-----------------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAILNG-----------------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~l~g-----------------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ++.+++++++||++|||||++||. ++.|++.+.|+++||.||++.||++||++|+++. . T Consensus 348 le~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~--~-- 423 (563) T protein:vir:95 348 EKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY--G-- 423 (563) T ss_pred HHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc--c-- Confidence 788999999999999999999962 2347888999999999999999999999999753 2 Q ss_pred ceEEEecchhhhcCHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCCCCCcccccc-ccccccchhhcccccCC------- Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADV--LTRNAIYTPNEIRELTGKAPHPNPLANEL-FNRNIADGNQVGGINTP------- 412 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~--~~~~G~~t~NE~R~~~gl~p~~g~~~~~~-~~~~~~~~~~~~~~~~~------- 412 (447) .++.|+ ++++|.+++++.+.. ++++||||+||+|+++||||++|| |.+ .+.++.+..+....... T Consensus 424 ~~~~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gG--D~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (563) T protein:vir:95 424 DKYTFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGG--DIILDASFLQGTAQLQQDKQYNDGKQKE 498 (563) T ss_pred cccEEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCc--ceeecccccccccccccccCCCccccch Confidence 234443 478899999998764 588999999999999999999984 553 34444433322111000 Q ss_pred -------C-CCCCCCCCcCCCCCCCcc-----ccccc---CCccCcCCCCC Q lcl|NC_010576. 413 -------G-QITSDQPATASTDPLNNV-----STSAI---ENGSLTDGGSY 447 (447) Q Consensus 413 -------~-~~~~~~~~~~~~~~~~~~-----~~~~~---~~~~~~~~~~~ 447 (447) + ....+.++.+.++..+++ +++.. ++-.+..+|+. T Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (563) T protein:vir:95 499 RLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNK 549 (563) T ss_pred hhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccc Confidence 0 000111111111111111 11111 22223344444 No 86 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=1.7e-59 Score=342.72 Aligned_cols=423 Identities=11% Similarity=0.100 Sum_probs=244.1 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccccccccccc----CCccc---ccchhhhhhHHHHHHHHHHHHhhcc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGR----GQSNY---SRSYSYNKADLIKSVITRIALDASM 73 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~---~~~~~~~~~~~v~~cv~~ia~~ia~ 73 (447) +..+++-+ .++++.+.. ++.. ....+.++.. ..+.+ ...+.+..+++|++||+.+++.||. T Consensus 43 ~~~~~~~~------~~~~~a~~~---~~~~---~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~ 110 (563) T protein:vir:99 43 YQDLTKSL------YGQQQAYAE---PFIE---MMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAM 110 (563) T ss_pred HHHHHhhh------ccCCCcchh---hhHh---hhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHH Confidence 33333211 111111111 1110 0011111110 11111 1133455678888999988888885 Q ss_pred -------------CceEEEEEcCCCcee-ccccchHHHHHh-hh--cCcc-cCHHHHHHHHHHHHHhcCCeeEEEe--ec Q lcl|NC_010576. 74 -------------VDFKHLKIDPISGNQ-TPMPSGLINVLT-RS--ANID-QTGRSFVFDLLYSLLDEGQIAMVPI--DT 133 (447) Q Consensus 74 -------------lp~~~~r~~~~~~~~-~~~~~~l~~lL~-~~--PN~~-~t~~~f~~~~~~~lll~Gna~i~~~--~~ 133 (447) +|+++++.+..+... ....|++..+|. .. |||+ +|+++||+.++.+++++||+|+++. ++ T Consensus 111 ~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd 190 (563) T protein:vir:99 111 YCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKN 190 (563) T ss_pred HhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEec Confidence 688877766555433 234566655442 22 3343 5889999999999999999998765 45 Q ss_pred cCCcccceeeeccCCCcceeeecCCceE--EEEeeecccccceeeecccccc-cccccccccc--cchhHHHHHHHHHHH Q lcl|NC_010576. 134 TVDPDSGSFDINTARVGKIMQFFPRQVM--VRVWNDNTGLEQDLLVSKENCI-IIESPFYAIL--NDTNQTLRMLEQKIK 208 (447) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~-~~~~~~~~~~--~~~~~~~~~~~~~~~ 208 (447) ..+.+..++++.+. ...+.....+... ...|...........++.++++ |++++..... ..+.+.+..+...+. T Consensus 191 ~~G~~~~L~pl~p~-~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~ 269 (563) T protein:vir:99 191 NKTKLEKFIAVDPS-TIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFI 269 (563) T ss_pred CCCceEEEEEeCCc-eeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHH Confidence 44544444444443 3333332222211 0111111122223456677766 5556543211 112233333444433 Q ss_pred HHH-----HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--CCcce-eecCCCceeeecCCChhhhh- Q lcl|NC_010576. 209 LMN-----SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMAN--NKYGV-ATLDTQEKFVSAGMGLQNNL- 279 (447) Q Consensus 209 ~~~-----~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~-~vl~~g~~~~~l~~~~~~~~- 279 (447) ... +...|.||+.|+|||++++.... .+++++++++.|.+.+++ |+|++ +|+++|++|+++++++++++ T Consensus 270 ~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l--s~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qf 347 (563) T protein:vir:99 270 AYNNTESFNDRFFSHGGTTRGILQIRSDQQQ--SQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQF 347 (563) T ss_pred HHHHHHHHHHHHHHccCCCceEEEeCCCCCC--CHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHH Confidence 332 23345789999999998754211 134566677777776664 67775 78999999999999998865 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcC-----------------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAILNG-----------------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~l~g-----------------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ++.+++++++||++|||||++||. ++.|++.+.|+++||.||++.||++||++|+++. . T Consensus 348 le~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~--~-- 423 (563) T protein:vir:99 348 EKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY--G-- 423 (563) T ss_pred HHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc--c-- Confidence 788999999999999999999962 2347888999999999999999999999999753 2 Q ss_pred ceEEEecchhhhcCHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCCCCCcccccc-ccccccchhhcccccCC------- Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADV--LTRNAIYTPNEIRELTGKAPHPNPLANEL-FNRNIADGNQVGGINTP------- 412 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~--~~~~G~~t~NE~R~~~gl~p~~g~~~~~~-~~~~~~~~~~~~~~~~~------- 412 (447) .++.|+ ++++|.+++++.+.. ++++||||+||+|+++||||++|| |.+ .+.++.+..+....... T Consensus 424 ~~~~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gG--D~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (563) T protein:vir:99 424 DKYTFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGG--DIILDASFLQGTAQLQQDKQYNDGKQKE 498 (563) T ss_pred cccEEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCc--ceeecccccccccccccccCCCccccch Confidence 234443 478899999998764 588999999999999999999984 553 34444433322111000 Q ss_pred -------C-CCCCCCCCcCCCCCCCcc-----ccccc---CCccCcCCCCC Q lcl|NC_010576. 413 -------G-QITSDQPATASTDPLNNV-----STSAI---ENGSLTDGGSY 447 (447) Q Consensus 413 -------~-~~~~~~~~~~~~~~~~~~-----~~~~~---~~~~~~~~~~~ 447 (447) + ....+.++.+.++..+++ +++.. ++-.+..+|+. T Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (563) T protein:vir:99 499 RLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNK 549 (563) T ss_pred hhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccc Confidence 0 000111111111111111 11111 22223344444 No 87 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.5e-57 Score=332.16 Aligned_cols=432 Identities=10% Similarity=0.074 Sum_probs=237.0 Q ss_pred CchhHhhhhhcccccC-Ccccccc----------------------cccc----ccccc-c-ccccccccccCCccccc- Q lcl|NC_010576. 1 MASSDRLLHSWNAFQS-NQNQNQN----------------------TNDF----LTPSN-G-MTSFGGYYGRGQSNYSR- 50 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~-~~~~~~~----------------------~~~~----~~~~~-~-~~~~~~~~~~~~~~~~~- 50 (447) -||.||+..+|.-..- +++..-. ..++ +.+.. . ....+|......+.+.- T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~ 87 (648) T protein:vir:79 8 RGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFN 87 (648) T ss_pred chhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHH Confidence 6888887776641000 1110000 0000 00000 0 00001111111111111 Q ss_pred --chhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 51 --SYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 51 --~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) .+.+..++.|++||++||++||++||+++.++++ .+..++. .+|..+||++||+++||+.++.+++++||||+ T Consensus 88 ~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~----~~~~~~~-~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYv 162 (648) T protein:vir:79 88 EITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN----AVEYIRM-RFTLMAEATQIPTNQLFIEIAEDLVKYCNVVI 162 (648) T ss_pred HHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCc----cchhhHH-HHHhhccCCCCCHHHHHHHHHHHHHhcCCeEE Confidence 2445678999999999999999999986543321 1223343 34446999999999999999999999999999 Q ss_pred EEeeccCCcccce--------------eeeccCCCcceeeecCCceEEEEeeecccccceeeeccccccccccc--cccc Q lcl|NC_010576. 129 VPIDTTVDPDSGS--------------FDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESP--FYAI 192 (447) Q Consensus 129 ~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~ 192 (447) ++.++..+....+ ++++......+.....+.... +.+...+....+.+++++|+|++.. ..+. T Consensus 163 eiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~-Y~y~~~g~~~~~~~~~~dIIHik~~~~~d~~ 241 (648) T protein:vir:79 163 AKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKG-WQQEQEGQDKPQKFKPEDIVHIYYKREKGRA 241 (648) T ss_pred EEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceee-eEEEecCCceeEEecCccEEEEccCCCCCCc Confidence 9999877643221 222222221222111111111 1122233344566889999999843 2222 Q ss_pred ccchhHHHHHHHHHHHHH-----HHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCce Q lcl|NC_010576. 193 LNDTNQTLRMLEQKIKLM-----NSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEK 267 (447) Q Consensus 193 ~~~~~~~~~~~~~~~~~~-----~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~ 267 (447) . +.+.+..+..++... .+...+.||+.|+|+|+++..-. ..+..+++++.|.+.+.+ ..+.+.+.+ T Consensus 242 ~--GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~---~~e~~k~~~e~~~~~~~~----~~i~gg~v~ 312 (648) T protein:vir:79 242 F--GTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQE---GFGAEEGEVDLVRGEVEN----MDVEGGMVT 312 (648) T ss_pred e--eccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCcc---chHHHHHHHHHHHHhccc----ccccccccc Confidence 1 122333333333332 23334578999999998753211 122333444444443322 223344444 Q ss_pred eeecCC----Chhhhh-HHHHHHHHHHHHHHhCCCHHHhcC-----CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcC--- Q lcl|NC_010576. 268 FVSAGM----GLQNNL-LSDVRQLQQDFYNQMGITEAILNG-----TANEQQTLGYYNRCVDVLLQYVTDAISRIAL--- 334 (447) Q Consensus 268 ~~~l~~----~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g-----~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl--- 334 (447) ++.+.+ ++++++ ++.+++++++||++|||||++||. .++.++...+|..+|.|+...++..++.+++ T Consensus 313 ~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~l 392 (648) T protein:vir:79 313 TERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEI 392 (648) T ss_pred cceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444443 345554 577899999999999999999962 1234445566778888887766665554433 Q ss_pred -ChhHh----cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccc Q lcl|NC_010576. 335 -TKTAV----SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGI 409 (447) Q Consensus 335 -~~~e~----~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~ 409 (447) .+..+ ...++++|++++|++.|.+++++.+.+++++||||+||+|+++||||++++++..++..++.+..+.... T Consensus 393 l~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~ 472 (648) T protein:vir:79 393 LMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATAL 472 (648) T ss_pred hhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhcccc Confidence 22211 1235789999999999999999999999999999999999999999999987655555554443222111 Q ss_pred ----cCCCCCCCCCCCcCCCCC-CCcccccccCCccCcCCCCC Q lcl|NC_010576. 410 ----NTPGQITSDQPATASTDP-LNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 410 ----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 447 (447) ..+.........+++... .++-..+...+|+..+...+ T Consensus 473 ~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~ 515 (648) T protein:vir:79 473 AALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQ 515 (648) T ss_pred ccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccc Confidence 111111110110111000 00111111222222222223 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=2.9e-55 Score=319.60 Aligned_cols=426 Identities=8% Similarity=-0.015 Sum_probs=241.4 Q ss_pred CchhHhhhhhcccccCCccc-c--ccccccc-cccccccccccccccCCccc-ccchhh-hhhHHHHHHHHHHHHhhc-- Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-N--QNTNDFL-TPSNGMTSFGGYYGRGQSNY-SRSYSY-NKADLIKSVITRIALDAS-- 72 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~--~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~v~~cv~~ia~~ia-- 72 (447) +|+= |.... -+...... + .....+. ....-+..+...+....... ...+.. -...+.++||.+|+++++ T Consensus 78 ~g~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~p 154 (651) T protein:vir:99 78 FGFD--LVPAQ-GVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRP 154 (651) T ss_pred cCce--eeecc-cCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccch Confidence 4431 11100 00000000 0 0000000 00000000001101000000 001111 113457899999998765 Q ss_pred ----cCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCC Q lcl|NC_010576. 73 ----MVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTAR 148 (447) Q Consensus 73 ----~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~ 148 (447) .+|.+.+|...++. ..+|++..+|+.+||.+|+...|.+.+ ++++.+++|+++..+..+... .+...... T Consensus 155 v~L~~lp~~~~Rv~~~~~---~~~~~~~~ll~~~pn~~~~~~~~~~~~--q~~~~~~~~~~~~g~~~~~~~-~~~~~~~~ 228 (651) T protein:vir:99 155 VGLAYVPARTVRVRRPQN---RFDQPRHPEEGRYVDGDVADIASRGYV--QIRNGNRRYFGEAGDRYRGQE-VVIDESGD 228 (651) T ss_pred hhhhhcChhheeeecccc---cccchhhhhhhcccccccchhHHHHHH--HHHhcCcceEEEeecccccee-eeeccCCc Confidence 47777777765543 357899999999999999998886543 455666666654433222111 11111111 Q ss_pred Cccee----------eecCCceEEEEeeecccccceeeeccccccccccccc-ccccchhHHHHHHHHHHHHHH-----H Q lcl|NC_010576. 149 VGKIM----------QFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFY-AILNDTNQTLRMLEQKIKLMN-----S 212 (447) Q Consensus 149 ~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-----~ 212 (447) ...+. ....+...+.+... .......++.++|||++.+.. +... +.+.+..+..++.... + T Consensus 229 ~v~~~~~~d~~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~eViHir~~~~~~g~~-G~spl~~a~~~i~~a~~a~~~~ 305 (651) T protein:vir:99 229 EPTIRYREDEESEREPIFVDRETGDVTTG--DANGLENRPANELIFIPNPSILEDDY-GVPDWVSAIRTISADEAAKDYN 305 (651) T ss_pred ceeEEeccCcceeeeeecccceeeeEEEc--CCCceeEecccceEEecCCCCCCCcc-cccHHHHHHHHHHHHHHHHHHH Confidence 11110 11111222222111 122334678899999996531 2111 1123333333333322 2 Q ss_pred HHHHhhcCcccceeeeCC-cCChHHHHHHHHHHHHHHHHHhccCCcceeecCC-----------CceeeecCCCh-hhh- Q lcl|NC_010576. 213 QDNRASSGKLNGFIQFPY-STKSTARAAQAARRKQEIENEMANNKYGVATLDT-----------QEKFVSAGMGL-QNN- 278 (447) Q Consensus 213 ~~~~~n~~~~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~-----------g~~~~~l~~~~-~~~- 278 (447) ...|.||+.++|||++++ .++++ +++++++.|++. .+|+|++++|+. |++|++|++++ +++ T Consensus 306 ~~~f~NG~~p~gil~~~~~~ls~e----~~~~lr~~~~~~-~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~q 380 (651) T protein:vir:99 306 RDFFDNDTIPRMVIKVTGGELSEE----SKRDLRQMLNGL-REESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMD 380 (651) T ss_pred HHHHhccCCCceEEEecCCCCCHH----HHHHHHHHHHHH-hccCCceEEeecccccccccccCCceEEEcCcCchhhHH Confidence 233578899999999875 45544 445566666553 457788888865 99999999876 465 Q ss_pred hHHHHHHHHHHHHHHhCCCHHHhcC------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCc--eEEEecc Q lcl|NC_010576. 279 LLSDVRQLQQDFYNQMGITEAILNG------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQ--VLVYYRN 350 (447) Q Consensus 279 ~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~--~i~f~~~ 350 (447) +++.+++++++||++|||||++||. ++.|++.+.|+++||.||++.||++||++||++.++..++ +++|+.+ T Consensus 381 fle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~ 460 (651) T protein:vir:99 381 FRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGA 460 (651) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccc Confidence 5688999999999999999999962 3459999999999999999999999999999988776665 5678888 Q ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc-cccccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 351 PFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF-NRNIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 351 ~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) .++++|.+++++++.+++++||||+||+|+++||||+++++++.+. +.+.....+. ...++....+++.+.+.-.. T Consensus 461 ~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~---~~gge~~~~~~~~~~~~~~~ 537 (651) T protein:vir:99 461 DQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDV---AGGGETEAVHEPPEENKIGE 537 (651) T ss_pred hhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccccccccccccc---ccCCCCcccccCcccccccc Confidence 8999999999999999999999999999999999999988777643 3333322221 11111111111111111000 Q ss_pred cccccccCC--------------ccCc---------------------CCCCC Q lcl|NC_010576. 430 NVSTSAIEN--------------GSLT---------------------DGGSY 447 (447) Q Consensus 430 ~~~~~~~~~--------------~~~~---------------------~~~~~ 447 (447) +.-.++.+ .+.. +|+-| T Consensus 538 -~e~~~~~~~~~~~e~~~~~~v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~~y 589 (651) T protein:vir:99 538 -REWDTVKSELTTKDPIEQMQFSSSNLDEGLYDFGENELYLSFLRDEGQSSLY 589 (651) T ss_pred -chhhhhhhhhcccchhhhhhHHHHHHHhhcCCCccceEEEEEeecCCCCcee Confidence 00001111 1111 22223 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=2.8e-52 Score=303.20 Aligned_cols=268 Identities=11% Similarity=0.081 Sum_probs=200.4 Q ss_pred hccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCc Q lcl|NC_010576. 71 ASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVG 150 (447) Q Consensus 71 ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~ 150 (447) ||+|||++|+.++ ..+|+++++|+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+.. . T Consensus 1 ia~l~~~~~~~~~------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~-v 73 (278) T protein:vir:78 1 MASLPLKMYEDYK------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDV-V 73 (278) T ss_pred CccceeEEEecCc------ccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCce-e Confidence 9999999998543 34799999999999999999999999999999999999999998777655444444443 3 Q ss_pred ceeeecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHH---HHhhcCcccceee Q lcl|NC_010576. 151 KIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQD---NRASSGKLNGFIQ 227 (447) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~n~~~~~gvl~ 227 (447) .+.....+... .|......+..+.++.++|+|++.+.....-.+.+.+..+...+....++. ...++..++++++ T Consensus 74 ~v~~~~~~~~~--~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~ 151 (278) T protein:vir:78 74 EMLIENQSREL--YYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLK 151 (278) T ss_pred EEEEcCCCceE--EEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Confidence 44333322222 222233445567788999999986532111111223333333333322221 1234556789999 Q ss_pred eCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcCC-- Q lcl|NC_010576. 228 FPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQLQQDFYNQMGITEAILNGT-- 304 (447) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~~~~Ia~~fgVP~~~l~g~-- 304 (447) .++.+++++. ++++++|++.. +++|+++++++|++|+++++++.+++ ++.+++.+++||++|||||++||+. T Consensus 152 ~~~~l~~e~~----~~~~~~~~~~~-~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~ 226 (278) T protein:vir:78 152 YGSNVGKEKR----QQVLEDFKQYY-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSN 226 (278) T ss_pred eCCCCCHHHH----HHHHHHHHHHh-ccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 9998887654 44555555544 46788999999999999999988866 6778999999999999999999732 Q ss_pred ----cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchh Q lcl|NC_010576. 305 ----ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPF 352 (447) Q Consensus 305 ----~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l 352 (447) +.+++.++|++.||.|+++.||++||++||++.++..|++|+||++.| T Consensus 227 ~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 227 TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred CCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 348889999999999999999999999999999999999999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=3.7e-41 Score=242.28 Aligned_cols=333 Identities=8% Similarity=-0.020 Sum_probs=195.2 Q ss_pred CchhHhhhhhcccccCCcccccc------cccccccccccccccccccc--CC------cccccchhhhhhHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQN------TNDFLTPSNGMTSFGGYYGR--GQ------SNYSRSYSYNKADLIKSVITR 66 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~--~~------~~~~~~~~~~~~~~v~~cv~~ 66 (447) |+--.+ +..-.-.+.....+.. ......+.. ..+ ||.... +. ........+.++|+-+.|+.. T Consensus 1 m~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~ 77 (368) T protein:vir:79 1 MSRNKT-RRAARAASAHVRTANTDAPTEHHTDRAAQAE-VFS-FGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLAR 77 (368) T ss_pred CCcccc-ccchhccCcccccccccCcchhhccccCceE-EEE-cCCceeecchhhHHHHHHHHhccchhccCcCHHHHHH Confidence 443211 0000000000000000 000000000 001 111100 00 000111113333333333332 Q ss_pred HHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeecc Q lcl|NC_010576. 67 IALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINT 146 (447) Q Consensus 67 ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~ 146 (447) + ++.....+.....+|++..++ .+||++||+++||+ ++.+++++||||+++.++..+.+..++++.+ T Consensus 78 ~-----------~~~~~~h~~~~~~~~n~l~l~-~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~ 144 (368) T protein:vir:79 78 S-----------FRAAAHHSSAVYVKRNILVST-FIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLA 144 (368) T ss_pred H-----------Hhhccccchhhhhhcchhhhh-cCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCc Confidence 2 222222233334578887666 58999999999976 6789999999999999988776665555544 Q ss_pred CCCcceeeecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCc Q lcl|NC_010576. 147 ARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGK 221 (447) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~ 221 (447) ..+ .+.. .... +++ .......+.++..+|+|++.+. .++ ...+...+..+....+.. .....+.||+. T Consensus 145 ~~v-~~~~--~~~~---~~~-~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~ 217 (368) T protein:vir:79 145 KYV-RRGL--DLNT---YFF-VQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSH 217 (368) T ss_pred ccc-eeec--cCCE---EEE-EecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 333 2221 1111 111 1223455678899999999653 222 222222233322222222 23334579999 Q ss_pred ccceeeeCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHH Q lcl|NC_010576. 222 LNGFIQFPY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQ 293 (447) Q Consensus 222 ~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~ 293 (447) ++|||.++. .+++++.+ +++++|++. +.+|+++++|+ ++|++|++++.++.++ +++.+++++++||++ T Consensus 218 ~~gil~~~~~~l~~e~~~----~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a 293 (368) T protein:vir:79 218 AGFILYMTDAAQKQEDVD----TLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAA 293 (368) T ss_pred CceEEEeCCCCCCHHHHH----HHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHH Confidence 999998764 56655544 444455442 34688999998 6899999999998775 568889999999999 Q ss_pred hCCCHHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHH Q lcl|NC_010576. 294 MGITEAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVAD 365 (447) Q Consensus 294 fgVP~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~ 365 (447) |||||.+||. ++.|++.+.|+++||.||++.|| ++|.+|.. .+++|+...|++.|.+++++... T Consensus 294 f~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~-------e~~rF~~~~l~~~D~~a~a~~~~ 365 (368) T protein:vir:79 294 HRVPPQLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD-------EVVRFAPYALGGHDQPAAAPGGQ 365 (368) T ss_pred hCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc-------ceeeechhHhhcccccccCCccc Confidence 9999999961 24589999999999999999998 68877632 36899999999999999987332 Q ss_pred HHH Q lcl|NC_010576. 366 VLT 368 (447) Q Consensus 366 ~~~ 368 (447) +-- T Consensus 366 rsa 368 (368) T protein:vir:79 366 RSA 368 (368) T ss_pred ccC Confidence 211 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=8.5e-39 Score=229.30 Aligned_cols=320 Identities=7% Similarity=-0.046 Sum_probs=177.7 Q ss_pred CchhHhhhhhcccccCCccccccc--cccccccccccccccccccCCc-ccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT--NDFLTPSNGMTSFGGYYGRGQS-NYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) |+--- + ..+....+... .....+.. ...|+. +.+ .+.+.+..+ .+..|+.- .+.-.-|+. T Consensus 26 ~~~~~---~----~~~~~~~~~~~~~~~~~~~~~--~~~f~f---g~p~~v~~~~~~~---~~~~~~~~--~~~~~pp~~ 88 (376) T protein:vir:10 26 MSKRR---S----RAPRTFAAAPNPSAGSAAPAR--AEVFTF---DDPTPVMNRAEIL---DYVECWSN--GEWFEPPVS 88 (376) T ss_pred chhcc---C----CCcccchhhhhHhhhccCcce--eEEEEc---CCceeccCcchhh---hhhhhhhc--CceecCCCC Confidence 33210 0 00000000000 00000000 000100 000 000111000 11112110 111111211 Q ss_pred ------EEEEcCCCceeccccchHHHHH-hhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCc Q lcl|NC_010576. 78 ------HLKIDPISGNQTPMPSGLINVL-TRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVG 150 (447) Q Consensus 78 ------~~r~~~~~~~~~~~~~~l~~lL-~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~ 150 (447) ++|.+. .......-....|. ..+|||+||+.+|++. +.+++++||||+++.++..+.+..+.++.+. .. T Consensus 89 ~~~La~~~~~~~--~h~s~l~~k~n~l~~~~~Pnp~lT~~~f~~~-v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~-~v 164 (376) T protein:vir:10 89 FAGLAKSFRAST--HHSSALFFKANVLASTFRPHRWLSRHAFERW-ALDFLTFGNGYLERRRNMVGGTLRLEPALAK-YV 164 (376) T ss_pred HHHHHHHHhhhH--HhhhhHHHHhHHHHhccCCCCCCCHHHHHHH-HHHHHhcCCeEEEEEECCCCCEEEEEEeCCc-ce Confidence 010000 00000000001111 2479999999999855 5689999999999999887766655555443 32 Q ss_pred ceeeecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCcccce Q lcl|NC_010576. 151 KIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGF 225 (447) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gv 225 (447) .+... ++. + ++ .......+.++.++|+|++.+. ... ...+...+..+..+.++. .+...+.||+.++|| T Consensus 165 r~~~d-~~~--~--~~-~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggI 238 (376) T protein:vir:10 165 RRKAD-FNG--F--VY-VNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFI 238 (376) T ss_pred EEEee-CCe--E--EE-EEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 32221 111 1 11 1223445678899999999653 222 222223333333332222 233456799999999 Q ss_pred eeeCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCC Q lcl|NC_010576. 226 IQFPY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGIT 297 (447) Q Consensus 226 l~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP 297 (447) |..+. .+++++. ++++++|++. +.+|.++++++ ++|++|+++++++.+. +++.+++++++||++|||| T Consensus 239 l~~~d~~l~~e~~----~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VP 314 (376) T protein:vir:10 239 LYMTDAAQKQDDV----DNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVP 314 (376) T ss_pred EEecCCCCCHHHH----HHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCC Confidence 98764 5665544 4455555443 34688888888 5799999999998765 5688899999999999999 Q ss_pred HHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHH Q lcl|NC_010576. 298 EAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQ 359 (447) Q Consensus 298 ~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~ 359 (447) |.++|- ++.|++.+.|+++||.|+++.|| ++|.+|.. ..|+|+...|+|+|.++ T Consensus 315 p~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~-------~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 315 PQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGE-------EVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhccc-------cccccChhHhhcccccC Confidence 999951 33589999999999999999998 58877732 36899999999999988 No 92 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=2.3e-39 Score=232.37 Aligned_cols=243 Identities=11% Similarity=0.006 Sum_probs=156.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ||||.+ .+++. ... .......+...+.+........ ++...++++++|++||++||++||+|||++|+ T Consensus 1 MglF~~------~~~r~-~~~----~~~~~~~~~~~~~~~~~~~~~~-v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~ 68 (251) T protein:vir:46 1 MGIFYK------NEKRD-LQY----NEDDLQMMVQTLPSFQGTKLRQ-YKDIEAIRHSDIFTAVMMIASDLARMPIRVTV 68 (251) T ss_pred CCcccc------ccccc-cCC----CccchhhhhhhhccccCcCcce-echhhhhccHHHHHHHHHHHHhHhhCceEEee Confidence 999864 12221 111 1111111111112222222333 44567889999999999999999999999997 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) ... ...+|++++||+.+||++||+++||+.++.+++++||||+++.++..+.+..++++.+. ...+.....+.. T Consensus 69 ~~~-----~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~-~v~v~~~~~g~~ 142 (251) T protein:vir:46 69 NGQ-----INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTS-EIELKSDARGRL 142 (251) T ss_pred Ccc-----ccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCc-eEEEEECCCCcE Confidence 432 34689999999999999999999999999999999999999999887765555555444 334444334444 Q ss_pred EEEEeeec-ccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHH-----HHHhhcCcccceeeeCCcCCh Q lcl|NC_010576. 161 MVRVWNDN-TGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQ-----DNRASSGKLNGFIQFPYSTKS 234 (447) Q Consensus 161 ~~~~~~~~-~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~gvl~~~~~~~~ 234 (447) .+.++... ...+....++++||+|++.+..++.. +.+.+..+..++....+. ..+.||++++|+|++++.+.+ T Consensus 143 ~~~~~~~~~~~~g~~~~~~~~diiH~r~~~~dg~~-G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~ 221 (251) T protein:vir:46 143 YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGIN-GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN 221 (251) T ss_pred EEEEEEeccCCcceeEEECCccEEEecCcCCCCee-ecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCC Confidence 44433222 23344567899999999976443322 223344444444443332 335789999999999988765 Q ss_pred HHHHHHHHHHHHHHHHHhcc--CCcceeecCCCcee Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMAN--NKYGVATLDTQEKF 268 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~ 268 (447) + ++++++++.|.+.+.+ |+|++++ |++= T Consensus 222 ~---e~~~~~~~~~~~~~~g~~n~g~~~~---gm~~ 251 (251) T protein:vir:46 222 K---KARDRAREEFPKVLVELNKLGKLSY---SMNQ 251 (251) T ss_pred H---HHHHHHHHHHHHHhcCccccccccc---ccCC Confidence 3 3445566667666664 6777654 3331 No 93 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=3.2e-38 Score=226.13 Aligned_cols=316 Identities=8% Similarity=-0.032 Sum_probs=185.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccC--C---ccc--c--cchhhhhhHHHHHHHHHHHHhh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRG--Q---SNY--S--RSYSYNKADLIKSVITRIALDA 71 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~---~~~--~--~~~~~~~~~~v~~cv~~ia~~i 71 (447) |+-.. ++..+............ .+++ |..... + ..+ + ....+.+.|.-+.. ||+.+ T Consensus 1 m~~~~----------~~~~~~~~~~~~~~~~~-~~~~-~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~---la~l~ 65 (346) T protein:vir:10 1 MKKQL----------RKNLTQNDRLQPQAQTE-IFSF-GDPIPVLDRADILNYLECSAMYEKWYNPPMSFDG---LAKSL 65 (346) T ss_pred CCccc----------CCCCCcccccccccCeE-EEec-CCcceecCchhHHHHHHHhhcCCceEecCCCHHH---HHHHH Confidence 33321 11111111111101100 0111 110000 0 000 0 00111122222222 23332 Q ss_pred ccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcc Q lcl|NC_010576. 72 SMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGK 151 (447) Q Consensus 72 a~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~ 151 (447) -..|.+ +.......|.+..++. +||++||+++|++ ++.+++++||||+++.++..+.+..++++.+..+ . T Consensus 66 ~~~~~h-------~~~i~~k~n~l~~l~~-~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v-~ 135 (346) T protein:vir:10 66 RSSTHH-------ESAIITKANILLSTCE-VDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYV-R 135 (346) T ss_pred Hhhhhc-------chhhhhhhhhHHHHHh-CCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCce-E Confidence 222322 1222345688888875 7999999999987 5678999999999999988776665555544333 3 Q ss_pred eeeecCCceEEEEeeecccccceeeeccccccccccccc-cc---ccchhHHHHHHHHHHHHH-HHHHHHhhcCccccee Q lcl|NC_010576. 152 IMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFY-AI---LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFI 226 (447) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl 226 (447) +.. ..+...+. .....+....++..+|+|++.+.. .. .+........+..+.++. .....+.||+.++||| T Consensus 136 ~~~-~~~~~~~~---~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il 211 (346) T protein:vir:10 136 KGL-EAGQFYYV---PQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVF 211 (346) T ss_pred EEE-cCCeEEEE---EEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 322 12222222 222234456788999999997642 11 222222223322222222 2223357899999999 Q ss_pred eeCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeecC-----CCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCH Q lcl|NC_010576. 227 QFPY-STKSTARAAQAARRKQEIENE-MANNKYGVATLD-----TQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITE 298 (447) Q Consensus 227 ~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl~-----~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~ 298 (447) .++. .+++++.++ ++++|++. +.+|+++++++. .|++++|++.++.+. +++.+++++++||++||||| T Consensus 212 ~~~d~~l~~e~~~~----i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp 287 (346) T protein:vir:10 212 YMSDASQKQEDVEN----IRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPP 287 (346) T ss_pred EeCCCCCCHHHHHH----HHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCH Confidence 8754 566555444 44444433 346888999885 478899999988665 56788999999999999999 Q ss_pred HHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCH Q lcl|NC_010576. 299 AILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPV 357 (447) Q Consensus 299 ~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~ 357 (447) .+||. ++.|++.+.|++++|.||+++||+ ++.+|.. .+|+|+...|+++|. T Consensus 288 ~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~-------e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 288 QLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQ-------EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhccc-------ceeeechhhhcccCC Confidence 99961 235899999999999999999985 6666642 368999999999997 No 94 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=2e-38 Score=227.28 Aligned_cols=316 Identities=9% Similarity=-0.043 Sum_probs=182.7 Q ss_pred ccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccC---ceE------EEEEcCC Q lcl|NC_010576. 14 FQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMV---DFK------HLKIDPI 84 (447) Q Consensus 14 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~l---p~~------~~r~~~~ 84 (447) ++.......+........ .+++++. +. .-+....+..|+..+.+..... |+. ++|.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~-----p~-----~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~ 68 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKS--VYSFDPN-----PE-----PVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGY 68 (348) T ss_pred CCccccchhhccccCCce--EEEecCC-----Ce-----eecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhh Confidence 111111111111110111 1111101 11 1112223555666665443322 321 1111100 Q ss_pred C-ceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceEEE Q lcl|NC_010576. 85 S-GNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVMVR 163 (447) Q Consensus 85 ~-~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (447) - .......+.+.. ..+||++||+++|++. +.+++++||||+++.++..+.+..+.++ +....++. ..+. T Consensus 69 h~~~i~~k~N~l~~--~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l-~~~~v~~~--~d~~---- 138 (348) T protein:vir:26 69 HGSLLKARANYVAG--RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPL-PMVHMRKR--KNGD---- 138 (348) T ss_pred hhhhHhhhhhHHhh--cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEe-cCceeEee--ecCc---- Confidence 0 000000011111 2379999999999775 4699999999999999887765544444 33332222 1111 Q ss_pred Eeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeeeCC-cCChHHH Q lcl|NC_010576. 164 VWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQFPY-STKSTAR 237 (447) Q Consensus 164 ~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~~~-~~~~~~~ 237 (447) ++.. ...+....+++++|+|++.+. ... ...+...++.+....++. .+...+.||+.++|||..+. .+++++ T Consensus 139 ~~~~-~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~- 216 (348) T protein:vir:26 139 FVQL-LRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEAD- 216 (348) T ss_pred EEEE-EecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHH- Confidence 1111 123445678899999999653 221 222223333333332222 23334678999999997654 465544 Q ss_pred HHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhh-hhHHHHHHHHHHHHHHhCCCHHHhc------C- Q lcl|NC_010576. 238 AAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQN-NLLSDVRQLQQDFYNQMGITEAILN------G- 303 (447) Q Consensus 238 ~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~-~~l~~~~~~~~~Ia~~fgVP~~~l~------g- 303 (447) +++++++|.+. +.+|.++++++ +.|++|+|++.++.+ ++++.+++++++||++|||||+++| + T Consensus 217 ---~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~ 293 (348) T protein:vir:26 217 ---EKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGAN 293 (348) T ss_pred ---HHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCc Confidence 44555555544 33588889888 889999999998866 5578899999999999999999986 1 Q ss_pred -CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchh-hhcCHHHH Q lcl|NC_010576. 304 -TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPF-KLVPVEQL 360 (447) Q Consensus 304 -~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l-~~~d~~~~ 360 (447) ++.|++.+.|+++||.||++.||++||++|..+ .+.+++|+++.. ++.|..+. T Consensus 294 ~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~----~~~~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 294 VPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPEIP----DNLKLKFNLNPGVESANGSAV 348 (348) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhhhhCCC----CccEEEEecCcccccchhhcC Confidence 235899999999999999999999999998643 457889988853 33332222 No 95 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=3e-38 Score=226.26 Aligned_cols=322 Identities=7% Similarity=-0.037 Sum_probs=178.1 Q ss_pred CchhHhhhhhcccccCCc-ccccccccccccccc--ccc--cccccc-cCCcccccchhhhhhHHHHHHHHHHHHhhc-- Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNG--MTS--FGGYYG-RGQSNYSRSYSYNKADLIKSVITRIALDAS-- 72 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~--~~~--~~~~~~-~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia-- 72 (447) |+-- ++....-..+. ........+.....+ ..+ ..+... .+.........+.+.|+=+..+..+.+.-+ T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h 77 (351) T protein:vir:79 1 MSKR---RSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHH 77 (351) T ss_pred CCCC---CCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhh Confidence 3220 10000000000 000000000000000 000 000000 000000001112222221211111111111 Q ss_pred cCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcce Q lcl|NC_010576. 73 MVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKI 152 (447) Q Consensus 73 ~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~ 152 (447) .-+++. ..+.+.. ..+|||+||+++|++ ++.+++++||||+++.++..+.+..+.++.+..+ ++ T Consensus 78 ~~~l~~------------k~n~l~~--~~~Pnp~~t~~~f~~-~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v-~~ 141 (351) T protein:vir:79 78 SSALFF------------KANVLAS--TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYV-RR 141 (351) T ss_pred hhhhhh------------hhhHHhh--cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcce-ee Confidence 001110 0111111 247999999999975 6679999999999999988776665555544332 22 Q ss_pred eeecCCceEEEEeeecccccceeeeccccccccccccc--c--cccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceee Q lcl|NC_010576. 153 MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFY--A--ILNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQ 227 (447) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~ 227 (447) .... + .+ + .....+..+.++.++|+|++.+.. . +......++..+..+.++. .....+.||+.++|||. T Consensus 142 ~~~~-~--~~--~-~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~ 215 (351) T protein:vir:79 142 KADF-S--GF--V-YVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILY 215 (351) T ss_pred eecC-C--eE--E-EEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 2111 1 11 1 112234456788999999986542 1 1222233333333333332 23334678999999998 Q ss_pred eCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHH Q lcl|NC_010576. 228 FPY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEA 299 (447) Q Consensus 228 ~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~ 299 (447) .+. .+++++.+. ++++|++. +.+|.++++++ +.|++|+|++.++++. +++.+++++++||++|||||. T Consensus 216 ~~~~~ls~e~~~~----lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~ 291 (351) T protein:vir:79 216 MTDAAQKQDDVDN----MRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ 291 (351) T ss_pred ecCCCCCHHHHHH----HHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 754 566555444 44444432 34688888887 6899999999998765 568889999999999999999 Q ss_pred Hhc----C----CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHH Q lcl|NC_010576. 300 ILN----G----TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQ 359 (447) Q Consensus 300 ~l~----g----~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~ 359 (447) ++| + ++.|++.+.|+++||.||++.||+ +|.+|- ...++|+...|+++|.++ T Consensus 292 llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg-------~~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 292 LLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLG-------DEVVTFDDYEIPPAPVAA 351 (351) T ss_pred HhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcC-------cceeeeChhhhccccccC Confidence 995 1 235899999999999999999985 776652 236899999999999988 No 96 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=5.6e-38 Score=224.82 Aligned_cols=322 Identities=7% Similarity=-0.040 Sum_probs=177.5 Q ss_pred CchhHhhhhhcccccCCc-ccccccccccccccc--ccc--cccccc-cCCcccccchhhhhhHHHHHHHHHHHHhh--c Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNG--MTS--FGGYYG-RGQSNYSRSYSYNKADLIKSVITRIALDA--S 72 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~--~~~--~~~~~~-~~~~~~~~~~~~~~~~~v~~cv~~ia~~i--a 72 (447) |+-- ++....-..+. ........+.....+ ..+ ..+... .+.........+.+.|+=+..+..+.+.- . T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h 77 (351) T protein:vir:78 1 MSKR---RSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHH 77 (351) T ss_pred CCCC---CCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhh Confidence 3220 10000000000 000000000000000 000 000000 00000000111222222222111111111 1 Q ss_pred cCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcce Q lcl|NC_010576. 73 MVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKI 152 (447) Q Consensus 73 ~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~ 152 (447) ..+++. ..+-+.. ..+||++||+++|++ ++.+++++||||+++.++..+.+....++.+ ....+ T Consensus 78 ~~~l~~------------k~n~l~~--~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~-~~v~~ 141 (351) T protein:vir:78 78 SSALFF------------KANVLAS--TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALA-KYVRR 141 (351) T ss_pred hhhhhh------------hhhHHhh--cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecC-cceEE Confidence 111111 0011111 247999999999975 5578899999999999988776655444443 33232 Q ss_pred eeecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHH-HHHHHHHhhcCcccceee Q lcl|NC_010576. 153 MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKL-MNSQDNRASSGKLNGFIQ 227 (447) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~gvl~ 227 (447) ... .+. +++ ....+....++..+|+|++.+. ... ...+..++..+..+.++ ..+...+.||+.++|||. T Consensus 142 ~~~-~~~----~~~-~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~ 215 (351) T protein:vir:78 142 KAD-FSG----FVY-VNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILY 215 (351) T ss_pred eee-CCe----EEE-EecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 221 111 111 1123445678899999999653 221 12222333333333222 233345679999999998 Q ss_pred eCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHH Q lcl|NC_010576. 228 FPY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEA 299 (447) Q Consensus 228 ~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~ 299 (447) .+. .+++++.+. ++++|++. +.+|+++++++ +.|++|+|++.++.+. +++.+++++++||++|||||. T Consensus 216 ~~~~~ls~e~~~~----lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~ 291 (351) T protein:vir:78 216 MTDAAQKQDDVDN----MRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ 291 (351) T ss_pred ecCCCCCHHHHHH----HHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 754 566555444 44444432 34588889887 5789999999998765 568889999999999999999 Q ss_pred Hhc----C----CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHH Q lcl|NC_010576. 300 ILN----G----TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQ 359 (447) Q Consensus 300 ~l~----g----~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~ 359 (447) ++| + ++.|++.+.|+++||.||++.||+ ++.+|. ..+|+|+.+.|+++|.++ T Consensus 292 llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~-------~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 292 LLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLG-------DEVVRFDDYEIPPAPVAA 351 (351) T ss_pred HhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcC-------ccceecChhhhccccccC Confidence 995 1 345899999999999999999995 666663 236999999999999988 No 97 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=5.2e-37 Score=219.49 Aligned_cols=318 Identities=7% Similarity=-0.010 Sum_probs=181.5 Q ss_pred hhhcccccCCcccccccccccccccccccccccccc-CCccc-----ccchhhhhhHHHHHHHHHHHHhhccCceEEEEE Q lcl|NC_010576. 8 LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGR-GQSNY-----SRSYSYNKADLIKSVITRIALDASMVDFKHLKI 81 (447) Q Consensus 8 ~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~ 81 (447) |+ -.+.+...+.....+..... +++ +-... ....| .....+.+.|.-+..+..+ +-.-+-+ T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~--f~~-~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l---~~~~~~h---- 67 (345) T protein:vir:37 1 MK---TNVKTDNKKGIVIAPINDRT--FSL-NEISASPALDYVGIGFDENYNCYLPPVNRHALAKL---PHQNAQH---- 67 (345) T ss_pred CC---CCccccchhhcccCcceeEE--eec-CCcccccchhhhhhhhcCCccccCCCCCHHHHHHH---hhccccc---- Confidence 11 11111111111110001000 011 10000 00000 1122233444333332222 2111111 Q ss_pred cCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecCCceE Q lcl|NC_010576. 82 DPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFPRQVM 161 (447) Q Consensus 82 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (447) +.....+.+-+.. ..+||++||+++|++ ++.+++++||||+++.++..+.+..+.++ +....++... .+... T Consensus 68 ---~~~i~~k~n~l~~--~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl-~~~~vr~~~d-~~~~~ 139 (345) T protein:vir:37 68 ---GGILHSRANMVSS--LYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPL-SSLYLRVRKD-GGYSY 139 (345) T ss_pred ---ccceeeechHHHh--hccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEE-cCceeEEEEe-CCeeE Confidence 0001111122332 347999999999986 45789999999999999887766554444 4333333322 12221 Q ss_pred EEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeeeC-CcCChH Q lcl|NC_010576. 162 VRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQFP-YSTKST 235 (447) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~~-~~~~~~ 235 (447) ...+......+....++.++|+|++.+. ... ...+...+..+....++. .+...+.||+.++|||..+ ..++++ T Consensus 140 ~~~~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e 219 (345) T protein:vir:37 140 LMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEE 219 (345) T ss_pred EEEEeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHH Confidence 1112222334456678899999999653 221 222223333333222222 2334467899999999875 456555 Q ss_pred HHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHHhcC----- Q lcl|NC_010576. 236 ARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAILNG----- 303 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~l~g----- 303 (447) +.+ +++++|++. +.+|.++++++ +.|++|+|+++++.+. +++.+++++++||++|||||+++|- T Consensus 220 ~~~----~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~ 295 (345) T protein:vir:37 220 MEE----EIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNT 295 (345) T ss_pred HHH----HHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCC Confidence 444 445555443 34577888877 6899999999998765 5688899999999999999999961 Q ss_pred ---CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhh Q lcl|NC_010576. 304 ---TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKL 354 (447) Q Consensus 304 ---~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~ 354 (447) ++.|++.+.|+++||.|++++||+++|+.+ +...+.+++|+..+|.+ T Consensus 296 ~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~~----~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 296 GGLGDPLKYREVYHYDEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred CCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCcceEEecchhhcC Confidence 235899999999999999999999999743 24456789999888877 No 98 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=4.5e-37 Score=219.87 Aligned_cols=319 Identities=7% Similarity=-0.025 Sum_probs=178.3 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccc--ccccccc-cccCCcccccchhhhhhHHHHHHHHHH--HHhhccCc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNG--MTSFGGY-YGRGQSNYSRSYSYNKADLIKSVITRI--ALDASMVD 75 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~cv~~i--a~~ia~lp 75 (447) |.-. +....++.....+...... ..+.... .......+.....+.+.|.=+..+..+ ++..-.-+ T Consensus 1 ~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~ 70 (345) T protein:vir:37 1 MKTN----------VKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGI 70 (345) T ss_pred CCcc----------ccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcch Confidence 1111 1111000011100000000 0000000 000000001112233333322222222 11111112 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF 155 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (447) +++ +.+-+ +...+|||+||+++|++ ++.+++++||||+++.++..+.+..+.++ ++...++... T Consensus 71 i~~------------k~n~l--~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl-~~~~vr~~~d 134 (345) T protein:vir:37 71 LHS------------RANMV--SATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPL-SSLYLRVHKD 134 (345) T ss_pred hhh------------hhhHH--hhccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEe-cCceeEEeec Confidence 211 11122 22458999999999975 55789999999999999887765544444 3333222211 Q ss_pred cCCceEEEEeeecccccceeeecccccccccccc--cccc--cchhHHHHHHHHHHHH-HHHHHHHhhcCcccceeeeCC Q lcl|NC_010576. 156 FPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAIL--NDTNQTLRMLEQKIKL-MNSQDNRASSGKLNGFIQFPY 230 (447) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~gvl~~~~ 230 (447) .+......+......+....+++++|+|++.+. .... ......+..+..+.++ ..+...+.||+.++|||..+. T Consensus 135 -~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~ 213 (345) T protein:vir:37 135 -GGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD 213 (345) T ss_pred -CCeeEEEeeeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC Confidence 111111112222333556678899999999653 2221 2222223332222222 223344678999999998654 Q ss_pred -cCChHHHHHHHHHHHHHHHHHhc-cCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_010576. 231 -STKSTARAAQAARRKQEIENEMA-NNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAILN 302 (447) Q Consensus 231 -~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~l~ 302 (447) .+++++ .++++++|++... +|.+.++++ +.|++|++++.++.+. +++.+++++++||++|||||.++| T Consensus 214 ~~l~~e~----~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liG 289 (345) T protein:vir:37 214 PDLTEEM----EEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSG 289 (345) T ss_pred CCCCHHH----HHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhc Confidence 565544 4455556655443 355555555 5789999999988664 568889999999999999999995 Q ss_pred C--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhh Q lcl|NC_010576. 303 G--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKL 354 (447) Q Consensus 303 g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~ 354 (447) - ++.|++.+.|+++||.||+++||+++|+.+ +...+++|+|+..+|+| T Consensus 290 i~~~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~----e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 290 IIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred cccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhh----ccCCcceEEECchhhcC Confidence 1 235899999999999999999999999743 34567899999999999 No 99 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=1.1e-37 Score=223.29 Aligned_cols=312 Identities=8% Similarity=-0.013 Sum_probs=180.8 Q ss_pred CchhHhhhhhcccccCCcccccccccccc--ccccccc--cc-cccccCCcccccchhhhhhHHHHHHHHHHHHhhcc-- Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLT--PSNGMTS--FG-GYYGRGQSNYSRSYSYNKADLIKSVITRIALDASM-- 73 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~--~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~-- 73 (447) |+- | |++...++....... ...+..| .. +..-.+.........+.++|+-+..+..+.++-+. T Consensus 1 m~~--~--------~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~ 70 (340) T protein:vir:98 1 MSK--R--------KPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHS 70 (340) T ss_pred CCC--C--------CCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccc Confidence 331 0 000111111100000 0000001 00 00000111111223344555544444333322221 Q ss_pred CceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCccee Q lcl|NC_010576. 74 VDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIM 153 (447) Q Consensus 74 lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (447) -+++. .-+.+... .+|||+||+++|++ ++.+++++||||+++.++..+.+..+.++ +....++. T Consensus 71 s~i~~------------k~n~l~~~--~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl-~~~~vr~~ 134 (340) T protein:vir:98 71 SPIYV------------KRNVLAST--YIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTS-PAKYTRRG 134 (340) T ss_pred hhhhh------------hhhHHhhc--cCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEe-CCceEEEc Confidence 12221 11222222 47999999999975 55799999999999999877765444443 33322221 Q ss_pred eecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHH-HHHHHHHhhcCcccceeee Q lcl|NC_010576. 154 QFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKL-MNSQDNRASSGKLNGFIQF 228 (447) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~gvl~~ 228 (447) ..+.. +| .....+..+.+++++|+|++.+. ... ...+..++..+....++ ..+...+.||+.++|||.+ T Consensus 135 --~~~~~---~~-~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~ 208 (340) T protein:vir:98 135 --VDDSV---FW-FVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYV 208 (340) T ss_pred --ccCcE---EE-EEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 11111 11 22233455678899999999643 222 12222223333222222 2233446789999999987 Q ss_pred CC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_010576. 229 PY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAI 300 (447) Q Consensus 229 ~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~ 300 (447) +. .+++++. ++++++|++. +.+|.++++++ ++|++|+|++.++.+. +++.+++++++||++|||||++ T Consensus 209 ~~~~ls~e~~----~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~l 284 (340) T protein:vir:98 209 TDPAQSATDV----ESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQL 284 (340) T ss_pred cCCCCCHHHH----HHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 64 4665544 4445555443 34588889888 6799999999998765 5688899999999999999999 Q ss_pred hcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcC Q lcl|NC_010576. 301 LNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVP 356 (447) Q Consensus 301 l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d 356 (447) +|- ++.|++.+.|+++||.||++.||+ +|.+|..+ .++|+...|++.| T Consensus 285 lGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e-------~~rF~~~~l~~~d 340 (340) T protein:vir:98 285 MGGKPENIGSLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME-------VIRFKEYTLDNPE 340 (340) T ss_pred hcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc-------ccccCccccccCC Confidence 961 235999999999999999999995 88887432 4789999999999 No 100 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=3.7e-37 Score=220.35 Aligned_cols=308 Identities=10% Similarity=0.009 Sum_probs=184.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhc---cCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDAS---MVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia---~lp~~ 77 (447) |+ |++..++... ..++.. ..+ || .+..+ +....+..|+....++++ ..|+. T Consensus 1 m~------------~~~~~~~~~~--~~~~~~-~~~-~~-----~p~~~-----~~~~~~~~~~~~~~~~~~~~~~pP~~ 54 (337) T protein:vir:78 1 MT------------KRQQQPAQAA--ASSPRP-SVV-FS-----MPEAI-----DPTAWMTDYTGVFYNPYGEYYQPPID 54 (337) T ss_pred CC------------CcccCccccc--ccCcee-EEE-ec-----Ccccc-----cCcchhHhhhhhhhccCcceecCCCC Confidence 22 1222222111 111111 111 11 11111 111124444444443333 23332 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHH----HHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCccee Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGR----SFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIM 153 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~----~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (447) ...- ..... .+.....+|..+||+.|+.+ ++++.++.+++++||||+++.++..+.+..++++ +....++. T Consensus 55 ~~~L---a~l~~-~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl-~~~~v~~~ 129 (337) T protein:vir:78 55 RKGL---AKVAR-ANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPL-SSVYLRRR 129 (337) T ss_pred HHHH---HHHhh-cchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEe-CCceeEee Confidence 1000 00000 11223557888999877654 6899999999999999999999887765554444 43332222 Q ss_pred eecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeee Q lcl|NC_010576. 154 QFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQF 228 (447) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~ 228 (447) .++...+ .. .......++.++|+|++.+. .+. ...+...+..+..+.++. .+...+.||+.++|||.. T Consensus 130 --~d~~~~~----~~-~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~ 202 (337) T protein:vir:78 130 --EDGCFVY----LQ-QGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYA 202 (337) T ss_pred --eCCeEEE----EE-cCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc Confidence 2222211 11 12345678899999999754 222 222223333333333222 223346799999999987 Q ss_pred CC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_010576. 229 PY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAI 300 (447) Q Consensus 229 ~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~ 300 (447) +. .+++++. +++++.|++. +.+|.++++++ +.|++|+++++++.+. +++.+++++++||++|||||++ T Consensus 203 ~~~~l~~e~~----~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~l 278 (337) T protein:vir:78 203 TDPNMDDDTE----EEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPAL 278 (337) T ss_pred CCCCCCHHHH----HHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 64 4655444 4455555443 23577888887 6889999999998765 5688999999999999999999 Q ss_pred hc-------CC--cHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhh Q lcl|NC_010576. 301 LN-------GT--ANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFK 353 (447) Q Consensus 301 l~-------g~--~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~ 353 (447) || ++ +.|++.+.|+++||.||+++||+++|++|++..++ .+++|+...++ T Consensus 279 lGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~---~~f~~~~~~~~ 337 (337) T protein:vir:78 279 AGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALW---VTFRETIGAAV 337 (337) T ss_pred cccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhc---eeccccccccC Confidence 95 12 25899999999999999999999999999876432 35677777777 No 101 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=6e-37 Score=219.17 Aligned_cols=318 Identities=8% Similarity=-0.021 Sum_probs=179.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccc--ccccc-ccCCcccccchhhhhhHHHHHHHHHHHHhh--ccCc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTS--FGGYY-GRGQSNYSRSYSYNKADLIKSVITRIALDA--SMVD 75 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~v~~cv~~ia~~i--a~lp 75 (447) |.- |..+.. .+...+............+..| ..... -.+...+.....+.+.|.-+..+..+.+.- ...+ T Consensus 1 m~~--~~~~~~---~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~ 75 (344) T protein:vir:60 1 MSK--KKGKTL---QPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSP 75 (344) T ss_pred CCc--ccCCCC---CchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccc Confidence 221 110000 0000000000000000000000 00000 000001111222333333233322221111 1223 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF 155 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (447) ++.. .+.+.. ..+||++||+++| +.++.+++++||||+++.++..+.+.. +++++....++.. T Consensus 76 i~~k------------~n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~-L~~l~~~~vr~~~- 138 (344) T protein:vir:60 76 IYVK------------RNILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIR-LETSPAKYTRRGV- 138 (344) T ss_pred hhhh------------hhHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEE-EEEcCcceEEEee- Confidence 3211 122333 2479999999999 578899999999999999988776544 4444544333321 Q ss_pred cCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeeeCC Q lcl|NC_010576. 156 FPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQFPY 230 (447) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~~~ 230 (447) .+.. |......+..+.++.++|+|++.+. .+. ...+..++..+..+.++. .+...+.||+.++|||..+. T Consensus 139 -~~~~----~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~ 213 (344) T protein:vir:60 139 -EEDV----YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD 213 (344) T ss_pred -cCCe----EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC Confidence 1111 1122334455678899999999653 221 222223333333332222 33444678999999998754 Q ss_pred -cCChHHHHHHHHHHHHHHHHHhccCCcceeec------CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_010576. 231 -STKSTARAAQAARRKQEIENEMANNKYGVATL------DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAILN 302 (447) Q Consensus 231 -~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl------~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~l~ 302 (447) .++++ +.++++++|++....++++.++| ++|++|++++.++.+. +++.+++++++||++|||||++|| T Consensus 214 ~~ls~e----~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llG 289 (344) T protein:vir:60 214 AVQDRN----DIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMG 289 (344) T ss_pred cCCCHH----HHHHHHHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhc Confidence 46554 44556666665555566777776 5799999999988664 578899999999999999999996 Q ss_pred C--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCH Q lcl|NC_010576. 303 G--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPV 357 (447) Q Consensus 303 g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~ 357 (447) - .+.|++.+.|+++||.||+++|| +||.+|- ...++|+...|...|. T Consensus 290 i~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg-------~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 290 GKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLG-------QEVIRFKNYSLDTDNG 344 (344) T ss_pred ccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcC-------CcccccCccccCCCCC Confidence 1 23489999999999999999998 5888773 2347788888887776 No 102 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=2e-36 Score=216.28 Aligned_cols=309 Identities=8% Similarity=0.003 Sum_probs=179.0 Q ss_pred CchhHhhhhhcccccCCccccc----cccccccccccccccccccc--------cCCcccccchhhhhhHHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQ----NTNDFLTPSNGMTSFGGYYG--------RGQSNYSRSYSYNKADLIKSVITRIA 68 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~----~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~cv~~ia 68 (447) |+- ++..+.. ........ .-..++ |... .+...+.....+.+.|+=+..+..+. T Consensus 1 ~~~------------~~~~~~~~~~~~~~~~~~~-~~~~~~-~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~ 66 (344) T protein:vir:56 1 MSK------------KKGKTPQPAAKTMTASAPK-MEAFTF-GEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSL 66 (344) T ss_pred CCC------------CCCCCCchhhHHhhcCCCc-eEEEEc-CCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHH Confidence 222 1111100 00000000 000111 1100 00000111122333333333333332 Q ss_pred Hhhc--cCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeecc Q lcl|NC_010576. 69 LDAS--MVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINT 146 (447) Q Consensus 69 ~~ia--~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~ 146 (447) +.-+ +-+++.. .+.+.. ..+|||+||+.+| +.++.+++++||||+++.++..+.+.. +++.+ T Consensus 67 ~a~~~h~s~i~~k------------~n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~-L~pl~ 130 (344) T protein:vir:56 67 RAAVHHSSPIYVK------------RNILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIR-LETSP 130 (344) T ss_pred hhhhhhCccceeh------------hhhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEE-EEEeC Confidence 2222 2234321 122333 2479999999999 567889999999999999988776554 44444 Q ss_pred CCCcceeeecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhcCc Q lcl|NC_010576. 147 ARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASSGK 221 (447) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~ 221 (447) ....++.. .+.. |......+..+.+++++|+|++.+. .+. ...+...+..+..+.++. .+...|.||+. T Consensus 131 ~~~v~~~~--~~~~----~~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~ 204 (344) T protein:vir:56 131 AKYTRRGV--EEDV----YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAH 204 (344) T ss_pred CceeEEee--cCCE----EEEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 43333221 1111 1122334556678899999999653 222 222222333333332222 23345678999 Q ss_pred ccceeeeCC-cCChHHHHHHHHHHHHHHHHHhccCCcceeec------CCCceeeecCCChhhh-hHHHHHHHHHHHHHH Q lcl|NC_010576. 222 LNGFIQFPY-STKSTARAAQAARRKQEIENEMANNKYGVATL------DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQ 293 (447) Q Consensus 222 ~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl------~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~ 293 (447) ++|||..+. .++++ ++++++++|.+....++++.++| ++|++|++++.++.+. +++.+++++++||++ T Consensus 205 pg~Il~~~d~~ls~e----~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~a 280 (344) T protein:vir:56 205 AGYIMYVTDAVQDRN----DIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDA 280 (344) T ss_pred CceEEEecCCCCCHH----HHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHH Confidence 999998764 46554 45566666666555577888887 5799999999988764 578899999999999 Q ss_pred hCCCHHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCH Q lcl|NC_010576. 294 MGITEAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPV 357 (447) Q Consensus 294 fgVP~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~ 357 (447) |||||+++|. .+.|++.+.|+++||.||++.||+ +|.+|..+ .++|+--.|...|. T Consensus 281 frVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~-------~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 281 HRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred hCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhccc-------cccCCCccccccCC Confidence 9999999961 235899999999999999999995 77787532 35666556655554 No 103 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=5.7e-36 Score=213.82 Aligned_cols=316 Identities=7% Similarity=-0.039 Sum_probs=178.2 Q ss_pred CchhHhhhhhcccccCCccccccccc-c-cccccccc--cccccc-ccCCcccccchhhhhhHHHHHHHHHHH--Hhhcc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTND-F-LTPSNGMT--SFGGYY-GRGQSNYSRSYSYNKADLIKSVITRIA--LDASM 73 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~-~-~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~v~~cv~~ia--~~ia~ 73 (447) |+--.. . .+.......... . .....+.. +..+.. -.+.........+.+.|+=+..+..+- +.... T Consensus 1 ~~~~~~----~---~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~ 73 (344) T protein:vir:20 1 MSKKKG----K---TPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHS 73 (344) T ss_pred CCcccC----C---CCcchhhhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhC Confidence 332100 0 000000000000 0 00000000 000000 000001111122333333333322221 11122 Q ss_pred CceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCccee Q lcl|NC_010576. 74 VDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIM 153 (447) Q Consensus 74 lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (447) .+++.. .+-+... .+||++||+++| +.++.+++++||||+++.++..+.+.. +++.+....++. T Consensus 74 ~~i~~k------------~n~l~~~--~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~-L~pl~~~~vr~~ 137 (344) T protein:vir:20 74 SPIYVK------------RNILAST--FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIR-LETSPAKYTRRG 137 (344) T ss_pred ccceeh------------hhhHHHh--ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEE-EEEcCCceeEee Confidence 233321 1223332 479999999999 577899999999999999987775544 444444333322 Q ss_pred eecCCceEEEEeeecccccceeeecccccccccccc--cc--cccchhHHHHHHHHHHHHH-HHHHHHhhcCcccceeee Q lcl|NC_010576. 154 QFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YA--ILNDTNQTLRMLEQKIKLM-NSQDNRASSGKLNGFIQF 228 (447) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~gvl~~ 228 (447) . .+.. |......+..+.++.++|+|++.+. .+ +...+..++..+....++. .+...+.||+.++|||.+ T Consensus 138 ~--~~~~----~~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~ 211 (344) T protein:vir:20 138 V--EEDV----YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYV 211 (344) T ss_pred e--cCCE----EEEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 1 1111 1122334455678899999999653 22 1222223333333332222 333446799999999987 Q ss_pred C-CcCChHHHHHHHHHHHHHHHHHhccCCcceeec------CCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_010576. 229 P-YSTKSTARAAQAARRKQEIENEMANNKYGVATL------DTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGITEAI 300 (447) Q Consensus 229 ~-~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl------~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP~~~ 300 (447) + ..++++ ++++++++|.+....++++.++| ++|++|+|++.++.+. +++.+++++++||++|||||++ T Consensus 212 ~d~~l~~e----~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~l 287 (344) T protein:vir:20 212 TDAVQDRN----DIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQL 287 (344) T ss_pred cCcCCCHH----HHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 5 345554 44556666665555566777776 5799999999988664 5788999999999999999999 Q ss_pred hcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCH Q lcl|NC_010576. 301 LNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPV 357 (447) Q Consensus 301 l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~ 357 (447) +|. .+.|++.+.|+++||.||++.|| ++|.+|-. ..++|+...|...|. T Consensus 288 lGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~~lg~-------~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 288 MGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQ-------EVIRFKNYSLDTDND 344 (344) T ss_pred hccCCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCC-------cccccCccccccCCC Confidence 961 23489999999999999999998 57777632 347788888877775 No 104 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=3e-36 Score=215.33 Aligned_cols=313 Identities=7% Similarity=-0.048 Sum_probs=172.3 Q ss_pred CchhHhhhhhccc----ccCCccccccccccccccccccccccccccCCcccccch------------hhhhhHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNA----FQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSY------------SYNKADLIKSVI 64 (447) Q Consensus 1 Mg~~~~l~~~~~~----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~v~~cv 64 (447) |+- .++.-.. ...... .+......+... ..+ ||.. ..+.+.+ .+.+.|+-+..+ T Consensus 1 m~~---~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~-~~~p----~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~l 70 (350) T protein:vir:11 1 MSK---RRSHRRQQPVTVQSAQE-GEFIPRQGGRAE-AFT-FGDP----MPVLDGRGILDYLECWPNGRWYEPPLSMEGL 70 (350) T ss_pred CCc---cccCCCcCccccCCcch-hhhccccccceE-EEE-eCCc----eeecCcchhhHHHHHhhcCccccCCCCHHHH Confidence 322 1110000 000000 000000000000 001 1110 0011111 122222222221 Q ss_pred HHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeee Q lcl|NC_010576. 65 TRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDI 144 (447) Q Consensus 65 ~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~ 144 (447) |+.+..-+-+ +.......+.+.. ..+||++||+++|++ ++.+++++||||+++.++..+.+..+.++ T Consensus 71 ---a~~~~~~~~h-------~~~l~~k~n~l~~--~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l 137 (350) T protein:vir:11 71 ---AKSVGSSVYL-------QSGLKFKRNMLAK--TFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAP 137 (350) T ss_pred ---HHHHhhhhhh-------ccchhhhhhhhhh--cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEe Confidence 2221111110 0000001111221 248999999999986 56799999999999999887755544444 Q ss_pred ccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccc--ccc--ccchhHHHHHHHHHHHHH-HHHHHHhhc Q lcl|NC_010576. 145 NTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPF--YAI--LNDTNQTLRMLEQKIKLM-NSQDNRASS 219 (447) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~n~ 219 (447) +....++.. .+... | .....+....++..+|+|++.+. ... .......+..+....++. .....+.|| T Consensus 138 -~~~~vr~~~--~~~~~---~-~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NG 210 (350) T protein:vir:11 138 -LAKYMRRGT--DLETF---Y-QVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNG 210 (350) T ss_pred -CCceeEeee--cCCeE---E-EEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 433322221 12211 1 11223445678899999999653 221 222223333333332222 233446789 Q ss_pred CcccceeeeCC-cCChHHHHHHHHHHHHHHHHH-hccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHHH Q lcl|NC_010576. 220 GKLNGFIQFPY-STKSTARAAQAARRKQEIENE-MANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDFY 291 (447) Q Consensus 220 ~~~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia 291 (447) +.++|||..+. .+++++.++ +++.|++. +.+|+++++++ +.|++|+|++.++.+. +++.+++++++|| T Consensus 211 a~~~gil~~~~~~ls~e~~~~----l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa 286 (350) T protein:vir:11 211 SHAGFILYMTDAAQNEEDIDA----LRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQL 286 (350) T ss_pred CCCceEEEecCCCCCHHHHHH----HHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHH Confidence 99999998864 566555444 44555443 34588888887 5689999999998765 5788899999999 Q ss_pred HHhCCCHHHhcC--------CcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchh Q lcl|NC_010576. 292 NQMGITEAILNG--------TANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPF 352 (447) Q Consensus 292 ~~fgVP~~~l~g--------~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l 352 (447) ++|||||++||. ++.|++.+.|+++||.||++.||+ +|.+|..+ ...+.+|++++| T Consensus 287 ~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~----~~~F~~~~~~~l 350 (350) T protein:vir:11 287 AGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEE----VVRFAQFDAPGL 350 (350) T ss_pred HHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcc----ccccCcccccCC Confidence 999999999961 234899999999999999999995 88888643 223557888888 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=99.96 E-value=4.2e-31 Score=187.12 Aligned_cols=195 Identities=9% Similarity=-0.056 Sum_probs=128.1 Q ss_pred CCCcceeeecCCceEEEEe-eecccccceeeecccccccccccc--cccccchhHHHHHHHHHHHHHH-----HHHHHhh Q lcl|NC_010576. 147 ARVGKIMQFFPRQVMVRVW-NDNTGLEQDLLVSKENCIIIESPF--YAILNDTNQTLRMLEQKIKLMN-----SQDNRAS 218 (447) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~n 218 (447) .+ .-.++...+... ......+....+++++|+|++.+. .+..+ . +.+..+..++.... +...|.| T Consensus 1 ~r-----~~~dg~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~G-l-spi~~a~~~i~~~~aa~~~~~~~f~N 73 (219) T protein:vir:98 1 MR-----VCKDGNYKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYG-S-PDYVGGITSALLNSDATIFRRRYYSN 73 (219) T ss_pred Cc-----eeecCeEEEEEecceecCCceeEEeccccEEEecCCCCCCCcce-e-cHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 11 111122222211 111223445678899999999654 22211 1 12222223333222 2234679 Q ss_pred cCcccceeeeCC-cCChHHHHHHHHHHHHHHHHHh-ccCCcceeec-----CCCceeeecCCChhhh-hHHHHHHHHHHH Q lcl|NC_010576. 219 SGKLNGFIQFPY-STKSTARAAQAARRKQEIENEM-ANNKYGVATL-----DTQEKFVSAGMGLQNN-LLSDVRQLQQDF 290 (447) Q Consensus 219 ~~~~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~~-~~n~~~~~vl-----~~g~~~~~l~~~~~~~-~l~~~~~~~~~I 290 (447) |++|+|||+++. .+++++ ++++++.|++.. .+|+++++++ +.|++|++++++++++ +++.+++++++| T Consensus 74 g~~p~gil~~~~~~l~~e~----~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eI 149 (219) T protein:vir:98 74 GAHMGFILYSTDPDMTEEM----EDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDV 149 (219) T ss_pred CCCCceEEEeCCCCCCHHH----HHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHH Confidence 999999998765 566554 445555565432 3466666665 5799999999999775 578899999999 Q ss_pred HHHhCCCHHHhc--------CCcHHHHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcC Q lcl|NC_010576. 291 YNQMGITEAILN--------GTANEQQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVP 356 (447) Q Consensus 291 a~~fgVP~~~l~--------g~~~e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d 356 (447) |++|||||++|| +++.|++.+.|+++||.||+++||++||++++.+ .+.+++|+.+.+.-.+ T Consensus 150 a~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~----~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 150 LTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIK----SALKVNFKQPEKRDKN 219 (219) T ss_pred HHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC----CccEEeecCcccccCC Confidence 999999999985 2346999999999999999999999999986543 4568888866655444 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.79 E-value=5.5e-19 Score=120.68 Aligned_cols=385 Identities=11% Similarity=0.050 Sum_probs=179.2 Q ss_pred CchhHhhhhhcc-cccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWN-AFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~-~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) |-+.|.+.++.. .-.++ ... |+.. +. ........-...|.+++.++.+|+++|+++-+-++++- T Consensus 1 ~~~~D~~~~~~~~~g~~~---~~~---------~~~~--~~-~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~ 65 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQ---EQT---------YYSP--SL-SLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIY 65 (437) T ss_pred CchhhhhHhHHhcCCCcc---ccc---------eeec--Cc-cccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEe Confidence 888887666432 11111 000 1100 00 00011111124577899999999999999999998753 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHH---HHHHHhcCCeeEEEeeccCCcc--------cceeeecc-C Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDL---LYSLLDEGQIAMVPIDTTVDPD--------SGSFDINT-A 147 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~---~~~lll~Gna~i~~~~~~~~~~--------~~~~~~~~-~ 147 (447) . ++... ..-..+...+. ...+|+.+ +.+..++|.|++++..++..+. ...+.+.+ + T Consensus 66 -~--~d~~~-~~~~~~~~~~~--------~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~ 133 (437) T protein:vir:52 66 -S--NDLNS-KQLDLFTKFER--------SLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKW 133 (437) T ss_pred -c--CCCCH-HHHHHHHHHHH--------hhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechh Confidence 1 11100 00011222221 11234443 3444588999999877653211 11122222 1 Q ss_pred CCcceeee-----cCCceEEEEeeecccccceeeecccccccccc---cccccccchhHHHHHHHHHHHH----HHHHHH Q lcl|NC_010576. 148 RVGKIMQF-----FPRQVMVRVWNDNTGLEQDLLVSKENCIIIES---PFYAILNDTNQTLRMLEQKIKL----MNSQDN 215 (447) Q Consensus 148 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~---~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 215 (447) .+...... .++-.....|....+ ...+.++++.++|+.+ |...-...+.+.++.+...+.. ...... T Consensus 134 ~v~~~~~~~~dp~s~~fg~p~~y~v~~~-~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~ 212 (437) T protein:vir:52 134 KISPTGTKDDDVLSPNFGRYSEYSILGG-SQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGD 212 (437) T ss_pred hccccccccccccccccCcceEEEEecC-CcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHH Confidence 11111111 111112223333322 2334566788999863 2111111122333333333222 221111 Q ss_pred HhhcCcccceeeeCC---cCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHH Q lcl|NC_010576. 216 RASSGKLNGFIQFPY---STKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYN 292 (447) Q Consensus 216 ~~n~~~~~gvl~~~~---~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~ 292 (447) ........ ++++++ .+.... ++...++.+.+ +.+. +.+++++++.+.+|++++.++.+.. +.+.+...+||. T Consensus 213 l~~~~~~~-v~k~~~l~~~l~~~~-~~~~~~~~~~~-~~~~-~~~~~~~~d~~~~~e~~~~~~sgl~-~~l~~~~~~iaa 287 (437) T protein:vir:52 213 LIFESKID-IFKIAGLSDKIAAGM-ENEVASVISAV-QEIK-SATNSLLLDAENEYDRKELTFTGLK-DLLTEFRNAVAG 287 (437) T ss_pred HHHHcCCC-ceecchHHHHhcCCc-HHHHHHHHHHH-HHhc-CCCceEEEcCCcceEEEecCcCCHH-HHHHHHHHHHHH Confidence 11111111 344432 122211 22222233332 2233 4578899999999999987755422 334566779999 Q ss_pred HhCCCHHHhcCC-----c-HHHHHHHHHH-------HHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHH Q lcl|NC_010576. 293 QMGITEAILNGT-----A-NEQQTLGYYN-------RCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQ 359 (447) Q Consensus 293 ~fgVP~~~l~g~-----~-~e~~~~~f~~-------~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~ 359 (447) +++||..+|.|. + .+.....||. .-|.|+++.+-+.|-+..+.. ...+ +.|.+++|...|.++ T Consensus 288 a~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~--~~~~--~~~~f~pL~~~s~ke 363 (437) T protein:vir:52 288 AADMPVTILFGQSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGG--LPAD--WWFEFVPLTTVKQEQ 363 (437) T ss_pred HhcCchhhhcCcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCc--ceEEeCCcCCcCHHH Confidence 999999998653 2 2445566654 457788887777776666543 2333 444455787777554 Q ss_pred -------HHHHHHHHHhCCCcCHHHHHHHhC----CCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCC Q lcl|NC_010576. 360 -------LATVADVLTRNAIYTPNEIRELTG----KAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPL 428 (447) Q Consensus 360 -------~~~~~~~~~~~G~~t~NE~R~~~g----l~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (447) +++++.+++++|+++++|+|+++. ++.++... +. ......+.+ .+....++.++ T Consensus 364 kae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~----~~-~~~~~~~~~---~~~~~~~~~~~------- 428 (437) T protein:vir:52 364 QINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISAEH----IE-ELKNADEFA---GNFEEPEKMEG------- 428 (437) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCccc----cc-cccCCCCCC---CccCCCCCCCC------- Confidence 445688899999999999999873 33332210 00 000000000 00000000000 Q ss_pred CcccccccC Q lcl|NC_010576. 429 NNVSTSAIE 437 (447) Q Consensus 429 ~~~~~~~~~ 437 (447) ....+++.+ T Consensus 429 ~~~~~~~~~ 437 (437) T protein:vir:52 429 AQVQNSEDQ 437 (437) T ss_pred CCCCCCCCC Confidence 001111111 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.78 E-value=1.3e-18 Score=118.71 Aligned_cols=418 Identities=11% Similarity=0.039 Sum_probs=182.6 Q ss_pred CchhHh---hh------hhcc-cccCCcccccccc--cc---ccccccc---ccccccc--------------ccCCccc Q lcl|NC_010576. 1 MASSDR---LL------HSWN-AFQSNQNQNQNTN--DF---LTPSNGM---TSFGGYY--------------GRGQSNY 48 (447) Q Consensus 1 Mg~~~~---l~------~~~~-~f~~~~~~~~~~~--~~---~~~~~~~---~~~~~~~--------------~~~~~~~ 48 (447) ||+|.+ .+ ...+ ....+.+....+. +. ....... .+.++.. ...-..+ T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGH 104 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccH Confidence 888753 11 1111 0011111000000 00 0000000 0000000 0000111 Q ss_pred ccchhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 49 SRSYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 49 ~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) --...|.+++.++.+|+++|+++.+-++++-- .++.. .+......|...-+.+.....|.+.+.+.. |+|.+++ T Consensus 105 ~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~---~~~~~--~~~~~~~~l~~~~~~l~~~~~l~~a~~~~r-lyG~~~i 178 (537) T protein:vir:10 105 QMCALIATHWLVNKACSQMPRDAMRKGYKIIS---DDGNE--LDPKDAKFIDRYDRAFNIKKHAIQFVRKGR-IFGIRIA 178 (537) T ss_pred HHHHHHHhCchhhhhhhhhhHHhhcCCceeec---CCccc--ccHHHHHHHHHHHHHhhHHHHHHHHHHhcc-cccceEE Confidence 11234678899999999999999998887531 11111 122333444444444444445555544444 5688887 Q ss_pred EEeeccCCcc-------cc--------ee-eeccCCCccee-e-ecCCc-----eEEEEeeecccccceeeecccccccc Q lcl|NC_010576. 129 VPIDTTVDPD-------SG--------SF-DINTARVGKIM-Q-FFPRQ-----VMVRVWNDNTGLEQDLLVSKENCIII 185 (447) Q Consensus 129 ~~~~~~~~~~-------~~--------~~-~~~~~~~~~~~-~-~~~~~-----~~~~~~~~~~~~~~~~~~~~~~v~~~ 185 (447) ++.-....+. .. .+ .+.+..+.... . +..+. .....|... + ..++++.++|+ T Consensus 179 ~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~---g--~~iH~SRli~f 253 (537) T protein:vir:10 179 LFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN---G--KKYHRSHLAIY 253 (537) T ss_pred EEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec---C--eEecceeEEEe Confidence 7543211100 00 01 11111111000 0 00000 011122111 1 13345666776 Q ss_pred ccc--------ccccccchhHHHHHHHHHHHHH----HHHHHHhhcCcccceeeeCC--cCChHHHHHHHHHHHHHHHHH Q lcl|NC_010576. 186 ESP--------FYAILNDTNQTLRMLEQKIKLM----NSQDNRASSGKLNGFIQFPY--STKSTARAAQAARRKQEIENE 251 (447) Q Consensus 186 ~~~--------~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~n~~~~~gvl~~~~--~~~~~~~~~~~~~~~~~~~~~ 251 (447) .+. .++. .+.+.++.+...+... ............. +++++. .+..+ +...+..+.+ ++ T Consensus 254 ~g~~~p~~~~~~~~~--~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~-v~k~~~~~~l~~~---~~~~~r~~~~-~~ 326 (537) T protein:vir:10 254 INDEVVDFLKPSYIY--GGVPLPQQIMERVYAAERTANEGPMLAMTKRQT-VLKVDAAQVLANK---QQFDETMSWW-TA 326 (537) T ss_pred cCCCCchhhhcccCc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-eeeechHHhhcCH---HHHHHHHHHH-Hh Confidence 431 1111 1123333333332222 1111111111111 333332 22221 2222222233 23 Q ss_pred hccCCcceeecCC-CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------c-HHHHHHHHH------HHH Q lcl|NC_010576. 252 MANNKYGVATLDT-QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------A-NEQQTLGYY------NRC 317 (447) Q Consensus 252 ~~~n~~~~~vl~~-g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~-~e~~~~~f~------~~t 317 (447) +.+| .++++++. +.+|++++.+..... +.+....+.||.+.|||..+|-|. + .+.....|| +.. T Consensus 327 ~r~n-~g~~~id~e~e~~e~~~~~lsgl~-~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~ 404 (537) T protein:vir:10 327 TRDN-YQVRVVDKDNEDVVQIDTTLNDLD-KVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDD 404 (537) T ss_pred hcCC-cceeEecCCCceeEEEeccCCCHH-HHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHH Confidence 4444 45566776 588988876655421 234566677999999999977432 1 133434443 335 Q ss_pred HhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_010576. 318 VDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNP 390 (447) Q Consensus 318 i~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~ 390 (447) |.|.++.+.+.+-+..+.+ -..|.|.+++|...|.+++++. +.+++++|++++||+|+.++..|..|. T Consensus 405 l~p~l~~l~~ll~~~~~~~-----~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~ 479 (537) T protein:vir:10 405 MRPLIDRHHQLVCRSHLRK-----RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGF 479 (537) T ss_pred HHHHHHHHHHHHHHhcCCC-----CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCcccc Confidence 7899999888887777653 2346667779998888887764 889999999999999999998876543 Q ss_pred cccc-cccccccchhhcccccCCCCCCCCCCCcCC----CCCCCcccccccCCccCcCC Q lcl|NC_010576. 391 LANE-LFNRNIADGNQVGGINTPGQITSDQPATAS----TDPLNNVSTSAIENGSLTDG 444 (447) Q Consensus 391 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 444 (447) .+.. -++..............+..... .++.++ ......+-...-.+|.++++ T Consensus 480 ~~l~~~~~~ed~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 480 TSITPAMRPTDAEDIDVDDEGKPVRIIE-DQPAPSEMFGATSSGESANDPRDSGAAFED 537 (537) T ss_pred ccccCCCChhhhhcccCCccCCcCCCCC-CCCCccccCCCCccccccCCCccCccccCC Confidence 2210 00111111101111111111111 111111 01111111222334555666 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.73 E-value=3.1e-17 Score=111.10 Aligned_cols=427 Identities=11% Similarity=0.041 Sum_probs=177.6 Q ss_pred CchhHhhhhhcccccC--------Ccccc------cccccccccc-cccccccc------ccccCCcccccchhhhhhHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQS--------NQNQN------QNTNDFLTPS-NGMTSFGG------YYGRGQSNYSRSYSYNKADL 59 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~--------~~~~~------~~~~~~~~~~-~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 59 (447) .|.-.|+++=.+.-+. .++.. ....+.+... .+.+.... .....-..+.....|.+++. T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l 96 (532) T protein:vir:94 17 LQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPE 96 (532) T ss_pred hhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCch Confidence 4444443320000000 00000 0000000000 00000000 00001111112235678899 Q ss_pred HHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc Q lcl|NC_010576. 60 IKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS 139 (447) Q Consensus 60 v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~ 139 (447) ++.+|+++|+++-+-.+++.- +++. + ........|...-..+.- .+-+..++....++|.+++++.-...+... T Consensus 97 ~r~~Vd~~aed~~r~~~~i~~---~~~~-~-~~~~~~~~i~~~~~~l~v-~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~ 170 (532) T protein:vir:94 97 YRTMHETPADECVRAWGKITC---SSKD-E-LAADKATRITQKLEQYNV-RTLVRTVVIHDQAYGGAHVFPHLKMDGDSV 170 (532) T ss_pred hhhhhccchHHHhhCCceEee---CCcc-c-cchHHHHHHHHHHHhhhH-HHHHHHHHHhhhcccceEEEEEeccCCccc Confidence 999999999999998887532 1111 1 122333333322222211 122223334445788888876543222110 Q ss_pred ce---eeeccCCC-----cceeeecCCceE----------------EEEeeecccccceeeeccccccccccc-cccc-- Q lcl|NC_010576. 140 GS---FDINTARV-----GKIMQFFPRQVM----------------VRVWNDNTGLEQDLLVSKENCIIIESP-FYAI-- 192 (447) Q Consensus 140 ~~---~~~~~~~~-----~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~-- 192 (447) .. +.+.+..+ ..+..+.+..+. ...|....+ ..++++.++|+.+. .... T Consensus 171 ~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~~g----~~iH~SRli~f~g~~~p~~~~ 246 (532) T protein:vir:94 171 PADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIATSG----KKIHSSRIHTVVGRPVGDMLK 246 (532) T ss_pred cccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEccC----eeeccceEEEecCCCchhhhc Confidence 00 00000000 011111111111 111111111 13445667777531 1100 Q ss_pred ---ccchhHHHHHHHHHHHH----HHHHHHHhhcCcccceeeeC--CcCChHHHHHHHHHHHHHHHHHhccCCcceeecC Q lcl|NC_010576. 193 ---LNDTNQTLRMLEQKIKL----MNSQDNRASSGKLNGFIQFP--YSTKSTARAAQAARRKQEIENEMANNKYGVATLD 263 (447) Q Consensus 193 ---~~~~~~~~~~~~~~~~~----~~~~~~~~n~~~~~gvl~~~--~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~ 263 (447) ...+.+.++.+...+.. .............. +++++ ..+..+..+...+++ +.+ +.+.+| .++++++ T Consensus 247 ~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~-v~k~~~a~~ls~~~~~~~~~r~-~~~-~~~~~n-~g~~~id 322 (532) T protein:vir:94 247 AAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMT-NLATDMAQLLAPGGAQSLDARL-QLF-NLYRDN-RNIGALD 322 (532) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-eeeechHHhhcchhHHHHHHHH-HHH-HhhcCC-ccceEEc Confidence 00112333333333222 22221111222222 23332 222222222222222 112 333444 4566677 Q ss_pred C-CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------c-HHHHHHHHHH-------HHHhHHHHHHHHH Q lcl|NC_010576. 264 T-QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------A-NEQQTLGYYN-------RCVDVLLQYVTDA 328 (447) Q Consensus 264 ~-g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~-~e~~~~~f~~-------~ti~P~~~~ie~~ 328 (447) . ..+|++++.+..... +.......+||.+.|||..+|-|. + .+.....||. .-|.|+++.+-+. T Consensus 323 ~~~e~~e~~~~~lsgl~-~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~ 401 (532) T protein:vir:94 323 KGTEEIQQTNTPLSGLD-SLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDL 401 (532) T ss_pred CCCceeEEEecccCCHH-HHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 578888876655421 234566778999999999987442 1 1333344443 3478999988888 Q ss_pred HHhhcCChhHhcCCceEEEecchhhhcCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccccccc Q lcl|NC_010576. 329 ISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLAT-------VADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIA 401 (447) Q Consensus 329 l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~ 401 (447) |-+..|.. .....+|+ +++|...|.+++++ ++.+++++|++++||+|++++..|..+..++......+. T Consensus 402 l~~s~~g~--~~~d~~~~--f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~ 477 (532) T protein:vir:94 402 IQLSEYGQ--IDPGLAWE--WSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELD 477 (532) T ss_pred HHHHhcCC--CCCCceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccc Confidence 87766643 23444555 45787777776654 568899999999999999999999776543321111111 Q ss_pred ch--hhcccccCCCCCCCCCCCcCCCCCCCccccccc-CCccCcCCCCC Q lcl|NC_010576. 402 DG--NQVGGINTPGQITSDQPATASTDPLNNVSTSAI-ENGSLTDGGSY 447 (447) Q Consensus 402 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 447 (447) .. .+........ ......+...+++.+.+.+.+. .+....+.|+= T Consensus 478 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 525 (532) T protein:vir:94 478 DVEEIAKQLMAAAL-NPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQN 525 (532) T ss_pred cccchhhhhccccc-CCCCCCCCCCCCCCCCCCCCCCCccCCCcccccc Confidence 11 1100000000 0000100000111111111111 12222223322 No 109 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.69 E-value=2.3e-16 Score=106.30 Aligned_cols=416 Identities=13% Similarity=0.027 Sum_probs=166.9 Q ss_pred CchhHhh-------------------------------hhhcccccCCcccccccc-ccccccccccccccccccCCccc Q lcl|NC_010576. 1 MASSDRL-------------------------------LHSWNAFQSNQNQNQNTN-DFLTPSNGMTSFGGYYGRGQSNY 48 (447) Q Consensus 1 Mg~~~~l-------------------------------~~~~~~f~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 48 (447) |...+|. .+........- ..+.. .......|+.. ..-..+ T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~--~~s~y~~~~~~~~~~~~------~~f~gy 137 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEG--KQSSYAVPEALQDWYLS------QGFIGH 137 (862) T ss_pred ccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccc--cccccccchhccccccc------cCcccH Confidence 3332221 00000000000 00000 00000111100 000111 Q ss_pred ccchhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 49 SRSYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 49 ~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) --...|.+++.++.+|+++++++.+-.+.+- ...++... +......|...=..+--...|.+.+ ....|+|.+++ T Consensus 138 ql~alY~~~~larkiVd~pAeDatR~g~~I~-~~~d~~e~---~~e~~~~ie~~~~rL~v~~~l~eai-r~~RLyGga~i 212 (862) T protein:vir:99 138 QACALIAQHWLVDKACSLAGEDAIRNGWHLK-SLGEGEEI---DEESLEKFKAIDVEFKVKENLIEFN-RFKNVFGIRVA 212 (862) T ss_pred HHHHHHHhCchhhhhhhhhhHHHhhCCceEe-ecCccccc---CHHHHHHHHHHHHHhhHHHHHHHHH-HhcccccceEE Confidence 1124577899999999999999999888743 22222111 1122222221111111122333333 33345676666 Q ss_pred EEeeccCCccc---------------ceee-eccCCCccee--eecCCc-----eEEEEeeecccccceeeecccccccc Q lcl|NC_010576. 129 VPIDTTVDPDS---------------GSFD-INTARVGKIM--QFFPRQ-----VMVRVWNDNTGLEQDLLVSKENCIII 185 (447) Q Consensus 129 ~~~~~~~~~~~---------------~~~~-~~~~~~~~~~--~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~v~~~ 185 (447) ++.-....+.. ..+. +.+..+.... .+..+. .....|... +. .++++-++|+ T Consensus 213 lilv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~---g~--~IH~SRliif 287 (862) T protein:vir:99 213 IFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIIS---GQ--KYHRSHLIIA 287 (862) T ss_pred EEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeec---Ce--eeccceeEEe Confidence 54321111100 0011 1111111100 000000 011111111 11 2233444554 Q ss_pred cc--------cccc--cccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCc--CChHHHHHHHHHHHHHHHHHhc Q lcl|NC_010576. 186 ES--------PFYA--ILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYS--TKSTARAAQAARRKQEIENEMA 253 (447) Q Consensus 186 ~~--------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 253 (447) .+ +.++ +.+.+..++..+.................... +++++.. +..+ +...+++ ...+.+. T Consensus 288 ~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~-v~ktd~l~~l~~e--d~l~~r~--~~~~~~r 362 (862) T protein:vir:99 288 RGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT-AIHTDTAKAIANE--DKFIQRL--MFWVRYR 362 (862) T ss_pred cCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeechhHhhhccH--HHHHHHH--HHHHhcc Confidence 32 1111 22222223333333222222222211122222 3333321 1111 1222222 2223444 Q ss_pred cCCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------c-HHHHHHHHHH-------HHHh Q lcl|NC_010576. 254 NNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------A-NEQQTLGYYN-------RCVD 319 (447) Q Consensus 254 ~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~-~e~~~~~f~~-------~ti~ 319 (447) +| .++++++.+.+|+.++.+..... +.+.....+||.+.+||..+|-|. + .|.....||. .-|. T Consensus 363 dN-~Gi~liD~eEe~e~ls~slSGL~-dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~ 440 (862) T protein:vir:99 363 DN-HAVKVLGTDETMEQFDTSLADFD-AVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYM 440 (862) T ss_pred Cc-ceeEEecCCCceeEEecccCChH-HHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHH Confidence 54 45889999999998887655322 223344558999999999987442 1 2444455554 4578 Q ss_pred HHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHh------CCCC Q lcl|NC_010576. 320 VLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELT------GKAP 386 (447) Q Consensus 320 P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~------gl~p 386 (447) |+++.+...+..++-. ... +.|.+++|...|.+++++. +.+++++|+++++|+|+++ |++. T Consensus 441 P~LerL~~li~~~lg~----~~d--~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~ 514 (862) T protein:vir:99 441 PFLQRHYLISRLSLGI----QHE--IDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNR 514 (862) T ss_pred HHHHHHHHHHHHhcCC----CCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCC Confidence 9999998877665532 223 4445569988888887755 6789999999999999976 4444 Q ss_pred CCCcccc--cc-ccccccchhhccccc-CC-CCC---CCCCCCcCCCCC---------------CCcccccccCCccCcC Q lcl|NC_010576. 387 HPNPLAN--EL-FNRNIADGNQVGGIN-TP-GQI---TSDQPATASTDP---------------LNNVSTSAIENGSLTD 443 (447) Q Consensus 387 ~~g~~~~--~~-~~~~~~~~~~~~~~~-~~-~~~---~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~ 443 (447) ++..... .. .+.++......++.. .. .++ .....+.+++.+ .....++..+.+.+-. T Consensus 515 l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~ 594 (862) T protein:vir:99 515 LTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDAPV 594 (862) T ss_pred CCcccccccCCCCcccccccccCCcccccccccccccccCCccccCCcccccccCCCCCCCccccccccccCCCcccccc Confidence 4432110 00 011111110000000 00 000 000000011111 0001111122222222 Q ss_pred CCCC Q lcl|NC_010576. 444 GGSY 447 (447) Q Consensus 444 ~~~~ 447 (447) +|-| T Consensus 595 ~~~~ 598 (862) T protein:vir:99 595 AGVV 598 (862) T ss_pred Cccc Confidence 3333 No 110 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.65 E-value=1.7e-16 Score=107.02 Aligned_cols=379 Identities=10% Similarity=0.069 Sum_probs=157.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccc-cchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYS-RSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ||+|-+-. +++..+...-.+.+....+.. ....+......+. -...|.+++.++.+|+.+|+++-+-.+++ T Consensus 1 ~~~~m~~~------~~~~~~~D~~~~~~~~~~g~~-~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i- 72 (435) T protein:vir:79 1 MGVFMSDK------VKAITKEDGYNEIFGSKDGTF-RPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKV- 72 (435) T ss_pred CCcccccc------cccchhhcchhhhhccccccc-ccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCcee- Confidence 99985411 111111111111111100000 0000011111111 12346788999999999999999888764 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc---------eeee-ccCCC Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG---------SFDI-NTARV 149 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~---------~~~~-~~~~~ 149 (447) . .++. . ..+...+. + +- ..+-+...+....++|.|++++.-.+...... .+.+ .+..+ T Consensus 73 ~--g~~~-~----~~~~~~~~-~---l~-~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i 140 (435) T protein:vir:79 73 D--GVKN-E----KSFKSRWD-E---LR-LNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQI 140 (435) T ss_pred c--CCCh-H----HHHHHHHH-H---hh-HHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhc Confidence 2 1111 0 11222221 1 11 11233334444557788887765432211111 1111 11111 Q ss_pred cceeee-----cCCceEEEEeeecccc-cceeeeccccccccccc--------ccccccc--h-hHHHHHHHHHHHHHHH Q lcl|NC_010576. 150 GKIMQF-----FPRQVMVRVWNDNTGL-EQDLLVSKENCIIIESP--------FYAILND--T-NQTLRMLEQKIKLMNS 212 (447) Q Consensus 150 ~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~--------~~~~~~~--~-~~~~~~~~~~~~~~~~ 212 (447) . +..+ .++-.....|...... .....++++.++|+.+. .++.++. + ..++..+......... T Consensus 141 ~-~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~ 219 (435) T protein:vir:79 141 T-IHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQEL 219 (435) T ss_pred c-chhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHH Confidence 0 0000 0111112223222111 11234556677777531 1111111 1 1222222222222211 Q ss_pred HHH--HhhcCcccceeeeCC---cCChHH-HHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhHHHHHHH Q lcl|NC_010576. 213 QDN--RASSGKLNGFIQFPY---STKSTA-RAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQL 286 (447) Q Consensus 213 ~~~--~~n~~~~~gvl~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~ 286 (447) ... +.+... +++++. .+..+. .....+++ .......++.+.+++.....+|+.++.+..... +..... T Consensus 220 ~~~l~~~~~~~---v~~~~~l~~~~~~~~~~~~~~~r~--~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl~-~~~~~~ 293 (435) T protein:vir:79 220 ATQLLRRKQQA---VWKARDLALMCDDEEGRYAARLRL--AQVDDESGVGKAIGIDATDEEYEVLNSDVSGVP-EFLQEK 293 (435) T ss_pred HHHHHHHhcCc---cccchhHHHhhcCccchHHHHHHH--HHHHHhcCCCCceeEecCCcceEEEecccCCHH-HHHHHH Confidence 111 122111 233321 111111 11222222 122334555566766666678998887655421 335566 Q ss_pred HHHHHHHhCCCHHHhcCC------cH-HHHHHHHHHH-------HHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchh Q lcl|NC_010576. 287 QQDFYNQMGITEAILNGT------AN-EQQTLGYYNR-------CVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPF 352 (447) Q Consensus 287 ~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~~~-------ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l 352 (447) ..+||.+.|||..+|.|. ++ +.....||.. -+.|.+..+=+.+ .+..+.+|+ +++| T Consensus 294 ~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li--------~~s~d~~~~--f~pL 363 (435) T protein:vir:79 294 IDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFM--------ISETEWSIE--FEPL 363 (435) T ss_pred HHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------hcCCCCeEE--eCCC Confidence 789999999999887542 22 3333444432 2334433332222 122344555 4588 Q ss_pred hhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCC Q lcl|NC_010576. 353 KLVPVEQL-------ATVADVLTRNAIYTPNEIRELT-GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATAS 424 (447) Q Consensus 353 ~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~-gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (447) ...|.+++ ++++.+++++|+++++|+|+.+ ...|..|-.+..+. .+ .+.++.++.+. T Consensus 364 ~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~--~~-------------~~~~d~~~~~~ 428 (435) T protein:vir:79 364 SVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNI--EL-------------PEPEDLDPEPG 428 (435) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccc--cC-------------CccccCCCCCC Confidence 87777655 4557778999999999999987 33332221111110 00 01111122222 Q ss_pred CCCCCcc Q lcl|NC_010576. 425 TDPLNNV 431 (447) Q Consensus 425 ~~~~~~~ 431 (447) +...+|+ T Consensus 429 ~e~g~~~ 435 (435) T protein:vir:79 429 QEGGLNK 435 (435) T ss_pred CCCCCCC Confidence 2222222 No 111 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.63 E-value=3.5e-15 Score=99.85 Aligned_cols=420 Identities=15% Similarity=0.085 Sum_probs=163.4 Q ss_pred CchhHhhhhhcccccCCccccccc-----------cccccccccccccccccccCC------------------cccccc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT-----------NDFLTPSNGMTSFGGYYGRGQ------------------SNYSRS 51 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~ 51 (447) |.=+.+++.+..-+.+ .+..... .+..... .+...++.+..+. ..+--. T Consensus 37 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~ds~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~ 114 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEK-APVIRSVKDFLEPGLSVAMDSAYGD-GPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQAC 114 (765) T ss_pred chhHHHHhhccccccc-CCCCCCCCcccCcccceeccccccc-cccchHHHhhhccCccchhhHHHhhhcccCCccHHHH Confidence 6655554432211111 1100000 0110000 0000011000000 001112 Q ss_pred hhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEe Q lcl|NC_010576. 52 YSYNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPI 131 (447) Q Consensus 52 ~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~ 131 (447) ..|.++..++.+|+++|+++-+-.+.+- .++. +..+ .....|+..=..+. ..+-+...+....++|.+|+++. T Consensus 115 alY~~~~l~rkiVd~pAeDa~R~g~~I~---~~~~--e~~~-~~~~~l~~~~~rl~-v~~~l~ea~~~~RlyGga~i~i~ 187 (765) T protein:vir:96 115 AIISQHWLVDKACSMSGEDAARNGWELK---SDGR--KLSD-EQSALIARRDMEFR-VKDNLVELNRFKNVFGVRIALFV 187 (765) T ss_pred HHHHhCchhhhhhhcchHHhhcCCceee---cCcc--ccCH-HHHHHHHHHHHHhh-HHHHHHHHHHHhhhceeeEEEEE Confidence 2377889999999999999988777652 1111 1111 11122211100111 12223333445567888887654 Q ss_pred eccCCccc-------c--------eee-eccCCCcce-e-eecCCce-----EEEEeeecccccceeeeccccccccccc Q lcl|NC_010576. 132 DTTVDPDS-------G--------SFD-INTARVGKI-M-QFFPRQV-----MVRVWNDNTGLEQDLLVSKENCIIIESP 188 (447) Q Consensus 132 ~~~~~~~~-------~--------~~~-~~~~~~~~~-~-~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~~~~~ 188 (447) -....+.. . .+. +.+..+... + .+..+.. ....|... +. .++++.++|+.+. T Consensus 188 i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~---g~--~IH~SRli~~~g~ 262 (765) T protein:vir:96 188 VESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIIS---GK--KYHRSHLVVVRGP 262 (765) T ss_pred ecccCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeec---Cc--eeccceEEEecCC Confidence 32111000 0 011 111111000 0 0000100 11112111 11 2334556666432 Q ss_pred c--------c--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCC--cCChHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_010576. 189 F--------Y--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPY--STKSTARAAQAARRKQEIENEMANNK 256 (447) Q Consensus 189 ~--------~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~--~~~~~~~~~~~~~~~~~~~~~~~~n~ 256 (447) . + .+.+.+..++..+.................... +++++. .+..+ +...+++ +.+ +++.+| T Consensus 263 ~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~-v~k~~~~~~l~~~--~~l~~r~-~~~-~~~r~n- 336 (765) T protein:vir:96 263 QPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS-TIHVDVEKAIANE--DAFNARL-AFW-IANRDN- 336 (765) T ss_pred CchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeeechHhhhccH--HHHHHHH-HHH-HHhcCC- Confidence 1 1 111222222222222222221111111111111 333322 12211 2222222 222 234444 Q ss_pred cceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cH-HHHHHHHHH-------HHHhHHH Q lcl|NC_010576. 257 YGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------AN-EQQTLGYYN-------RCVDVLL 322 (447) Q Consensus 257 ~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~~-------~ti~P~~ 322 (447) .++++++.+.+|+.++.+..... +.+....++||.+.+||..+|-|. ++ |.....||. .-|.|.+ T Consensus 337 ~g~~~id~ee~~e~~s~~lsgl~-d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~l 415 (765) T protein:vir:96 337 HGVKVIGIDETMEQFDTNLSDFD-SVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLL 415 (765) T ss_pred ceeEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 46788999999999887655421 234556778999999999887442 12 444455554 3455665 Q ss_pred HHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCcc--cc Q lcl|NC_010576. 323 QYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPL--AN 393 (447) Q Consensus 323 ~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~--~~ 393 (447) +.+=+.|-+. ..+..+ +.|.+++|...|.+++++. +.+++++|+++++|+|+.+..+|.-|.. .+ T Consensus 416 e~L~~li~~s----~~i~~d--~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d 489 (765) T protein:vir:96 416 ERHYLLLAKS----ESIDVQ--LEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTD 489 (765) T ss_pred HHHHHHHHHh----cCCCCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCc Confidence 5555544332 122333 5555568888888777654 7889999999999999998766543221 11 Q ss_pred cccc--ccccchh--hcccc----cCCCCCCCCCCCcCCCCCCCcccccccCCcc--CcCCCCC Q lcl|NC_010576. 394 ELFN--RNIADGN--QVGGI----NTPGQITSDQPATASTDPLNNVSTSAIENGS--LTDGGSY 447 (447) Q Consensus 394 ~~~~--~~~~~~~--~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 447 (447) +-+. ..+.+.. +..+. .+..++.+..+++.+......+--++.+.++ ....|.. T Consensus 490 ~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~ 553 (765) T protein:vir:96 490 DQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEE 553 (765) T ss_pred cccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccc Confidence 1110 0111100 00000 0000000000000000000000000000000 0001111 No 112 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.55 E-value=4.3e-15 Score=99.34 Aligned_cols=393 Identities=12% Similarity=0.078 Sum_probs=166.4 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccc---cc-cccccccC-Cccccc-chhhhhhHHHHHHHHHHHHhhccC Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGM---TS-FGGYYGRG-QSNYSR-SYSYNKADLIKSVITRIALDASMV 74 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~~~-~~~~~~-~~~~~~~~~v~~cv~~ia~~ia~l 74 (447) |+-.+--.... + .+ .+....+......+. .+ .++.+... ...+.. ...|..+..++.+|+.+++++-+- T Consensus 1 ~~~~~~a~~~~-~-~~---~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~ 75 (461) T protein:vir:80 1 MYSIDKAKQAK-I-DS---KIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRA 75 (461) T ss_pred Cccchhhhhhh-h-hh---hhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcC Confidence 55443211000 0 00 000000100000000 00 00100000 001111 134667788899999999999887 Q ss_pred ceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc---------cc---ee Q lcl|NC_010576. 75 DFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD---------SG---SF 142 (447) Q Consensus 75 p~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~---------~~---~~ 142 (447) .+++ ... + ......|...=+.+-.... +...+....++|.+++++.-.+.... .. .+ T Consensus 76 g~~i-~~~-~--------~~~~~~~~~~~~~l~~~~~-l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~ 144 (461) T protein:vir:80 76 GWSL-KTD-N--------KEMKKNIESKWRKLKTKDR-FQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSI 144 (461) T ss_pred Ceee-ecC-C--------HHHHHHHHHHHHHhhHHHH-HHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccce Confidence 7754 211 1 1111122111111112223 33344556688999988753221110 00 00 Q ss_pred -eeccCC-----Ccceeee--cCCceEEEEeeecc------------cccceeeecccccccccccccccccchhHHHHH Q lcl|NC_010576. 143 -DINTAR-----VGKIMQF--FPRQVMVRVWNDNT------------GLEQDLLVSKENCIIIESPFYAILNDTNQTLRM 202 (447) Q Consensus 143 -~~~~~~-----~~~~~~~--~~~~~~~~~~~~~~------------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 202 (447) .+.+.. ....... .++-.....|.... .......++++.++|+.+....-...+.+.++. T Consensus 145 ~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~ 224 (461) T protein:vir:80 145 PYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFES 224 (461) T ss_pred eEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHH Confidence 000000 0000000 00111111121110 111123456788899875322111112233443 Q ss_pred HHHHHHHHHHHHH----HhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 203 LEQKIKLMNSQDN----RASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 203 ~~~~~~~~~~~~~----~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) +...+........ ....... .+++++.. .. ...+......+.+. ...+ ..++++++...+|+.++.+.... T Consensus 225 ~~~~l~~~~~~~~~~~~l~~~~~~-~v~k~~~l-~~-~~~~~~~~~~~~~~-~~~~-~~g~~~~d~~e~~e~~~~~lsgl 299 (461) T protein:vir:80 225 LYDIITVMDTSLWSVGQILYDFAF-KVYKTDDI-DA-LNKDDKANLTAMLD-FMFR-TEALAIIKGDEQLTKESTNVSGM 299 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCC-CceecchH-Hh-hhchHHHHHHHHHH-HhcC-CceEEEEcCCcceEEEecCcCCH Confidence 3333322221111 1101111 24454421 11 11111122222232 2233 45688899999999888765542 Q ss_pred hHHHHHHHHHHHHHHhCCCHHHhcCC-----cH-HHHHHHHHH-------HHHhHHHHHHHHHHHhhcCChhH-hcC-Cc Q lcl|NC_010576. 279 LLSDVRQLQQDFYNQMGITEAILNGT-----AN-EQQTLGYYN-------RCVDVLLQYVTDAISRIALTKTA-VSQ-GQ 343 (447) Q Consensus 279 ~l~~~~~~~~~Ia~~fgVP~~~l~g~-----~~-e~~~~~f~~-------~ti~P~~~~ie~~l~~kLl~~~e-~~~-g~ 343 (447) . +.++.....||.+-+||..+|.|. ++ +.....||. .-+.|+++.+-+.+-+..+.... .++ .+ T Consensus 300 ~-~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~ 378 (461) T protein:vir:80 300 K-DLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSF 378 (461) T ss_pred H-HHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCcccc Confidence 2 345567779999999999887432 22 434444432 34667777777766554432111 111 24 Q ss_pred eEEEecchhhhcCHHHHHH-------HHHHHHhCCCcCHHHHHHHh----CCCCCCCccccccccccccchhhcccccCC Q lcl|NC_010576. 344 VLVYYRNPFKLVPVEQLAT-------VADVLTRNAIYTPNEIRELT----GKAPHPNPLANELFNRNIADGNQVGGINTP 412 (447) Q Consensus 344 ~i~f~~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~----gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~ 412 (447) .+.|.+++|...|.+++++ ++.+++++|+++++|+|+.+ +++|..+..++. .+. ........ T Consensus 379 ~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~---~~~---~~~~~~~~- 451 (461) T protein:vir:80 379 EWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDS---AEI---DKLAKLVY- 451 (461) T ss_pred ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCC---chh---hhhhhhcc- Confidence 6777778999888888765 48889999999999999855 333321111110 000 00000000 Q ss_pred CCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 413 GQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~ 434 (447) +.+.+.+.++ T Consensus 452 ------------~~~~~e~~~g 461 (461) T protein:vir:80 452 ------------DAYAKKNADG 461 (461) T ss_pred ------------ccccccCCCC Confidence 0000000111 No 113 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.52 E-value=1.6e-14 Score=96.17 Aligned_cols=369 Identities=11% Similarity=0.040 Sum_probs=156.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) |=-.|.+.+++ ..-. +. ...++... ....+.-...|.++..++.+|+++|+++-+-.|++ . T Consensus 1 ~~~~D~~~n~~---~gg~----~~----------~~~~~~~~-~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i-~ 61 (422) T protein:vir:10 1 MVKTDSYANIF---LGGS----DG----------SEIYGSLQ-NQAPTILASLYADNALVRRIIDTIPETALAAGFHI-D 61 (422) T ss_pred CccchhhHHHH---cCCC----CC----------ccccCccc-ccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccc-c Confidence 55555444422 1100 00 00011111 01111223457789999999999999998877764 1 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc---------ceeee-ccCCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS---------GSFDI-NTARVG 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~---------~~~~~-~~~~~~ 150 (447) .++.. ......+. .|+. .+-+...+....++|.|++++.-.+..... ..+.+ .+..+. T Consensus 62 --~~~~~-~~~~~~~~-~l~~--------~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~ 129 (422) T protein:vir:10 62 --GIDDE-PAFWSRWD-DLEM--------TQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVK 129 (422) T ss_pred --CCCHH-HHHHHHHH-HhhH--------HHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeecccccc Confidence 11111 11111111 1221 223333445556788888887542211100 01111 111111 Q ss_pred cee----eecCCceEEEEeeeccc-ccceeeeccccccccccc--------ccccc--cchh-HHHHHHHHHHHHHHHHH Q lcl|NC_010576. 151 KIM----QFFPRQVMVRVWNDNTG-LEQDLLVSKENCIIIESP--------FYAIL--NDTN-QTLRMLEQKIKLMNSQD 214 (447) Q Consensus 151 ~~~----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~--------~~~~~--~~~~-~~~~~~~~~~~~~~~~~ 214 (447) ... ...++-.....|..... ......++++.++|+.+. .+... +.+. .++..+........... T Consensus 130 ~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~ 209 (422) T protein:vir:10 130 VQTREENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (422) T ss_pred chhcccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHH Confidence 000 00011111122222111 112234455667777432 11111 1111 11222222222211111 Q ss_pred --HHhhcCcccceeeeCC---cCChH-HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhHHHHHHHHH Q lcl|NC_010576. 215 --NRASSGKLNGFIQFPY---STKST-ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQ 288 (447) Q Consensus 215 --~~~n~~~~~gvl~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~ 288 (447) .+.+... +++++. .+... ......+++ .......++.+.+++...+.+|++++.+.... -+.+.+... T Consensus 210 ~l~~~~~~~---v~~~~~l~~~~~~~~~~~~~~~r~--~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl-~~~~~~~~~ 283 (422) T protein:vir:10 210 QLLKRKQQA---VWKAKGLAELCDDSEGFGAARLRL--AQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI-DAFLDKKFD 283 (422) T ss_pred HHHHHhccc---cccchhHHHhcCCccchHHHHHHH--HHHHHhcCCccceeEecCCcceEEEecccCCh-HHHHHHHHH Confidence 1122211 233332 11221 122223322 22234455666676666778999888776542 133566678 Q ss_pred HHHHHhCCCHHHhcCC------cH-HHHHHHHHH-------HHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhh Q lcl|NC_010576. 289 DFYNQMGITEAILNGT------AN-EQQTLGYYN-------RCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKL 354 (447) Q Consensus 289 ~Ia~~fgVP~~~l~g~------~~-e~~~~~f~~-------~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~ 354 (447) +||.+.|||..+|.|. ++ +.....||. .-|.|.++.+=+.+ .+..+.+|+| ++|.. T Consensus 284 ~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i--------~~s~~~~~~f--~pL~~ 353 (422) T protein:vir:10 284 RIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFI--------VNAEEWSVEF--NPLAQ 353 (422) T ss_pred HHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------cccCCcEEEe--CCCCC Confidence 9999999999988542 12 333344443 23445444433222 1223455555 47777 Q ss_pred cCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCC Q lcl|NC_010576. 355 VPVEQL-------ATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDP 427 (447) Q Consensus 355 ~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (447) .|.+++ ++++.+++++|+++++|+|+.+--.....+..+. +.+... ...+.+++.+.+++ T Consensus 354 ~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~-----~~~~~~--------~~~~~~~~~~~~~~ 420 (422) T protein:vir:10 354 ESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDG-----SVETEV--------TISETSNDPLEVPT 420 (422) T ss_pred CCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCC-----CCcccc--------chhhcCCCCCCCCC Confidence 777654 4557888999999999999987333222211111 111100 00000011111111 Q ss_pred CC Q lcl|NC_010576. 428 LN 429 (447) Q Consensus 428 ~~ 429 (447) .+ T Consensus 421 ~d 422 (422) T protein:vir:10 421 DD 422 (422) T ss_pred CC Confidence 11 No 114 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.46 E-value=5.5e-14 Score=93.27 Aligned_cols=367 Identities=13% Similarity=0.098 Sum_probs=155.0 Q ss_pred CchhHh--hhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDR--LLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~--l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |-.|.. +.++++.-.. .+..... .....+.-...|.+++.++.+|+++|+++-+-.+++ T Consensus 1 ~~~~~~d~~~~~~~~~~~-----------------~~~~~~~--~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i 61 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGAD-----------------GSPKPFF--MSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKM 61 (427) T ss_pred CCccccchHHHHhhcCCC-----------------CcccCcc--ccCchHHHHHHHHcCchhhhhhccchHHhhcCCccc Confidence 444332 2222111000 0000000 011112223457789999999999999999877764 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc-cc--------ceeeec-cCC Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP-DS--------GSFDIN-TAR 148 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~-~~--------~~~~~~-~~~ 148 (447) + .++.. ..+...+. +=+ ..+-+...+....++|.+++++.-.+..+ .. ..+.+. +.. T Consensus 62 -~--g~~~~-----~~~~~~~~-~l~----~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~ 128 (427) T protein:vir:10 62 -S--GVKDE-----KEFKSLWD-SYK----LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFA 128 (427) T ss_pred -c--CccHH-----HHHHHHHH-Hhh----HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhc Confidence 2 11111 11222221 111 11233344455567888888764322111 00 011111 111 Q ss_pred Ccceeee-----cCCceEEEEeeeccc-ccceeeeccccccccccc--------ccccccc--h-hHHHHHHHHHHHHHH Q lcl|NC_010576. 149 VGKIMQF-----FPRQVMVRVWNDNTG-LEQDLLVSKENCIIIESP--------FYAILND--T-NQTLRMLEQKIKLMN 211 (447) Q Consensus 149 ~~~~~~~-----~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~--------~~~~~~~--~-~~~~~~~~~~~~~~~ 211 (447) + .+..+ .++-.....|..... ....+.++++.++|+.+. .++..+. + ..++..+........ T Consensus 129 ~-~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~ 207 (427) T protein:vir:10 129 I-TVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCES 207 (427) T ss_pred c-cccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHH Confidence 1 00000 011111222222211 112234556667777532 1111111 1 112222222222211 Q ss_pred HHHH--HhhcCcccceeeeCC---cCChHH-HHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhHHHHHH Q lcl|NC_010576. 212 SQDN--RASSGKLNGFIQFPY---STKSTA-RAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQ 285 (447) Q Consensus 212 ~~~~--~~n~~~~~gvl~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~ 285 (447) .... +++.. -+++++. .+.... .....+++ .......++.+.+++...+.+|++++.+..... +.... T Consensus 208 ~~~~l~~k~~~---~v~k~~~l~~~~~~~~~~~~~~~r~--~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~-~~~~~ 281 (427) T protein:vir:10 208 LATQILRRKQQ---AVWKVKGLAEMCDDDDAQYAARLRL--AQVDDNSGVGRAIGIDAETEEYDVLNSDISGVP-EFLSS 281 (427) T ss_pred HHHHHHHHhcc---ccccchhHHHHhcCccchHHHHHHH--HHHHHhcCcccceeeecCCCceeEEecccCChH-HHHHH Confidence 1111 11111 1333322 111111 11222222 223344566667777777789998887665421 23556 Q ss_pred HHHHHHHHhCCCHHHhcCC------cH-HHHHHHHHH-------HHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecch Q lcl|NC_010576. 286 LQQDFYNQMGITEAILNGT------AN-EQQTLGYYN-------RCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNP 351 (447) Q Consensus 286 ~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~~-------~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~ 351 (447) ...+||.+.+||..+|-|. ++ +.....||. .-|.|.++.+=+.+- +..+.+++|+ + T Consensus 282 ~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~--------~s~~~~~~f~--p 351 (427) T protein:vir:10 282 KMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV--------DEEEWSIEFE--P 351 (427) T ss_pred HHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------cCCCcEEEeC--C Confidence 6778999999999988542 22 333344443 234555544433221 2234555554 7 Q ss_pred hhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh----CCCCCCCccccccccccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 352 FKLVPVEQL-------ATVADVLTRNAIYTPNEIRELT----GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 352 l~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~----gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) |...+.+++ ++++.+++++|+++++|+|+.+ +..++.++ .+. .... .+...+ .+ T Consensus 352 L~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~--~~~------~~e~------~~~~~e-~~ 416 (427) T protein:vir:10 352 LSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG--NNI------NIRE------PEETTE-PE 416 (427) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCC--ccc------cccc------cchhcC-CC Confidence 777766655 4567888999999999999876 23333221 110 0000 001111 11 Q ss_pred CcCCCCCCCcccccccCC Q lcl|NC_010576. 421 ATASTDPLNNVSTSAIEN 438 (447) Q Consensus 421 ~~~~~~~~~~~~~~~~~~ 438 (447) |. .+.+...+ + T Consensus 417 p~-----~~e~~~d~--~ 427 (427) T protein:vir:10 417 PG-----LGEKLEDE--N 427 (427) T ss_pred CC-----CCCCCCCC--C Confidence 11 11111000 0 No 115 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.45 E-value=9.5e-13 Score=86.50 Aligned_cols=413 Identities=11% Similarity=0.027 Sum_probs=173.9 Q ss_pred CchhHhhhhhcccccCCcccccccccccccccccccccccccc---------------CCcccccchhh-----hhhHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGR---------------GQSNYSRSYSY-----NKADLI 60 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~-----~~~~~v 60 (447) |+- +.+.+++.........+-.....+....+..+.. .++.+.....+ .+.+.| T Consensus 1 ~~~------~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i 74 (528) T protein:vir:10 1 MAA------IVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHL 74 (528) T ss_pred CCe------eECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHH Confidence 432 2232222211111100000000010000111111 11111111111 146789 Q ss_pred HHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHH---HHHhcCCeeEEEeeccCCc Q lcl|NC_010576. 61 KSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLY---SLLDEGQIAMVPIDTTVDP 137 (447) Q Consensus 61 ~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~---~lll~Gna~i~~~~~~~~~ 137 (447) .+|++.+...|.+++|.|.-...+....+....-+..+|..-| .|..++. +.+++|.+...+++...+. T Consensus 75 ~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~--------~f~~~i~~~lda~~~G~s~~Ei~w~~~~g 146 (528) T protein:vir:10 75 FAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLE--------GIEDLMLDCMDGVGHGYSAIELDWSLQGR 146 (528) T ss_pred HHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCc--------cHHHHHHHHHhhhhhcceeEEEEEeecCC Confidence 9999999999999999875422221111111112333443222 2344443 3457999998887654321 Q ss_pred --ccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccc-cccccccccccccchhHHHHHHH-----HHHHH Q lcl|NC_010576. 138 --DSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKEN-CIIIESPFYAILNDTNQTLRMLE-----QKIKL 209 (447) Q Consensus 138 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~-----~~~~~ 209 (447) ....+.+.+.+. ......+...++.. +.... .+.++... ++|..... ++.......+..+. ..... T Consensus 147 ~~~~~~~~~r~~~~--f~~~~~~~~~l~~~-~~~~~--g~~l~~~k~iv~~~~~~-~g~p~g~gLlr~~~w~~~fK~~~~ 220 (528) T protein:vir:10 147 EWLPQAFDHRPQSW--FQLNPDDQDELRLR-DNSIA--GEVLQPFGWIMHKPRSR-SGYVARSGLFRVLAWPYLFKHYST 220 (528) T ss_pred ceeEEEeeeecccc--eeeccCCCcEEecc-CCCCC--ceeecCCCeEEEeecCC-CCCccccchHHHHHHHHHHHHhhH Confidence 111222222111 11111112222221 11111 22333333 44432221 11111112222211 11222 Q ss_pred HHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh--hhhhHHHHHHHH Q lcl|NC_010576. 210 MNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL--QNNLLSDVRQLQ 287 (447) Q Consensus 210 ~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~~l~~~~~~~ 287 (447) ..-......-|.|--+.+++...++++. +++.+.+.+.. .++ .++++.|++++-++.+. .+.+.+-.++.. T Consensus 221 ~~w~~f~E~yG~P~~igky~~~a~~~ek----~~L~~al~~i~-~~~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d 293 (528) T protein:vir:10 221 ADLAEMLEIYGLPIRLGKYPPGTPDEEK----VTLLRAVTGLG-HAA--AGIIPESMSIDFQEASKGSAEPFMAMMRWCD 293 (528) T ss_pred HHHHHHHHHcCCCeEEEecCCCCCHHHH----HHHHHHHHHHh-hCc--EEEecCCceeEEeecCCCChhHHHHHHHHHH Confidence 2222223445666556677765554433 33444443332 233 45567777666555432 233344567888 Q ss_pred HHHHHHhCCCHHHh-------cCCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC------ceEEEecchh Q lcl|NC_010576. 288 QDFYNQMGITEAIL-------NGTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG------QVLVYYRNPF 352 (447) Q Consensus 288 ~~Ia~~fgVP~~~l-------~g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g------~~i~f~~~~l 352 (447) ++|+++.-= ..+- +|+.. .+-......+-+.-.++.|++.||+.|+.+- .+..+ .+.+|.++.- T Consensus 294 ~~Isk~iLG-qtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~ 372 (528) T protein:vir:10 294 DSMSKAILG-GTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLK 372 (528) T ss_pred HHHHHHHhh-hhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC Confidence 888886511 1221 12221 1223444566777789999999998886542 12211 1234444455 Q ss_pred hhcCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCccccccc-cccccchhhcccccCCCCCCCCCCCcCCCCCCCc Q lcl|NC_010576. 353 KLVPVEQLATVADVLTRNAI-YTPNEIRELTGKAPHPNPLANELF-NRNIADGNQVGGINTPGQITSDQPATASTDPLNN 430 (447) Q Consensus 353 ~~~d~~~~~~~~~~~~~~G~-~t~NE~R~~~gl~p~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (447) ...|++++++.+.+++..|+ ++..++|+.+|+|.-..+ ..+. +....+..+.+.. ++. ......+...+... T Consensus 373 e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~~~--e~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~ 446 (528) T protein:vir:10 373 DRADLAAMATSLPPLVKLGVQVPVNWVQEQLGIPLPANG--EAVLGDQAGAGIAQLSRR--PGP--RIAALAQVIGPRYR 446 (528) T ss_pred CcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCC--cccccCCCcccccccCcc--ccc--cccccccccccccc Confidence 67899999999999999998 899999999999754332 2221 1111110000000 000 00000000000000 Q ss_pred cc---ccccCC------------------ccCcCCCCC Q lcl|NC_010576. 431 VS---TSAIEN------------------GSLTDGGSY 447 (447) Q Consensus 431 ~~---~~~~~~------------------~~~~~~~~~ 447 (447) .. +....+ ..--+++|| T Consensus 447 ~~~~~d~~~~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ 484 (528) T protein:vir:10 447 DQEALDQVLASLPAQDMQNQADSLVAPLLDVISRGGSE 484 (528) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 00 000000 011244555 No 116 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.43 E-value=1.2e-13 Score=91.44 Aligned_cols=422 Identities=9% Similarity=-0.008 Sum_probs=192.9 Q ss_pred CchhHhhhhhcccccCC---cccccc----cccccccccccccccccccc---CCccc-ccchh-hhhhHHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSN---QNQNQN----TNDFLTPSNGMTSFGGYYGR---GQSNY-SRSYS-YNKADLIKSVITRIA 68 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~-~~~~~~v~~cv~~ia 68 (447) |+++||+..++..-... ..+... .........|..+..+.... +.... ...+. +..++.+.++|+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 99999987666531110 000000 00000011111111110000 00000 00112 345678889999776 Q ss_pred Hhhc-c--CceEEEEEcCCCceeccccchH---HHHHhhh--cCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCC---- Q lcl|NC_010576. 69 LDAS-M--VDFKHLKIDPISGNQTPMPSGL---INVLTRS--ANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVD---- 136 (447) Q Consensus 69 ~~ia-~--lp~~~~r~~~~~~~~~~~~~~l---~~lL~~~--PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~---- 136 (447) ..+- . +.+..--...+.+..+..+..+ ...+..+ .+..++.+.+...++..++..|++|+.+.+.... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 6664 2 2222110111111111111111 2222211 2234677888888889999999999998664321 Q ss_pred --cccceeeec-cCCCcce------------eeecCCceEEEEeeecccc---cceeeecccccccccccccc----ccc Q lcl|NC_010576. 137 --PDSGSFDIN-TARVGKI------------MQFFPRQVMVRVWNDNTGL---EQDLLVSKENCIIIESPFYA----ILN 194 (447) Q Consensus 137 --~~~~~~~~~-~~~~~~~------------~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~v~~~~~~~~~----~~~ 194 (447) +.+-.+.++ +.++... +..+...+.|.++....+. .....++..+|+|+..+... +.+ T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis 240 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTS 240 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCCc Confidence 111122221 2111100 0111223334444333222 22356888999999865433 223 Q ss_pred chhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCChHH--HHHHHHHHHHHHHHHhccCCccee-ecCCCceeee Q lcl|NC_010576. 195 DTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKSTA--RAAQAARRKQEIENEMANNKYGVA-TLDTQEKFVS 270 (447) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~~~--~~~~~~~~~~~~~~~~~~n~~~~~-vl~~g~~~~~ 270 (447) .+..++..+...-....+.... .-.+...++|+.+..-.... ....... ....=..|.++ .|..|.+++. T Consensus 241 ~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~------~~~~l~pG~i~~~L~pGe~i~~ 314 (502) T protein:vir:79 241 LLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENE------RELTIQPGIIYDDLKPGEEIGM 314 (502) T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCcc------ccccccCCccccccCCCceeee Confidence 3333333333332222222222 23455566777543211110 0000000 00011234444 5899999999 Q ss_pred cCCChh-hhhHHHHHHHHHHHHHHhCCCHHHhcCCcHH-----------------HHHHHHHHHHHhHHHHH-HHHHHHh Q lcl|NC_010576. 271 AGMGLQ-NNLLSDVRQLQQDFYNQMGITEAILNGTANE-----------------QQTLGYYNRCVDVLLQY-VTDAISR 331 (447) Q Consensus 271 l~~~~~-~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e-----------------~~~~~f~~~ti~P~~~~-ie~~l~~ 331 (447) ++.+.. ....+..+.+.+.||..+|||.+.|.|+.+. ....-|...-++|+.+. +++++-. T Consensus 315 ~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 394 (502) T protein:vir:79 315 VKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVAS 394 (502) T ss_pred eCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 886643 3445667888999999999999999765321 11112444556665554 5666555 Q ss_pred hcCChh---HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc-------ccccc Q lcl|NC_010576. 332 IALTKT---AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF-------NRNIA 401 (447) Q Consensus 332 kLl~~~---e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~-------~~~~~ 401 (447) ..++.- ++..-..+++-.-...-.|+..-+++...++++|+.|.-|+-+..|.+|-+- .++.. ...+. T Consensus 395 G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v--~~q~a~e~~~~~~~Gl~ 472 (502) T protein:vir:79 395 GVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDV--KRRRKAEIDENRKLDLV 472 (502) T ss_pred CCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHH--HHHHHHHHHHHHHcCCC Confidence 544311 1111224455555666679999999999999999999999999999998532 11110 00110 Q ss_pred chhhccc-ccCCCCCCCCCCCcCCCCCCCccc Q lcl|NC_010576. 402 DGNQVGG-INTPGQITSDQPATASTDPLNNVS 432 (447) Q Consensus 402 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 432 (447) ....... ........+.++++++++ ++|. T Consensus 473 ~~~~~~~~~~~~~~~~~~~e~~~~~~--~~e~ 502 (502) T protein:vir:79 473 FDTDPASDKGGSSAATKRQEPQHTDD--QSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCC--CCCC Confidence 0000000 000001111111111111 1111 No 117 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.43 E-value=6.4e-12 Score=81.96 Aligned_cols=405 Identities=8% Similarity=0.004 Sum_probs=176.5 Q ss_pred Cchh-Hh-hhhhcccccCCccccccccccccccccccccccccccCCcccccchhhh-hhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASS-DR-LLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYN-KADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~-~~-l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~cv~~ia~~ia~lp~~ 77 (447) +|-. ++ +..++.++..- ...... . ++ ..+.--...+ +.+.|++|++.+...|.+++|. T Consensus 14 ~g~~~~~~~~~~~~~~~~~----e~~~~l-r--------~~------~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~ 74 (469) T protein:vir:10 14 AGYVFGSGVVDGWTVWDPF----EQTPEL-Q--------WP------QSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWR 74 (469) T ss_pred hhhhhhcccccchhhcccc----cccccc-c--------cc------cchHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 4442 11 11111111110 000000 0 00 0000011222 4678999999999999999999 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCc-------------ccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc--c--- Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANI-------------DQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD--S--- 139 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~-------------~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~--~--- 139 (447) |....++. + ...-+...|. .+.. ..++++++..++...+.+|.+...+++...... + T Consensus 75 v~p~~~~~---e-~~~~~~~~L~-~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~ 149 (469) T protein:vir:10 75 IRANGASD---E-VTEFVSRNLM-VPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFW 149 (469) T ss_pred EecCCCCH---H-HHHHHHHHHH-hhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCcee Confidence 75322211 1 1222334443 1211 235667777777777889999998887643211 1 Q ss_pred -ceeeeccCCCcceeeecCCceEEEEeeec----------ccccceeeecccccccccccccccccchhHHHHHHHHH-- Q lcl|NC_010576. 140 -GSFDINTARVGKIMQFFPRQVMVRVWNDN----------TGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQK-- 206 (447) Q Consensus 140 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-- 206 (447) ..+.+.+.....-..+.++.....+.+.. ........++....++.+....++.....+.+..+.-. T Consensus 150 ~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~ 229 (469) T protein:vir:10 150 LRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWL 229 (469) T ss_pred eeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHH Confidence 11112121110000111111111111000 00111223444443333322111111122222222211 Q ss_pred ---HHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhH-HH Q lcl|NC_010576. 207 ---IKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLL-SD 282 (447) Q Consensus 207 ---~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l-~~ 282 (447) .....-......-|.|--+.+++...++++.+ .+.+...+...+ ....++++.|++++-++.+...... +. T Consensus 230 fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~----~l~~a~~~~~~g-~~a~~iip~~~~ie~~ea~g~~~~~~~l 304 (469) T protein:vir:10 230 LKDKLLRIEAATAERNGMGIPVGTASSATDEDEVR----KMAALARSVRGG-INAGVGLAQGQILELLGVSGNLPDIRRA 304 (469) T ss_pred HHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHH----HHHHHHHHHhcC-CceEEEccCCceEEEeecCCCchHHHHH Confidence 11111111223344443356766655544332 333333322222 2334568899988887765443333 44 Q ss_pred HHHHHHHHHHHhCCCH-HHh--cCCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC---ceEEEecchhhh Q lcl|NC_010576. 283 VRQLQQDFYNQMGITE-AIL--NGTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG---QVLVYYRNPFKL 354 (447) Q Consensus 283 ~~~~~~~Ia~~fgVP~-~~l--~g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g---~~i~f~~~~l~~ 354 (447) .++..++|+++.--.- ..= +|+.. .+.......+.+.-.++.|++.||+.|+.+- .+-.| .+.+|.++... T Consensus 305 i~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e- 383 (469) T protein:vir:10 305 IEGHDRSIALSGLAHFLNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIG- 383 (469) T ss_pred HHHHHHHHHHHHhcccccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCC- Confidence 5778888887653211 110 12222 2334455667778889999999999887642 22222 12234334443 Q ss_pred cCHHHHHHHHHHHHhCCCc-----CHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCC Q lcl|NC_010576. 355 VPVEQLATVADVLTRNAIY-----TPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLN 429 (447) Q Consensus 355 ~d~~~~~~~~~~~~~~G~~-----t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (447) .+.+..++++.++++.|++ +.+.+|+.+|+|+-..+.. .... ..+. +..... ..++... .++.... T Consensus 384 ~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~--~~~~-~~~~-~~~~~~---~~~~~~~--~~~~~~~ 454 (469) T protein:vir:10 384 SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTP--SAEP-EEPA-AVPNQS---AAPARTR--SSGNADA 454 (469) T ss_pred CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcc--cccc-hhcc-cCCCCC---ccccccC--CCCCccc Confidence 5667889999999999984 5677999999997655321 1111 0000 000000 0000000 0111111 Q ss_pred cccccccCCccCcCC Q lcl|NC_010576. 430 NVSTSAIENGSLTDG 444 (447) Q Consensus 430 ~~~~~~~~~~~~~~~ 444 (447) .+..++...+-..|- T Consensus 455 ~~~~~~~~~~~l~da 469 (469) T protein:vir:10 455 RARAPKADQGVLFDA 469 (469) T ss_pred ccccCCChHHhhccC Confidence 111122222222222 No 118 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.34 E-value=5.3e-11 Score=76.90 Aligned_cols=420 Identities=11% Similarity=0.031 Sum_probs=175.1 Q ss_pred Cch-hHhhhhhcccccCCcccccccccccccccccccccc--------cc-ccCCcccccchhh-----hhhHHHHHHHH Q lcl|NC_010576. 1 MAS-SDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGG--------YY-GRGQSNYSRSYSY-----NKADLIKSVIT 65 (447) Q Consensus 1 Mg~-~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~--------~~-~~~~~~~~~~~~~-----~~~~~v~~cv~ 65 (447) |+- +|.--++...-.-++........ ....+...+..| .. .+.++.+...-.+ .+.+.|.+|++ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~-~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~ 79 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAG-LAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMS 79 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhh-hhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 432 11111111100000111100000 000000000000 00 0011111110011 14678999999 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc--ccceee Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP--DSGSFD 143 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~--~~~~~~ 143 (447) .+...|.+++|.|.-...+....+....-+..+|+..|+ ..+++..|+ +.+++|.+...+++...+. ....+. T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~----~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~ 154 (526) T protein:vir:99 80 KRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEG----LEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFH 154 (526) T ss_pred HHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccC----HHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEee Confidence 999999999998653222221111122224444433232 344444444 4667999988887655321 111122 Q ss_pred eccCCCcceeeec-CCceEEEEeeecccccceeeeccc-ccccccccccccccchhHHHHHHH-----HHHHHHHHHHHH Q lcl|NC_010576. 144 INTARVGKIMQFF-PRQVMVRVWNDNTGLEQDLLVSKE-NCIIIESPFYAILNDTNQTLRMLE-----QKIKLMNSQDNR 216 (447) Q Consensus 144 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 216 (447) +.+.+ -..+. .+...+++. .....+ +.++.. -++|...... +.......+..+. .......-.... T Consensus 155 ~r~~~---~f~~~~~~~~~l~~~-~~~~~g--~~l~~~k~i~~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~ 227 (526) T protein:vir:99 155 HRPQS---WFQLNPEDQNELRLR-DNSPAG--EALQPFGWIIHRPRARS-GYVARSGLFRVLAWPYLFRHYATSDLAEML 227 (526) T ss_pred eeccc---ceeeccCCCcEEEec-CCCCCc--eeecCCCeEEEeecCCc-CCccccchHHHHHHHHHHHHhhHHHHHHHH Confidence 22211 11111 112222221 211122 233333 3444432221 1111111222111 112222222222 Q ss_pred hhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh--hhhhHHHHHHHHHHHHHHh Q lcl|NC_010576. 217 ASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL--QNNLLSDVRQLQQDFYNQM 294 (447) Q Consensus 217 ~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~~l~~~~~~~~~Ia~~f 294 (447) ..-|.|--+.+++...++++. +++.+.+.+. ..+ ..++++.|++++-++.+. .+.+..-.++..++|++++ T Consensus 228 E~yG~P~~igky~~~a~~~ek----~~L~~av~~i-~~d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i 300 (526) T protein:vir:99 228 EIYGLPIRLGKYPPGTADEEK----ATLLRAVTGL-GHA--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV 300 (526) T ss_pred HHcCCceEEEecCCCCCHHHH----HHHHHHHHHH-hhC--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH Confidence 345666556677655544433 3344444333 223 345667777766655432 2333444678889998874 Q ss_pred -CCCHHH-h--cCCcHH---HHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC------ceEEEecchhhhcCHHHH Q lcl|NC_010576. 295 -GITEAI-L--NGTANE---QQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG------QVLVYYRNPFKLVPVEQL 360 (447) Q Consensus 295 -gVP~~~-l--~g~~~e---~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g------~~i~f~~~~l~~~d~~~~ 360 (447) |=.... . ++.++. +....-..+-+.-.++.|++.||+.|+.+- .+..+ .+.+|.++.....|++++ T Consensus 301 LGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~ 380 (526) T protein:vir:99 301 LGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSM 380 (526) T ss_pred hhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHH Confidence 322111 1 111221 222334456667788999999998886542 12111 122344445567799999 Q ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccc---ccc Q lcl|NC_010576. 361 ATVADVLTRNAI-YTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVST---SAI 436 (447) Q Consensus 361 ~~~~~~~~~~G~-~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 436 (447) ++.+.+++..|+ ++..++|+++|+|.-.++ ..+....-.+..+.. .++.... .......+...... ... T Consensus 381 a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~--e~~l~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~~~~~d~~l 453 (526) T protein:vir:99 381 AQSIPALVNVGLEIPSAWVYDKLGIPQPAKN--EPVLRSAAQPAILSR---QHGQRVA--ALATIVGPRYGDQQALDKAL 453 (526) T ss_pred HHHHHHHHhCCCccCHHHHHHHhCCCCCCCc--ccccCCCCCCccccc---ccccccc--cccccccccCcchhhHHHHH Confidence 999999999997 899999999999754332 222111101000000 0000000 00000000000000 000 Q ss_pred C------------------CccCcCCCCC Q lcl|NC_010576. 437 E------------------NGSLTDGGSY 447 (447) Q Consensus 437 ~------------------~~~~~~~~~~ 447 (447) . -...-+++|| T Consensus 454 ~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ 482 (526) T protein:vir:99 454 ADLPAKDMQNQANDLLAPLLEAVNRGDSE 482 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 0 0001245555 No 119 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.34 E-value=1.1e-12 Score=86.22 Aligned_cols=421 Identities=10% Similarity=0.028 Sum_probs=198.2 Q ss_pred CchhHhhhhhcccccCCcccccc-------cccccccccccccccccccc----CCccc-ccchh-hhhhHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQN-------TNDFLTPSNGMTSFGGYYGR----GQSNY-SRSYS-YNKADLIKSVITRI 67 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~-~~~~~~v~~cv~~i 67 (447) |.++||+-.+.... ........ ........|...+...+... +.... ...+. +..++.+.++|+.+ T Consensus 8 ~~~~dr~i~~~~~~-~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 86 (505) T protein:vir:96 8 PSLAQRMVNWAWYR-YVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLL 86 (505) T ss_pred cchhhcccchhhhh-hHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 88888865322110 00000000 00011111111111111000 00000 11122 34568888999977 Q ss_pred HHhhc-cCceEEEE-Ec-CCCceeccccc---hHHHHHhhhcCcc----cCHHHHHHHHHHHHHhcCCeeEEEeeccCCc Q lcl|NC_010576. 68 ALDAS-MVDFKHLK-ID-PISGNQTPMPS---GLINVLTRSANID----QTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP 137 (447) Q Consensus 68 a~~ia-~lp~~~~r-~~-~~~~~~~~~~~---~l~~lL~~~PN~~----~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~ 137 (447) ...+- .-=|+..- .+ ..++..+.... .+...+..+||.. ++.+++-..++..++..|+||+...++.... T Consensus 87 ~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~ 166 (505) T protein:vir:96 87 KNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNK 166 (505) T ss_pred HHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCC Confidence 66664 22232211 11 11111111111 2233333455543 5677788888999999999999887765544 Q ss_pred ccceeeec-cCCCccee----------------eecCCceEEEEeeecccc---------cceeeecccccccccccccc Q lcl|NC_010576. 138 DSGSFDIN-TARVGKIM----------------QFFPRQVMVRVWNDNTGL---------EQDLLVSKENCIIIESPFYA 191 (447) Q Consensus 138 ~~~~~~~~-~~~~~~~~----------------~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~~~~~~~ 191 (447) .+-.+.++ +.++.... .-+...+.|.++....+. .....++..+|+|+..+... T Consensus 167 ~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~ 246 (505) T protein:vir:96 167 WGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRP 246 (505) T ss_pred cceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhcccCC Confidence 33233222 11111110 011223334443332221 12234788899999865433 Q ss_pred ----cccchhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCc Q lcl|NC_010576. 192 ----ILNDTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQE 266 (447) Q Consensus 192 ----~~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~ 266 (447) +.+.+..++..+........+.... .-.+...++|+.+.........+...... ..-..|.+..|..|. T Consensus 247 gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~------~~l~pG~i~~L~pGe 320 (505) T protein:vir:96 247 HQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGEIV------EEVEAGTYQLLPYGI 320 (505) T ss_pred ccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCccc------cccCCceeeecCCCC Confidence 2233333333333333332222222 23455567787654322211111000000 011356788899999 Q ss_pred eeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH------------------HHHHHHHHHHHHhHHHH-HHH Q lcl|NC_010576. 267 KFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNGTAN------------------EQQTLGYYNRCVDVLLQ-YVT 326 (447) Q Consensus 267 ~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~------------------e~~~~~f~~~ti~P~~~-~ie 326 (447) +++.++.+. .....+..+.+.++||..+|||.+.|.|+.+ +.....|....+.|+.+ .++ T Consensus 321 ~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~ 400 (505) T protein:vir:96 321 RFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLIS 400 (505) T ss_pred eeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999998774 4455566888999999999999999965432 11112355566777555 466 Q ss_pred HHHHhhcCChhH--hcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc-ccccch Q lcl|NC_010576. 327 DAISRIALTKTA--VSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN-RNIADG 403 (447) Q Consensus 327 ~~l~~kLl~~~e--~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~-~~~~~~ 403 (447) .++-...++-.. .+.-..+.+-.-...-.|+...+++...++++|+.|.-|+-+..|.+|-+- -++... ..... T Consensus 401 ~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v--~~q~a~e~~~~~- 477 (505) T protein:vir:96 401 MSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDV--FDEIAWEEQLMR- 477 (505) T ss_pred HHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHH--HHHHHHHHHHHH- Confidence 666555543211 111124566556666679999999999999999999999988899998532 111110 00000 Q ss_pred hhcccccCCCCCCCCCCCcCCCCCCCcccccccCCc Q lcl|NC_010576. 404 NQVGGINTPGQITSDQPATASTDPLNNVSTSAIENG 439 (447) Q Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (447) ..+-. ...++........++.++...++ T Consensus 478 -~~Gl~-------~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 478 -DKGVN-------PTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred -HcCCC-------CCCCCCCCCCCCCCCCCCCCCCC Confidence 00000 00000000000001100000000 No 120 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.28 E-value=6.4e-12 Score=81.94 Aligned_cols=442 Identities=11% Similarity=0.003 Sum_probs=193.4 Q ss_pred CchhHhhhhhcccccCCcc---cccc----cccccccccccccccccccc---CCccc-ccchh-hhhhHHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN---QNQN----TNDFLTPSNGMTSFGGYYGR---GQSNY-SRSYS-YNKADLIKSVITRIA 68 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~---~~~~----~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~-~~~~~~v~~cv~~ia 68 (447) |+|+||+..++..-..... +... .........|+.+..+.... +.... ...+. +..++.+..+|+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 9999998877653211100 0000 00000011111111110000 00000 01111 335678889999876 Q ss_pred Hhhc-c--CceEEEEEcCCCceecccc---chHHHHHhhhc--CcccCHHHHHHHHHHHHHhcCCeeEEEeeccCC---- Q lcl|NC_010576. 69 LDAS-M--VDFKHLKIDPISGNQTPMP---SGLINVLTRSA--NIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVD---- 136 (447) Q Consensus 69 ~~ia-~--lp~~~~r~~~~~~~~~~~~---~~l~~lL~~~P--N~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~---- 136 (447) +.|- . +-+.---...++...+... ..+...+..++ ...++.+++...++..++..|++|+.+.+.... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 6554 2 2222111111111100011 11222332222 234678888888999999999999988764321 Q ss_pred --cccceeee-ccCCCc--------cee---e--ecCCceEEEEeeecccc-------cceeeecccccccccccccc-- Q lcl|NC_010576. 137 --PDSGSFDI-NTARVG--------KIM---Q--FFPRQVMVRVWNDNTGL-------EQDLLVSKENCIIIESPFYA-- 191 (447) Q Consensus 137 --~~~~~~~~-~~~~~~--------~~~---~--~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~~~~~~~~~-- 191 (447) ..+-.+.. .+.++. .+. . -+...+.|.++....+. .....++..+|+|+..+... T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ 240 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQ 240 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCcc Confidence 11112221 111111 110 0 11223334444433321 22345889999999876533 Q ss_pred --cccchhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCccee-ecCCCce Q lcl|NC_010576. 192 --ILNDTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVA-TLDTQEK 267 (447) Q Consensus 192 --~~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~-vl~~g~~ 267 (447) +.+.+.+++..+...-....+.... .-.+...++|+.+..-....... ...-.....-..|.++ .|..|.+ T Consensus 241 ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~-----~~~~~~~~~~~pG~iv~~L~pGe~ 315 (548) T protein:vir:95 241 NRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPG-----KDRKNRTIPIAPGMVFDDLEPGED 315 (548) T ss_pred ccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCC-----cccccccccccCCccccccCCCce Confidence 2233333333333332222222222 23455566777543221110000 0000000011234433 5899999 Q ss_pred eeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH----H-------------HHHHHHHHHHHhHHHH-HHHHH Q lcl|NC_010576. 268 FVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNGTAN----E-------------QQTLGYYNRCVDVLLQ-YVTDA 328 (447) Q Consensus 268 ~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~----e-------------~~~~~f~~~ti~P~~~-~ie~~ 328 (447) ++.++.+. .....+..+.+.+.||..+|||.+.|.|+.+ . +....|...-+.|+.+ .++.+ T Consensus 316 i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a 395 (548) T protein:vir:95 316 VGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMY 395 (548) T ss_pred eeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99888663 4445566788999999999999999976532 1 1111244555666444 35555 Q ss_pred HHhhcCChh---HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc-----cc---ccc-c Q lcl|NC_010576. 329 ISRIALTKT---AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNP-----LA---NEL-F 396 (447) Q Consensus 329 l~~kLl~~~---e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~-----~~---~~~-~ 396 (447) +-.-.++-- ++.....+++---...-.|+..-+++...++++|+.|.-|+-++.|.+|-+-- +. +++ + T Consensus 396 ~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL 475 (548) T protein:vir:95 396 LLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGL 475 (548) T ss_pred HHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCC Confidence 544443210 11112334554455556699999999999999999999999888898885421 00 000 0 Q ss_pred cccccchhhccc-ccCCCCCCCCCCCcCCCCCCCcccccccCCcc-------------CcCCCCC Q lcl|NC_010576. 397 NRNIADGNQVGG-INTPGQITSDQPATASTDPLNNVSTSAIENGS-------------LTDGGSY 447 (447) Q Consensus 397 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~ 447 (447) +....+..+..+ ..++..+.+....+.+......|.+.++.... +..||.- T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (548) T protein:vir:95 476 VFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGAD 540 (548) T ss_pred CCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCC Confidence 000000011100 01111111101111000011111111111000 0011111 No 121 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=99.24 E-value=1.4e-13 Score=91.05 Aligned_cols=266 Identities=15% Similarity=0.149 Sum_probs=143.8 Q ss_pred HHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHH---hhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeecc Q lcl|NC_010576. 58 DLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVL---TRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTT 134 (447) Q Consensus 58 ~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL---~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~ 134 (447) -.+| .+++.|++++-..|.|-. ...+--+-+|| ..--|...+-..=++.+.+..+-.-.+|-+ |-+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 69 (279) T protein:vir:40 1 MSLF-NLSRRAEDVSFSTFTVQD--------PTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWALQGKEVYRV--WYG 69 (279) T ss_pred Cccc-ccchhhcccceeeeeecC--------cchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhhhccceeehh--hhh Confidence 0011 123445555543333210 00111111222 222233333333333333322221122210 000 Q ss_pred CCcccceeeeccCCCcceeeecCCceEEEEee--ecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHH Q lcl|NC_010576. 135 VDPDSGSFDINTARVGKIMQFFPRQVMVRVWN--DNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNS 212 (447) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (447) . . .+|..++...-++ ..++...++.+|.+|++.+.+|+++....-.. .++......... T Consensus 70 ~---------~--------~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~IieNPlv~v~~ee~~--kM~~la~nai~~ 130 (279) T protein:vir:40 70 G---------F--------KYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLLNPFYGANPQRFG--VMFGMASNGIGR 130 (279) T ss_pred h---------H--------HHHHhhcCcchhhhheecCCcceeEeecchhhhhhcchheeccchhh--HHHHHHHhhhhh Confidence 0 0 0011111110000 23456678899999999999999765433211 222222222221 Q ss_pred HHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--CCcceeecCCCceeeecCCChhhhhHHHHHHHHHHH Q lcl|NC_010576. 213 QDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMAN--NKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDF 290 (447) Q Consensus 213 ~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~I 290 (447) +.-+.+.++++++++..... ++++++.+..+.++... +=+|+.+++.|.++++|..+......++.++++++. T Consensus 131 --KLD~~~qIk~fIKTd~d~gl---ee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYStslk~die~lkS~l 205 (279) T protein:vir:40 131 --RLDSQAQIKIYWKTKVSSGL---KEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGSLQNDANLAIEIA 205 (279) T ss_pred --hhcccceeeeEEecCcchhH---HHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccccccHHHHHHHHHHH Confidence 22467788899998765443 44455555555555443 336899999999999999999999999999999999 Q ss_pred HHHhCCCHHHhcCCcHHHHHHHHHHHHHhHHHHHHHHHH------HhhcCChhHhcCCceEEEecchhhhcCHHHHHHHH Q lcl|NC_010576. 291 YNQMGITEAILNGTANEQQTLGYYNRCVDVLLQYVTDAI------SRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVA 364 (447) Q Consensus 291 a~~fgVP~~~l~g~~~e~~~~~f~~~ti~P~~~~ie~~l------~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~ 364 (447) +..+|||..+|.|+++|.+..+||..+|.|+++++|-.| +.+.++. T Consensus 206 ~Sq~GinekIL~GsAtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~tt---------------------------- 257 (279) T protein:vir:40 206 LSEYGMPRELLYGQSNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYIST---------------------------- 257 (279) T ss_pred HhhcCCchhhccccCchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhhee---------------------------- Confidence 999999999999999999999999999999999987644 2222222 Q ss_pred HHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_010576. 365 DVLTRNAIYTPNEIRELTGKAPHPNP 390 (447) Q Consensus 365 ~~~~~~G~~t~NE~R~~~gl~p~~g~ 390 (447) -.++|.+.- .-...+-+|+... T Consensus 258 --ta~gg~~~s--~~~~~~~~~~~~~ 279 (279) T protein:vir:40 258 --TAKGGAIES--KSSKRDSEPVGND 279 (279) T ss_pred --cccCccccc--ccccccCCCCCCC Confidence 112232211 1122334444221 No 122 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.22 E-value=4.8e-10 Score=71.66 Aligned_cols=418 Identities=12% Similarity=0.058 Sum_probs=175.6 Q ss_pred Cc-hhHhhhhhcccccCCcccccccccc--ccc------cccccccc--ccc-ccCCcccccc-hhh---h-hhHHHHHH Q lcl|NC_010576. 1 MA-SSDRLLHSWNAFQSNQNQNQNTNDF--LTP------SNGMTSFG--GYY-GRGQSNYSRS-YSY---N-KADLIKSV 63 (447) Q Consensus 1 Mg-~~~~l~~~~~~f~~~~~~~~~~~~~--~~~------~~~~~~~~--~~~-~~~~~~~~~~-~~~---~-~~~~v~~c 63 (447) |+ |+|.--+++ +...-.+...... ... ..+.++.. .-. .+..+.+... .-| + +.+.|.+| T Consensus 1 ~~~~~d~~g~p~---~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~ 77 (526) T protein:vir:79 1 MAQIVDVYGNPI---RPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CCeeeCCCCCcc---CccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 43 222222222 1111000000000 000 00111100 000 0011111110 011 1 46789999 Q ss_pred HHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc--cce Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD--SGS 141 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~--~~~ 141 (447) ++.+-..|.+++|.|.-...+....+....-+..+|+..|+ ..+++..|+. .+.+|.+...+++...+.. ... T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~----~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~ 152 (526) T protein:vir:79 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEG----LEDLLLDALD-GIGHGYSCIELEWALQGREWMPLA 152 (526) T ss_pred HHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccC----HHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEE Confidence 99999999999998753222221111112224444433232 3344444443 5578999888876553211 111 Q ss_pred eeeccCCCcceeeec-CCceEEEEeeecccccceeeecccc-cccccccccccccchhHHHHHHH-----HHHHHHHHHH Q lcl|NC_010576. 142 FDINTARVGKIMQFF-PRQVMVRVWNDNTGLEQDLLVSKEN-CIIIESPFYAILNDTNQTLRMLE-----QKIKLMNSQD 214 (447) Q Consensus 142 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 214 (447) +.+.+.+ -..+. .+...+++. .....+ +.++... ++|..... ++.......+..+. .......-.. T Consensus 153 l~~r~~~---~F~~~~~~~~~l~~~-~~~~~g--~~l~~~k~iv~~~~~~-~g~p~g~gLlr~~~w~~~fK~~~~~~w~~ 225 (526) T protein:vir:79 153 FHHRPQS---WFQLNPEDQNELRLR-DNSPAG--EALQPFGWIIHRPRAR-SGYVARSGLFRVLAWPYLFRHYATSDLAE 225 (526) T ss_pred eeeeccc---ceEeccCCCcEEEec-CCCCCc--eeecCCceEEEeecCC-cCCccccchHHHHHHHHHHHHhhHHHHHH Confidence 1111111 11111 112222221 111112 2333343 34432221 11111111222111 1122222222 Q ss_pred HHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh--hhhhHHHHHHHHHHHHH Q lcl|NC_010576. 215 NRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL--QNNLLSDVRQLQQDFYN 292 (447) Q Consensus 215 ~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~~l~~~~~~~~~Ia~ 292 (447) ....-|.|--+.+++...++++. +++.+.+.+. ..+ ..++++.|++++-++.+. .+.+..-.++..++|++ T Consensus 226 F~E~yG~P~~igky~~~a~~~ek----~~L~~av~~i-~~d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk 298 (526) T protein:vir:79 226 MLEIYGLPIRLGKYPPGTADEEK----ATLLRAVTGL-GHA--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISK 298 (526) T ss_pred HHHHcCCceEEEecCCCCCHHHH----HHHHHHHHHH-hcC--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHH Confidence 22344565556677655544433 3344444333 223 355677777776665432 23333446788889988 Q ss_pred Hh-CCCHHHh---c--CCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCChhH-hcCC------ceEEEecchhhhcCHH Q lcl|NC_010576. 293 QM-GITEAIL---N--GTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTKTA-VSQG------QVLVYYRNPFKLVPVE 358 (447) Q Consensus 293 ~f-gVP~~~l---~--g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e-~~~g------~~i~f~~~~l~~~d~~ 358 (447) +. |=....- + |+.. .+....-..+-+.-.++.|++.||+.|+.+-- +..+ .+-+|.++.-...|++ T Consensus 299 ~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~ 378 (526) T protein:vir:79 299 AVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADIT 378 (526) T ss_pred HHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHH Confidence 73 3222211 1 1211 12233445666777899999999998865421 2111 1223444455677999 Q ss_pred HHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc--- Q lcl|NC_010576. 359 QLATVADVLTRNAI-YTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS--- 434 (447) Q Consensus 359 ~~~~~~~~~~~~G~-~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 434 (447) ++++.+.+++..|+ ++..++|+.+|+|.-..+ ..+......+..... .++... ........+...+.+. T Consensus 379 ~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~--e~~l~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~d~ 451 (526) T protein:vir:79 379 SMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKN--EPVLRPAAQPAILSR---QHGQRV--AALATIVGPRYGDQQALDK 451 (526) T ss_pred HHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCc--hhhccccCCcccccc---cccccc--ccccccccccCchhhHHHH Confidence 99999999999997 799999999999753322 222211111100000 000000 0000011111111100 Q ss_pred ccCC------------------ccCcCCCCC Q lcl|NC_010576. 435 AIEN------------------GSLTDGGSY 447 (447) Q Consensus 435 ~~~~------------------~~~~~~~~~ 447 (447) +... ...-+++|| T Consensus 452 ~l~~~~~~~~~~~~~~~~~~i~~~~~~~~s~ 482 (526) T protein:vir:79 452 ALADLPAKDMQNQANDLLAPLLDAVNRGDSE 482 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 0000 111245666 No 123 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.22 E-value=5.4e-11 Score=76.90 Aligned_cols=422 Identities=10% Similarity=-0.032 Sum_probs=186.1 Q ss_pred CchhHhhhhhcccccCCcccc----------cccccccccccccccccccc---ccCCccc-ccchh-hhhhHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQN----------QNTNDFLTPSNGMTSFGGYY---GRGQSNY-SRSYS-YNKADLIKSVIT 65 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~----------~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~-~~~~~~v~~cv~ 65 (447) |.. -.++....... ...... ....|.....+.- ..+.... ...+. +..++.+.+||+ T Consensus 1 ~~~-------~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~ 72 (530) T protein:vir:38 1 MKI-------PSLVGPDGKTSLREYAGYHGGGGGFGG-QLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQ 72 (530) T ss_pred Ccc-------ceeecCccccchHHHhhhhcccCCCCC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 221 11111110000 000000 0011111000000 0011111 11122 345688899999 Q ss_pred HHHHhhccCceEEEEEcCC-------Cceeccccch---HHHHHhhhcCc------ccCHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPI-------SGNQTPMPSG---LINVLTRSANI------DQTGRSFVFDLLYSLLDEGQIAMV 129 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~-------~~~~~~~~~~---l~~lL~~~PN~------~~t~~~f~~~~~~~lll~Gna~i~ 129 (447) .+...|-.--|.+. -+.+ +...+..... +...+...||. .+|.+++-+.++..++..|++|+. T Consensus 73 ~~~~nvVG~Gi~~~-~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 151 (530) T protein:vir:38 73 LHQDHIVGSFFRLS-YRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQ 151 (530) T ss_pred HHHHHhhCCCceee-eccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEE Confidence 88877754445432 1111 1111111112 23333345543 467888899999999999999999 Q ss_pred EeeccCC--cccceeee-ccCCCccee------------e--ecCCceEEEEeeecc-----ccc----ceeeecccccc Q lcl|NC_010576. 130 PIDTTVD--PDSGSFDI-NTARVGKIM------------Q--FFPRQVMVRVWNDNT-----GLE----QDLLVSKENCI 183 (447) Q Consensus 130 ~~~~~~~--~~~~~~~~-~~~~~~~~~------------~--~~~~~~~~~~~~~~~-----~~~----~~~~~~~~~v~ 183 (447) +.+.... +.+-.+.+ .+.++.... . -....+-|.++.... ..+ ....++..+|+ T Consensus 152 ~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vl 231 (530) T protein:vir:38 152 ATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFI 231 (530) T ss_pred eeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeE Confidence 8765432 12212221 111111100 0 111223333433211 111 12345667999 Q ss_pred cccccccc----cccchhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCChHH-------HHHHHHHHH------ Q lcl|NC_010576. 184 IIESPFYA----ILNDTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKSTA-------RAAQAARRK------ 245 (447) Q Consensus 184 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~~~-------~~~~~~~~~------ 245 (447) |+..+... +.+.+..++..+...-+...+.... .-.+...++|+.+....... .......+. T Consensus 232 H~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (530) T protein:vir:38 232 HVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEM 311 (530) T ss_pred eeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhh Confidence 99876533 2233333333333322222222221 23444556666433211000 000000000 Q ss_pred HHHHH-H-hccCCcceeecCCCceeeecCCC-hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHH--------------- Q lcl|NC_010576. 246 QEIEN-E-MANNKYGVATLDTQEKFVSAGMG-LQNNLLSDVRQLQQDFYNQMGITEAILNGTANE--------------- 307 (447) Q Consensus 246 ~~~~~-~-~~~n~~~~~vl~~g~~~~~l~~~-~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e--------------- 307 (447) ..+.+ . ..=..|.|..|..|.+++.++.+ +.....+..+.+.+.||..+|||.++|.|+.+. T Consensus 312 ~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r 391 (530) T protein:vir:38 312 AAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWA 391 (530) T ss_pred hhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHH Confidence 00000 0 01135667889999999998876 444556678899999999999999999664321 Q ss_pred ---HHHHHHHHHHHhHHHH-HHHHHHHhhcCChh---------HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcC Q lcl|NC_010576. 308 ---QQTLGYYNRCVDVLLQ-YVTDAISRIALTKT---------AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYT 374 (447) Q Consensus 308 ---~~~~~f~~~ti~P~~~-~ie~~l~~kLl~~~---------e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t 374 (447) +....|...-+.|+.. .+++++....++-. .+..-..+++-.-...-.|+...+++...++++|+.| T Consensus 392 ~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s 471 (530) T protein:vir:38 392 YFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLST 471 (530) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCC Confidence 1111233334455444 46666666554311 0001113455555556669999999999999999999 Q ss_pred HHHHHHHhCCCCCCCcccccccc-ccccchhhcccccCCCCCCCCCCCcCC-CCCCCcccccccCCccCcCCCC Q lcl|NC_010576. 375 PNEIRELTGKAPHPNPLANELFN-RNIADGNQVGGINTPGQITSDQPATAS-TDPLNNVSTSAIENGSLTDGGS 446 (447) Q Consensus 375 ~NE~R~~~gl~p~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 446 (447) .-|+-+..|.+|-+- .++... ........+....... .++..+ .++..++.+++ +|+ T Consensus 472 ~~~~~a~~G~D~~~v--~~q~a~e~~~~~~~Gl~~~~~~~-----~~~~~~~~~~~~~~~d~~--------~~a 530 (530) T protein:vir:38 472 YEKECAKRGDDYQEI--FAQQVRESMERRAAGLNPPAWAA-----AAFEAGVKKSNEEEQDGA--------RAA 530 (530) T ss_pred HHHHHHHcCCCHHHH--HHHHHHHHHHHHHcCCCCCCCcc-----cccCCCCCCCCCCCCCCC--------CCC Confidence 999999999998532 111110 0000000000000000 000000 00111111111 111 No 124 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.14 E-value=4.3e-10 Score=71.94 Aligned_cols=422 Identities=10% Similarity=0.005 Sum_probs=170.0 Q ss_pred Cc-hhHhhhhhcccccCCcccccccccccccccccccccccc--------c-cCCcccccchh-----hhhhHHHHHHHH Q lcl|NC_010576. 1 MA-SSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYY--------G-RGQSNYSRSYS-----YNKADLIKSVIT 65 (447) Q Consensus 1 Mg-~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~-~~~~~~~~~~~-----~~~~~~v~~cv~ 65 (447) |+ |+|..-++...-.-.+.+..... .....+-..+..|-. . +.++.+..... .++.+.|.+|++ T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~-~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~ 79 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELA-MVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELS 79 (512) T ss_pred CcceeCCCCCccccccccccccchhc-ccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 33 22221222211111111000000 000010011111110 0 01111111111 125678999999 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCC--cccceee Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVD--PDSGSFD 143 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~--~~~~~~~ 143 (447) .+-..|.+++|.|.-..+.....+..-.-+...|...|+ ..+++..|+ +.+++|.+...+++...+ .....+. T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~----f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~ 154 (512) T protein:vir:19 80 KRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAW----FEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALH 154 (512) T ss_pred HHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCC----HHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeee Confidence 999999999998753222111111111223444543332 334444444 456789998887764322 1111222 Q ss_pred eccCCCcceeeecC-CceEEEEeeecccccceeeecccc-cccccccccccccchhHHHHHH-----HHHHHHHHHHHHH Q lcl|NC_010576. 144 INTARVGKIMQFFP-RQVMVRVWNDNTGLEQDLLVSKEN-CIIIESPFYAILNDTNQTLRML-----EQKIKLMNSQDNR 216 (447) Q Consensus 144 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 216 (447) +.+.+ ...+.+ +...++.. .....+ +.+++.. ++|...+.. +.......+..+ ........-.... T Consensus 155 ~r~~~---~f~~~~~~~~~lr~~-~~~~~G--~~l~~~k~i~~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~ 227 (512) T protein:vir:19 155 HRDPA---LFCANPDNLNELRLR-DASYHG--LELQPFGWFMHRAKSRT-GYVGTNGLVRTLIWPFIFKNYSVRDFAEFL 227 (512) T ss_pred eeccc---cceeccCCCcEEEec-CCCCCc--eeecCCceEEEeccCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221 111111 12222222 111122 2233333 333322221 111111122211 1112222122222 Q ss_pred hhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh--hhhhHHHHHHHHHHHHHHh Q lcl|NC_010576. 217 ASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL--QNNLLSDVRQLQQDFYNQM 294 (447) Q Consensus 217 ~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~~l~~~~~~~~~Ia~~f 294 (447) ..-|.|--+.+++...++++. +++.+.+.+.. .+ ..++++.|++++-+.... .+.+..-.++..++|+++. T Consensus 228 E~yG~P~~igky~~~a~~~ek----~~L~~al~~~~-~~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i 300 (512) T protein:vir:19 228 EIYGLPMRVGKYPTGSTNREK----ATLMQAVMDIG-RR--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI 300 (512) T ss_pred HHcCCCeeEEecCCCCCHHHH----HHHHHHHHHHh-hC--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH Confidence 344555445666655544433 33444444332 23 345677777776555432 2333444678889998772 Q ss_pred -CCCHHHh-c--CCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC--------ceEEEecchhhhcCHHHH Q lcl|NC_010576. 295 -GITEAIL-N--GTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG--------QVLVYYRNPFKLVPVEQL 360 (447) Q Consensus 295 -gVP~~~l-~--g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g--------~~i~f~~~~l~~~d~~~~ 360 (447) |=-...- + |+.. .+.......+-+.-.++.|++.||+.|+.+- .+..+ .+++|+ .....|.+.. T Consensus 301 LGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~--~~e~eDl~~~ 378 (512) T protein:vir:19 301 LGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFD--TSEAGDITAL 378 (512) T ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEec--CCChhhHHHH Confidence 2211111 1 1111 2223445667778899999999999887642 12221 134554 4456788999 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCC-CCcccc--cccC Q lcl|NC_010576. 361 ATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDP-LNNVST--SAIE 437 (447) Q Consensus 361 ~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~ 437 (447) ++.+.++..+--++..++|+.+|+|.-..+ .......-... +.+.........+..+....-+. ...... .+.. T Consensus 379 a~~~~~l~~G~~i~~~~i~e~~Gip~~~~~--e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 455 (512) T protein:vir:19 379 SDAIPKLAAGMRIPVSWIQEKLHIPQPVGD--EAVFTIQPVVP-DNGSQKEAALSAEDIPQEDDIDRMGVSPEDWQRSVD 455 (512) T ss_pred HHHHHHHhcCCCCCHHHHHHHhCCCCCCCc--cccccCCCccc-cccccccccccccCCCchhhHhHHhhhHHHHHHHHH Confidence 999988874446799999999999743332 11211100000 00000000000000000000000 000000 0000 Q ss_pred CccC-----cCCCCC Q lcl|NC_010576. 438 NGSL-----TDGGSY 447 (447) Q Consensus 438 ~~~~-----~~~~~~ 447 (447) .-.+ ..-+|| T Consensus 456 ~~~~~i~~~~~~~s~ 470 (512) T protein:vir:19 456 PLLKPVIFSVLKDGP 470 (512) T ss_pred HHHHHHHHHHHhCCH Confidence 0000 001244 No 125 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.14 E-value=4.9e-10 Score=71.64 Aligned_cols=431 Identities=13% Similarity=0.099 Sum_probs=169.3 Q ss_pred CchhHhhhhhccccc-----------CCcccccccccc-------cccccccc---ccccccccCCcccccchhhhhhHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQ-----------SNQNQNQNTNDF-------LTPSNGMT---SFGGYYGRGQSNYSRSYSYNKADL 59 (447) Q Consensus 1 Mg~~~~l~~~~~~f~-----------~~~~~~~~~~~~-------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 59 (447) ||--.-| +.++.-- +......+.+.+ +...+... .+.-....+-..|.....+.+.+- T Consensus 46 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~e 124 (698) T protein:vir:10 46 MGRRGAL-NALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPE 124 (698) T ss_pred hcccccc-cccccccccCCCccccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccc Confidence 6542211 1111100 000000000111 11111000 000011112222333334567788 Q ss_pred HHHHHHHHHHhhccCceEEEEEc----------CCCcee-ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 60 IKSVITRIALDASMVDFKHLKID----------PISGNQ-TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 60 v~~cv~~ia~~ia~lp~~~~r~~----------~~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) +++|+..|++.+..- |...... ..++.. ...+..-...|..+=..+.-+..|.+.+.+.. ++|-+.+ T Consensus 125 yr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aR-lfGGa~~ 202 (698) T protein:vir:10 125 YRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHP 202 (698) T ss_pred hhhHHHHHHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccceEE Confidence 999999999998764 5322100 000000 11111223344433222333333444444444 4555544 Q ss_pred EEe-eccCC-ccc----c-----------eeeeccCCCcce-----eeecCCceEEEEeeecccccceeeeccccccccc Q lcl|NC_010576. 129 VPI-DTTVD-PDS----G-----------SFDINTARVGKI-----MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE 186 (447) Q Consensus 129 ~~~-~~~~~-~~~----~-----------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 186 (447) ++. ..+.. ... . ..++.++.+... .+..++-....+|... + .. +| .+-++.+. T Consensus 203 ~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~~-IH-~SRL~~~v 277 (698) T protein:vir:10 203 YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--SE-VH-ATRLHTIV 277 (698) T ss_pred EEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEEe-c--ce-ec-ceeEEEec Confidence 432 22110 000 0 111111111100 0000011111112111 0 11 11 11111111 Q ss_pred -c-------ccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCccccee-eeCCcCChHHHHHHHHHHHHHHHHHhccC Q lcl|NC_010576. 187 -S-------PFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFI-QFPYSTKSTARAAQAARRKQEIENEMANN 255 (447) Q Consensus 187 -~-------~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl-~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 255 (447) . |.+ .+.+.....+..+.............-......++. .+-..+.+....+... +-++.+++++| T Consensus 278 g~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~--R~eli~~~Rsn 355 (698) T protein:vir:10 278 SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALTPGANVDLSM--RAELINRYRDN 355 (698) T ss_pred CCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCChhhHHHHH--HHHHHHHhcCc Confidence 1 111 111111222222222222111111100111111111 0111222222222322 22444566666 Q ss_pred CcceeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cH-HHHHHHHH-------HHHHhH Q lcl|NC_010576. 256 KYGVATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------AN-EQQTLGYY-------NRCVDV 320 (447) Q Consensus 256 ~~~~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~-------~~ti~P 320 (447) - ++++|+ ...+|++.+.+..... +-+....+.||.+-+||...|-|+ ++ |...+.|| ..-|.| T Consensus 356 ~-G~~llDk~~Eefeq~st~lSGLd-dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p 433 (698) T protein:vir:10 356 R-NILFLDKATEEFFQFNTPLSGLD-ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQ 433 (698) T ss_pred c-ceEEEecCCcceEEEecCcCCHH-HHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHH Confidence 4 566778 5789999886554432 112234567888888888776332 11 33333443 456889 Q ss_pred HHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCccc- Q lcl|NC_010576. 321 LLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPLA- 392 (447) Q Consensus 321 ~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~~- 392 (447) .++.+=+.|-+..|.. ++. .|.|.+++|...+.++++++ ...++..|+++++|+|+++.-+|--+..+ T Consensus 434 ~L~rl~~ii~rS~~G~--idp--~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~ 509 (698) T protein:vir:10 434 LMNDVIVMIQLSLFGA--VDP--SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGK 509 (698) T ss_pred HHHHHHHHHHHHhcCC--CCC--cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccc Confidence 9998888888877764 333 46666778888888777765 45677889999999999997766332211 Q ss_pred ----cc-cccc-cccchh--hcccccCCCCCC-----CCCCCcCCCCC-CCcccccccCCccCcCCCCC Q lcl|NC_010576. 393 ----NE-LFNR-NIADGN--QVGGINTPGQIT-----SDQPATASTDP-LNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 393 ----~~-~~~~-~~~~~~--~~~~~~~~~~~~-----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 447 (447) |+ +.+. +.++.. ++.....++... ...-++.+.+| ..+.+.++..+-+..+..+- T Consensus 510 ~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (698) T protein:vir:10 510 LDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAM 578 (698) T ss_pred cCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCccccee Confidence 11 1111 111111 011111111111 11111222222 33344444444444444443 No 126 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.12 E-value=2.6e-10 Score=73.16 Aligned_cols=395 Identities=8% Similarity=-0.071 Sum_probs=170.1 Q ss_pred ccCCc-cccc-cccccccccccccccccccccCCccc---------ccchhhhhhHHHHHHHHHHHHhhccCceEEEEEc Q lcl|NC_010576. 14 FQSNQ-NQNQ-NTNDFLTPSNGMTSFGGYYGRGQSNY---------SRSYSYNKADLIKSVITRIALDASMVDFKHLKID 82 (447) Q Consensus 14 f~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~ 82 (447) .++.. .++. +... .... +.++++......... ..-...++.+.|.+|++.+...|.+++|.+.-.. T Consensus 1 v~~~~l~~e~at~~~--~~d~-~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~ 77 (488) T protein:vir:99 1 MEKPALGREIATSGD--GRDI-TRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGG 77 (488) T ss_pred CCccchhHHHHHHHh--hhhh-hccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCC Confidence 11111 0000 0000 0000 011111111111111 1112335678899999999999999999975321 Q ss_pred CCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc--ccceeeeccCCCcceeeecCCce Q lcl|NC_010576. 83 PISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP--DSGSFDINTARVGKIMQFFPRQV 160 (447) Q Consensus 83 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 160 (447) +....+....-+..+|. + +...++++.|+ +.+++|.+...+++...+. ....+.+.+.+ .......+.. T Consensus 78 -~~~~~~~~ae~v~~~l~-~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~--~f~~d~~~~l 148 (488) T protein:vir:99 78 -DRPIDQAAAEHLEQQLQ-R----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRR--RFRYDQDGGL 148 (488) T ss_pred -CChHHHHHHHHHHHHHh-C----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeeccc--ceeecCCCce Confidence 11111111122344443 2 34556666665 4577999998887754321 11112222221 1111111122 Q ss_pred EEEEeeecccccceeeecccccccccccccccccchhHHHHHHH-----HHHHHHHHHHHHhhcCcccceeeeCC-cCCh Q lcl|NC_010576. 161 MVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLE-----QKIKLMNSQDNRASSGKLNGFIQFPY-STKS 234 (447) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~n~~~~~gvl~~~~-~~~~ 234 (447) .++. ......+..+..+..=++|...+..+ .....+.+..+. .......-.......|.|--+.+++. ..++ T Consensus 149 ~~~~-~~~~~~g~~lp~~~~~i~~~~~~~~g-~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~ 226 (488) T protein:vir:99 149 RLLT-PNNMFEGEPCPAPYFWHFSTGADNDD-EPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATP 226 (488) T ss_pred EEec-cCCCCCccccccCceEEEEeecCCCC-CcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCH Confidence 2221 11211222221122223443332221 111111222211 11122112222234566655666663 2333 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh--hhhhHHHHHHHHHHHHHHh-CCCHHHh--cCCcH-HH Q lcl|NC_010576. 235 TARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL--QNNLLSDVRQLQQDFYNQM-GITEAIL--NGTAN-EQ 308 (447) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~~l~~~~~~~~~Ia~~f-gVP~~~l--~g~~~-e~ 308 (447) ++ ++++.+.+.+.. .+ ..++++.|++++-++... .+.+.+..++..++|+++. |=-...= +|+.. .+ T Consensus 227 ~e----k~~l~~av~~~~-~~--~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~ 299 (488) T protein:vir:99 227 ED----KAKLLAALHAIQ-TD--SAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDD 299 (488) T ss_pred HH----HHHHHHHHHHHh-cC--cEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH Confidence 32 233334443332 23 345667777666555422 2233445678888888873 2110000 12221 22 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC----ceEEEecchhhhcCHHHHHHHHHHHHhC-CC-cCHHHHHHH Q lcl|NC_010576. 309 QTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG----QVLVYYRNPFKLVPVEQLATVADVLTRN-AI-YTPNEIREL 381 (447) Q Consensus 309 ~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g----~~i~f~~~~l~~~d~~~~~~~~~~~~~~-G~-~t~NE~R~~ 381 (447) .......+.+.-.++.|++.||+.|+.+- .+-.+ .++.|+ .....|.+++++++.++++. |+ ++..++|+. T Consensus 300 vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~--~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~ 377 (488) T protein:vir:99 300 LQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRV--IEEPEDITAKAERDEKVFRMSGFRPTRGYVQET 377 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEec--CCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHH Confidence 33345567778899999999998886542 12111 234454 44567999999999999996 64 788889999 Q ss_pred hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccC-------------CccCcCCCCC Q lcl|NC_010576. 382 TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIE-------------NGSLTDGGSY 447 (447) Q Consensus 382 ~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~ 447 (447) +|+|+-..+. ....+. +...... +. ...+ +.....+...+ -..--+.+|| T Consensus 378 ~Gip~~~~~~-~~~~~~------~~~~~~~-~~--~~~~------~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ 440 (488) T protein:vir:99 378 YGVEVESTQA-EATAPT------PSTEFAE-GD--QPSD------PAAAMAPQLAEAMQPVVGNWTTQLRTLIEQASSL 440 (488) T ss_pred cCCCCccccc-ccccCC------CcccCCC-CC--CCCC------chHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 9999744321 111110 0000000 00 0000 00000000000 0011134555 No 127 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.09 E-value=5.4e-10 Score=71.37 Aligned_cols=429 Identities=10% Similarity=-0.051 Sum_probs=186.8 Q ss_pred CchhHhhhhhccc-----ccCCcccccccccccccccccccccccc---ccCCccc-ccchh-hhhhHHHHHHHHHHHHh Q lcl|NC_010576. 1 MASSDRLLHSWNA-----FQSNQNQNQNTNDFLTPSNGMTSFGGYY---GRGQSNY-SRSYS-YNKADLIKSVITRIALD 70 (447) Q Consensus 1 Mg~~~~l~~~~~~-----f~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~-~~~~~~v~~cv~~ia~~ 70 (447) |--+-+++.+-.. +..-.. ..+..... ...|.....+.- ..+.... ...+. +..++.+..||+.+... T Consensus 3 ~p~~~~~~~~~~~~~~~~~~~y~~-~a~~~~~~-~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSLREYAGYHG-GGSGFGGQ-LRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CchhhhhhcccccchHHHHHhhhh-ccCCCCCc-ccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 2111111110000 000000 00000000 011111111100 0011111 11122 33568889999988877 Q ss_pred hccCceEEEEEcCC-------Cceeccccch---HHHHHhhhcCc------ccCHHHHHHHHHHHHHhcCCeeEEEeecc Q lcl|NC_010576. 71 ASMVDFKHLKIDPI-------SGNQTPMPSG---LINVLTRSANI------DQTGRSFVFDLLYSLLDEGQIAMVPIDTT 134 (447) Q Consensus 71 ia~lp~~~~r~~~~-------~~~~~~~~~~---l~~lL~~~PN~------~~t~~~f~~~~~~~lll~Gna~i~~~~~~ 134 (447) +-.--|.+. -+.+ +...+..... +...+..+|+. .++.+++...++..++..|++|+.+.+.. T Consensus 81 vVG~Gi~~~-~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~ 159 (533) T protein:vir:34 81 IVGSFFRLS-HRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDT 159 (533) T ss_pred hhCCCceee-eccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 743344322 1111 1111111112 23333334543 35778888889999999999999876654 Q ss_pred CC--cccceeee-ccCCCccee--------------eecCCceEEEEeeeccc-----cc----ceeeeccccccccccc Q lcl|NC_010576. 135 VD--PDSGSFDI-NTARVGKIM--------------QFFPRQVMVRVWNDNTG-----LE----QDLLVSKENCIIIESP 188 (447) Q Consensus 135 ~~--~~~~~~~~-~~~~~~~~~--------------~~~~~~~~~~~~~~~~~-----~~----~~~~~~~~~v~~~~~~ 188 (447) .. +.+-.+.+ .+.++.... .-+...+.|.++..... .+ ....++..+|+|+..+ T Consensus 160 ~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~ 239 (533) T protein:vir:34 160 SSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEP 239 (533) T ss_pred CCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccc Confidence 32 12212221 111111110 01112233434322111 11 1234667899999876 Q ss_pred ccc----cccchhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCCh----------HHHHHHHHHHHHHHHHHhc Q lcl|NC_010576. 189 FYA----ILNDTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKS----------TARAAQAARRKQEIENEMA 253 (447) Q Consensus 189 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~----------~~~~~~~~~~~~~~~~~~~ 253 (447) ... +.+.+-+++..+...-....+.... .-.+...++|+.+..... +.................. T Consensus 240 ~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (533) T protein:vir:34 240 VEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYA 319 (533) T ss_pred cCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccC Confidence 533 2233333333333322222222221 234455567764422100 0000000000000000001 Q ss_pred -----cCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH-----H-------------HH Q lcl|NC_010576. 254 -----NNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNGTAN-----E-------------QQ 309 (447) Q Consensus 254 -----~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~-----e-------------~~ 309 (447) =..|.+..|..|.+++.++.+- .....+..+.+.+.||..+|||.++|.|+.+ . +. T Consensus 320 ~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~ 399 (533) T protein:vir:34 320 AAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGR 399 (533) T ss_pred cceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHH Confidence 1346678899999999988764 4455566888999999999999999976432 1 11 Q ss_pred HHHHHHHHHhHHHHH-HHHHHHhhcCChh---------HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHH Q lcl|NC_010576. 310 TLGYYNRCVDVLLQY-VTDAISRIALTKT---------AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIR 379 (447) Q Consensus 310 ~~~f~~~ti~P~~~~-ie~~l~~kLl~~~---------e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 379 (447) ...|....+.|+.+. +++++-...++-. .+..-..+++-.-...-.|+...+++...++++|+.|.-|+- T Consensus 400 q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~ 479 (533) T protein:vir:34 400 RKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKEC 479 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHH Confidence 112444455665554 5555555444310 001112345555666667999999999999999999999999 Q ss_pred HHhCCCCCCCcccccccc-ccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCCcc Q lcl|NC_010576. 380 ELTGKAPHPNPLANELFN-RNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIENGS 440 (447) Q Consensus 380 ~~~gl~p~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (447) +..|.+|-+- .++... ........+.-...+...+ .+..+.+.+..+++..++ T Consensus 480 a~~G~D~~ev--~~q~a~e~~~~~~~gl~~~~~~~~~~------~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 480 AKRGDDYQEI--FAQQVRETMERRAAGLKPPAWAAAAF------ESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HHcCCCHHHH--HHHHHHHHHHHHhcCCCCCCCCCcCc------cCCCCCCCCCCcccCCCC Confidence 9999998532 111110 0000000000000010000 000001111111111111 No 128 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.06 E-value=1.5e-09 Score=68.95 Aligned_cols=428 Identities=12% Similarity=0.079 Sum_probs=163.1 Q ss_pred CchhHhhhhhccccc-----------CCcccccccccc-------cccccccc---ccccccccCCcccccchhhhhhHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQ-----------SNQNQNQNTNDF-------LTPSNGMT---SFGGYYGRGQSNYSRSYSYNKADL 59 (447) Q Consensus 1 Mg~~~~l~~~~~~f~-----------~~~~~~~~~~~~-------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 59 (447) ||--.-| +.++.-- +......+.+.+ +...+... .+.-....+-..|.....+.+.+- T Consensus 46 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~e 124 (695) T protein:vir:78 46 MGRRGAL-NALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPE 124 (695) T ss_pred hcccccc-cccccccccCCCcccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccc Confidence 6542211 1111100 000000000111 11111000 000011112222333334567788 Q ss_pred HHHHHHHHHHhhccCceEEEEEc----------CCCcee-ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 60 IKSVITRIALDASMVDFKHLKID----------PISGNQ-TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 60 v~~cv~~ia~~ia~lp~~~~r~~----------~~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) +++|+..|++.+..- |...... ..++.. ...+..-...|..+=..+.-+..|.+.+.+.. ++|-+.+ T Consensus 125 yr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aR-lfGGa~~ 202 (695) T protein:vir:78 125 YRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHP 202 (695) T ss_pred hhhHHHHHHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccceEE Confidence 999999999998764 5322100 000000 11111223344433222333333444444444 5565554 Q ss_pred EEee-ccCC-ccc----c-----------eeeeccCCCcce-----eeecCCceEEEEeeecccccceeeeccccccccc Q lcl|NC_010576. 129 VPID-TTVD-PDS----G-----------SFDINTARVGKI-----MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE 186 (447) Q Consensus 129 ~~~~-~~~~-~~~----~-----------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 186 (447) ++.- .+.. ... . ..++.++.+... .+..++-....+|... + .. +| .+-++.+. T Consensus 203 ~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~k-IH-~SRL~~f~ 277 (695) T protein:vir:78 203 YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TE-VH-ATRLHTIV 277 (695) T ss_pred EEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eE-Ee-eeeEEEec Confidence 4422 2110 000 0 111111111100 0000110111111111 0 01 11 11111111 Q ss_pred --------cccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeC--CcCChHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010576. 187 --------SPFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFP--YSTKSTARAAQAARRKQEIENEMAN 254 (447) Q Consensus 187 --------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (447) .+.+ .+.+....++..+.............-......++ +.+ ..+.+....+... +-++.+++++ T Consensus 278 g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~--R~eli~~~Rs 354 (695) T protein:vir:78 278 SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSM--RAELINRYRD 354 (695) T ss_pred CCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHH--HHHHHHHhcC Confidence 1111 12222222222222222222211111111222222 111 1222222222222 2244456666 Q ss_pred CCcceeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cH-HHHHHHHH-------HHHHh Q lcl|NC_010576. 255 NKYGVATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------AN-EQQTLGYY-------NRCVD 319 (447) Q Consensus 255 n~~~~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~-------~~ti~ 319 (447) |- ++++|+ ...+|++.+.+..... +-+....+.||.+-+||...|-|+ ++ |...+.|| +.-|. T Consensus 355 n~-G~~llDk~~Eefeq~stslSGLd-dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~ 432 (695) T protein:vir:78 355 NR-NILFLDKATEEFFQFNTPLSGLD-ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQ 432 (695) T ss_pred cc-ceEEEecCCcceEEEecccCCHH-HHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHH Confidence 64 566778 4789999886554332 112234567888888888776332 11 33333443 45688 Q ss_pred HHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCccc Q lcl|NC_010576. 320 VLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPLA 392 (447) Q Consensus 320 P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~~ 392 (447) |.++.+=+.|-+..|.. ++. .|.|.+++|...+.++++++ ...++..|+++++|+|.++.-+|--+..+ T Consensus 433 p~L~rl~~ii~rS~~G~--idp--di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~ 508 (695) T protein:vir:78 433 QLMNDVIVMIQLSLFGA--VDP--SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAG 508 (695) T ss_pred HHHHHHHHHHHHHhcCC--CCC--cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccc Confidence 99988888888877764 333 46666678888887777665 46678899999999999998877433211 Q ss_pred -----ccc-ccccc-----cchhhcccccCCCCCCCCCCCcCCC--CC-CCcccccccC-----CccC--------cCCC Q lcl|NC_010576. 393 -----NEL-FNRNI-----ADGNQVGGINTPGQITSDQPATAST--DP-LNNVSTSAIE-----NGSL--------TDGG 445 (447) Q Consensus 393 -----~~~-~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~-----~~~~--------~~~~ 445 (447) |++ ++.+. .+.++.. ...++..+...+.++. +| ..+......+ -+.+ ..+| T Consensus 509 ~~D~~d~p~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ag~~~~~~~aag~v~~~~g 586 (695) T protein:vir:78 509 KLDANDDPGVPADDDIDGVLTYVQRL--AEGGDTGAPGGARAGATAPPTVANVNANVKPREAGAQDAAMRAAGAVYVVDG 586 (695) T ss_pred ccccccCCCcCccchhhhhHhhhcCc--ccccccCCCCCCCCCCCCCCceeeeeccccccccCCCCcccceeEEEEEeCC Confidence 111 11110 1111110 0111111111111111 11 0011000000 0000 0112 Q ss_pred CC Q lcl|NC_010576. 446 SY 447 (447) Q Consensus 446 ~~ 447 (447) .. T Consensus 587 ~v 588 (695) T protein:vir:78 587 KV 588 (695) T ss_pred EE Confidence 22 No 129 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.03 E-value=4.2e-09 Score=66.48 Aligned_cols=427 Identities=12% Similarity=0.094 Sum_probs=161.2 Q ss_pred CchhHhh--------------hhhcc----cccCCcccccccccccccccccc---ccccccccCCcccccchhhhhhHH Q lcl|NC_010576. 1 MASSDRL--------------LHSWN----AFQSNQNQNQNTNDFLTPSNGMT---SFGGYYGRGQSNYSRSYSYNKADL 59 (447) Q Consensus 1 Mg~~~~l--------------~~~~~----~f~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 59 (447) -|-+.-+ -+.|- .+.+.+ .+.-.+...+... .+.-....+-..|.....+.+.+- T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~e 123 (694) T protein:vir:10 48 RGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRE----RRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPE 123 (694) T ss_pred cccchhhcccccCCCCcchhhhhhccccccCCCccc----cchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccc Confidence 1111110 00000 001110 0000001110000 000011112222333334567888 Q ss_pred HHHHHHHHHHhhccCceEEEEEc----------CCCcee-ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 60 IKSVITRIALDASMVDFKHLKID----------PISGNQ-TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 60 v~~cv~~ia~~ia~lp~~~~r~~----------~~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) +++|+..|++.+..- |...... ..++.. ...+..-...|..+=..+.-+..|.+.+.+.. ++|-+.+ T Consensus 124 yr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aR-lfGGa~~ 201 (694) T protein:vir:10 124 YRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHP 201 (694) T ss_pred hhhHHHHHHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccceEE Confidence 999999999998764 5322100 000000 11111223344433222333333444444444 5565554 Q ss_pred EEee-ccCC-cccc---------------eeeeccCCCcce-----eeecCCceEEEEeeecccccceeeeccccccccc Q lcl|NC_010576. 129 VPID-TTVD-PDSG---------------SFDINTARVGKI-----MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE 186 (447) Q Consensus 129 ~~~~-~~~~-~~~~---------------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 186 (447) ++.- .+.. ...- ..++.++.+... .+..++-....+|... + .. +| .+-++.+. T Consensus 202 ~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~~-IH-~SRL~~f~ 276 (694) T protein:vir:10 202 YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TE-VH-ATRLHTIV 276 (694) T ss_pred EEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eE-Ee-eeeEEEec Confidence 4432 2111 0000 111111111100 0000110111111111 0 01 11 11111111 Q ss_pred --------cccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeC--CcCChHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010576. 187 --------SPFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFP--YSTKSTARAAQAARRKQEIENEMAN 254 (447) Q Consensus 187 --------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (447) .+.+ .+.+....++..+.............-......++ +.+ ..+.+....+... +-++.+++++ T Consensus 277 g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~--R~eli~~~Rs 353 (694) T protein:vir:10 277 SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSM--RAELINRYRD 353 (694) T ss_pred CCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHH--HHHHHHHhcC Confidence 1111 12222222222222222222111111111222222 111 1222222222222 2244456666 Q ss_pred CCcceeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cH-HHHHHHHH-------HHHHh Q lcl|NC_010576. 255 NKYGVATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------AN-EQQTLGYY-------NRCVD 319 (447) Q Consensus 255 n~~~~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~-------~~ti~ 319 (447) |- ++++|+ ...+|++.+.+..... +-+....+.||.+-+||...|-|+ ++ |...+.|| +.-|. T Consensus 354 n~-G~~llDk~~Eefeq~stslSGLd-dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~ 431 (694) T protein:vir:10 354 NR-NILFLDKATEEFFQFNTPLSGLD-ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQ 431 (694) T ss_pred cc-ceEEEecCCcceEEEecccCCHH-HHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 64 566778 4789999886554332 112234567888888888776332 11 33333443 45688 Q ss_pred HHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCccc Q lcl|NC_010576. 320 VLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPLA 392 (447) Q Consensus 320 P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~~ 392 (447) |.++.+=+.+-+..|.. ++. .|.|.+++|...+.++++++ ...++..|+++++|+|.++.-+|--+..+ T Consensus 432 p~L~rl~~ii~rS~~G~--idp--~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~ 507 (694) T protein:vir:10 432 QLMNDVIVMIQLSLFGA--VDP--SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAG 507 (694) T ss_pred HHHHHHHHHHHHHhcCC--CCC--cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccc Confidence 99988888888877764 333 46666678888887777665 46678899999999999998877433211 Q ss_pred -----ccc-ccccc-----cchhhc----ccccCCCCCCCC--CCCcCCCCC--CCcccccccCCccCc------CCCCC Q lcl|NC_010576. 393 -----NEL-FNRNI-----ADGNQV----GGINTPGQITSD--QPATASTDP--LNNVSTSAIENGSLT------DGGSY 447 (447) Q Consensus 393 -----~~~-~~~~~-----~~~~~~----~~~~~~~~~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~------~~~~~ 447 (447) |++ ++.+. ...++. +..+..++...+ .+++.+.-+ .+.++-++...--.. .+|.. T Consensus 508 ~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g~v 587 (694) T protein:vir:10 508 KLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKV 587 (694) T ss_pred ccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEE Confidence 111 11110 111111 011111110000 011111100 111111110000000 11222 No 130 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.02 E-value=5.1e-09 Score=66.04 Aligned_cols=411 Identities=11% Similarity=0.023 Sum_probs=156.0 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccc-cccccCCccc--------------ccchhhhhhHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFG-GYYGRGQSNY--------------SRSYSYNKADLIKSVIT 65 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~--------------~~~~~~~~~~~v~~cv~ 65 (447) |.- . +.....++|.-. ...+ .........+ .--...++.+.|.+|++ T Consensus 1 ~~~---------------~--~~~~~gl~p~rl-~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~ 62 (488) T protein:vir:95 1 MAD---------------I--TETQESLPPFRM-GEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVN 62 (488) T ss_pred CCC---------------c--cccCCCCCHHHH-HHHHHHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHH Confidence 111 0 111111111100 0000 0000000000 00012235678999999 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhh-cCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc---- Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRS-ANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG---- 140 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~---- 140 (447) .+...|.+++|.|.-.+.. .....+-..+..+... -|-..+..+++..|+ +.+++|.+...+++........ T Consensus 63 ~Rk~av~~~~w~v~p~~~~--~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~ 139 (488) T protein:vir:95 63 IIKMFVRKVNWRFVPPKGK--EQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQS 139 (488) T ss_pred HHHHHHhcCCceEecCCCC--chhHHHHHHHHHHHHHHhccCccHHHHHHHHH-Hhhcccceeeeeeeeccccccccccc Confidence 9999999999987532211 1111111122222211 122223445555554 4578898888877754321110 Q ss_pred -----e-----eeeccCCC-cceeeecCCceEEEEeeecc------------cccceeeeccccc-ccccccccccccch Q lcl|NC_010576. 141 -----S-----FDINTARV-GKIMQFFPRQVMVRVWNDNT------------GLEQDLLVSKENC-IIIESPFYAILNDT 196 (447) Q Consensus 141 -----~-----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~v-~~~~~~~~~~~~~~ 196 (447) . +.+.+... ........+.......+... .....+.++.... +|..... +..... T Consensus 140 ~~~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~-~g~p~g 218 (488) T protein:vir:95 140 KFDDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDE-YGNPEG 218 (488) T ss_pred cccCCeeeeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCC-CCccch Confidence 0 11111000 00000000000000000000 0011122333333 3332221 111111 Q ss_pred hHHHHHHHHHHHHHHHHH-----HHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHh---ccCCcceeecCCCcee Q lcl|NC_010576. 197 NQTLRMLEQKIKLMNSQD-----NRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEM---ANNKYGVATLDTQEKF 268 (447) Q Consensus 197 ~~~~~~~~~~~~~~~~~~-----~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~---~~n~~~~~vl~~g~~~ 268 (447) ...+..+.-..-.-.... .....+.+--+++.+....+.+.++..+.+.+...+.. ..+...-++++.|++. T Consensus 219 ~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~ 298 (488) T protein:vir:95 219 RSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDP 298 (488) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeecccccc Confidence 122222211111111111 11111111112333222222222222222333332222 1111122355666542 Q ss_pred e---------ecCCCh-hhhhH-HHHHHHHHHHHHHhCCC-HHHhc---CCcH-HHHHHHHHHHHHhHHHHHHHHHHHhh Q lcl|NC_010576. 269 V---------SAGMGL-QNNLL-SDVRQLQQDFYNQMGIT-EAILN---GTAN-EQQTLGYYNRCVDVLLQYVTDAISRI 332 (447) Q Consensus 269 ~---------~l~~~~-~~~~l-~~~~~~~~~Ia~~fgVP-~~~l~---g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~k 332 (447) . -++... ..... ...++..++|+++.--. ..+-. |+.. .+-......+.+.-.++.|++.||+. T Consensus 299 ~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~ 378 (488) T protein:vir:95 299 DTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRD 378 (488) T ss_pred ccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 222221 11223 33567778888765321 11111 2221 23334455677778899999999998 Q ss_pred cCChh-HhcCC---ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCccccccccccccch Q lcl|NC_010576. 333 ALTKT-AVSQG---QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTP-----NEIRELTGKAPHPNPLANELFNRNIADG 403 (447) Q Consensus 333 Ll~~~-e~~~g---~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~g~~~~~~~~~~~~~~ 403 (447) |+.+- .+..| .+.+|-++.....|++++++++.++++.|+.-+ +.+|+.+|+|+-+++.. .... ..+. T Consensus 379 li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~--~~~~-~~~~ 455 (488) T protein:vir:95 379 LVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQP--VSEK-LSPN 455 (488) T ss_pred HHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCcc--cccc-CCCC Confidence 87652 22111 223455556667899999999999999998765 56999999997543221 1110 1111 Q ss_pred hhcccccCCCCCCCCCCCcCCCCCCCcccccccCCccCc Q lcl|NC_010576. 404 NQVGGINTPGQITSDQPATASTDPLNNVSTSAIENGSLT 442 (447) Q Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (447) .+. ... . ....++.++.......++++-+.... T Consensus 456 ~~~-~~~---~--~~~~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 456 SQS-RSG---D--GYKTAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred CCC-CCC---c--ccCCCcccCCcccccccchhhhhccC Confidence 110 000 0 00011111111111111111111111 No 131 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.02 E-value=4.1e-09 Score=66.58 Aligned_cols=431 Identities=13% Similarity=0.093 Sum_probs=159.8 Q ss_pred CchhH--------------hhhhhcccccCCcccccccccccccccccc---ccccccccCCcccccchhhhhhHHHHHH Q lcl|NC_010576. 1 MASSD--------------RLLHSWNAFQSNQNQNQNTNDFLTPSNGMT---SFGGYYGRGQSNYSRSYSYNKADLIKSV 63 (447) Q Consensus 1 Mg~~~--------------~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~c 63 (447) -|-++ ||-+.|-+--....+.+.+.-.+...+... .+.-....+-..|.....+.+.+-+++| T Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~ 128 (695) T protein:vir:36 49 RGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAM 128 (695) T ss_pred cccccccccccccCCCcccccceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhH Confidence 11111 111111000000000011110011111000 0000111122223333445678889999 Q ss_pred HHHHHHhhccCceEEEEEc------C----CCcee-ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEee Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKID------P----ISGNQ-TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPID 132 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~------~----~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~ 132 (447) +..|++.+..- |...... . .++.. ...+..-.+.|..+=..+.-+..|.+.+.+. .++|-+.+++.- T Consensus 129 ~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~a-RlfGGa~~~i~i 206 (695) T protein:vir:36 129 HEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHD-QAFGRAHPYFKI 206 (695) T ss_pred HHHHHHHhhcc-cceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhh-ccccceEEEEEe Confidence 99999998764 5322100 0 00000 1011122333432212222222333333344 455656544432 Q ss_pred -ccCC-ccc----c-----------eeeeccCCCcce-----eeecCCceEEEEeeecccccceeeeccccccccc---- Q lcl|NC_010576. 133 -TTVD-PDS----G-----------SFDINTARVGKI-----MQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE---- 186 (447) Q Consensus 133 -~~~~-~~~----~-----------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---- 186 (447) .+.. ... . ..++.++.+... .+..++-....+|... + .. +| .+-++.+. T Consensus 207 ~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~k-IH-~SRL~~f~g~pl 281 (695) T protein:vir:36 207 KGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TE-VH-ATRLHTIVSRPV 281 (695) T ss_pred ccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eE-Ee-eeeEEEecCCCc Confidence 2110 000 0 111111111100 0000110111111111 0 01 11 11111111 Q ss_pred ----cccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeC--CcCChHHHHHHHHHHHHHHHHHhccCCcc Q lcl|NC_010576. 187 ----SPFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFP--YSTKSTARAAQAARRKQEIENEMANNKYG 258 (447) Q Consensus 187 ----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~--~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 258 (447) .+.+ .+.+.....+..+.............-......++ +.+ ..+.+....+... +-++.+++++|- + T Consensus 282 Pd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~aL~~g~~~~l~~--R~eli~~~Rsn~-G 357 (695) T protein:vir:36 282 GDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSM--RAELINRYRDNR-N 357 (695) T ss_pred hhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHH-HHHHHHhhcChhHHHHHH--HHHHHHHhcCcc-c Confidence 1111 11221222222222222222111111111112222 111 1222222222222 224445666664 5 Q ss_pred eeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cH-HHHHHHHH-------HHHHhHHHH Q lcl|NC_010576. 259 VATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT------AN-EQQTLGYY-------NRCVDVLLQ 323 (447) Q Consensus 259 ~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~-e~~~~~f~-------~~ti~P~~~ 323 (447) +++|+ ...+|++.+.+..... +-+....+.||.+-+||...|-|+ ++ |...+.|| +.-|.|.++ T Consensus 358 ~~llDk~~Eefeq~stslSGLd-dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~ 436 (695) T protein:vir:36 358 ILFLDKATEEFFQFNTPLSGLD-ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMN 436 (695) T ss_pred eEEEecCCcceEEEecccCCHH-HHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 66778 4789999886554332 112234567888888888776332 11 33333443 456889998 Q ss_pred HHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCccc---- Q lcl|NC_010576. 324 YVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATV-------ADVLTRNAIYTPNEIRELTGKAPHPNPLA---- 392 (447) Q Consensus 324 ~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~t~NE~R~~~gl~p~~g~~~---- 392 (447) .+=+.|-+..|.. ++. .|.|.+++|...+.++++++ ...++..|+++++|+|.++.-+|--+..+ T Consensus 437 rl~~ii~rS~~G~--idp--di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~ 512 (695) T protein:vir:36 437 DVIVMIQLSLFGA--VDP--SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDA 512 (695) T ss_pred HHHHHHHHHhcCC--CCC--cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccccccc Confidence 8888888877764 333 46666678888887777665 46678899999999999998877433211 Q ss_pred -ccc-ccccc-----cchhhc----ccccCCCCCCCC--CCCcCCCCC--CCcccccccCCccCc------CCCCC Q lcl|NC_010576. 393 -NEL-FNRNI-----ADGNQV----GGINTPGQITSD--QPATASTDP--LNNVSTSAIENGSLT------DGGSY 447 (447) Q Consensus 393 -~~~-~~~~~-----~~~~~~----~~~~~~~~~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~------~~~~~ 447 (447) |++ ++.+. ...++. +..+..++...+ .+++.+.-+ .+.++-++...--.. .+|.+ T Consensus 513 ~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~v 588 (695) T protein:vir:36 513 NDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKV 588 (695) T ss_pred ccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEE Confidence 111 11110 111111 111111110000 011111111 111111110000000 11222 No 132 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.00 E-value=2.6e-09 Score=67.66 Aligned_cols=426 Identities=9% Similarity=-0.012 Sum_probs=187.2 Q ss_pred CchhHhhhhhcccccCCccc--ccccccccc--------cccccccccccccc---CCccc-ccchh-hhhhHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ--NQNTNDFLT--------PSNGMTSFGGYYGR---GQSNY-SRSYS-YNKADLIKSVIT 65 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~--~~~~~~~~~--------~~~~~~~~~~~~~~---~~~~~-~~~~~-~~~~~~v~~cv~ 65 (447) |-...|+.. ...+.-.. +......+. ...|..+..+.-.. +.... ...+. +..++.+.++|+ T Consensus 2 ~~~~~r~~~---~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 2 TKVTVRKLS---EVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred cchhhhhhc---ccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 222222211 11111110 000000000 00111100000000 00000 00111 335677888888 Q ss_pred HHHHhhccCceEEEEEcCCC----ceeccccch-------HHHHHhhhcCc------ccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPIS----GNQTPMPSG-------LINVLTRSANI------DQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~----~~~~~~~~~-------l~~lL~~~PN~------~~t~~~f~~~~~~~lll~Gna~i 128 (447) .+...+-.-=|.+- -+.+- +.....+.. +...+..+||. .++.+.+-..++..++..|++|+ T Consensus 79 ~~~~nvVG~Gi~~~-~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 157 (553) T protein:vir:63 79 YQRDSIVGAQYRLN-SMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLA 157 (553) T ss_pred HHHHhhccCCceee-eccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEE Confidence 77776643334321 11110 000011111 22444444543 35677888888899999999999 Q ss_pred EEeeccC--Ccccceeee-ccCCCccee------------e--ecCCceEEEEeeeccccc--------------ceeee Q lcl|NC_010576. 129 VPIDTTV--DPDSGSFDI-NTARVGKIM------------Q--FFPRQVMVRVWNDNTGLE--------------QDLLV 177 (447) Q Consensus 129 ~~~~~~~--~~~~~~~~~-~~~~~~~~~------------~--~~~~~~~~~~~~~~~~~~--------------~~~~~ 177 (447) .+.+... .+.+-.+.+ .+.++.... . -+...+.|.++....+.. ....+ T Consensus 158 ~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v 237 (553) T protein:vir:63 158 TAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPW 237 (553) T ss_pred EeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeecccccc Confidence 8766432 222222222 222211111 0 111223333333222210 12246 Q ss_pred cccccccccccccc----cccchhHHHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCcCChHHHHHHH----------- Q lcl|NC_010576. 178 SKENCIIIESPFYA----ILNDTNQTLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYSTKSTARAAQA----------- 241 (447) Q Consensus 178 ~~~~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~~~~~~~~~~~----------- 241 (447) +..+|+|+..+... +...+..++..+...-....+.... .-.+...++|+.+.. ++...+.. T Consensus 238 ~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~ 315 (553) T protein:vir:63 238 GRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVG 315 (553) T ss_pred ChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhcccccccccccc Confidence 78899999866433 2333333333333333333222222 234555667775431 11111000 Q ss_pred ------HHHHHHHHHH--hccCCcceeecCCCceeeecCCC-hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHH----- Q lcl|NC_010576. 242 ------ARRKQEIENE--MANNKYGVATLDTQEKFVSAGMG-LQNNLLSDVRQLQQDFYNQMGITEAILNGTANE----- 307 (447) Q Consensus 242 ------~~~~~~~~~~--~~~n~~~~~vl~~g~~~~~l~~~-~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e----- 307 (447) +......... ..=..|.|..|..|.+++.++.+ +...+.+-.+.+.+.||..+|||.+.|.|+.+. T Consensus 316 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS 395 (553) T protein:vir:63 316 IFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSS 395 (553) T ss_pred cccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHH Confidence 0000000000 00135678889999999998877 455566778889999999999999999764321 Q ss_pred -------------HHHHHHHHHHHhHHHHH-HHHHHHhhcCChh------------HhcCCceEEEecchhhhcCHHHHH Q lcl|NC_010576. 308 -------------QQTLGYYNRCVDVLLQY-VTDAISRIALTKT------------AVSQGQVLVYYRNPFKLVPVEQLA 361 (447) Q Consensus 308 -------------~~~~~f~~~ti~P~~~~-ie~~l~~kLl~~~------------e~~~g~~i~f~~~~l~~~d~~~~~ 361 (447) ....-|....++|+.+. +++++-...++-. .+..-..+++-.-...-.|+..-+ T Consensus 396 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~ 475 (553) T protein:vir:63 396 IQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKET 475 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHH Confidence 11112444455665444 4555544333210 001111244544555566999999 Q ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccc------c-cccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 362 TVADVLTRNAIYTPNEIRELTGKAPHPNPLANEL------F-NRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 362 ~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) ++...++++|+.|.-|+-+..|.+|-+- ..+. . ...+............+.. ....+.+.....++++++ T Consensus 476 ~A~~~~i~~G~~t~~~~~a~~G~D~~~v--~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 552 (553) T protein:vir:63 476 QAAVMRIDAGLSTYEREIARLGGDFRKS--FAQRAREDALLKKYGLTFNLSAKRSLGDGRD-AATGIAEDPAAAQTSQQG 552 (553) T ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHH--HHHHHHHHHHHHHcCCCCCCCCccccCCCcc-cCCCCCCCCCCCCccccc Confidence 9999999999999999999989998542 1111 0 0011100000000000010 111111111112223333 Q ss_pred c Q lcl|NC_010576. 435 A 435 (447) Q Consensus 435 ~ 435 (447) + T Consensus 553 e 553 (553) T protein:vir:63 553 E 553 (553) T ss_pred C Confidence 3 No 133 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.94 E-value=3.8e-10 Score=72.26 Aligned_cols=381 Identities=9% Similarity=0.029 Sum_probs=144.9 Q ss_pred CchhHhhhhhcccccCCccccccccccc-cccccccccccccccC-Cccccc-chhhhhhHHHHHHHHHHHHhhc-cCce Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFL-TPSNGMTSFGGYYGRG-QSNYSR-SYSYNKADLIKSVITRIALDAS-MVDF 76 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~v~~cv~~ia~~ia-~lp~ 76 (447) +++.+.+.+.--.|.+ ..-.+.. +........+..+... ...+.. ...|..+...+.||+++++++- +.|. T Consensus 7 ~~~~~~~~~~~~~~~r-----d~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~ 81 (449) T protein:vir:10 7 LAVNHALNDARMARAR-----MGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPE 81 (449) T ss_pred HHHhhhcchhHHHHHH-----HHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcc Confidence 1111111000000000 0000000 0000000001000000 000000 2245567778899999998763 2222 Q ss_pred EEEEEcCCCcee-c-cccchHHHHHhhhcCcccCHHHHHHHHH---HHHHhcCCeeEEEe-eccCCccc------c--ee Q lcl|NC_010576. 77 KHLKIDPISGNQ-T-PMPSGLINVLTRSANIDQTGRSFVFDLL---YSLLDEGQIAMVPI-DTTVDPDS------G--SF 142 (447) Q Consensus 77 ~~~r~~~~~~~~-~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~---~~lll~Gna~i~~~-~~~~~~~~------~--~~ 142 (447) +....+..... . .....+.+|+. ..+|..+. ..-.++|-|.+++. ++...+.. . .+ T Consensus 82 -i~~g~~~~~~~~~~~~e~~~~~l~~---------~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i 151 (449) T protein:vir:10 82 -IIEGDDADDSEDETSWEKKSKQVFT---------NRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKV 151 (449) T ss_pred -cccCccccchhhhHHHHHHHHHHHH---------HHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcceeeE Confidence 11111110000 0 00111222222 13455443 22346777776653 33222110 0 01 Q ss_pred eecc-CC--Cccee-----eecCCceEEEEeeeccc-ccceeeecccccccccccccccccchhHHHHHHHHHHH----- Q lcl|NC_010576. 143 DINT-AR--VGKIM-----QFFPRQVMVRVWNDNTG-LEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIK----- 208 (447) Q Consensus 143 ~~~~-~~--~~~~~-----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~----- 208 (447) .+.. .. ..... +.|..+..+.+.....+ ......++++-++|+...-..+...+..++..+..... T Consensus 152 ~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~~~g~~~L~~~yn~l~~~~~~~~~~ 231 (449) T protein:vir:10 152 SVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYSEDAIGFLEPAYNAFVSLEKVEGGS 231 (449) T ss_pred EeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCCCCChhHHHHHHHHhhhHHHhhhhH Confidence 1100 00 00000 01111112221111111 11122344566676643212222222222221110000 Q ss_pred ---HHHHHHHHh-----hcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhH Q lcl|NC_010576. 209 ---LMNSQDNRA-----SSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLL 280 (447) Q Consensus 209 ---~~~~~~~~~-----n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l 280 (447) .+.+..... ....+.++..... ...++..+++.+.......+. ..+.++.+.+|+.++.++.+.. T Consensus 232 a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~----~~~e~~~~~~~~~~~~~~~~~--~~~~i~~~~d~~~~~~~~sgl~- 304 (449) T protein:vir:10 232 GESFLKNAARQLNVNFEKEIDFTNLASLYG----VSIDELQDKFNEVAGEINRGN--DVLMTTQGATVTPLVTSVADPT- 304 (449) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhHHhh----CCchHHHHHHHHHHHHHhccc--hheeecCCcceEEEecccCChh- Confidence 001110100 0011111111111 112233334443333222222 2345677778998887765432 Q ss_pred HHHHHHHHHHHHHhCCCHHHhcCC------cHHHHHHHHH------HHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEe Q lcl|NC_010576. 281 SDVRQLQQDFYNQMGITEAILNGT------ANEQQTLGYY------NRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYY 348 (447) Q Consensus 281 ~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~~~~~f~------~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~ 348 (447) +.......+||.+-|||..+|-|+ +++. ...|| +.-|.|.++.+-+.|-+.-+.. .-..+.|. T Consensus 305 d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~----~~~d~~i~ 379 (449) T protein:vir:10 305 ATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKYFNARCQSRRVDLSFEIEDFCDKLIELKIID----AVAKKAVI 379 (449) T ss_pred HHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCC----CCCceeEE Confidence 223445566999999998887432 2232 23333 2236677766666664443322 11256777 Q ss_pred cchhhhcCHHHHHH-------HHHHHHhCC---CcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCC Q lcl|NC_010576. 349 RNPFKLVPVEQLAT-------VADVLTRNA---IYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSD 418 (447) Q Consensus 349 ~~~l~~~d~~~~~~-------~~~~~~~~G---~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (447) +++|...+.+++++ ++++++++| +++++|+|+.+|++|..+. . . +....+. +..+ T Consensus 380 f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~-~--~-~~e~~de-----------~~~~ 444 (449) T protein:vir:10 380 WDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEE-P--L-GEEDGDE-----------EDKA 444 (449) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCC-C--C-CCCCCcc-----------cccc Confidence 78999999998866 445666666 9999999999999985431 1 0 0000000 0000 Q ss_pred CCCcCCCCCCCcccccc Q lcl|NC_010576. 419 QPATASTDPLNNVSTSA 435 (447) Q Consensus 419 ~~~~~~~~~~~~~~~~~ 435 (447) .++ +| T Consensus 445 ~d~------------~a 449 (449) T protein:vir:10 445 TDS------------AA 449 (449) T ss_pred CCc------------CC Confidence 000 00 No 134 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.93 E-value=1.1e-09 Score=69.59 Aligned_cols=420 Identities=10% Similarity=0.015 Sum_probs=181.4 Q ss_pred CchhHhhhhhcccccCCccccc--cccccccccccccccccccccC------Cccc-ccchh-hhhhHHHHHHHHHHHHh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQ--NTNDFLTPSNGMTSFGGYYGRG------QSNY-SRSYS-YNKADLIKSVITRIALD 70 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~------~~~~-~~~~~-~~~~~~v~~cv~~ia~~ 70 (447) |-++++-.... .+...... ...+-.+....... ++..+.+ .... ...+. +..++.+..+|+.+.+. T Consensus 1 m~~~~~~~~a~---~~~~~~~~~~~~y~aa~~~~~~~~-~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~ 76 (495) T protein:vir:10 1 MNMTPSGYQSL---ASGLLVPVGASAYEGASGGHRWQD-IGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAA 76 (495) T ss_pred CCccccccccc---chhhhhHHHhhhhhccccCcccCC-CCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHh Confidence 88887621100 00000000 00000000000000 0000000 0000 00111 34567888999988877 Q ss_pred hccCceEEEEEcCCCceeccccchHHHHHhhhc--CcccCHHHHHHHHHHHHHhcCCeeEEEeeccCC---cccceee-e Q lcl|NC_010576. 71 ASMVDFKHLKIDPISGNQTPMPSGLINVLTRSA--NIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVD---PDSGSFD-I 144 (447) Q Consensus 71 ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~P--N~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~---~~~~~~~-~ 144 (447) +-.--|.. +-...+......-..+...+..++ ...++.+.+...++..++..|+||+.+...... ..+-.+. + T Consensus 77 vVG~Gi~p-~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqli 155 (495) T protein:vir:10 77 AVGNGLTP-RWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQII 155 (495) T ss_pred hcCCCccc-ccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEe Confidence 73222321 111111111001112222222222 224677888888899999999999987654321 1111121 1 Q ss_pred ccCCCccee-----------------eecCCceEEEEeeeccccc-------ceeeecccccccccccccc---cccchh Q lcl|NC_010576. 145 NTARVGKIM-----------------QFFPRQVMVRVWNDNTGLE-------QDLLVSKENCIIIESPFYA---ILNDTN 197 (447) Q Consensus 145 ~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~v~~~~~~~~~---~~~~~~ 197 (447) .+.++...+ .-+...+.|.++....+.. ....++..+|+|+.....+ +...+. T Consensus 156 epd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis~la 235 (495) T protein:vir:10 156 EPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAPWFQ 235 (495) T ss_pred chhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccccCCCcccCcchhH Confidence 111111100 0112233444443333221 2345888999999632211 112221 Q ss_pred HHHHHHHHHHHHHHHHHH-HhhcCcccceeeeCCcCChHHHHHHHH-HHHHHHHHHhc-cCCcceeecCCCceeeecCCC Q lcl|NC_010576. 198 QTLRMLEQKIKLMNSQDN-RASSGKLNGFIQFPYSTKSTARAAQAA-RRKQEIENEMA-NNKYGVATLDTQEKFVSAGMG 274 (447) Q Consensus 198 ~~~~~~~~~~~~~~~~~~-~~n~~~~~gvl~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~ 274 (447) ++..+...-....+... ..-.+...++|+.+..-... ..... .-.+.-..... =..|.|..|..|.+++.++.+ T Consensus 236 -~i~~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~--~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~ 312 (495) T protein:vir:10 236 -LLLRLNELDQYEDAELVRKKTAALFAAFIQEATADSTG--GPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPA 312 (495) T ss_pred -HHHHHHHhhHHHHHHHHHHHHhhhheeeeecCCCcccc--ccccCccccccCcccceecCCceeeecCCCCeeeeeCCC Confidence 11122211111111111 12344556677654221100 00000 00000000001 135678889999999998876 Q ss_pred h-hhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH-----H-------------H-HHHHHHHHHHhHHHHH-HHHHHHhhc Q lcl|NC_010576. 275 L-QNNLLSDVRQLQQDFYNQMGITEAILNGTAN-----E-------------Q-QTLGYYNRCVDVLLQY-VTDAISRIA 333 (447) Q Consensus 275 ~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~-----e-------------~-~~~~f~~~ti~P~~~~-ie~~l~~kL 333 (447) . .....+..+.+.+.||..+|||.+.|.|+.+ . + |..-+....+.|+.+. ++.++-... T Consensus 313 ~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~ 392 (495) T protein:vir:10 313 DVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGA 392 (495) T ss_pred CCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 4 4445566788899999999999999976432 1 0 1111333445564443 555555443 Q ss_pred CChhH----hcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc-ccccchhhccc Q lcl|NC_010576. 334 LTKTA----VSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN-RNIADGNQVGG 408 (447) Q Consensus 334 l~~~e----~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~-~~~~~~~~~~~ 408 (447) ++... +..-..+++-.-...-.|+...+++...++++|+.|.-|+-+..|++|-+- .++... ..........- T Consensus 393 i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v--~~q~a~e~~~~~~~Gl~~ 470 (495) T protein:vir:10 393 VVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEEL--FDMISDANQLIDEYDLRL 470 (495) T ss_pred CCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHH--HHHHHHHHHHHHHcCCCC Confidence 32100 111123445555556679999999999999999999999999999998532 111100 00000000000 Q ss_pred ccCCCC---CCCCCCCcCCCCCCCcc Q lcl|NC_010576. 409 INTPGQ---ITSDQPATASTDPLNNV 431 (447) Q Consensus 409 ~~~~~~---~~~~~~~~~~~~~~~~~ 431 (447) ...+.. ....+++.+ ++..++| T Consensus 471 ~~~p~~~~~~~~~~~~~~-~~~~~~e 495 (495) T protein:vir:10 471 DSDPRYVNGSGAEQKSVM-EAALNNE 495 (495) T ss_pred CCCCCcCCCccCCCCCCC-CCCCCCC Confidence 000000 000011100 1111111 No 135 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.92 E-value=6.1e-09 Score=65.63 Aligned_cols=416 Identities=12% Similarity=-0.014 Sum_probs=168.9 Q ss_pred Cc--hhHhhhhhcccccCCcccccccccccccccccccccccccc-------CCcccccchhhhhhHHHHHHHHHHHHhh Q lcl|NC_010576. 1 MA--SSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGR-------GQSNYSRSYSYNKADLIKSVITRIALDA 71 (447) Q Consensus 1 Mg--~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~v~~cv~~ia~~i 71 (447) |- +.+.-.++...-..+.......... ...+-..+..+-... .+..+..-...++.+.|.+|++.+...| T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~-~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~Rk~av 79 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATR-ARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRKAAV 79 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhh-ccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHH Confidence 32 2221112221111111101111100 000000011110000 0011111123456788999999999999 Q ss_pred ccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc--cceeeeccCCC Q lcl|NC_010576. 72 SMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD--SGSFDINTARV 149 (447) Q Consensus 72 a~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~--~~~~~~~~~~~ 149 (447) .+++|.|...+.+. .....+..+|. ++ ...++++.|+ +.+++|.+...+++...+.. ...+.+.+.+ T Consensus 80 ~~~~w~i~~~~~~~----~~a~~i~e~l~-~~----~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~- 148 (491) T protein:vir:79 80 KALEWGLDRGKAKS----RVAKSIADVFA-DL----DLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPAD- 148 (491) T ss_pred hCCCcEEecCCCCH----HHHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeeccc- Confidence 99999875432211 12234556664 33 3444555543 46679999988877543221 1112222211 Q ss_pred cceeeecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHH-----HHHHHHHHHHHHhhcCcccc Q lcl|NC_010576. 150 GKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLE-----QKIKLMNSQDNRASSGKLNG 224 (447) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~n~~~~~g 224 (447) .+.....+...++.. ... ...+.++....++.++...++.......+..+. .......-......-|.|-- T Consensus 149 -~f~~d~~~~l~l~~~-~~~--~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~ 224 (491) T protein:vir:79 149 -WFVYDPENQLRFRSK-EHW--VQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPML 224 (491) T ss_pred -ceeeccCCceEEeec-CCC--CCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeE Confidence 111111122222211 111 122344445444444322222111122222221 11222222222234566655 Q ss_pred eeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCC--h-h-hhhHHHHHHHHHHHHHHhC-CCHH Q lcl|NC_010576. 225 FIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMG--L-Q-NNLLSDVRQLQQDFYNQMG-ITEA 299 (447) Q Consensus 225 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~--~-~-~~~l~~~~~~~~~Ia~~fg-VP~~ 299 (447) +.+++...++++.+. +.+.+.+. ..++ .++++.|++++-+... . . +.+.+-.++..++|+++.- =... T Consensus 225 igky~~~a~~~ek~~----l~~al~~~-~~~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlT 297 (491) T protein:vir:79 225 VGKHPRSASDAETNL----LLDRLEDM-VQDA--VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQT 297 (491) T ss_pred EEecCCCCCHHHHHH----HHHHHHHH-hcCe--EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhc Confidence 667776655543333 33333333 3333 4556677766655432 2 2 2233446778888887541 1100 Q ss_pred H-hcCCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCCh-hHhcC--CceEEEecchhhhcCHHHHHHHHHHHHhCCC-c Q lcl|NC_010576. 300 I-LNGTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTK-TAVSQ--GQVLVYYRNPFKLVPVEQLATVADVLTRNAI-Y 373 (447) Q Consensus 300 ~-l~g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~-~e~~~--g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~-~ 373 (447) - =+|+.. .+....-..+-+.-.++.|++.||+ |+.+ ..+-. ...++|.+.... .+.+.+++.+.++++.|+ + T Consensus 298 t~~~gs~a~~~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~e-e~~~~~a~~~~~L~~~G~~i 375 (491) T protein:vir:79 298 TEATSTRASAQAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQE-QVDEIQAGRDEKLTRAGARF 375 (491) T ss_pred cCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcC-chhHHHHHHHHHHHhCCCcc Confidence 0 012221 1222333455566677788888875 5433 12211 113445544432 223568999999999986 7 Q ss_pred CHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccc----ccc------cCCccCcC Q lcl|NC_010576. 374 TPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVS----TSA------IENGSLTD 443 (447) Q Consensus 374 t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~------~~~~~~~~ 443 (447) +..++|+.+|+|+-+.+ ....+.. .+. ... ..+.......+....+...+... ..+ ---..-.+ T Consensus 376 ~~~~~~e~~Gip~~~~~--e~~~~~~-~~~-~~~--~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~ 449 (491) T protein:vir:79 376 TPAYFKRAYNLQDGDLD--ERPLPVS-AVD-AVG--AASFAEFEAPDQDALDAALNALSARDLNADAQALVAPLLKRIAN 449 (491) T ss_pred CHHHHHHHhCCCCCCCC--ccccCcC-ccc-ccc--cccccccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999864432 1111110 000 000 00000000000000000000000 000 00011123 Q ss_pred CCCC Q lcl|NC_010576. 444 GGSY 447 (447) Q Consensus 444 ~~~~ 447 (447) .+|| T Consensus 450 ~~s~ 453 (491) T protein:vir:79 450 GASA 453 (491) T ss_pred cCCH Confidence 4555 No 136 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=98.91 E-value=6.1e-09 Score=65.61 Aligned_cols=425 Identities=11% Similarity=0.044 Sum_probs=184.9 Q ss_pred CchhHhhhhhcccccCCccccc-------------ccc-ccccccccccccccccccCCcccccchhhhh-hHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQ-------------NTN-DFLTPSNGMTSFGGYYGRGQSNYSRSYSYNK-ADLIKSVIT 65 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~-------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~cv~ 65 (447) |.++ ++|..+.. ..+ ....+..-+...+.+...+|..--+ .++. -+-++..|. T Consensus 1 ~~~~----------rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW--~~~d~vpELry~vg 68 (646) T protein:vir:10 1 MALL----------KPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGW--RLYDIIPEHHFLAG 68 (646) T ss_pred Cccc----------CCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHH--HHHhhhhhHhhHhh Confidence 5552 22211110 000 0000000011111121222211111 1223 256778899 Q ss_pred HHHHhhccCceEEEEEcCCCcee-ccccchHHHHHhhhcCcc-cCHHHHHHHHHHHHHhcCCeeEEEe---eccCCcccc Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQ-TPMPSGLINVLTRSANID-QTGRSFVFDLLYSLLDEGQIAMVPI---DTTVDPDSG 140 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~-~~~~~~l~~lL~~~PN~~-~t~~~f~~~~~~~lll~Gna~i~~~---~~~~~~~~~ 140 (447) .|++.++++.+..-+.++.|... .+.++++..+-. .|=.. .-..++++.+..+|-+-|++|++.. ......... T Consensus 69 W~~~a~SR~rL~aseiddtG~~tg~v~~~~v~~iv~-~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~ 147 (646) T protein:vir:10 69 RIGDSVAQARLYVTEVDDTGEETGEVQDERIKRLAA-VPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGS 147 (646) T ss_pred hhhhhhceeeeeeeeecCCCCCcCccchHHHHHHhh-hhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccc Confidence 99999999999887777555432 334455554443 23222 2346889999999999999999852 222233344 Q ss_pred eeeeccCCCcceeeecCCceEEEEeeecccccceeeeccccccccc--cccccccc----chhHHHHHHHHHHHHHHHHH Q lcl|NC_010576. 141 SFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE--SPFYAILN----DTNQTLRMLEQKIKLMNSQD 214 (447) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 214 (447) |+.+....+.+ ..+...++.-.. .+...++++...|++ +| .|...-.. ...+.+..+..-..+-.... T Consensus 148 W~vvt~~Ev~~----tg~~~~i~~p~~-~~g~~~v~~~~~d~l-vRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~ 221 (646) T protein:vir:10 148 WFVVTGSAISR----TGDEIAVRRPQQ-RGGSKLVLVDGQDIL-IRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREF 221 (646) T ss_pred eeeecHHHhcc----CCCeeeeecCcc-CCCCCcceecCCceE-EEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhH Confidence 55544333211 122222222211 112233334344442 23 22211111 22222222222211111110 Q ss_pred HH-hhcCcccceeeeCCcCC------h-HHHHHHHHHHHHHHHHHhcc----CCcceeecCC-Cc------eeeecCCC- Q lcl|NC_010576. 215 NR-ASSGKLNGFIQFPYSTK------S-TARAAQAARRKQEIENEMAN----NKYGVATLDT-QE------KFVSAGMG- 274 (447) Q Consensus 215 ~~-~n~~~~~gvl~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~----n~~~~~vl~~-g~------~~~~l~~~- 274 (447) .. +.-..-+|||=+|..++ + -.......-+.+.-...+.+ .+.-++++.. |. +++.+... T Consensus 222 aaakSRL~GnGvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~ 301 (646) T protein:vir:10 222 AELDSRLTGAGIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWS 301 (646) T ss_pred HHHHHHHhcCceeeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCc Confidence 00 11111245655443221 1 11111112222211122222 2222333322 11 33444443 Q ss_pred -hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHH------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC----- Q lcl|NC_010576. 275 -LQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQ------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG----- 342 (447) Q Consensus 275 -~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g----- 342 (447) .+..-+..++..+..||...-|||+.|-|.++.+ ....=++ .|.|.+..|+++|++.+|.+.-...| T Consensus 302 eite~aiktR~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~ 380 (646) T protein:vir:10 302 ELSAEITPMKDKAIARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPE 380 (646) T ss_pred hhhHHHhhhHHHHHHHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChh Confidence 2334567888999999999999999885543211 1111244 69999999999999999865322223 Q ss_pred -ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc----c------cccccc--chhhcc-- Q lcl|NC_010576. 343 -QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE----L------FNRNIA--DGNQVG-- 407 (447) Q Consensus 343 -~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~----~------~~~~~~--~~~~~~-- 407 (447) |-|=||.+.|.. |. ++.+-...+++.|.+|-...|+.+|+.--+++.-.+ + .+++++ |..|.. T Consensus 381 kyvvW~DaS~Lt~-~p-d~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~ 458 (646) T protein:vir:10 381 RYAFAFDTSTLAS-KP-NRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALG 458 (646) T ss_pred HeEEeecCccccc-CC-CCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhcccc Confidence 345677777643 22 233333457889999999999999998766652111 0 011111 121110 Q ss_pred -----------cccCCCCCCCCCCCcCCCCCCCcccccccCCc-----cCcCCCC---C Q lcl|NC_010576. 408 -----------GINTPGQITSDQPATASTDPLNNVSTSAIENG-----SLTDGGS---Y 447 (447) Q Consensus 408 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~---~ 447 (447) ...+.++...+.+..++.++.+...+....+. ++.++|- | T Consensus 459 ~P~~~~~~lpp~~~~~~dg~~~~~e~~g~~~~~E~~~~pda~~~~a~~~~~~~r~~~~~ 517 (646) T protein:vir:10 459 LPAVQSVGLPPTAAQRTDGDLDDDESEGAPNGGEAPDQPDADEARAITAALDRRIALAA 517 (646) T ss_pred CCCcCccccCCcccccccCCCCChhhcCCCCCCccCCCCCCCccccccccccccchhhh Confidence 00111111111111111111111111111111 1222221 1 No 137 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.89 E-value=1.9e-08 Score=62.91 Aligned_cols=406 Identities=11% Similarity=-0.034 Sum_probs=167.5 Q ss_pred Cc--hhHhhhhhcccccCCccccccccccccccccc-cccccc--------cccCCcccccchhhhhhHHHHHHHHHHHH Q lcl|NC_010576. 1 MA--SSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGM-TSFGGY--------YGRGQSNYSRSYSYNKADLIKSVITRIAL 69 (447) Q Consensus 1 Mg--~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~-~~~~~~--------~~~~~~~~~~~~~~~~~~~v~~cv~~ia~ 69 (447) |- +.+---++...-..+......... ...+. .+.++. ....+..+..-...++.+.|.+|++.+.. T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~---~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~l~~Rk~ 77 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIAT---RARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRKA 77 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHh---hhcccccccccCCccchHHHHHhcCCCHHHHHHHhhChHHHHHHHHHHH Confidence 32 111101111100000000000000 00000 000000 00000011111224567889999999999 Q ss_pred hhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc--cceeeeccC Q lcl|NC_010576. 70 DASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD--SGSFDINTA 147 (447) Q Consensus 70 ~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~--~~~~~~~~~ 147 (447) .|.+++|.|...+.+. .....+..+|. ++ ...++++.|+ +.+++|.+...+++...+.. ...+.+.+. T Consensus 78 av~~~~w~i~~~~~~~----~~~e~v~e~l~-~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 147 (491) T protein:vir:10 78 AVKALEWGLDRGKAKS----RVAKSIADVFA-DL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPA 147 (491) T ss_pred HHhCCCcEEecCCCCH----HHHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecc Confidence 9999999875432211 12234556664 33 3456666665 56789999988877543221 111222222 Q ss_pred CCcceeeecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHH-----HHHHHHHHHHHhhcCcc Q lcl|NC_010576. 148 RVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQ-----KIKLMNSQDNRASSGKL 222 (447) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~n~~~~ 222 (447) +. +.....+...++. .+.. ...+.++....++.++...++.......+..+.- ......-......-|.| T Consensus 148 ~~--f~~d~~~~l~~~~-~~~~--~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P 222 (491) T protein:vir:10 148 DW--FVYDPENQLRFRS-KDHW--MQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSP 222 (491) T ss_pred cc--eeeccCCceEEec-CCCC--CCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 11 1111112222221 1111 1223344444444443222221112222222211 11111112222344555 Q ss_pred cceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCC--Chhh--hhHHHHHHHHHHHHHHh-CCC Q lcl|NC_010576. 223 NGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGM--GLQN--NLLSDVRQLQQDFYNQM-GIT 297 (447) Q Consensus 223 ~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~--~~~~--~~l~~~~~~~~~Ia~~f-gVP 297 (447) --+.+++...++++.+ ++.+.+.+. ..++ .++++.|++++-+.. +... .+.+-.++..++|+++. |=. T Consensus 223 ~~igky~~~a~~~ek~----~l~~al~~~-~~~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqt 295 (491) T protein:vir:10 223 MLVGKHPRSASDGEKN----LLLDCLEDM-VQDA--VAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQN 295 (491) T ss_pred eEEEecCCCCCHHHHH----HHHHHHHHH-hcCc--EEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhh Confidence 5566776655554333 334444333 2233 456677776665543 2222 23344678888887762 211 Q ss_pred HHH-hcCCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCCh-hHhcCC--ceEEEecchhhhcCHHHHHHHHHHHHhCCC Q lcl|NC_010576. 298 EAI-LNGTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTK-TAVSQG--QVLVYYRNPFKLVPVEQLATVADVLTRNAI 372 (447) Q Consensus 298 ~~~-l~g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~-~e~~~g--~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~ 372 (447) ..- =+|+.. .+.......+-+.-.++.|++.||+ |+.+ ..+-.+ .+.+|.++... .+.+.+++.+.++++.|+ T Consensus 296 lTt~~~gs~a~~~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~ 373 (491) T protein:vir:10 296 QTTEATSTRASAQAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGA 373 (491) T ss_pred cccCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCC Confidence 000 012221 1222333455556677777777774 5432 111111 12334443332 344789999999999997 Q ss_pred -cCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccccc--------------- Q lcl|NC_010576. 373 -YTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAI--------------- 436 (447) Q Consensus 373 -~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 436 (447) ++..++|+.+|+|+-+.+. ...+.. .+ +.. ...... +.+.+...+.+... T Consensus 374 ~i~~~~i~e~~Gip~~~~~~--~~~~~~-~~--~~~-~~~~~~--------~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 439 (491) T protein:vir:10 374 RFTPAYFKRAYNLQDGDLDE--RPLPVS-AV--DTV-GAASFA--------EFEAPDQDALDAALNTLSARDLNADAQAL 439 (491) T ss_pred cCCHHHHHHHhCCCCCCcCc--cccccC-CC--CCc-cccccc--------ccCCCCCCchHHHHHHHHHHHHHHHHHHH Confidence 7999999999998644321 111100 00 000 000000 00000000000000 Q ss_pred ---CCccCcCCCCC Q lcl|NC_010576. 437 ---ENGSLTDGGSY 447 (447) Q Consensus 437 ---~~~~~~~~~~~ 447 (447) -...--+.+|| T Consensus 440 ~~~i~~~l~~~~s~ 453 (491) T protein:vir:10 440 VAPLLKRIANGASA 453 (491) T ss_pred HHHHHHHHHhcCCH Confidence 00111234555 No 138 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=98.85 E-value=2.8e-08 Score=62.02 Aligned_cols=366 Identities=10% Similarity=0.080 Sum_probs=168.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccc-cchhh---h-hhHHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYS-RSYSY---N-KADLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~-~~~~v~~cv~~ia~~ia~lp 75 (447) |.+-..-. ..-.++. .+..+......++..........+.... ....| + +.+.|++|++.+...|.+++ T Consensus 3 ~~~~~~p~----~~~~~~~--~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~ 76 (446) T protein:vir:98 3 MEVRNAPT----PAIRRRT--IYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKV 76 (446) T ss_pred ccccCCCc----hhhhhhh--hhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCC Confidence 44422100 0000000 0001111111111111111111122111 11222 2 36889999999999999999 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc--ccc-----eeeeccCC Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP--DSG-----SFDINTAR 148 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~--~~~-----~~~~~~~~ 148 (447) |.|- . + ..+ ...-+..+|...+ +++....+.+.+.+|.++..+++..... .+. .+++.+.. T Consensus 77 w~V~---p-~-~~~-~a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~ 144 (446) T protein:vir:98 77 GPYQ---H-G-DKR-IKKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQ 144 (446) T ss_pred ceec---C-c-cHH-HHHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhcccccccccc Confidence 9863 2 1 111 1223556664321 2444455678889999998887753211 000 00110000 Q ss_pred CcceeeecCC---------ceEEEEee-------------e----cccccceeeecccccccccccccccccchhHHHHH Q lcl|NC_010576. 149 VGKIMQFFPR---------QVMVRVWN-------------D----NTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRM 202 (447) Q Consensus 149 ~~~~~~~~~~---------~~~~~~~~-------------~----~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 202 (447) . .|... ......+. + ....+.++.+|....++++....+......+.+.. T Consensus 145 ~----r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~ 220 (446) T protein:vir:98 145 V----MLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTS 220 (446) T ss_pred c----eeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHH Confidence 0 01000 00000000 0 00111223344455444443222221111222222 Q ss_pred HHHHH-----HHHHHHHHHhhcCcccceeeeCCcCChHHH---------HHHHHHHHHHHHHHhccCCccee---ecCCC Q lcl|NC_010576. 203 LEQKI-----KLMNSQDNRASSGKLNGFIQFPYSTKSTAR---------AAQAARRKQEIENEMANNKYGVA---TLDTQ 265 (447) Q Consensus 203 ~~~~~-----~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~n~~~~~---vl~~g 265 (447) +.-.. ....-......-|.|=-+.+++...++++. +..++++.+.+.+. ..+++.++ +++.| T Consensus 221 ~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~-~~da~~ii~~~~~P~g 299 (446) T protein:vir:98 221 VLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRL-STDSGLVLTQLSKEQP 299 (446) T ss_pred HHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhc-cccceeeeecccCCCC Confidence 22111 111111122334444445666544332211 11223344444322 22333333 34899 Q ss_pred ceeeecCCChhh-hhHH-HHHHHHHHHHHHhCCCHHHhc------CCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCCh Q lcl|NC_010576. 266 EKFVSAGMGLQN-NLLS-DVRQLQQDFYNQMGITEAILN------GTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTK 336 (447) Q Consensus 266 ~~~~~l~~~~~~-~~l~-~~~~~~~~Ia~~fgVP~~~l~------g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~ 336 (447) ++++-++..... ...+ -.++..++|+++....--.++ |++. .+.......+.+.-.+++|++.||+.|+.+ T Consensus 300 ~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~ 379 (446) T protein:vir:98 300 VQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGN 379 (446) T ss_pred ceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 998877654332 2344 458889999998866533332 3322 222334456677789999999999988654 Q ss_pred hH-hcCC----------ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCH---HHHHHHhCCCCCCCccc Q lcl|NC_010576. 337 TA-VSQG----------QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTP---NEIRELTGKAPHPNPLA 392 (447) Q Consensus 337 ~e-~~~g----------~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~---NE~R~~~gl~p~~g~~~ 392 (447) -- +-.| .+++|++. ...|++..++++.++++.|+.++ +.+|+++|+|+-+.. . T Consensus 380 l~~lNf~~~~~~~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~-~ 446 (446) T protein:vir:98 380 LIRLNFDPALYPLASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISS-T 446 (446) T ss_pred HHHhCCCccccccccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCC-C Confidence 21 1111 12344433 35689999999999999998765 459999999875431 1 No 139 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=98.80 E-value=3.7e-10 Score=72.28 Aligned_cols=310 Identities=15% Similarity=0.193 Sum_probs=159.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccc---cccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSN---GMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) ||+|+. |+++.-..-.-+.+-+.. --.|+-| .+.+ .+-+-.|+.||.- +. T Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~-------~~~~~~~~~~~~~-~~ 53 (320) T protein:vir:97 1 MGIFNF--------KKRETLTPELKESIIRQVTIEDESPFTG-----------TTDF-------NVRNEVAESIATY-LG 53 (320) T ss_pred CCcccc--------ccccccChhHHhhhhheeeeccCCCccc-----------cccc-------chhhHHHHHHHHH-hh Confidence 999863 333211000000000000 0011111 1111 2223345555432 11 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeecC Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFFP 157 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (447) .|+... .++ .||. -|| .|++.++.+.++.--.|+++.....-++.+.+.+...+. +..-..| T Consensus 54 ~~~~~~---------~~~-~~~~--~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 115 (320) T protein:vir:97 54 AYKTSA---------KRL-SLLT--NNP-----SFLRRLVKHALHNKTTYVYKSPTYGWLITDSMTIEGLRA-RLTFTLP 115 (320) T ss_pred hhcccc---------cee-eeee--CCH-----HHHHHHHHHhhcccceEEeeCCccceeeecceeeeeeee-eEEEecC Confidence 243211 111 2232 222 699999999999999999876543332222222222111 1111122 Q ss_pred CceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHHHHHHHHHHHH--HHhhcCcccceeeeCCcCChH Q lcl|NC_010576. 158 RQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQKIKLMNSQD--NRASSGKLNGFIQFPYSTKST 235 (447) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~n~~~~~gvl~~~~~~~~~ 235 (447) +.+..-+ .++++-+|+-.+.+|+.+.-.. .....+.++.++. ...+.+++..+++.+-..+- T Consensus 116 D~FN~~V---------~mtvpfyD~~ILdnpl~gv~tq------e~gkM~g~a~~~v~kkL~~~~~IKafi~Tdid~GL- 179 (320) T protein:vir:97 116 DPFNSAV---------TMTVPFYDVGIIDSPLVEVDTE------EANKMLEAAYSAVMKKLHNTGAIKAFISSDIDVGL- 179 (320) T ss_pred cccceeE---------EEEeeeechhhhhhhhcccChH------HhhHHHHHHhhhhhhhccccceeEEEEecccchhH- Confidence 3222221 3456677887788888764322 2222233332222 22466778888887654433 Q ss_pred HHHHHHHHHHHHHHHHh--ccCCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHHHHHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEM--ANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQQTLGY 313 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~--~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~~~~~f 313 (447) ++.+++.+..+.++. ..-=.|+-+++.+-+++++..+............+++.+.-|+||..+|-|++++.+..+| T Consensus 180 --ee~kD~~~~kIk~mq~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn~~D~~l~~t~alS~y~m~~~IL~GsAte~~~Iaf 257 (320) T protein:vir:97 180 --EKMKEESDSKIKAMLATAELLSGYTYIQRGDDVTQMMPDYTTSNVTDFAAMRTFAASQLSVSDKILDGSATDGEKVAV 257 (320) T ss_pred --HHHHHHHHHHHHHHHHHHHHhcCcccccCCcceeeecccccccchhHHHHHHHHHHhhcCCchhhccccCCcceeeeh Confidence 333333333333322 2213567899999999999988777767777788889999999999999999999999999 Q ss_pred HHHHHhHHHHHH---HHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_010576. 314 YNRCVDVLLQYV---TDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNP 390 (447) Q Consensus 314 ~~~ti~P~~~~i---e~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~ 390 (447) +...+.|+++++ |-.|..++- +..++.|-. .+|.+.-|.+ ++ T Consensus 258 ~~~~V~PLL~Q~~~~Ek~Lvy~m~------~E~FVs~mt-------------------TGG~l~S~~~----------~~ 302 (320) T protein:vir:97 258 MFRFVEPILEQFREYEPSLIYAMR------DEFFVSFMT-------------------TGGMLNSNRV----------DG 302 (320) T ss_pred hhHhHHHHHHHhhhcCcceeeeec------cceeeeeee-------------------cCceeecccc----------cc Confidence 999999999997 555554441 123343321 1454444332 33 Q ss_pred cccccccccccchhhcccccCCCCC Q lcl|NC_010576. 391 LANELFNRNIADGNQVGGINTPGQI 415 (447) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (447) ||.+-.|.. .. .++. |+. T Consensus 303 ~~~~~~~~~-~~---~~~~---~~~ 320 (320) T protein:vir:97 303 WGKEKAPNE-SK---GGDV---GDV 320 (320) T ss_pred cccccCCcc-cc---CCcc---cCC Confidence 444322110 00 0000 111 No 140 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.57 E-value=2.7e-07 Score=56.60 Aligned_cols=435 Identities=12% Similarity=0.078 Sum_probs=185.2 Q ss_pred CchhHhhhhhcccccCCc--ccc-ccccccc-ccccccc-ccccccccCCcccccchhhhh-hHHHHHHHHHHHHhhccC Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ--NQN-QNTNDFL-TPSNGMT-SFGGYYGRGQSNYSRSYSYNK-ADLIKSVITRIALDASMV 74 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~--~~~-~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~v~~cv~~ia~~ia~l 74 (447) |.--+-|+-+...|...- ..+ ..-+.++ .+.--+. +++.+....|..--+ .++. .+-++-.|..|++.++++ T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW--~~~d~v~Elry~vgW~~~s~sr~ 78 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAW--EAVDLVGELRYYVGWRASSCSRC 78 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHH--HHHHhhhhHHHHhhhhhhhhcee Confidence 665555443333332210 000 0011111 1111111 111111111111111 1222 356777899999999999 Q ss_pred ceEEEEEcCCCc-----eec-ccc-chHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccC-Cc--------- Q lcl|NC_010576. 75 DFKHLKIDPISG-----NQT-PMP-SGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTV-DP--------- 137 (447) Q Consensus 75 p~~~~r~~~~~~-----~~~-~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~-~~--------- 137 (447) .+..-+.+.+.+ .++ ..+ ..+..+...=+-.-+...++++.+..+|-+-|++|+++..... +. T Consensus 79 rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r 158 (631) T protein:vir:10 79 RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (631) T ss_pred eeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccc Confidence 998888887732 221 011 2233333333556678899999999999999999998744322 11 Q ss_pred -ccceeeeccCCCcceeeecCCc-eEEEEeeecccccceeeecccccc-cccccccccc----cchhHHHHHHHHHHHHH Q lcl|NC_010576. 138 -DSGSFDINTARVGKIMQFFPRQ-VMVRVWNDNTGLEQDLLVSKENCI-IIESPFYAIL----NDTNQTLRMLEQKIKLM 210 (447) Q Consensus 138 -~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~-~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 210 (447) ...|+.+....+. ..-.+. ..+..- .+. .+......|++ .+-.|...-. +...+.+..+..-..+- T Consensus 159 ~~~~W~~vt~~ei~---~~~~g~g~~v~lp---~g~-~h~~~~~~D~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t 231 (631) T protein:vir:10 159 TRQEWYAVSKEEIK---KSNKGSGTNIVLP---TGE-EHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (631) T ss_pred cccceeeccHHHHh---cccCcccceeecC---CCC-ccceecCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhh Confidence 2233333221110 000000 111111 111 11222222221 1112211111 11222222222221111 Q ss_pred HHHHHH-hhcCcccceeeeCCcCCh---------------------HHHHHHHHHHHHHHHHHhcc----CCcceeecC- Q lcl|NC_010576. 211 NSQDNR-ASSGKLNGFIQFPYSTKS---------------------TARAAQAARRKQEIENEMAN----NKYGVATLD- 263 (447) Q Consensus 211 ~~~~~~-~n~~~~~gvl~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~----n~~~~~vl~- 263 (447) ...... +.-..-+|||=+|..++- ....+..+-+.+.-...+.+ .+.-++++. T Consensus 232 ~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~ 311 (631) T protein:vir:10 232 KTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGV 311 (631) T ss_pred hHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEee Confidence 111000 111111345444332211 11122222222111122222 222223322 Q ss_pred -----CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc-HH------HHHHHHHHHHHhHHHHHHHHHHHh Q lcl|NC_010576. 264 -----TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA-NE------QQTLGYYNRCVDVLLQYVTDAISR 331 (447) Q Consensus 264 -----~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~-~e------~~~~~f~~~ti~P~~~~ie~~l~~ 331 (447) ++.+.-.+....+..-+..++..+..||....|||+.|-|.+ +. |....=++-.|.|.+..|+++|++ T Consensus 312 p~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~ 391 (631) T protein:vir:10 312 PGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTD 391 (631) T ss_pred chHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHh Confidence 222333333344455678889999999999999999885542 11 111223567799999999999999 Q ss_pred hcCChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccc-------------- Q lcl|NC_010576. 332 IALTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLA-------------- 392 (447) Q Consensus 332 kLl~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~-------------- 392 (447) .+|.+.- .+.. |-+=||.+.|.. |. ++.+-...+++.|.+|-...|+.+|+.--+++.- T Consensus 392 q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-dP-dr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av 469 (631) T protein:vir:10 392 QILRVTLAREGIDPSKYVVWYDPSQLTI-DP-DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (631) T ss_pred hHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHh Confidence 9876532 2222 445677777642 22 2333334578899999999999999987666420 Q ss_pred --cccccccccch--hhcccccCCCCCCCCCCCcC-------CCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 393 --NELFNRNIADG--NQVGGINTPGQITSDQPATA-------STDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 393 --~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) |.-..+++++. .++....-+.......+.++ +++-...+.++.+.+..+ ..+.- T Consensus 470 ~~dpaLip~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~~epdt~d~~p~~~-~a~~~ 534 (631) T protein:vir:10 470 SKDPTLIPMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGEQEPDTEDDDDGTQ-KAGLE 534 (631) T ss_pred hcccCcchhhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCCCCCCCCCCCCccc-cccch Confidence 10011222331 12222111111111111100 000011111111111111 11111 No 141 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.54 E-value=5.9e-09 Score=65.70 Aligned_cols=177 Identities=15% Similarity=0.143 Sum_probs=89.0 Q ss_pred eeeeCC---cCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_010576. 225 FIQFPY---STKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAIL 301 (447) Q Consensus 225 vl~~~~---~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l 301 (447) |++.++ .+..+ .+...+++ +.+ ...+++.+.+++...+-+|+.++.+..... +.+......||.+-|||..+| T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~-~~~-~~~~~~~~~~~ld~~~e~~e~~~~~lsGl~-d~l~~~~~~iaa~s~iP~t~L 76 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRL-AQV-DNNSGVGQAIGIDADSEEYNVLNSDIGGID-TFLSQKFDRIVALSGIHEIIL 76 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHH-HHH-HHhhhhhhhheeecCCcceeeeecCcCChH-HHHHHHHHHHHhHhcCchhhh Confidence 333222 11111 12222222 222 344454455655666678888877654321 223456678999999999887 Q ss_pred cCC------cH-HHHHHHHHH-------HHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHH------ Q lcl|NC_010576. 302 NGT------AN-EQQTLGYYN-------RCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLA------ 361 (447) Q Consensus 302 ~g~------~~-e~~~~~f~~-------~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~------ 361 (447) -|. ++ +.....||. .-|.|.++++=+-+ .+.. .|.|.+++|...+.++++ T Consensus 77 fG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~--------~~~~--~~~~~f~pL~~~s~kekAei~~~~ 146 (201) T protein:vir:10 77 KGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFI--------VTEQ--EWSVEFNPLSQVSDKDKSEILEKN 146 (201) T ss_pred cCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------cCCC--CceEeeCCCCCCCHHHHHHHHHHH Confidence 442 12 223334442 33556555544321 1223 345555688888877665 Q ss_pred -HHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCc Q lcl|NC_010576. 362 -TVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNN 430 (447) Q Consensus 362 -~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (447) +++.+++++|+++++|+|+.+--.+..+..++.-+..... ..++.+|. +.|.++ T Consensus 147 a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~-------------~~e~~dp~--~~~~~~ 201 (201) T protein:vir:10 147 VNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVV-------------INESEDPL--DVSANN 201 (201) T ss_pred HHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCcccc-------------ccccCCCC--CCCCCC Confidence 4567788999999999999886655444333222211110 00111110 011111 No 142 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.51 E-value=1.9e-07 Score=57.44 Aligned_cols=429 Identities=12% Similarity=0.075 Sum_probs=185.2 Q ss_pred CchhHhhhhhcccccCCccc-cccc------cccc--cccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNT------NDFL--TPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDA 71 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~i 71 (447) |.--| |+- +-++|..+ .... +.+. ....|.+..++.....|..--+ ..|-..+-++-.|..|++.+ T Consensus 1 ma~~~-lr~---~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~ 75 (639) T protein:vir:10 1 MAATS-LRV---VRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSC 75 (639) T ss_pred CCccc-eee---eecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhh Confidence 54432 211 11222111 1000 0111 1112223334443333332222 11222366778899999999 Q ss_pred ccCceEEEEEcCCCcee--------ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec-cCCc----- Q lcl|NC_010576. 72 SMVDFKHLKIDPISGNQ--------TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT-TVDP----- 137 (447) Q Consensus 72 a~lp~~~~r~~~~~~~~--------~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~-~~~~----- 137 (447) +++.+..-+.+.+.+.. ....+++....+.=--.-+...++++.+..+|-+-|++||+.... +.++ T Consensus 76 sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~ 155 (639) T protein:vir:10 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (639) T ss_pred ceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccc Confidence 99999888888665421 112233333322112334566789999999999999999875442 2222 Q ss_pred --ccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccccccc----cchhHHHHHHHHHHHH-- Q lcl|NC_010576. 138 --DSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAIL----NDTNQTLRMLEQKIKL-- 209 (447) Q Consensus 138 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~~~-- 209 (447) ...|+.+....+ .....+...+..- + +..-++....+-++.+-.|...-. +...+.+..+..-..+ T Consensus 156 ~~~~~W~vvs~~Ei---~~~~~~~~~i~lP-d--G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~ 229 (639) T protein:vir:10 156 APRARWYAVTREEI---KSKAGETAEISLP-D--GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (639) T ss_pred ccccceeeeeHHHh---cccCCCeeEeecC-C--CCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhh Confidence 233443322211 1111111111111 0 000000000011111112211111 1112222222221111 Q ss_pred -HHHHHHHhhcCcccceeeeCCcCChHHH---------------------HHHHHHHHHHHHH----Hhcc----CCcce Q lcl|NC_010576. 210 -MNSQDNRASSGKLNGFIQFPYSTKSTAR---------------------AAQAARRKQEIEN----EMAN----NKYGV 259 (447) Q Consensus 210 -~~~~~~~~n~~~~~gvl~~~~~~~~~~~---------------------~~~~~~~~~~~~~----~~~~----n~~~~ 259 (447) ..++.+ .-..-+|||=+|..+.-... .-..+.|.+.+.+ .+.+ .+.-+ T Consensus 230 ~i~aaak--SRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vP 307 (639) T protein:vir:10 230 KIKNAAK--SRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIP 307 (639) T ss_pred HHHHHHH--HHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceee Confidence 111111 00111244333322110000 0012223333222 2222 22222 Q ss_pred eecC----CCceeeecCCC--hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHH------HHHHHHHHHHhHHHHHHHH Q lcl|NC_010576. 260 ATLD----TQEKFVSAGMG--LQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQ------QTLGYYNRCVDVLLQYVTD 327 (447) Q Consensus 260 ~vl~----~g~~~~~l~~~--~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~------~~~~f~~~ti~P~~~~ie~ 327 (447) +++. ..-+++.|.+. .+..-+..++..+..||....|||..|-|.++.+ ....=++-.|.|.+..|++ T Consensus 308 iia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icd 387 (639) T protein:vir:10 308 LVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQ 387 (639) T ss_pred eeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHH Confidence 3322 23345555554 3445578889999999999999999886543211 1112256779999999999 Q ss_pred HHHhhcCChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc------c- Q lcl|NC_010576. 328 AISRIALTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE------L- 395 (447) Q Consensus 328 ~l~~kLl~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~------~- 395 (447) +|++.+|.+.- .+.. |-+=||.+.|.. |. ++.+-...+++.|.+|-.-.|+.+|+.--+++.-+. + T Consensus 388 AlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-dP-d~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A 465 (639) T protein:vir:10 388 AIYNDILTPLLAREGIDPTKYILWYDASGLTS-DP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFA 465 (639) T ss_pred HHHhhHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHH Confidence 99999876532 2222 445677777643 22 233333457889999999999999998765542110 0 Q ss_pred ---c--ccccc--------chhhc------ccccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 396 ---F--NRNIA--------DGNQV------GGINTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 396 ---~--~~~~~--------~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) + ++.+. +..+. .....++.+..+.+..++ ..+...+.++++.++++--= T Consensus 466 ~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~g---a~~~~ePdte~~~~~~~a~~ 533 (639) T protein:vir:10 466 ADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSG---ARQQREPQTEDERSTEEAAS 533 (639) T ss_pred HHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccC---CCCCcCCCcccccCCccccC Confidence 0 01111 11110 111111122222221111 11111123333333322111 No 143 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.51 E-value=1.9e-07 Score=57.44 Aligned_cols=429 Identities=12% Similarity=0.075 Sum_probs=185.2 Q ss_pred CchhHhhhhhcccccCCccc-cccc------cccc--cccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ-NQNT------NDFL--TPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDA 71 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~-~~~~------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~i 71 (447) |.--| |+- +-++|..+ .... +.+. ....|.+..++.....|..--+ ..|-..+-++-.|..|++.+ T Consensus 1 ma~~~-lr~---~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~ 75 (639) T protein:vir:97 1 MAATS-LRV---VRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSC 75 (639) T ss_pred CCccc-eee---eecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhh Confidence 54432 211 11222111 1000 0111 1112223334443333332222 11222366778899999999 Q ss_pred ccCceEEEEEcCCCcee--------ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec-cCCc----- Q lcl|NC_010576. 72 SMVDFKHLKIDPISGNQ--------TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT-TVDP----- 137 (447) Q Consensus 72 a~lp~~~~r~~~~~~~~--------~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~-~~~~----- 137 (447) +++.+..-+.+.+.+.. ....+++....+.=--.-+...++++.+..+|-+-|++||+.... +.++ T Consensus 76 sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~ 155 (639) T protein:vir:97 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (639) T ss_pred ceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccc Confidence 99999888888665421 112233333322112334566789999999999999999875442 2222 Q ss_pred --ccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccccccc----cchhHHHHHHHHHHHH-- Q lcl|NC_010576. 138 --DSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAIL----NDTNQTLRMLEQKIKL-- 209 (447) Q Consensus 138 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~~~-- 209 (447) ...|+.+....+ .....+...+..- + +..-++....+-++.+-.|...-. +...+.+..+..-..+ T Consensus 156 ~~~~~W~vvs~~Ei---~~~~~~~~~i~lP-d--G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~ 229 (639) T protein:vir:97 156 APRARWYAVTREEI---KSKAGETAEISLP-D--GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (639) T ss_pred ccccceeeeeHHHh---cccCCCeeEeecC-C--CCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhh Confidence 233443322211 1111111111111 0 000000000011111112211111 1112222222221111 Q ss_pred -HHHHHHHhhcCcccceeeeCCcCChHHH---------------------HHHHHHHHHHHHH----Hhcc----CCcce Q lcl|NC_010576. 210 -MNSQDNRASSGKLNGFIQFPYSTKSTAR---------------------AAQAARRKQEIEN----EMAN----NKYGV 259 (447) Q Consensus 210 -~~~~~~~~n~~~~~gvl~~~~~~~~~~~---------------------~~~~~~~~~~~~~----~~~~----n~~~~ 259 (447) ..++.+ .-..-+|||=+|..+.-... .-..+.|.+.+.+ .+.+ .+.-+ T Consensus 230 ~i~aaak--SRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vP 307 (639) T protein:vir:97 230 KIKNAAK--SRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIP 307 (639) T ss_pred HHHHHHH--HHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceee Confidence 111111 00111244333322110000 0012223333222 2222 22222 Q ss_pred eecC----CCceeeecCCC--hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcHHH------HHHHHHHHHHhHHHHHHHH Q lcl|NC_010576. 260 ATLD----TQEKFVSAGMG--LQNNLLSDVRQLQQDFYNQMGITEAILNGTANEQ------QTLGYYNRCVDVLLQYVTD 327 (447) Q Consensus 260 ~vl~----~g~~~~~l~~~--~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e~------~~~~f~~~ti~P~~~~ie~ 327 (447) +++. ..-+++.|.+. .+..-+..++..+..||....|||..|-|.++.+ ....=++-.|.|.+..|++ T Consensus 308 iia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icd 387 (639) T protein:vir:97 308 LVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQ 387 (639) T ss_pred eeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHH Confidence 3322 23345555554 3445578889999999999999999886543211 1112256779999999999 Q ss_pred HHHhhcCChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc------c- Q lcl|NC_010576. 328 AISRIALTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANE------L- 395 (447) Q Consensus 328 ~l~~kLl~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~------~- 395 (447) +|++.+|.+.- .+.. |-+=||.+.|.. |. ++.+-...+++.|.+|-.-.|+.+|+.--+++.-+. + T Consensus 388 AlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-dP-d~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A 465 (639) T protein:vir:97 388 AIYNDILTPLLAREGIDPTKYILWYDASGLTS-DP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFA 465 (639) T ss_pred HHHhhHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHH Confidence 99999876532 2222 445677777643 22 233333457889999999999999998765542110 0 Q ss_pred ---c--ccccc--------chhhc------ccccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 396 ---F--NRNIA--------DGNQV------GGINTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 396 ---~--~~~~~--------~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) + ++.+. +..+. .....++.+..+.+..++ ..+...+.++++.++++--= T Consensus 466 ~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~g---a~~~~ePdte~~~~~~~a~~ 533 (639) T protein:vir:97 466 ADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSG---ARQQREPQTEDERSTEEAAS 533 (639) T ss_pred HHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccC---CCCCcCCCcccccCCccccC Confidence 0 01111 11110 111111122222221111 11111123333333322111 No 144 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=98.39 E-value=8.9e-07 Score=53.76 Aligned_cols=403 Identities=9% Similarity=-0.033 Sum_probs=164.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccc------c-ccccccccccCCc-------ccccchhhhhhHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSN------G-MTSFGGYYGRGQS-------NYSRSYSYNKADLIKSVITR 66 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~------~-~~~~~~~~~~~~~-------~~~~~~~~~~~~~v~~cv~~ 66 (447) |.--.| ++...- +..........+.. + ..++.|....... ...--...++.+.|.+|++. T Consensus 1 m~k~~~--k~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~ 74 (448) T protein:vir:79 1 MAKRGR--KPKELV----PGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNY 74 (448) T ss_pred CCCCCC--CCcccc----CcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHH Confidence 433211 111000 00000000000000 0 0111111100000 00001123456789999999 Q ss_pred HHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCccc---CHHHHHHHHHHHHHhcCCeeEEEeecc--CCc-ccc Q lcl|NC_010576. 67 IALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQ---TGRSFVFDLLYSLLDEGQIAMVPIDTT--VDP-DSG 140 (447) Q Consensus 67 ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~---t~~~f~~~~~~~lll~Gna~i~~~~~~--~~~-~~~ 140 (447) +...|.+++|.|....++. .....-.-+...|. .++... +..+++..+ .+.+++|.+++.+++.. .+. ... T Consensus 75 Rk~av~~~~w~v~p~~~~~-~~~~~ae~v~~~l~-~~~~~~~~~~f~~~~~~~-lda~~~G~s~~Eivw~~~~~g~~~~~ 151 (448) T protein:vir:79 75 IFGRIRSAKWYVEPASTDP-EDIAIAAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILD 151 (448) T ss_pred HHHHHhcCCceEecCCCCH-HHHHHHHHHHHHhh-hhhhhhccCCHHHHHHHH-HHhhhhcceeEEEEeeecCCCceecc Confidence 9999999999974321111 11111112333332 333322 233333333 34568999998887642 111 111 Q ss_pred eeeeccCCCcceeee-cCCceEEEEeeeccc----ccceeeecccccccccccccccccchhHHHHHHHH-----HHHHH Q lcl|NC_010576. 141 SFDINTARVGKIMQF-FPRQVMVRVWNDNTG----LEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQ-----KIKLM 210 (447) Q Consensus 141 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~----~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~ 210 (447) .+.+.+.+...-..+ ..+...++....... ....+.+|..-++|.....++ .......+..+.- ..... T Consensus 152 ~l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g-~p~g~gLlr~~~w~~~fK~~~~~ 230 (448) T protein:vir:79 152 KIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLHNDDG-SFTGQSALRAAVPHWLAKRALIL 230 (448) T ss_pred cccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEEecCccC-CcccchhHHHHHHHHHHHHHHHH Confidence 121112111111111 112222221111111 011233455556665443221 1111122222211 11111 Q ss_pred HHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh-hHHHHHHHHHH Q lcl|NC_010576. 211 NSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN-LLSDVRQLQQD 289 (447) Q Consensus 211 ~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~-~l~~~~~~~~~ 289 (447) .-......-|.|--+.+++...+.+ ++.++.+.+...+...++. ..++++.|++++-++...... ..+..++..++ T Consensus 231 ~w~~f~E~yG~P~~vgky~~ga~~~--~~~~~~l~~av~~i~~g~~-a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~ 307 (448) T protein:vir:79 231 LINHGLERFMIGVPTLTIPKSVRQG--TKQWEAAKEIVKNFVQKPR-HGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAG 307 (448) T ss_pred HHHHHHHHcCCceEEEecCCCCCcC--HHHHHHHHHHHHHHhcCCc-eEEEecCCceEEEEecCCCcccHHHHHHHHHHH Confidence 1122223344443366666444322 1122233333322222322 235688888877776543322 23445777888 Q ss_pred HHHHhCCCHHHh-----cCCcHH--HHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC-----ceEEEecchhhhcC Q lcl|NC_010576. 290 FYNQMGITEAIL-----NGTANE--QQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG-----QVLVYYRNPFKLVP 356 (447) Q Consensus 290 Ia~~fgVP~~~l-----~g~~~e--~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g-----~~i~f~~~~l~~~d 356 (447) |+.+.-= .-+ +|+++. +.......+.+.-.+++|++.||+.|+.+- .+-.| .+|.|+ .....| T Consensus 308 Isk~iLG--qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~--~~e~~D 383 (448) T protein:vir:79 308 IARALGI--DFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFE--MEERND 383 (448) T ss_pred HHHHHhh--hhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEec--CCChHH Confidence 8876432 111 122222 222334456677789999999999887642 22111 145554 456678 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCC-CCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 357 VEQLATVADVLTRNAIYTPNEIRELTGKA-PHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 357 ~~~~~~~~~~~~~~G~~t~NE~R~~~gl~-p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) ++++++++.+++..+-..-+-+|+.+|+| |.++..- . + .+.... .. +..++.+++..-=+..-- T Consensus 384 l~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~------~-a----~~~~~~-~~---~~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 384 FSAAANLMGMLINAVKDSEDIPTELKALIDALPSKMR------R-A----LGVVDE-VR---EAVRQPADSRYLYTRRRR 448 (448) T ss_pred HHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCCccc------c-c----cCCCCc-cc---ccccCCccccchhhcccC Confidence 99999999999987655555567777887 3332100 0 0 000000 00 011111111111010000 No 145 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.38 E-value=9.4e-07 Score=53.61 Aligned_cols=424 Identities=10% Similarity=0.051 Sum_probs=187.1 Q ss_pred CchhHhhhhhcccccCCccccccc-------cccc-ccc-ccccccccccccCCcccccchhhhh-hHHHHHHHHHHHHh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT-------NDFL-TPS-NGMTSFGGYYGRGQSNYSRSYSYNK-ADLIKSVITRIALD 70 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~-------~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~cv~~ia~~ 70 (447) |.--| |+- +-++|..+..++ +..+ .+. ....+++++....+..--+ .++. .+-++-.|..|++. T Consensus 1 ma~~~-lr~---~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW--~~~d~v~Elry~vgW~~~s 74 (629) T protein:vir:99 1 MAPTS-LRI---VRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAW--KAYDAVGELRYYVGWRSSS 74 (629) T ss_pred CCccc-eee---eecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHH--HHHHhhhhHHHHhhhhhhh Confidence 54432 111 112222111111 0111 000 0011111221111211111 1222 45677788999999 Q ss_pred hccCceEEEEEcCCCceecc-ccc--h----HHHHHhhhcC-cccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc--- Q lcl|NC_010576. 71 ASMVDFKHLKIDPISGNQTP-MPS--G----LINVLTRSAN-IDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS--- 139 (447) Q Consensus 71 ia~lp~~~~r~~~~~~~~~~-~~~--~----l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~--- 139 (447) ++++.+..-+.+.+++.... .++ + +.++.. ++- .-+-..++++.+..+|-+-|++|+++.....+... T Consensus 75 ~Sr~rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~-~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~ 153 (629) T protein:vir:99 75 ASRVRLIASAIDPDTGLPTGSIDEDDRVGARVQQIVN-QIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNG 153 (629) T ss_pred hceeeeEeeeecCCCCCCccccCCCchhHHHHHHHHH-hhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCC Confidence 99999998888877653311 122 1 222222 222 23456789999999999999999998765443331 Q ss_pred ----ceeeeccCCCcceeeecCCceEEEEeeecccccceeeeccccccccc--ccccccc----cchhHHHHHHHHHHHH Q lcl|NC_010576. 140 ----GSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIE--SPFYAIL----NDTNQTLRMLEQKIKL 209 (447) Q Consensus 140 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~~~~~----~~~~~~~~~~~~~~~~ 209 (447) .|+.+.+..+.. .. +...+.. .....+.+....|++ +| .|...-. +...+.+..+..-..+ T Consensus 154 ~~~~eW~~vt~~ei~~---~~-~~~~i~l----P~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~l 224 (629) T protein:vir:99 154 NPVPEWLALTPEEVRA---SE-KKTIIEL----PTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRT 224 (629) T ss_pred cchhhheeechHHhhh---cc-CceeEEc----CCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHh Confidence 233222221110 00 0011111 111222222223322 22 2211111 1122222222221111 Q ss_pred ---HHHHHHHhhcCcccceeeeCCcCC------hH-----HHH-------HHHHHHHHHHH----HHhcc----CCccee Q lcl|NC_010576. 210 ---MNSQDNRASSGKLNGFIQFPYSTK------ST-----ARA-------AQAARRKQEIE----NEMAN----NKYGVA 260 (447) Q Consensus 210 ---~~~~~~~~n~~~~~gvl~~~~~~~------~~-----~~~-------~~~~~~~~~~~----~~~~~----n~~~~~ 260 (447) ..++.+ .-..-+|||=++..++ +. ... -..+++.+.+. ..+.+ .+.-++ T Consensus 225 t~~i~aaak--SRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPi 302 (629) T protein:vir:99 225 TKTIANASK--SRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPM 302 (629) T ss_pred hhHHHHHHH--HHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeee Confidence 111111 0011124432221110 00 000 02223333332 22222 222223 Q ss_pred ecC------CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc-HH------HHHHHHHHHHHhHHHHHHHH Q lcl|NC_010576. 261 TLD------TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA-NE------QQTLGYYNRCVDVLLQYVTD 327 (447) Q Consensus 261 vl~------~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~-~e------~~~~~f~~~ti~P~~~~ie~ 327 (447) ++. ++.+.-.+....+..-+..++..+..||...-|||+.|-|.+ +. |....=++-.|.|.+..|++ T Consensus 303 ia~~P~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~ 382 (629) T protein:vir:99 303 FAAAPGELIKNVTHLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCE 382 (629) T ss_pred eEeechHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHH Confidence 322 222333333344455678889999999999999999885542 11 11122356779999999999 Q ss_pred HHHhhcCChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----------- Q lcl|NC_010576. 328 AISRIALTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPL----------- 391 (447) Q Consensus 328 ~l~~kLl~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~----------- 391 (447) +|++.+|.+.- .+.. |-+=||.+.|.. |. ++.+-...+++.|.+|-...|+.+|+.--+|++ T Consensus 383 AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~-dP-d~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a 460 (629) T protein:vir:99 383 AITNQVLRTVLMREGIDPNAYVVWHDASQLTV-DP-DKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWA 460 (629) T ss_pred HHHhhHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHH Confidence 99999876532 2222 345677777642 22 233333457889999999999999998765552 Q ss_pred cccc-ccccccc----h-hhcccccCC--------CCCCCCCCCcCCCCCCCcccccccCCccCcCCC----CC Q lcl|NC_010576. 392 ANEL-FNRNIAD----G-NQVGGINTP--------GQITSDQPATASTDPLNNVSTSAIENGSLTDGG----SY 447 (447) Q Consensus 392 ~~~~-~~~~~~~----~-~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 447 (447) .+-. ..+++++ + .+.+....+ +++++.++. .+...++..+.++++.++++- .. T Consensus 461 ~d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE---~sga~~~~ep~te~d~~~~~a~~aa~~ 531 (629) T protein:vir:99 461 RDRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEE---ASGASRREEPDTEDDAGTDDSDQASLD 531 (629) T ss_pred HHhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCccc---ccCCCcCCCCCCCCCCcccccCCCCCC Confidence 0100 0112211 0 111111111 112221111 122334455556655554433 21 No 146 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.37 E-value=1e-06 Score=53.47 Aligned_cols=425 Identities=10% Similarity=0.044 Sum_probs=187.4 Q ss_pred CchhHhhhhhcccccCCccccccc-------cccc-ccc-ccccccccccccCCcccccchhhhh-hHHHHHHHHHHHHh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT-------NDFL-TPS-NGMTSFGGYYGRGQSNYSRSYSYNK-ADLIKSVITRIALD 70 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~-------~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~cv~~ia~~ 70 (447) |.--| |+- +-++|..+..++ +..+ .+. ....+++++....+..--+ .++. .+-++-.|..|++. T Consensus 1 ma~~~-lr~---~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW--~~~d~v~Elry~vgW~~~s 74 (629) T protein:vir:86 1 MAPTS-LRI---VRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAW--KAYDAVGELRYYVGWRSSS 74 (629) T ss_pred CCccc-eee---eecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHH--HHHHhhhhHHHHhhhhhhh Confidence 54432 111 112222111111 0111 000 0011111221112211111 1222 45677788999999 Q ss_pred hccCceEEEEEcCCCceecc-ccc--h----HHHHHhhhcC-cccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc--- Q lcl|NC_010576. 71 ASMVDFKHLKIDPISGNQTP-MPS--G----LINVLTRSAN-IDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS--- 139 (447) Q Consensus 71 ia~lp~~~~r~~~~~~~~~~-~~~--~----l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~--- 139 (447) ++++.+..-+.+.+++.... .++ + +.++.. ++- .-+-..++++.+..+|-+-|++|+++.....+... T Consensus 75 ~Sr~rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~-~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~ 153 (629) T protein:vir:86 75 ASRVRLIASAIDPDTGLPTGSIDEDDRVGARVQQIVN-QIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNG 153 (629) T ss_pred hceeeeEeeeecCCCCCCccccCCCchhHHHHHHHHH-hhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCC Confidence 99999998888877653311 122 1 222222 222 23456789999999999999999998765443331 Q ss_pred ----ceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccc-cccccccccc----cchhHHHHHHHHHHHH- Q lcl|NC_010576. 140 ----GSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCI-IIESPFYAIL----NDTNQTLRMLEQKIKL- 209 (447) Q Consensus 140 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~~~~~~~~----~~~~~~~~~~~~~~~~- 209 (447) .|+.+.+..+..- . +...+.. .....+.+....+++ .+-.|...-. +...+.+..+..-..+ T Consensus 154 ~~~~eW~~vt~~ei~~~---~-~~~~i~l----P~g~~~e~~~~~d~l~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt 225 (629) T protein:vir:86 154 NPVPEWLALTPEEVRAS---E-KKTIIEL----PTGDKHEFRDGLDGMFRVWNPRARRAREPDSPVRANLDSLKEIVRTT 225 (629) T ss_pred cchhhheeechHHhhhc---c-CceeeEc----CCCCcceeeCCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhh Confidence 2332222221100 0 0011111 111222222223332 1112211111 1122222222221111 Q ss_pred --HHHHHHHhhcCcccceeeeCCcCC------hH-----HHH-------HHHHHHHHHHH----HHhcc----CCcceee Q lcl|NC_010576. 210 --MNSQDNRASSGKLNGFIQFPYSTK------ST-----ARA-------AQAARRKQEIE----NEMAN----NKYGVAT 261 (447) Q Consensus 210 --~~~~~~~~n~~~~~gvl~~~~~~~------~~-----~~~-------~~~~~~~~~~~----~~~~~----n~~~~~v 261 (447) ..+..+ .-..-+|||=++..++ +. ... -..+++.+.+. ..+.+ .+.-+++ T Consensus 226 ~~i~aaak--SRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPii 303 (629) T protein:vir:86 226 KTIANASK--SRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMF 303 (629) T ss_pred hHHHHHHH--HHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeee Confidence 111111 0011124432221110 00 000 02223333332 22222 2222233 Q ss_pred cC------CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc-HH------HHHHHHHHHHHhHHHHHHHHH Q lcl|NC_010576. 262 LD------TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA-NE------QQTLGYYNRCVDVLLQYVTDA 328 (447) Q Consensus 262 l~------~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~-~e------~~~~~f~~~ti~P~~~~ie~~ 328 (447) +. ++.+.-.+....+..-+..++..+..||...-|||+.|-|.+ +. |....=++-.|.|.+..|+++ T Consensus 304 a~~P~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~A 383 (629) T protein:vir:86 304 AAAPGELIKNVTHLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEA 383 (629) T ss_pred EeechHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHH Confidence 22 222333333344455678889999999999999999885542 11 111223567799999999999 Q ss_pred HHhhcCChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccc--c-------- Q lcl|NC_010576. 329 ISRIALTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLA--N-------- 393 (447) Q Consensus 329 l~~kLl~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~--~-------- 393 (447) |++.+|.+.- .+.. |-+=||.+.|.. |. ++.+-...+++.|.+|-...|+.+|+.--+|++- . T Consensus 384 lT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~-dP-d~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~ 461 (629) T protein:vir:86 384 ITNQVLRTVLMREGIDPNAYVVWHDASQLTV-DP-DKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWAR 461 (629) T ss_pred HHhhHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHH Confidence 9999876532 2222 345677777642 22 2333334578899999999999999987655520 0 Q ss_pred ccc--cccccc----h-hhcccccCC--------CCCCCCCCCcCCCCCCCcccccccCCccCcCCC----CC Q lcl|NC_010576. 394 ELF--NRNIAD----G-NQVGGINTP--------GQITSDQPATASTDPLNNVSTSAIENGSLTDGG----SY 447 (447) Q Consensus 394 ~~~--~~~~~~----~-~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 447 (447) ..+ .+++++ + .+.+....+ +++++.++. .+...++..+.++++.++++- .. T Consensus 462 d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE---~sga~~~~ep~te~d~~~~~a~~aa~~ 531 (629) T protein:vir:86 462 DRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEE---ASGASRREEPDTEDDAGTDDSDQASLD 531 (629) T ss_pred HhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCccc---ccCCCcCCCCCCCCCCcccccCCCCCC Confidence 011 112211 0 111111111 112221111 122334455556655554433 21 No 147 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=98.28 E-value=1.8e-06 Score=52.14 Aligned_cols=402 Identities=9% Similarity=-0.033 Sum_probs=162.6 Q ss_pred CchhHhhhhhcccccCCccccccccccc-----cc----ccc-ccccccccccC-------CcccccchhhhhhHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFL-----TP----SNG-MTSFGGYYGRG-------QSNYSRSYSYNKADLIKSV 63 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~-----~~----~~~-~~~~~~~~~~~-------~~~~~~~~~~~~~~~v~~c 63 (447) |.-=.+ |+++. ...+.... .. ..+ ..++.|..... ...+.--...++.+.|.+| T Consensus 1 m~kk~~--------k~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~ 71 (448) T protein:vir:77 1 MAKRGR--------KPKEL-VPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNA 71 (448) T ss_pred CCCCCC--------CCccc-CCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHH Confidence 333111 11110 00000000 00 000 00111111000 0000001123456789999 Q ss_pred HHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcC---cccCHHHHHHHHHHHHHhcCCeeEEEeecc--CCc- Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSAN---IDQTGRSFVFDLLYSLLDEGQIAMVPIDTT--VDP- 137 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN---~~~t~~~f~~~~~~~lll~Gna~i~~~~~~--~~~- 137 (447) ++.+...|.+++|.|....++. .......-+...|. .+. ...+..+++..| .+.+++|.+...+++.. .+. T Consensus 72 l~~Rk~av~~~~w~v~p~~~~~-~d~~~ae~v~~~l~-~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~ 148 (448) T protein:vir:77 72 LNYIFGRIRSAKWYVEPASTDP-EDIAIAAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred HHHHHHHHhcCCceEecCCCCH-HHHHHHHHHHHHhh-chhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCce Confidence 9999999999999874321111 11111112333332 222 123445566665 46778999988877642 111 Q ss_pred ccceeeeccCCCcceeeecC-CceEEEEeeecc-c---ccceeeecccccccccccccccccchhHHHHHHHHH-----H Q lcl|NC_010576. 138 DSGSFDINTARVGKIMQFFP-RQVMVRVWNDNT-G---LEQDLLVSKENCIIIESPFYAILNDTNQTLRMLEQK-----I 207 (447) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-----~ 207 (447) ....+.+.+.....-..+.+ +...++...... + ....+.+|..-++|.....++ .......+..+.-. . T Consensus 149 ~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g-~p~g~gLlr~~~w~~~fK~~ 227 (448) T protein:vir:77 149 ILDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLHNDDG-SFTGQSALRAAVPHWLAKRA 227 (448) T ss_pred eeccccccCCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEEecCCcC-CcccchHHHHHHHHHHHHHh Confidence 11112121211111111111 122222111111 0 111233455556665543221 11111222221111 1 Q ss_pred HHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhh-HHHHHHH Q lcl|NC_010576. 208 KLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNL-LSDVRQL 286 (447) Q Consensus 208 ~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-l~~~~~~ 286 (447) ....-......-|.|--+.+++...+.+. +.++.+.+...+...++. ..++++.|++++-++....... .+..++. T Consensus 228 ~~~~w~~f~E~yG~P~~vgky~~ga~~~~--~~~~~l~~av~~i~~g~~-a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~ 304 (448) T protein:vir:77 228 LILLINHGLERFMIGVPTLTIPKSVRQGT--KQWEAAKEIVKNFVQKPR-HGIILPDDWKFDTVDLKSAMPDAIPYLTYH 304 (448) T ss_pred hHHHHHHHHHHcCCceeEEecCCCCCCCH--HHHHHHHHHHHHHhcCCc-eEEEecCCceEEEEecCCCccCHHHHHHHH Confidence 11111112233444444566664443221 122233333322212322 2456888888776665433222 3445777 Q ss_pred HHHHHHHhCCCHHH-h---cCCcH--HHHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC-----ceEEEecchhhh Q lcl|NC_010576. 287 QQDFYNQMGITEAI-L---NGTAN--EQQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG-----QVLVYYRNPFKL 354 (447) Q Consensus 287 ~~~Ia~~fgVP~~~-l---~g~~~--e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g-----~~i~f~~~~l~~ 354 (447) .++|+++..-. .+ . +|+++ .+.......+.+.-.+++|++.||+.|+.+- .+-.| .+|.|+ .... T Consensus 305 d~~Isk~iLGq-tlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~--~~e~ 381 (448) T protein:vir:77 305 DAGIARALGID-FNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFE--MEER 381 (448) T ss_pred HHHHHHHHhcc-ccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEec--CCCh Confidence 88888875332 11 1 12222 2222234566677789999999999887642 22211 245565 4456 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 355 VPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 355 ~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) .|++++++.+.+++ +-+|+.+|+|.-.++ + ... .+ .....+....+.+.+ ....|.++. T Consensus 382 eDl~~~a~~~~~l~-------~~~~~~~~ip~~~~~-~---~~~-----~~-~~~~~~~~~~~~~~~-~~~~~~~~~--- 440 (448) T protein:vir:77 382 NDFSAAANLMGMLI-------NAVKDSEDIPTELKA-L---IDA-----LP-SKMRRALGVVDEVRE-AVRQPADSR--- 440 (448) T ss_pred hhHHHHHHHhHHHH-------HHHHHHhcCCccCCc-C---CCC-----Cc-hhcccccCCCCCCCc-hhhcchhhH--- Confidence 78999999988886 458999999752221 0 100 00 000000001111111 111111111 Q ss_pred ccCCccCc Q lcl|NC_010576. 435 AIENGSLT 442 (447) Q Consensus 435 ~~~~~~~~ 442 (447) ...+=... T Consensus 441 ~~~~r~~~ 448 (448) T protein:vir:77 441 YLYTRRRR 448 (448) T ss_pred HHHhhhcC Confidence 00000000 No 148 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=97.72 E-value=2.5e-05 Score=45.83 Aligned_cols=311 Identities=11% Similarity=0.036 Sum_probs=118.2 Q ss_pred eEEEeeccCCc--ccceeeeccCCCcceeeecCCceEEEEeeecccccceeeecccccccccccccccccchhHHHHHHH Q lcl|NC_010576. 127 AMVPIDTTVDP--DSGSFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENCIIIESPFYAILNDTNQTLRMLE 204 (447) Q Consensus 127 ~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 204 (447) +..+++...+. ....+.+.+.+.-.-..+.++.--..+..........+.++....++.+....++.......+..+. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 33344432211 1111222221110000011111111111111111122334444444433221121111222222222 Q ss_pred HHHHHHHH-----HHHHhh--cCcccceeeeCCcCChHHH-------HHHHHHHHHHHHHHhccCCcceeecCCCceeee Q lcl|NC_010576. 205 QKIKLMNS-----QDNRAS--SGKLNGFIQFPYSTKSTAR-------AAQAARRKQEIENEMANNKYGVATLDTQEKFVS 270 (447) Q Consensus 205 ~~~~~~~~-----~~~~~n--~~~~~gvl~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~ 270 (447) -..-.-.. ...... .+.|-++...+....+.+. ++.++.+..... .........++++.|++++- T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~-~i~~g~~a~~iip~g~~ie~ 159 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAK-EFRAGEAAGGYIPHGANFTL 159 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHH-HhhCCcceeEeecCCceEEE Confidence 11111111 111111 2333333333222222211 112222222221 22222224567899998887 Q ss_pred cCCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc-----CCcH-HHHHHHHHHHHHhHHHHHHHHHHHhhcCChh-HhcCC Q lcl|NC_010576. 271 AGMGLQNNLL-SDVRQLQQDFYNQMGITEAILN-----GTAN-EQQTLGYYNRCVDVLLQYVTDAISRIALTKT-AVSQG 342 (447) Q Consensus 271 l~~~~~~~~l-~~~~~~~~~Ia~~fgVP~~~l~-----g~~~-e~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~-e~~~g 342 (447) ++........ +..++..++|+.++.-.---.. |+.. .+.......+.+.-.++.|++.||+.|+..- .+-.| T Consensus 160 ~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~ 239 (355) T protein:vir:78 160 TGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWG 239 (355) T ss_pred eecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 7654433333 4457788889888754311111 2222 2333455567777888999999998886642 22111 Q ss_pred -----ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHH-----HHHHHhCCCCCCCcccccccccc-ccchhhcccccC Q lcl|NC_010576. 343 -----QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPN-----EIRELTGKAPHPNPLANELFNRN-IADGNQVGGINT 411 (447) Q Consensus 343 -----~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~N-----E~R~~~gl~p~~g~~~~~~~~~~-~~~~~~~~~~~~ 411 (447) .+|+|+ ... .+.+++++++.+++..|+..++ .+|+.+|+|.-..+ .+...+.. ..+..+.... . T Consensus 240 ~~~~~P~~~~~--~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~-~~~~~~~~~~~~~~~~~~~-~ 314 (355) T protein:vir:78 240 PEEPAPRLVPA--QLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAER-DDGADAAAAKAAGRRRAKR-L 314 (355) T ss_pred CCCCCCEEEec--CcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCC-CcccCCccccccccccccc-c Confidence 134553 433 3556789999999999988664 47999999753222 11111110 0101111110 0 Q ss_pred CCCCCCCCCCcC-CCCCCCccc--------ccccCCccCcCC Q lcl|NC_010576. 412 PGQITSDQPATA-STDPLNNVS--------TSAIENGSLTDG 444 (447) Q Consensus 412 ~~~~~~~~~~~~-~~~~~~~~~--------~~~~~~~~~~~~ 444 (447) ++.......+.. ...+...+. .++. .=-+++| T Consensus 315 ~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~ 355 (355) T protein:vir:78 315 PGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAH-RRCAPDG 355 (355) T ss_pred CCccccccccccCCCCCChhhhHHHHHHhhcccc-CCCCCCC Confidence 110000000000 000000000 0111 1112233 No 149 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=425 Identities=13% Similarity=0.062 Sum_probs=178.9 Q ss_pred CchhHhhhhhccccc-CCcccc---cc-cccccccccccccccccc--ccCCcccccchhhhh-hHHHHHHHHHHHHhhc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQ-SNQNQN---QN-TNDFLTPSNGMTSFGGYY--GRGQSNYSRSYSYNK-ADLIKSVITRIALDAS 72 (447) Q Consensus 1 Mg~~~~l~~~~~~f~-~~~~~~---~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~v~~cv~~ia~~ia 72 (447) |.--+ |+ +.. +|-... -+ -+.+..|.--++....+. ..+|..--+ .++. .+-++-.|..|++.++ T Consensus 1 ma~~~-lr----v~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW--~~~d~VgElryyvgW~~ss~S 73 (629) T protein:vir:10 1 MAAST-LR----VSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAW--ECMDLVGELRYYVGWRASSCS 73 (629) T ss_pred CCccc-ee----EEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHH--HHHHhhhhHHHHhhhhhhhhe Confidence 44322 11 110 110011 00 111111111111111111 111111111 1222 2556778899999999 Q ss_pred cCceEEEEEcCCCceecc---ccchHH----HHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-----c Q lcl|NC_010576. 73 MVDFKHLKIDPISGNQTP---MPSGLI----NVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-----G 140 (447) Q Consensus 73 ~lp~~~~r~~~~~~~~~~---~~~~l~----~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-----~ 140 (447) ++.+..-+.+.+++.... .++|-. +..+.=--.-+...++++.+..+|-+-|+.|+++.....+... . T Consensus 74 r~rL~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~ 153 (629) T protein:vir:10 74 RVELIASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHN 153 (629) T ss_pred eeeEEEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccc Confidence 999988888876653211 233321 1122112234566788999999999999999987755443222 2 Q ss_pred eeeeccCCCcceeeecCCceEEEEeeecccccceeeeccccc-cccccccccccc----chhHHHHHHHHHHHH---HHH Q lcl|NC_010576. 141 SFDINTARVGKIMQFFPRQVMVRVWNDNTGLEQDLLVSKENC-IIIESPFYAILN----DTNQTLRMLEQKIKL---MNS 212 (447) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~~~~~~----~~~~~~~~~~~~~~~---~~~ 212 (447) |+.+....+ ...-.+...+..-. ...+......|+ +.+-.|...-.. ...+.+..+..-..+ ..+ T Consensus 154 W~vVt~~Ei---~~kg~g~~~i~lpd----g~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~ 226 (629) T protein:vir:10 154 WYVVTNDEV---KNKGAGKTDIELPD----GTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRN 226 (629) T ss_pred eeeecHHHh---ccccCceeEEEcCC----CceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHH Confidence 333322111 10000111111110 001111111111 111122111111 112222222221111 111 Q ss_pred HHHHhhcCcccceeeeCCcCC------------hHH------HHHHHHHHHHHHHH----Hhcc--C--CcceeecC--- Q lcl|NC_010576. 213 QDNRASSGKLNGFIQFPYSTK------------STA------RAAQAARRKQEIEN----EMAN--N--KYGVATLD--- 263 (447) Q Consensus 213 ~~~~~n~~~~~gvl~~~~~~~------------~~~------~~~~~~~~~~~~~~----~~~~--n--~~~~~vl~--- 263 (447) +.+ .-..-+|||=++..++ +.+ ..-..+.|.+.+.+ .+.+ + +.-++++. T Consensus 227 aak--SRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~ 304 (629) T protein:vir:10 227 ASK--SRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPG 304 (629) T ss_pred HHH--hHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeech Confidence 111 0011124432221110 000 00012222222222 2222 2 22222221 Q ss_pred -CCceeeecCCC--hhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc-HH------HHHHHHHHHHHhHHHHHHHHHHHhhc Q lcl|NC_010576. 264 -TQEKFVSAGMG--LQNNLLSDVRQLQQDFYNQMGITEAILNGTA-NE------QQTLGYYNRCVDVLLQYVTDAISRIA 333 (447) Q Consensus 264 -~g~~~~~l~~~--~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~-~e------~~~~~f~~~ti~P~~~~ie~~l~~kL 333 (447) ..-+++.|.+. .+..-+..++..+..+|....|||..|-|.+ +. |....=++-.|.|.+..|+++|++.+ T Consensus 305 E~l~~ikhLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~ 384 (629) T protein:vir:10 305 EHLQKIFHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREV 384 (629) T ss_pred HHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHH Confidence 22245555553 3445578889999999999999999885542 11 11122356779999999999999998 Q ss_pred CChhH----hcCC-ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccc------------c Q lcl|NC_010576. 334 LTKTA----VSQG-QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANEL------------F 396 (447) Q Consensus 334 l~~~e----~~~g-~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~------------~ 396 (447) |.+.- ++.. |-|=||.+.|. .|+ ++.+-...+.+.|.+|-...|+.+|+.--+++.-+.. - T Consensus 385 Lrp~L~~eGiDp~~Yvvw~DaS~Lt-~dP-d~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~ 462 (629) T protein:vir:10 385 LVATLRAEGIDPDRYVLWYDASGLT-VDP-DKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVA 462 (629) T ss_pred HHHHHHHhCCCHHHhEeeecCcccc-cCC-CCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcC Confidence 76532 2222 44557777664 333 2333334578899999999999999988766421110 0 Q ss_pred cccccchhh-c-----ccccCCCC------CCCCCCCcCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 397 NRNIADGNQ-V-----GGINTPGQ------ITSDQPATASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 397 ~~~~~~~~~-~-----~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) ++.+.+..+ . .+..-+.. ..+++++.+++.+.+ .+.++++ +....+- T Consensus 463 ~P~Li~~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~---e~~~e~d-A~~a~~~ 521 (629) T protein:vir:10 463 DPSLIKVLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGS---EPSTEDD-AEAAARI 521 (629) T ss_pred CCchhhhhhhhcCCccccccccCCCCcCCCCCcccCccccCCCCC---CcCCCcc-hhhcccC Confidence 111111111 0 00000000 001111122222221 1222332 1222222 No 150 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.43 E-value=6.9e-05 Score=43.40 Aligned_cols=407 Identities=11% Similarity=0.011 Sum_probs=137.8 Q ss_pred CchhHhhhhhcccccCC-cccccc-cc---------ccccccccccccc-ccc------ccCCcccccchhhhhhHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSN-QNQNQN-TN---------DFLTPSNGMTSFG-GYY------GRGQSNYSRSYSYNKADLIKS 62 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~-~~~~~~-~~---------~~~~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~v~~ 62 (447) ||+|+|++.++.-...+ ..+... .. ..+.........+ |.+ ...+... .+...+...-.. T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~--~~~~~slnl~~~ 78 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIK--SRPMNHLPIART 78 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchh--cccceecchHHH Confidence 99999988766522211 111000 00 0000000000000 100 0001000 111222222333 Q ss_pred HHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc-- Q lcl|NC_010576. 63 VITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG-- 140 (447) Q Consensus 63 cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~-- 140 (447) +++.+|+-+..=|..+ . -++ +..+..+..+|. -|... ..|.+ .+...+..|..++.+..+....... T Consensus 79 i~~~~A~lv~~e~~~i-~--v~d---~~~~~~l~~~l~--~n~f~--~~~~~-~~e~a~a~G~~a~k~~~d~~~~~i~~v 147 (522) T protein:vir:47 79 ASKKIASLVYNEQATI-T--TKN---EILQKFLDDMLT--NDRFN--KNFER-YLESCLALGGLAMRPYIDGDKVRVAFI 147 (522) T ss_pred HHHHHhhhhcCCccee-e--cCC---hHHHHHHHHHHh--hcchH--HHHHH-HHHHhhccCCEEEEEEEcCCceEEEEE Confidence 4444444444333221 1 111 112333445553 23332 12333 3344444454444444443322111 Q ss_pred ---eeeeccCCCccee---ee---cC---Cce-EEEEeee-c--cccc-ceeeeccccccccc----------------- Q lcl|NC_010576. 141 ---SFDINTARVGKIM---QF---FP---RQV-MVRVWND-N--TGLE-QDLLVSKENCIIIE----------------- 186 (447) Q Consensus 141 ---~~~~~~~~~~~~~---~~---~~---~~~-~~~~~~~-~--~~~~-~~~~~~~~~v~~~~----------------- 186 (447) .+.|+......+. .+ .. ... .+..... . .+.. .......+.-.+|+ T Consensus 148 ~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (522) T protein:vir:47 148 QAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVN 227 (522) T ss_pred cCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccc Confidence 1111111100000 00 00 000 0000000 0 0000 00000000011111 Q ss_pred --------------------ccccc------------cccchhHHHHHHHHHHHHHHHHHHH----hhcCccc-----ce Q lcl|NC_010576. 187 --------------------SPFYA------------ILNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLN-----GF 225 (447) Q Consensus 187 --------------------~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~-----gv 225 (447) .|+.. ....+-+.+..+...++.+...... ...++.+ .+ T Consensus 228 l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~ 307 (522) T protein:vir:47 228 LSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHL 307 (522) T ss_pred ccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHH Confidence 11100 0001112222222222222211111 0111111 11 Q ss_pred eeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC- Q lcl|NC_010576. 226 IQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG- 303 (447) Q Consensus 226 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g- 303 (447) ++......... ......+. .-...+..-. .-.+.+.+++.++... .+++....+...+.|+...|+++..++. T Consensus 308 l~~~~~~~~g~-~~~~~~fd-~~~~~f~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~ 382 (522) T protein:vir:47 308 TQRQYQRPDGT-IDFRPRFD-VEQNVYMQIG---GSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFD 382 (522) T ss_pred hccCCCCCCcc-cccccccC-cccceEeecC---CCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCcc Confidence 11100000000 00000000 0000000000 0012233455555443 3345667788888999999999988852 Q ss_pred -----CcHHHH------------HHHHHHHHHhHHHHHHHHHHHh-hcCChhHhcCCceEEEecchhhhcCHHHHHHHHH Q lcl|NC_010576. 304 -----TANEQQ------------TLGYYNRCVDVLLQYVTDAISR-IALTKTAVSQGQVLVYYRNPFKLVPVEQLATVAD 365 (447) Q Consensus 304 -----~~~e~~------------~~~f~~~ti~P~~~~ie~~l~~-kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~ 365 (447) |++|.. ....++.+|..++..|-+..+. .++.. .....+.+.+++++-+..|..+.++... T Consensus 383 ~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~-~~~~~~~i~v~f~D~i~~D~~~~~~~~~ 461 (522) T protein:vir:47 383 GQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSG-EIPELDDISVNLDDGVFTDRHAELDYWA 461 (522) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC-CCCCcceeEEEcCCCCCCCHHHHHHHHH Confidence 233321 1234556666665555544332 12111 1123457888999999999999999999 Q ss_pred HHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCCc Q lcl|NC_010576. 366 VLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIENG 439 (447) Q Consensus 366 ~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (447) +++.+|+|++-+++.+ ++++++.....-+ ........ ...+...+-..+.++.. ..+++.| T Consensus 462 ~~v~aG~~s~e~~i~~--~~g~~eeea~~el----~ri~~E~~----~~~~~~~~~~~~~~~~~---~~~d~~~ 522 (522) T protein:vir:47 462 KMVAAGFSTKKRAIGK--TLNISGVEAEKEL----NAINSELL----PMNDAELAIYGMHDQNE---EKADDKG 522 (522) T ss_pred HHHhcCCCCHHHHHHh--cCCCChHHHHHHH----HHHHHhhc----cCCCCCCCCCCCCCccc---ccCCCCC Confidence 9999999999998765 3444443332211 11110000 00000000000001111 1111111 No 151 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=96.77 E-value=0.00035 Score=39.51 Aligned_cols=384 Identities=8% Similarity=-0.007 Sum_probs=143.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +-+..+|.+.+.-..+ +......++........ .+. .....+.............-+|+..++-+-.-|+. +. T Consensus 7 ~~~~~~l~~~~~~~~~---r~~~l~~Yy~g~~~i~~-~~~--~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~-~~ 79 (456) T protein:vir:79 7 AEWLPVLTKRIDDGMS---RVRLLARYSNGDAPLPE-LTR--NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT-VG 79 (456) T ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHhccCChhh-cCc--ccChhhchhhhhhhcchHHHHHHHHHhhhccCCee-cC Confidence 3344443332221111 01111111111000000 000 00000000000111224556777777766666765 22 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-------ceeeeccCCCc-ce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-------GSFDINTARVG-KI 152 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-------~~~~~~~~~~~-~~ 152 (447) ... +. ..+..+.+++.. |.. ..+...+..+++.+|.||+++..+..+... ..+++...... .+ T Consensus 80 ~~~-d~---~~~~~~~~~~~~--n~~---d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~ 150 (456) T protein:vir:79 80 GSA-DS---DLALRARRIWRD--NRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRI 150 (456) T ss_pred CCC-Cc---cHHHHHHHHHHh--cCh---hHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCce Confidence 111 11 122345666642 432 245667888999999999987665543221 01111000000 00 Q ss_pred ---eeec---CCce-EEEEeeec------------cccc-------ceeeeccccccc---------ccccccccccchh Q lcl|NC_010576. 153 ---MQFF---PRQV-MVRVWNDN------------TGLE-------QDLLVSKENCII---------IESPFYAILNDTN 197 (447) Q Consensus 153 ---~~~~---~~~~-~~~~~~~~------------~~~~-------~~~~~~~~~v~~---------~~~~~~~~~~~~~ 197 (447) ..++ .+.. ....|... .... .....+..++-| +.++ .+.+.+. T Consensus 151 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~--~~~gd~e 228 (456) T protein:vir:79 151 RSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP--DGMGEVE 228 (456) T ss_pred EEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCC--CCCchhh Confidence 0000 0000 00000000 0000 000000011111 1111 0111111 Q ss_pred HH---HHHHHHHHHHHHHHHHHhhcCcccceeee---CCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeec Q lcl|NC_010576. 198 QT---LRMLEQKIKLMNSQDNRASSGKLNGFIQF---PYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSA 271 (447) Q Consensus 198 ~~---~~~~~~~~~~~~~~~~~~n~~~~~gvl~~---~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l 271 (447) .. ...+...+.-......+.. .+.-++.- .....++.- ... .....| ....++++.++.+.++.++ T Consensus 229 ~v~~liD~~~~~~s~~~~~~~~~a--~~~~~~~G~~~~~~~~d~~g-~~i-~~~~~~----~~~~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:79 229 PHIDIINRINRAELQLLSTMAIQA--FRQRALKSSEHRLPKVDENG-NAI-DYASIF----EAAPGALWELPPGVDIWES 300 (456) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHh--hHHHHHhcCCcccccccccc-ccc-chhhhh----hhhccccccCCCCcceeee Confidence 11 1111111111111111111 11111100 000001000 000 011111 1223567778899998887 Q ss_pred CCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc--HHHHHHH---------------HHHHHHhHHHHHHHHHHHhhcC Q lcl|NC_010576. 272 GMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA--NEQQTLG---------------YYNRCVDVLLQYVTDAISRIAL 334 (447) Q Consensus 272 ~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~--~e~~~~~---------------f~~~ti~P~~~~ie~~l~~kLl 334 (447) ....-..+.+.++.+..+|+..-++|++.+++.. ....... .+...|.-.++.+-. +. T Consensus 301 ~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~-----~~ 375 (456) T protein:vir:79 301 QTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ-----IE 375 (456) T ss_pred cccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hc Confidence 6554445667788899999999999999997631 1111222 222333332222211 11 Q ss_pred ChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCC Q lcl|NC_010576. 335 TKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQ 414 (447) Q Consensus 335 ~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (447) ...+ ...++..+......+..+.++++.+++..|+++..-+++++|+.+-+-... +. +.... +.... .++ T Consensus 376 g~~~---~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~-e~---~r~~~-e~~~~--~~~ 445 (456) T protein:vir:79 376 GESV---EDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQD-DL---DRARE-QITLF--AGN 445 (456) T ss_pred CCCc---cccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHH-HH---HHHHH-HHHHH--hhh Confidence 1111 123444444556678899999999999999999888888888866321100 00 00000 00000 000 Q ss_pred CCCCCCCcCCCCC Q lcl|NC_010576. 415 ITSDQPATASTDP 427 (447) Q Consensus 415 ~~~~~~~~~~~~~ 427 (447) -.+. +.+..+. T Consensus 446 ~~~~--~~~~~~~ 456 (456) T protein:vir:79 446 PVQR--PQEDGSR 456 (456) T ss_pred Hhhc--CCCCCCC Confidence 0000 0000000 No 152 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=96.70 E-value=0.0004 Score=39.19 Aligned_cols=400 Identities=9% Similarity=0.018 Sum_probs=143.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) .-+..+|.+ ....+..+......++........ .+... ...+.............-+|+.+++-+- +.-|+ T Consensus 29 ~~l~~~l~~---~~~~~~~rl~~l~~YY~G~~~~~~-~~~~~--~~~~~~~~~~~v~n~~~~ivd~~a~~l~---~~gf~ 99 (501) T protein:vir:25 29 GALVADMWR---LHISERQWLDRIYEYTKGLRGRPE-VPEGA--SDEVKELAKLSVKNVLSLVRDSFAQNLS---VVGYR 99 (501) T ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHhcCCCchh-ccccC--ChhhhhhHhhhhcChHHHHHHHHHhhhc---cccee Confidence 111111111 111111111111111111000000 00000 0000000000011123334444444331 22243 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee--cCC Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF--FPR 158 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 158 (447) .. ++. .+..+..++. =|... ..-..+..+++.+|.||+++..++.++....+. |..+.-++.. ... T Consensus 100 ~~-d~~----~~~~l~~i~~--~N~~d---~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~s--p~~~~~iy~D~~~~~ 167 (501) T protein:vir:25 100 NA-LAK----ENDPAWEMWQ--RNRMD---ARQAEVHRPALTYGASYVTVTPTDEGPVFRTRS--PRQILAVYADPSVDA 167 (501) T ss_pred cC-Ccc----chHHHHHHHH--hcChh---HHHHHHHHHHhhcCceEEEEecCCCCCeEEEec--cccEEEEEecCCCCc Confidence 22 111 2345555553 35433 445567788899999999887776553221111 1111000000 000 Q ss_pred ceE--EEEeeeccccc--c-------e-e-------------------------------------e--ecccccccccc Q lcl|NC_010576. 159 QVM--VRVWNDNTGLE--Q-------D-L-------------------------------------L--VSKENCIIIES 187 (447) Q Consensus 159 ~~~--~~~~~~~~~~~--~-------~-~-------------------------------------~--~~~~~v~~~~~ 187 (447) .+. ++++....... . . + . +..-.++++.+ T Consensus 168 ~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N 247 (501) T protein:vir:25 168 WPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVN 247 (501) T ss_pred ceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccC Confidence 000 00000000000 0 0 0 0 00001333332 Q ss_pred ccc---ccccchh---HHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceee Q lcl|NC_010576. 188 PFY---AILNDTN---QTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVAT 261 (447) Q Consensus 188 ~~~---~~~~~~~---~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v 261 (447) -.. .+.+.+. +....+...+..+....++... ....++... .+.. +.++ -..++++. T Consensus 248 ~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~-p~~~i~G~~----~~~~----~~~~--------~~~~~i~~ 310 (501) T protein:vir:25 248 GRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGAN-PQRVISGWT----GSKA----EVLK--------ASALRVWT 310 (501) T ss_pred ccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhcc-HHHHHhCCC----CCcc----chhh--------hcccceec Confidence 100 0111111 1112222222222222222211 111121111 1111 1111 11345666 Q ss_pred cC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH--HHHHHHHHHHHHhHHHHHHHHHHHhhc----- Q lcl|NC_010576. 262 LD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTAN--EQQTLGYYNRCVDVLLQYVTDAISRIA----- 333 (447) Q Consensus 262 l~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~--e~~~~~f~~~ti~P~~~~ie~~l~~kL----- 333 (447) ++ .+.+|.++....-..+++.++.+..+|+..=++|++.+++... ......+....|.-.+...+..|...| T Consensus 311 ~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~r 390 (501) T protein:vir:25 311 FEDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLR 390 (501) T ss_pred cCCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66 4667766654333345677888899999999999999976422 222222222222222222222211111 Q ss_pred -----CChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCCCccccccccccccchhhcc Q lcl|NC_010576. 334 -----LTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRE-LTGKAPHPNPLANELFNRNIADGNQVG 407 (447) Q Consensus 334 -----l~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~-~~gl~p~~g~~~~~~~~~~~~~~~~~~ 407 (447) ....+......+++.+......+.++.++++.++++.|+ +.-.+.. +.|+.+-+ -..... .....+.. T Consensus 391 l~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~---ie~~~~--~~~e~~~~ 464 (501) T protein:vir:25 391 LAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQT---IQAIKD--SLRGGEVK 464 (501) T ss_pred HHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHH---HHHHHH--HHHHHhHH Confidence 111111122356666677778899999999999998885 4433333 34665411 111100 00000000 Q ss_pred cccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCCC Q lcl|NC_010576. 408 GINTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGGS 446 (447) Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (447) .. -......++.+....+.+.+..++++.+...+||+ T Consensus 465 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 465 SL--VDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HH--HHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 00 00000111112223333444444555555556666 No 153 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=96.67 E-value=0.00043 Score=39.06 Aligned_cols=401 Identities=8% Similarity=0.018 Sum_probs=142.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) .-+..+|.+.+.- +..+......++........ .+... .... .+.........-+|+.+++.+--.. |+ T Consensus 14 ~~~~~~l~~~~~~---~~~rl~~l~~Yy~G~~~i~~-~~~~~--~~~~--~~~~~~~n~~~~ivd~~~~~l~~~g---~~ 82 (484) T protein:vir:77 14 EKAREEMLNLFTE---RTQDLGDNTAYYESERRPDA-VGVTV--PQQM--QKLLAHVGYPRLYIDAIAARQELEG---FR 82 (484) T ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHhccccchh-ccccc--chhH--HhhhhhcCcHHHHHHHHHhhhccCc---ee Confidence 2222222222211 00011111111111000000 00000 0000 0000111233344555544332222 33 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc------ceeeeccC------- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS------GSFDINTA------- 147 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~------~~~~~~~~------- 147 (447) ... + ...+..+..++. =|.. ......+..+.+.+|.||+++..+..+... ..+.+.++ T Consensus 83 ~~~-~---~~~~~~l~~i~~--~N~~---d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~ 153 (484) T protein:vir:77 83 LGG-A---DKADEQLWDWWQ--ANDL---DIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQI 153 (484) T ss_pred cCC-c---chhHHHHHHHHH--hcCH---hHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEe Confidence 221 1 112344666554 2433 345677888999999999988776554221 01111111 Q ss_pred --CCcceee----ec---CCce-EEEEee--------ecccccce---eeecc--ccccccccccc----ccccch---- Q lcl|NC_010576. 148 --RVGKIMQ----FF---PRQV-MVRVWN--------DNTGLEQD---LLVSK--ENCIIIESPFY----AILNDT---- 196 (447) Q Consensus 148 --~~~~~~~----~~---~~~~-~~~~~~--------~~~~~~~~---~~~~~--~~v~~~~~~~~----~~~~~~---- 196 (447) ...++.. ++ .+.. .+.+|. ...+.+.. ..++- =.|+++.+... .+.+.+ T Consensus 154 D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v 233 (484) T protein:vir:77 154 DPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPEL 233 (484) T ss_pred cCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHH Confidence 1111100 00 0000 011110 00111100 01111 12344432111 011111 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-CCceeeecCCCh Q lcl|NC_010576. 197 NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-TQEKFVSAGMGL 275 (447) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~ 275 (447) .+....+...+.-+....++.. .+.-++. +........+ ...-...| ....++++.++ .+.+|.++.... T Consensus 234 ~~L~Da~~~~~s~~~~~~~~~a--~p~~~i~-G~~~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~~~q~~~~~ 304 (484) T protein:vir:77 234 RSVTDAAARTLMLMQATAELMG--VPQRLLF-GVKGEELGVD--PETGQTLF----DAYLARILAFEDHESKAQQFSAAE 304 (484) T ss_pred HHHHHHHHHHHHHHHHHHHhhh--hhHHHHh-CCCcchhccc--ccccchhh----hhhhhhhcccCCCCceeEeecCCC Confidence 1111222222222222222221 1211221 1111111000 00001111 11224566665 467787776554 Q ss_pred hhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHHHH---------------HHHHHhHHHHHHHHHHHhhcCChh Q lcl|NC_010576. 276 QNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTLGY---------------YNRCVDVLLQYVTDAISRIALTKT 337 (447) Q Consensus 276 ~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~~f---------------~~~ti~P~~~~ie~~l~~kLl~~~ 337 (447) -..+++.++....+|+.+=++|++.+++.. .....+.+ +...|.-.++.+....+..-.+ T Consensus 305 ~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~-- 382 (484) T protein:vir:77 305 LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIP-- 382 (484) T ss_pred hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc-- Confidence 445667778888899999999999997642 11111112 2222222222221111110000 Q ss_pred HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCccccccccccccc----hhhc-cccc Q lcl|NC_010576. 338 AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELFNRNIAD----GNQV-GGIN 410 (447) Q Consensus 338 e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~----~~~~-~~~~ 410 (447) .....+++.+......+.++.++.+.+++++| +++..-+++++|+-+-+-..........-.. ..+. +... T Consensus 383 --~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~ 460 (484) T protein:vir:77 383 --PEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDP 460 (484) T ss_pred --cccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 11134555556666788999999999999876 8888888888877543211000010000000 0000 1111 Q ss_pred CCCCCCCCCCCcCCCCCCCcccccccCCccCcCC Q lcl|NC_010576. 411 TPGQITSDQPATASTDPLNNVSTSAIENGSLTDG 444 (447) Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (447) ..++.++.+++.++.++. ++...| T Consensus 461 ~~~~~~~~~~~~~~~~~~----------~~~~~~ 484 (484) T protein:vir:77 461 SGGGNPDNPETPEPQPNP----------AEEAAA 484 (484) T ss_pred cCCCCCCCCCcccccCCC----------ccccCC Confidence 111111111111111111 111111 No 154 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=96.09 E-value=0.001 Score=36.98 Aligned_cols=406 Identities=7% Similarity=0.025 Sum_probs=134.4 Q ss_pred CchhH-hhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSD-RLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~-~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) ...+. +|.+.+.- +.++......++............. .. ................-+|+.+++.+- +.-| T Consensus 15 ~~~~~~~l~~~~~~---~~~r~~~~~~YY~g~~~i~~~~~~~-~~-~~~~~~~~~~~~n~~~~iVd~~~~~l~---~~gf 86 (479) T protein:vir:99 15 AKYLETKVFPKMNT---ECERLDDFEAWTKNGQEVPDLATRH-KN-KEREVLQQLSRKPWMGLMVNSFAQQLI---VDGY 86 (479) T ss_pred HHHHHHHHHHHHHH---HhHHHHHHHHHHhcCCccccccccc-CC-hhHHHHHHHhhcCcHHHHHHHHHhhcc---cccc Confidence 11111 12111110 0000111111111000000000000 00 000000000111233445555554332 2223 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccC----Ccccceeeec-cCCC----- Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTV----DPDSGSFDIN-TARV----- 149 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~----~~~~~~~~~~-~~~~----- 149 (447) +.. +. ..+..+.+++.. |... .....+..+++.+|.||+++..... .+.. .+.+. +..+ T Consensus 87 ~~~--d~---~~~~~~~~i~~~--N~~d---~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iyd 155 (479) T protein:vir:99 87 RKT--GT---NENAKGWDTWRL--NQMD---KQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWE 155 (479) T ss_pred cCC--Cc---hhhHHHHHHHHh--cChh---HHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEec Confidence 321 11 123456666652 4332 4556677888999999998654211 1111 11111 1100 Q ss_pred ----cc--eeeec-CCceEEEEe--------eecccccc---eeeec--ccccccccccc---cccccchhHHHH---HH Q lcl|NC_010576. 150 ----GK--IMQFF-PRQVMVRVW--------NDNTGLEQ---DLLVS--KENCIIIESPF---YAILNDTNQTLR---ML 203 (447) Q Consensus 150 ----~~--~~~~~-~~~~~~~~~--------~~~~~~~~---~~~~~--~~~v~~~~~~~---~~~~~~~~~~~~---~~ 203 (447) .. .+... .......+| ....+.+. ...|. .-.++++.+-- ..+.+.+..... .+ T Consensus 156 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e~v~~liDa~ 235 (479) T protein:vir:99 156 DPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVEPLVTVAKAI 235 (479) T ss_pred CCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCcCcCCcchhHHHHHHHHHH Confidence 00 00000 000011111 11111111 00110 01123333210 111222221111 11 Q ss_pred HHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceee-cCCCceeeecCCChhhhhHHH Q lcl|NC_010576. 204 EQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVAT-LDTQEKFVSAGMGLQNNLLSD 282 (447) Q Consensus 204 ~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v-l~~g~~~~~l~~~~~~~~l~~ 282 (447) ...+.-+.....+. +.+.-++. +....++.. ..... + .-..++++. -+.+.++.++....-..+++. T Consensus 236 ~~~~s~~~~~~~~~--a~p~~~i~-G~~~~~~~~-~~~~~----~----~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~ 303 (479) T protein:vir:99 236 DKTGLDILLVQHHQ--SFQIRWAT-GLMLPEGAN-ADQEK----M----RFAQESMLISQNEKASFGAIPAAPLDGLLNA 303 (479) T ss_pred HHHHHHHHHHHHHh--hchhhhhc-CCCcccccc-cchhc----c----ccccccceeecCCCceEEEecccchHHHHHH Confidence 11222222222222 22222221 111111100 00000 1 011233433 355667766653333345666 Q ss_pred HHHHHHHHHHHhCCCHHHhcCCc-HHHHHHH---------------HHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEE Q lcl|NC_010576. 283 VRQLQQDFYNQMGITEAILNGTA-NEQQTLG---------------YYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLV 346 (447) Q Consensus 283 ~~~~~~~Ia~~fgVP~~~l~g~~-~e~~~~~---------------f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~ 346 (447) ++....+|+..=++|++.+|... ....... .+...|.-+++.+-...+. .+....+.++ T Consensus 304 l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~-----~~~~~~~~i~ 378 (479) T protein:vir:99 304 YKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGR-----TEEATDLDFT 378 (479) T ss_pred HHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC-----Cccccceeee Confidence 77788889998999999986321 1111122 2233333333333221111 0001112345 Q ss_pred EecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcccccccccc--ccc-hhhcccccCCCCCCCCCCC- Q lcl|NC_010576. 347 YYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT-GKAPHPNPLANELFNRN--IAD-GNQVGGINTPGQITSDQPA- 421 (447) Q Consensus 347 f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~-gl~p~~g~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~- 421 (447) +.+......+..+.+++..++++.|+++.-.+.+++ |+.+-+=..-....... ... ..+......+.+..+..++ T Consensus 379 ~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (479) T protein:vir:99 379 ITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGA 458 (479) T ss_pred EEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCC Confidence 544455566888999999999999999988787776 66542100000000000 000 0001000011110000000 Q ss_pred cCCCCCCCcccccccCCccCcCCC Q lcl|NC_010576. 422 TASTDPLNNVSTSAIENGSLTDGG 445 (447) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~ 445 (447) .+.+...+.+..++- --..|| T Consensus 459 ~~~~~~~~~~~~~~~---~~~~~~ 479 (479) T protein:vir:99 459 TNMQQANNKTGEPAS---LNKSGA 479 (479) T ss_pred CCCCCCCCCCcchhc---cCCCCC Confidence 000111111111111 112233 No 155 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=95.92 E-value=0.0013 Score=36.48 Aligned_cols=402 Identities=9% Similarity=0.048 Sum_probs=142.6 Q ss_pred Cc-------hhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhcc Q lcl|NC_010576. 1 MA-------SSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASM 73 (447) Q Consensus 1 Mg-------~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~ 73 (447) |. ++++|.+-|.-.++ +......++..........-... ......+ .......-+|+.+++.+-- T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~---r~~~~~~Yy~G~~~i~~~~~~~~---~~~~~~~--~~~n~~~~ivd~~~~~l~~ 79 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQ---NLKTNTSYYEAERRPEAIGVTVP---IQMQSLL--AHVGYPRLYVDSIAERQAV 79 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHhcCCcchhcCCCCC---hhhhhhh--hhcCcHHHHHHHHHhhhcc Confidence 21 23333322221111 01111111110000000000000 0000000 0111234455555444321 Q ss_pred CceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc------ceeeecc- Q lcl|NC_010576. 74 VDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS------GSFDINT- 146 (447) Q Consensus 74 lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~------~~~~~~~- 146 (447) .. |+..+ + ...+..+.+++.. |. ...+...+...++.+|.||+++.++...... ..+.+.+ T Consensus 80 ~g---~~~~~-~---~~~~~~~~~i~~~--N~---~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p 147 (485) T protein:vir:10 80 EG---FRFGD-A---DEADEELWQWWQA--NN---LDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPP 147 (485) T ss_pred cc---eecCC-C---chhHHHHHHHHHh--cC---HhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEcc Confidence 12 33211 1 1123345555532 32 2356677888999999999987765432110 0111111 Q ss_pred --------CCCccee----eec---CCceE-EEEeeec--------ccccc---eeeec--cccccccccccc----ccc Q lcl|NC_010576. 147 --------ARVGKIM----QFF---PRQVM-VRVWNDN--------TGLEQ---DLLVS--KENCIIIESPFY----AIL 193 (447) Q Consensus 147 --------~~~~~~~----~~~---~~~~~-~~~~~~~--------~~~~~---~~~~~--~~~v~~~~~~~~----~~~ 193 (447) .....+. .++ .+... +.+|... .+.+. ...++ .-.|+++.+-.. .+. T Consensus 148 ~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~ 227 (485) T protein:vir:10 148 TRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGT 227 (485) T ss_pred ceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCc Confidence 1111110 000 01110 1111100 00110 00011 012233332100 011 Q ss_pred cch----hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-CCcee Q lcl|NC_010576. 194 NDT----NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-TQEKF 268 (447) Q Consensus 194 ~~~----~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~ 268 (447) +.+ ......+...+.-+....++.. .+.-++. +........+.. .-...| ....++++.++ .+.+| T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a--~p~~~i~-G~~~~~~~~~~~--~~~~~~----~~~~~~i~~~~~~d~k~ 298 (485) T protein:vir:10 228 SEITPELRSMTDAAARILMLMQATAELMG--VPQRLIF-GIKPEEIGVDPE--TGQTLF----DAYLARILAFEDAEGKI 298 (485) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhc--chHHHHh-cCCccccccccc--ccchhh----hhcccceeccCCCCceE Confidence 111 1111222222222222222221 1211221 001111000000 000011 11234566665 46677 Q ss_pred eecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHH---------------HHHHHHHhHHHHHHHHHHH Q lcl|NC_010576. 269 VSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTL---------------GYYNRCVDVLLQYVTDAIS 330 (447) Q Consensus 269 ~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~---------------~f~~~ti~P~~~~ie~~l~ 330 (447) .++....-..+++.++....+|+..=++|++.+++.. .....+ ..+...|..+++.+-...+ T Consensus 299 ~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~ 378 (485) T protein:vir:10 299 QQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMK 378 (485) T ss_pred EeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 7776544444567777888889999999999997642 111111 1233333333332221111 Q ss_pred hhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCccccccccccccc-hhhcc Q lcl|NC_010576. 331 RIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELFNRNIAD-GNQVG 407 (447) Q Consensus 331 ~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~-~~~~~ 407 (447) . .........+++.+..-+..+.++.++++.+++..| +++..-+++++|+.+-+-.....+....... ..... T Consensus 379 ~----~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~ 454 (485) T protein:vir:10 379 G----GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIG 454 (485) T ss_pred C----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHH Confidence 0 000011235666666677788999999999999866 8888888888888653211000011100000 00000 Q ss_pred cccCCCCCCCC--CCCcCCCCC-CCcccccc Q lcl|NC_010576. 408 GINTPGQITSD--QPATASTDP-LNNVSTSA 435 (447) Q Consensus 408 ~~~~~~~~~~~--~~~~~~~~~-~~~~~~~~ 435 (447) ....++...++ ++.++.+++ .++..++| T Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 455 TMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 00111111111 111111111 12223333 No 156 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=367 Identities=9% Similarity=-0.038 Sum_probs=135.8 Q ss_pred ccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccc Q lcl|NC_010576. 14 FQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPS 93 (447) Q Consensus 14 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~ 93 (447) +-+++. ...+.............-||+.+++.+- +.-|+. +++ ..+. T Consensus 1 ~l~~~~-------------------------~~~~~~~~~~~v~n~~~~ivd~~~~~l~---~~gf~~-~d~----~~~~ 47 (434) T protein:vir:98 1 MLPKNA-------------------------EQAFLDFQRKARTNFCGLIANASVHRLL---ALGVTG-PDG----EPDT 47 (434) T ss_pred CCCCCc-------------------------cHHHHHhhhhhhccchHHHHHHHHhhhc---cCceec-CCC----chHH Confidence 001000 0000000000011123345555554332 222332 111 1234 Q ss_pred hHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-----ceeeeccCC---------Ccce----eee Q lcl|NC_010576. 94 GLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-----GSFDINTAR---------VGKI----MQF 155 (447) Q Consensus 94 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-----~~~~~~~~~---------~~~~----~~~ 155 (447) .+.+++. =|... .....+..+.+.+|.||+++..+..+... ..+.+.++. ..++ ..+ T Consensus 48 ~~~~i~~--~N~~d---~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~ 122 (434) T protein:vir:98 48 RASRWWQ--ANRLD---SRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVW 122 (434) T ss_pred HHHHHHH--hcChh---HHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEE Confidence 4666664 25433 35556778889999999987665432111 011111111 0000 000 Q ss_pred c---CCceEEEEee---------e--cccccc---e-----------eee--ccccccccccc-cc--ccccchhHHH-- Q lcl|NC_010576. 156 F---PRQVMVRVWN---------D--NTGLEQ---D-----------LLV--SKENCIIIESP-FY--AILNDTNQTL-- 200 (447) Q Consensus 156 ~---~~~~~~~~~~---------~--~~~~~~---~-----------~~~--~~~~v~~~~~~-~~--~~~~~~~~~~-- 200 (447) . .+......+. . ...... . ..| ..-.++++.+. .. .+.+.+...+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~l 202 (434) T protein:vir:98 123 HNDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDI 202 (434) T ss_pred EeccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHH Confidence 0 0011100000 0 000000 0 000 00112333321 00 1111121111 Q ss_pred -HHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-CCceeeecCCChhhh Q lcl|NC_010576. 201 -RMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-TQEKFVSAGMGLQNN 278 (447) Q Consensus 201 -~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~~~ 278 (447) ..+...+..+....++. +.+.-++. +....+.. . ........+ +.+....++++.++ .+.++.++....-.. T Consensus 203 iDa~~~~~s~~~~~~~~~--a~p~~~i~-G~~~~~~~-~-~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~ 276 (434) T protein:vir:98 203 QDRVNLGILNRMAASRFS--GFRQKWIK-GHKFAKRT-D-PATGMTVVD-QPFVPSPSAVWASEGENTQFGQLDATDLSG 276 (434) T ss_pred HHHHHHHHHHHHHHHHHh--cchhhhhc-CCCccccc-c-cccccchhh-hhhhccccccccCCCCCceEEEecCcchHH Confidence 11222222222222222 22221221 11111100 0 000000011 11222335566665 356777776544445 Q ss_pred hHHHHHHHHHHHHHHhCCCHHHhcCCc--HHHHHHHHHHHHHhHHHHHHHHHHHhhc----------CChhHhcCCceEE Q lcl|NC_010576. 279 LLSDVRQLQQDFYNQMGITEAILNGTA--NEQQTLGYYNRCVDVLLQYVTDAISRIA----------LTKTAVSQGQVLV 346 (447) Q Consensus 279 ~l~~~~~~~~~Ia~~fgVP~~~l~g~~--~e~~~~~f~~~ti~P~~~~ie~~l~~kL----------l~~~e~~~g~~i~ 346 (447) +++.++....+|+..=++|++.++++. .......|....|.-.+...+..|...| -.... ....++ T Consensus 277 ~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~--~~~~~~ 354 (434) T protein:vir:98 277 FLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPE--DYTEAE 354 (434) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCh--hheeee Confidence 667788889999999999999998642 2111222222222222222222211111 01000 113455 Q ss_pred EecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 347 YYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 347 f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) +.+..-...+..+.++++.+++..|+ +..-+++++|+++-+ -..... . ..+. ...... ..+..++. T Consensus 355 v~w~~~~~~s~~~~ada~~kl~~~g~-~~e~~~~~lg~~~~e---~~r~~~-e---~~~~---~~~~~~---~~~~~~~~ 420 (434) T protein:vir:98 355 VRWANPAHVTMAVKADAATKLKSIGY-PLDVIAEELDESPAR---VRRIVA-G---AASQ---ALLAAS---LLPAPGAP 420 (434) T ss_pred EEecCCCCCCHHHHHHHHHHHHhcCC-cHHHHHHhCCCCHHH---HHHHHH-H---HHHH---HHHHHh---hhccCCCC Confidence 55566777899999999999998885 777778888876521 000100 0 0000 000000 00000011 Q ss_pred CCCcccccccCCccCcCC Q lcl|NC_010576. 427 PLNNVSTSAIENGSLTDG 444 (447) Q Consensus 427 ~~~~~~~~~~~~~~~~~~ 444 (447) +..+ ....|.+++| T Consensus 421 ~~g~----~~~~~~~~dg 434 (434) T protein:vir:98 421 SAGN----VPDSGGAVDG 434 (434) T ss_pred CCCC----CCcccCCCCC Confidence 1111 1123445555 No 157 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=95.54 E-value=0.0019 Score=35.53 Aligned_cols=391 Identities=8% Similarity=0.012 Sum_probs=142.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|++.....+..+..-.... . ... ......+ .......-+|+..++-+---+| + T Consensus 26 ~~~~~r~~~~~~YY~G~~~i~~~~-~-------------~~~---~~~~~~~--~~~n~~~~ivd~~~~~l~~~g~---~ 83 (485) T protein:vir:24 26 EDQNQNLRSNTSYYEAERRPEAIG-V-------------TVP---VQMQSLL--AHVGYPRLYVDSIAERQAVEGF---R 83 (485) T ss_pred HHHHHHHHHHHHHHhccCchhhcC-c-------------ccc---hhhhhhh--hccchHHHHHHHHhhhhccCce---e Confidence 222233333333333332110000 0 000 0000000 0111233344444444322233 2 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc------ceeeecc-------- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS------GSFDINT-------- 146 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~------~~~~~~~-------- 146 (447) .. ++ ...+..+.+++.. |. ...+...+..+++.+|.||+++..+...... ..+.+.+ T Consensus 84 ~~-~~---~~~~~~l~~i~~~--N~---~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~ 154 (485) T protein:vir:24 84 LG-DA---DEADEELWQWWQA--NN---LDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEI 154 (485) T ss_pred cC-CC---chhHHHHHHHHHh--cC---hhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEe Confidence 11 11 1123445666642 43 2356778888999999999987765432110 0111111 Q ss_pred -CCCccee----eec---CCce-EEEEeeec--------cccc---ceeeecc--cccccccccc-c---ccccch---- Q lcl|NC_010576. 147 -ARVGKIM----QFF---PRQV-MVRVWNDN--------TGLE---QDLLVSK--ENCIIIESPF-Y---AILNDT---- 196 (447) Q Consensus 147 -~~~~~~~----~~~---~~~~-~~~~~~~~--------~~~~---~~~~~~~--~~v~~~~~~~-~---~~~~~~---- 196 (447) ....++. .++ .+.+ .+.+|... .+.+ ....++- =.|+++++-- . .+.+.+ T Consensus 155 D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v 234 (485) T protein:vir:24 155 DPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPEL 234 (485) T ss_pred eCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhH Confidence 1111110 000 0011 11111110 0000 0011111 1223333210 0 011111 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-CCceeeecCCCh Q lcl|NC_010576. 197 NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-TQEKFVSAGMGL 275 (447) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~ 275 (447) ......+...+.-+.....+. +.+.-++. +...... ....+.....| ....+.++.++ .+.++.++.... T Consensus 235 ~~liDa~~~~~s~~~~~~~~~--a~p~~~i~-G~~~~~~--~~~~~~~~~~~----~~~~~~i~~~~~~~~~~~q~~~~~ 305 (485) T protein:vir:24 235 RSMTDAAARILMLMQATAELM--GVPQRLIF-GIKPEEI--GVDPETGQTLF----DAYLARILAFEDAEGKIQQFSAAE 305 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhh--cchhhhhc-cCCcccc--ccccccccchh----hhcccceeccCCCCceEEeecccc Confidence 111222222222222222222 12211221 1111110 00000001111 11234566664 566777776544 Q ss_pred hhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHHH---------------HHHHHHhHHHHHHHHHHHhhcCChh Q lcl|NC_010576. 276 QNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTLG---------------YYNRCVDVLLQYVTDAISRIALTKT 337 (447) Q Consensus 276 ~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~~---------------f~~~ti~P~~~~ie~~l~~kLl~~~ 337 (447) -..+++.++....+++..=++|++.++++. .....+. .+...|.-+++.+....+..-.+ T Consensus 306 ~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~-- 383 (485) T protein:vir:24 306 LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVP-- 383 (485) T ss_pred hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc-- Confidence 445566677778888888899999997653 1222222 23333333333332221111000 Q ss_pred HhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcccccccccccc----chhhcccccC Q lcl|NC_010576. 338 AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELFNRNIA----DGNQVGGINT 411 (447) Q Consensus 338 e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~----~~~~~~~~~~ 411 (447) .....+++.+..-...+..+.++.+.+++.+| +++..-+++++|+.+-+-.....+...... ......+... T Consensus 384 --~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 461 (485) T protein:vir:24 384 --PDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADP 461 (485) T ss_pred --cccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCC Confidence 11134555555556678889999999998866 777777788877754321111111100000 0000111000 Q ss_pred CCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 412 PGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) .....++..+..++.+..+..++| T Consensus 462 ~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 462 TVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred CCCCCCCCCCCCCCccCCCCCCCC Confidence 000000011111111122222222 No 158 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=389 Identities=7% Similarity=-0.055 Sum_probs=145.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ..+.++|.+.+.-..+ +......++...... ...+.. ..........-..+....-+|+..++-+-.-|+.+ . T Consensus 7 ~~~~~~l~~~~~~~~~---r~~~l~~Yy~g~~~i-~~~~~~--~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~-~ 79 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMS---RVRLLARYSNGDAPL-PELTRN--TSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV-G 79 (456) T ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCc-hhcCcc--cChhhhhhhhhhhcchHHHHHHHHHhhhccCCeec-C Confidence 3444444332221111 111111111110000 000000 00000000000122345566777777666667652 1 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-------ceeeeccCCCcc-e Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-------GSFDINTARVGK-I 152 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-------~~~~~~~~~~~~-~ 152 (447) .. ++. ..+..+.+++. =|.. ..+...+..+++.+|.||+++..+..+... ..+++....... + T Consensus 80 ~~-~d~---~~~~~~~~i~~--~N~~---d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~ 150 (456) T protein:vir:10 80 GS-ADS---DLALRARRIWR--DNRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRI 150 (456) T ss_pred CC-CCc---chHHHHHHHHH--hcCh---hhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcce Confidence 11 111 12234556654 2432 244566778899999999887665543221 011111101100 0 Q ss_pred ---eeec---CCceEE-EEeeec------------ccccceee-------eccccccc---------ccccccccccchh Q lcl|NC_010576. 153 ---MQFF---PRQVMV-RVWNDN------------TGLEQDLL-------VSKENCII---------IESPFYAILNDTN 197 (447) Q Consensus 153 ---~~~~---~~~~~~-~~~~~~------------~~~~~~~~-------~~~~~v~~---------~~~~~~~~~~~~~ 197 (447) ..++ .+...+ ..|... ........ ......-| +.++ .+.+.+. T Consensus 151 ~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~--~g~gd~e 228 (456) T protein:vir:10 151 RAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP--DGMGEVE 228 (456) T ss_pred EEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCC--CCCchhh Confidence 0000 000000 000000 00000000 00000011 1111 1112221 Q ss_pred HHHHHHH---HHHHHHHHHHHHhhcCcccceeee-C--CcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeec Q lcl|NC_010576. 198 QTLRMLE---QKIKLMNSQDNRASSGKLNGFIQF-P--YSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSA 271 (447) Q Consensus 198 ~~~~~~~---~~~~~~~~~~~~~n~~~~~gvl~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l 271 (447) .....+. ..+.-+.....+. +.+.-++.- . ....++. .... .....| ....++++.++.+.++.++ T Consensus 229 ~vi~liDa~~~~~s~~~~~~~~~--a~~~~~i~G~~~~~~~~d~~-g~~~-~~~~~~----~~~~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 229 PHIDIINRINRAELQLLSTMAIQ--AFRQRALKSTEHGLPNVDEN-GNAI-DYASIF----EAAPGALWELPPGVDIWES 300 (456) T ss_pred hhHHHHHHHHHHHHHHHHHHHHh--hhHhHhhhccCccccccccc-cccc-chhhhh----hhhccccccCCCCcceEEe Confidence 1111111 1111111111111 111111110 0 0000110 0000 011112 1223567778899998887 Q ss_pred CCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc--HHHHHHHHHHHHHhHHHHHHHHHHHhhc----------CChhHh Q lcl|NC_010576. 272 GMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA--NEQQTLGYYNRCVDVLLQYVTDAISRIA----------LTKTAV 339 (447) Q Consensus 272 ~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~--~e~~~~~f~~~ti~P~~~~ie~~l~~kL----------l~~~e~ 339 (447) ....-..+++.++.+..+|+..=++|++.+++.. .......|....+.-.+...+..|...| -... T Consensus 301 ~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~-- 378 (456) T protein:vir:10 301 QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-- 378 (456) T ss_pred cccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-- Confidence 6544444567788899999999999999997632 1222222222222222222222222111 1111 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCC Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQ 419 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (447) ....+++.+...+..+..+.++++.++++.|+++..-+++++|+.+-+-.. .........+....+...+.+ T Consensus 379 -~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~----~e~er~~~e~~~~~~~~~~~~--- 450 (456) T protein:vir:10 379 -VEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQ----DDLDRAREQITLFAGNPVQRP--- 450 (456) T ss_pred -cccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHH----HHHHHHHHHHHHHhhhhhhcC--- Confidence 112344444566677889999999999999999988888888876531100 000001000000000000000 Q ss_pred CCcCCCCCCCcccc Q lcl|NC_010576. 420 PATASTDPLNNVST 433 (447) Q Consensus 420 ~~~~~~~~~~~~~~ 433 (447) ..+.++ T Consensus 451 --------~~~~~~ 456 (456) T protein:vir:10 451 --------QEDGSR 456 (456) T ss_pred --------CCCCCC Confidence 000000 No 159 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=389 Identities=7% Similarity=-0.055 Sum_probs=145.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ..+.++|.+.+.-..+ +......++...... ...+.. ..........-..+....-+|+..++-+-.-|+.+ . T Consensus 7 ~~~~~~l~~~~~~~~~---r~~~l~~Yy~g~~~i-~~~~~~--~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~-~ 79 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMS---RVRLLARYSNGDAPL-PELTRN--TSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV-G 79 (456) T ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCc-hhcCcc--cChhhhhhhhhhhcchHHHHHHHHHhhhccCCeec-C Confidence 3444444332221111 111111111110000 000000 00000000000122345566777777666667652 1 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-------ceeeeccCCCcc-e Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-------GSFDINTARVGK-I 152 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-------~~~~~~~~~~~~-~ 152 (447) .. ++. ..+..+.+++. =|.. ..+...+..+++.+|.||+++..+..+... ..+++....... + T Consensus 80 ~~-~d~---~~~~~~~~i~~--~N~~---d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~ 150 (456) T protein:vir:10 80 GS-ADS---DLALRARRIWR--DNRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRI 150 (456) T ss_pred CC-CCc---chHHHHHHHHH--hcCh---hhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcce Confidence 11 111 12234556654 2432 244566778899999999887665543221 011111101100 0 Q ss_pred ---eeec---CCceEE-EEeeec------------ccccceee-------eccccccc---------ccccccccccchh Q lcl|NC_010576. 153 ---MQFF---PRQVMV-RVWNDN------------TGLEQDLL-------VSKENCII---------IESPFYAILNDTN 197 (447) Q Consensus 153 ---~~~~---~~~~~~-~~~~~~------------~~~~~~~~-------~~~~~v~~---------~~~~~~~~~~~~~ 197 (447) ..++ .+...+ ..|... ........ ......-| +.++ .+.+.+. T Consensus 151 ~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~--~g~gd~e 228 (456) T protein:vir:10 151 RAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP--DGMGEVE 228 (456) T ss_pred EEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCC--CCCchhh Confidence 0000 000000 000000 00000000 00000011 1111 1112221 Q ss_pred HHHHHHH---HHHHHHHHHHHHhhcCcccceeee-C--CcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeec Q lcl|NC_010576. 198 QTLRMLE---QKIKLMNSQDNRASSGKLNGFIQF-P--YSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSA 271 (447) Q Consensus 198 ~~~~~~~---~~~~~~~~~~~~~n~~~~~gvl~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l 271 (447) .....+. ..+.-+.....+. +.+.-++.- . ....++. .... .....| ....++++.++.+.++.++ T Consensus 229 ~vi~liDa~~~~~s~~~~~~~~~--a~~~~~i~G~~~~~~~~d~~-g~~~-~~~~~~----~~~~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 229 PHIDIINRINRAELQLLSTMAIQ--AFRQRALKSTEHGLPNVDEN-GNAI-DYASIF----EAAPGALWELPPGVDIWES 300 (456) T ss_pred hhHHHHHHHHHHHHHHHHHHHHh--hhHhHhhhccCccccccccc-cccc-chhhhh----hhhccccccCCCCcceEEe Confidence 1111111 1111111111111 111111110 0 0000110 0000 011112 1223567778899998887 Q ss_pred CCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc--HHHHHHHHHHHHHhHHHHHHHHHHHhhc----------CChhHh Q lcl|NC_010576. 272 GMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA--NEQQTLGYYNRCVDVLLQYVTDAISRIA----------LTKTAV 339 (447) Q Consensus 272 ~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~--~e~~~~~f~~~ti~P~~~~ie~~l~~kL----------l~~~e~ 339 (447) ....-..+++.++.+..+|+..=++|++.+++.. .......|....+.-.+...+..|...| -... T Consensus 301 ~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~-- 378 (456) T protein:vir:10 301 QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-- 378 (456) T ss_pred cccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-- Confidence 6544444567788899999999999999997632 1222222222222222222222222111 1111 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCC Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQ 419 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (447) ....+++.+...+..+..+.++++.++++.|+++..-+++++|+.+-+-.. .........+....+...+.+ T Consensus 379 -~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~----~e~er~~~e~~~~~~~~~~~~--- 450 (456) T protein:vir:10 379 -VEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQ----DDLDRAREQITLFAGNPVQRP--- 450 (456) T ss_pred -cccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHH----HHHHHHHHHHHHHhhhhhhcC--- Confidence 112344444566677889999999999999999988888888876531100 000001000000000000000 Q ss_pred CCcCCCCCCCcccc Q lcl|NC_010576. 420 PATASTDPLNNVST 433 (447) Q Consensus 420 ~~~~~~~~~~~~~~ 433 (447) ..+.++ T Consensus 451 --------~~~~~~ 456 (456) T protein:vir:10 451 --------QEDGSR 456 (456) T ss_pred --------CCCCCC Confidence 000000 No 160 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=95.35 E-value=0.0022 Score=35.12 Aligned_cols=393 Identities=11% Similarity=0.024 Sum_probs=137.8 Q ss_pred CchhHhhhhhcccccCCccccccc-------cccccc-----cc-cccccccccc------cCCcccccchhhhhhHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNT-------NDFLTP-----SN-GMTSFGGYYG------RGQSNYSRSYSYNKADLIK 61 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~-------~~~~~~-----~~-~~~~~~~~~~------~~~~~~~~~~~~~~~~~v~ 61 (447) ||+|+|+++++.-..++....+.. ....++ .. |..-+-|... ..+... .+...+...-. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~--~~~~~slnl~~ 78 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQ--KHELQSVNVTK 78 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCcc--ccceeecchHH Confidence 999999987765422221111110 011111 00 1000011100 000000 11111112234 Q ss_pred HHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccce Q lcl|NC_010576. 62 SVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGS 141 (447) Q Consensus 62 ~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~ 141 (447) .+++.+|+-+..=|..+ . -++ +..+..|.++|.. |.. ..-.+..+...+..|..++.+..+...+... T Consensus 79 ~i~~~~A~ll~~e~~~i-~--~~d---~~~~e~l~~i~~~--n~f---~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~- 146 (505) T protein:vir:79 79 LASAKLASLIFNEQCQV-T--VSD---ETANDFLDDVFQQ--NDF---YTTFEEKLEEWIALGSGCVRPYVDSGKIKLA- 146 (505) T ss_pred HHHHHHHhhhcCCCcee-e--cCC---hHHHHHHHHHHHh--ccH---HHHHHHHHHHHhhcCCeEEEEEEeCCceEEE- Confidence 44455555554434332 1 111 1122334455531 222 2223344455555566665555443322111 Q ss_pred eeeccCCCcceeeecCCce-EEEEeee----cccccceee----ec-ccccccc-------------------------- Q lcl|NC_010576. 142 FDINTARVGKIMQFFPRQV-MVRVWND----NTGLEQDLL----VS-KENCIII-------------------------- 185 (447) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~-~~~~~~~----~~~~~~~~~----~~-~~~v~~~-------------------------- 185 (447) ..++...-+..+..+.+ .+.+... .......++ |. ...-.+| T Consensus 147 --~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~ 224 (505) T protein:vir:79 147 --WATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYE 224 (505) T ss_pred --EEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccccc Confidence 11111100000001111 0000000 000000000 00 0000001 Q ss_pred -----------ccccccc------------ccchhHHHHHHHHHHHHHHHHHHH----hhcCccc-----ceeeeCCcCC Q lcl|NC_010576. 186 -----------ESPFYAI------------LNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLN-----GFIQFPYSTK 233 (447) Q Consensus 186 -----------~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~-----gvl~~~~~~~ 233 (447) ..|+... ...+-+.+..+...++.+...... ...++.+ .+++...... T Consensus 225 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~ 304 (505) T protein:vir:79 225 GLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYG 304 (505) T ss_pred ccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCC Confidence 1111000 000112222222222222211111 0111111 1111110000 Q ss_pred hHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC------CcH Q lcl|NC_010576. 234 STARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG------TAN 306 (447) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~ 306 (447) .+........+... ...+. .+..=+++..++.++... .++.++..+.+.++|+...|+++..++. |++ T Consensus 305 ~~~~~~~~~~fd~~-~~~y~----~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAt 379 (505) T protein:vir:79 305 GQASETHPPMFDPD-ETVYQ----AMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTAT 379 (505) T ss_pred cccccccccCCCcc-ceeee----eccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHH Confidence 00000000000000 00000 000001123455555543 3456777888889999999999998862 223 Q ss_pred HHH------------HHHHHHHHHhHHHHHHHHHHHhhcCChhH------hcCCceEEEecchhhhcCHHHHHHHHHHHH Q lcl|NC_010576. 307 EQQ------------TLGYYNRCVDVLLQYVTDAISRIALTKTA------VSQGQVLVYYRNPFKLVPVEQLATVADVLT 368 (447) Q Consensus 307 e~~------------~~~f~~~ti~P~~~~ie~~l~~kLl~~~e------~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~ 368 (447) |.. ....++.+|..++..|-.......+.... -...+.+.|++++-+..|..+.++.+.+++ T Consensus 380 ei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v 459 (505) T protein:vir:79 380 EVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAV 459 (505) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHH Confidence 321 11234555555555544433222211110 011246888888988999999999999999 Q ss_pred hCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 369 RNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 369 ~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) .+|+|+.-+++.. ++-+++.+....+ ........ .........+.+ T Consensus 460 ~~Gi~s~e~~l~~--~~~~~eeea~~el----~ri~~E~~------~~~p~~~~~gg~ 505 (505) T protein:vir:79 460 QAQVMPKKQFLMR--NYGLDEEEADEWL----AQIDAENS------TAEPEFNQFGGD 505 (505) T ss_pred HcCCCCHHHHHHh--cCCCChHHHHHHH----HHHHHhcc------ccCCCchhccCC Confidence 9999999887765 3434443332211 11100000 000001111111 No 161 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=95.28 E-value=0.0024 Score=34.96 Aligned_cols=420 Identities=9% Similarity=-0.028 Sum_probs=140.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +-++.+|.+.|.-+.++ ......++....... ..+. ..+... .+.........-||+.+++.+- +.=|+ T Consensus 23 ~~~i~~L~~~~~~~~~r---~~~l~~YY~G~~~i~-~~~~---~~p~~~-~~~~~v~n~~~~iVd~~a~rl~---~~Gf~ 91 (504) T protein:vir:99 23 VDKVNGLYQQLVDRTPR---NLLRASFYDGKYAIR-QIGN---LIPPEY-LRTATVLGWSAKAVDTLARRCN---LESFV 91 (504) T ss_pred HHHHHHHHHHHHHHhHH---HHHHHHHHhccccch-hccc---cccHHH-HHHhhccCcHHHHHHHHHhhhc---cceee Confidence 55555555544332221 111111211111000 0000 000000 0000111122334455554332 22233 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccC---------CCcc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTA---------RVGK 151 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~---------~~~~ 151 (447) .. ++ ...+..+.+++. =|... .....+..+.+.+|.||+++..+..+.....+...++ .... T Consensus 92 ~~-d~---~~~~~~l~~i~~--~N~ld---~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~ 162 (504) T protein:vir:99 92 WP-DG---DYGSIGGPDVWD--ENFFA---TKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNA 162 (504) T ss_pred CC-CC---ChhhHHHHHHHH--hcChh---hHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCc Confidence 21 11 112334555553 35543 3566778889999999998876654432111111111 1111 Q ss_pred ee---e-e-cC--CceE-EEEeee---------cccccce--eeec-ccccccccccc--c--cccc----chhHHHHHH Q lcl|NC_010576. 152 IM---Q-F-FP--RQVM-VRVWND---------NTGLEQD--LLVS-KENCIIIESPF--Y--AILN----DTNQTLRML 203 (447) Q Consensus 152 ~~---~-~-~~--~~~~-~~~~~~---------~~~~~~~--~~~~-~~~v~~~~~~~--~--~~~~----~~~~~~~~~ 203 (447) +. . + .. +... ..+|.. ..+.+.. ..++ .-.|+++.+.- . .+.. .+......+ T Consensus 163 ~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~ 242 (504) T protein:vir:99 163 MDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRA 242 (504) T ss_pred eeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHHHHHHH Confidence 10 0 0 00 0000 111110 0000000 0000 01133332210 0 0111 111222222 Q ss_pred HHHHHHHHHHHHHhhcCcccceeeeCC-cCChHHHHHHHHHHHHHHHHH--hccCCcceeecCCCceeeecCCChhhhhH Q lcl|NC_010576. 204 EQKIKLMNSQDNRASSGKLNGFIQFPY-STKSTARAAQAARRKQEIENE--MANNKYGVATLDTQEKFVSAGMGLQNNLL 280 (447) Q Consensus 204 ~~~~~~~~~~~~~~n~~~~~gvl~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~n~~~~~vl~~g~~~~~l~~~~~~~~l 280 (447) ...+..+....++.... .+.++-... ...+++ ......|+...... +..+..+...-....++.++....-..++ T Consensus 243 ~~~~~~~~~~~e~~a~p-~r~i~G~~~~~~~~~d-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~ 320 (504) T protein:vir:99 243 LKGCIRMDGHADVYSFP-QLILLGADAKNFRNKD-GSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHI 320 (504) T ss_pred HHHHHHHHHHHHHhcch-hhhhccCCcccccccc-ccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHH Confidence 22222222233332211 111211100 000000 01111222211110 11111111111223556566544333456 Q ss_pred HHHHHHHHHHHHHhCCCHHHhc--CC--cHHHHHHHHHHHHHhHHHH----HHHHHHHh----h--cCCh--hHhcCCce Q lcl|NC_010576. 281 SDVRQLQQDFYNQMGITEAILN--GT--ANEQQTLGYYNRCVDVLLQ----YVTDAISR----I--ALTK--TAVSQGQV 344 (447) Q Consensus 281 ~~~~~~~~~Ia~~fgVP~~~l~--g~--~~e~~~~~f~~~ti~P~~~----~ie~~l~~----k--Ll~~--~e~~~g~~ 344 (447) +.++.+..+||..=++|++.|| +. ........+-...|.-.+. .+.+.+.+ . +... ........ T Consensus 321 ~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~ 400 (504) T protein:vir:99 321 EMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKT 400 (504) T ss_pred HHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc Confidence 7788899999999999999885 21 1111122222222222111 11122211 0 1000 00111234 Q ss_pred EEEecchhhhcCHHHHHHHHHHHHhCCCcC--H-HHHHHHhCCCCCCCcccccccccc--ccchhhccc-ccCCCCCCCC Q lcl|NC_010576. 345 LVYYRNPFKLVPVEQLATVADVLTRNAIYT--P-NEIRELTGKAPHPNPLANELFNRN--IADGNQVGG-INTPGQITSD 418 (447) Q Consensus 345 i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t--~-NE~R~~~gl~p~~g~~~~~~~~~~--~~~~~~~~~-~~~~~~~~~~ 418 (447) +++.+.+....+..++++++.++++.|... + .-+++++|+.|-+=..-....... .....+..+ ....+..... T Consensus 401 ~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~ 480 (504) T protein:vir:99 401 IDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGED 480 (504) T ss_pred ceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCC Confidence 444455667778999999999999988532 2 335566777653210000000000 000011100 1111111111 Q ss_pred CCCcCCCCCCCcccccccCCccCcCCC Q lcl|NC_010576. 419 QPATASTDPLNNVSTSAIENGSLTDGG 445 (447) Q Consensus 419 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (447) .+...+ .+..++. ..-.+.-+.-| T Consensus 481 ~~~~~~-e~a~~~~--~~~~~~p~~~~ 504 (504) T protein:vir:99 481 QDQGAG-EPPANEP--PAALGRPTLVG 504 (504) T ss_pred CCcCCC-CCCCCCC--CccCCCcccCC Confidence 111111 1111111 11122222333 No 162 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=95.23 E-value=0.0025 Score=34.87 Aligned_cols=396 Identities=11% Similarity=0.015 Sum_probs=144.3 Q ss_pred CchhHhhhhhcc-----cccCCcccc----ccccc---cccc---c-ccccc---cccccccCCcccccchhhhhhHHHH Q lcl|NC_010576. 1 MASSDRLLHSWN-----AFQSNQNQN----QNTND---FLTP---S-NGMTS---FGGYYGRGQSNYSRSYSYNKADLIK 61 (447) Q Consensus 1 Mg~~~~l~~~~~-----~f~~~~~~~----~~~~~---~~~~---~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~v~ 61 (447) ||+|+||++++. .|..+.-.+ ..... .+.. . .|+.. ........+... .+...+...-. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~--~~~~~sln~~~ 78 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKK--KRLKNTINMAK 78 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCcc--ccceeecchHH Confidence 999999987652 221111110 11110 0000 0 01110 000001111110 11111112233 Q ss_pred HHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-- Q lcl|NC_010576. 62 SVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-- 139 (447) Q Consensus 62 ~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-- 139 (447) .+++.+|+-+..=|..+ ....+. ..+..|..+|. -|... .-.+..+...+..|.+++.+..+...... T Consensus 79 ~i~~~~A~lv~~e~~~i-~v~~~~----~~~e~l~~il~--~n~f~---~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~ 148 (508) T protein:vir:15 79 TAARRIASVVFNEKAEI-HVKDNN----EADKFLNDVLE--DNDFK---NKFEEALEKGVALGGFAMRPYIDGNHIKIAW 148 (508) T ss_pred HHHHHHHhhhhCCCceE-EeCCch----HHHHHHHHHHH--hccHH---HHHHHHHHHHhhcCceEEEEEEeCCeeEEEE Confidence 44455555554334332 111111 12223455553 23321 22233344555556666555444332211 Q ss_pred ---ceeeeccCCCcce---eeec-------------------------CCceEEEEeeeccc--ccceee---ecc---- Q lcl|NC_010576. 140 ---GSFDINTARVGKI---MQFF-------------------------PRQVMVRVWNDNTG--LEQDLL---VSK---- 179 (447) Q Consensus 140 ---~~~~~~~~~~~~~---~~~~-------------------------~~~~~~~~~~~~~~--~~~~~~---~~~---- 179 (447) ..+.|.......+ ..+. +..+....|..... .+..+. ++. T Consensus 149 v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l 228 (508) T protein:vir:15 149 VRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKEL 228 (508) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCC Confidence 1111111111111 0000 00111111111100 001010 000 Q ss_pred -----------cccccccccccc----cccchhHHHHHHHHHHHHHHHHHHH----hhcCccccee-----eeCCcCChH Q lcl|NC_010576. 180 -----------ENCIIIESPFYA----ILNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLNGFI-----QFPYSTKST 235 (447) Q Consensus 180 -----------~~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~gvl-----~~~~~~~~~ 235 (447) -...|++.|..+ ....+-+.+..+...++.+...... ...++.+-++ ..+....+ T Consensus 229 ~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~~- 307 (508) T protein:vir:15 229 APQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHKP- 307 (508) T ss_pred CcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCcc- Confidence 001223222111 0111223333333333333222211 1233333222 11111000 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC------CcHHH Q lcl|NC_010576. 236 ARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG------TANEQ 308 (447) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e~ 308 (447) .+... ...+..-.+ --+.|..++.++... .+++.+..+...+.|....|+++..++. |++|. T Consensus 308 -------~~~~~-~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei 376 (508) T protein:vir:15 308 -------TFDTE-QNVYVGVLS---DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEV 376 (508) T ss_pred -------ccCCC-CeeEEeccC---CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHH Confidence 00000 000000000 012344566666663 4566778888888999999999998852 23332 Q ss_pred HH------------HHHHHHHHhHHHHHHHHHHHhh-cCCh-------hHhcCCceEEEecchhhhcCHHHHHHHHHHHH Q lcl|NC_010576. 309 QT------------LGYYNRCVDVLLQYVTDAISRI-ALTK-------TAVSQGQVLVYYRNPFKLVPVEQLATVADVLT 368 (447) Q Consensus 309 ~~------------~~f~~~ti~P~~~~ie~~l~~k-Ll~~-------~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~ 368 (447) .. ...++.+|..++..|-.-.+.. ++.. .-....+.+.|++++-+..|..+.++.+.+++ T Consensus 377 ~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v 456 (508) T protein:vir:15 377 VSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVL 456 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHH Confidence 11 1234444545444443332211 1110 00012346778888988899999999999999 Q ss_pred hCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 369 RNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 369 ~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) .+|+++.-+++.. ++-+++++....+. ..........+.+ ....+..+ .++| T Consensus 457 ~aGi~s~e~~i~~--~~g~~deea~~el~----ri~~E~~~~~~~~--~~~~~~~g---~~ge 508 (508) T protein:vir:15 457 AIGALSKQTFLQR--NYGMTDEQAAEELA----KIQSEAPTDTFEG--GRSAILNG---GDGE 508 (508) T ss_pred hcCCCCHHHHHHh--cCCCChHHHHHHHH----HHHHhccccCccc--cccccCCC---CCCC Confidence 9999999888765 33343333322111 0000000000000 00111111 0111 No 163 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=95.08 E-value=0.00058 Score=38.32 Aligned_cols=428 Identities=7% Similarity=-0.076 Sum_probs=119.9 Q ss_pred CchhHhhhhhcccccCCcc--ccccccccccccccccccccccccCCccc--ccchhhhh-hHHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN--QNQNTNDFLTPSNGMTSFGGYYGRGQSNY--SRSYSYNK-ADLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~v~~cv~~ia~~ia~lp 75 (447) |.=-.+--+.- +++-... .+.......+.......+......-.+.+ .....+.+ ++++++||++++++||.+. T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g 79 (651) T protein:vir:99 1 MTDTTGETQET-KVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFG 79 (651) T ss_pred CCCccceeeee-EEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccC Confidence 32111000000 0000000 00000000011111111111111111211 22233444 8999999999999999999 Q ss_pred eEEEEEcC-CCc----e-ecc------ccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceee Q lcl|NC_010576. 76 FKHLKIDP-ISG----N-QTP------MPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFD 143 (447) Q Consensus 76 ~~~~r~~~-~~~----~-~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~ 143 (447) |.+.-... ++. . .+. ..|+....+...+|+.+|..+|++.++.+++.+||+|+.+.++..+.....+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~ 159 (651) T protein:vir:99 80 FDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAY 159 (651) T ss_pred ceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhh Confidence 98643211 111 1 001 12333334444679999999999999999999999999877776554332221 Q ss_pred eccCCCcceee--------------ecCCce------------------EEEEeeecccccceee-eccccc--cccccc Q lcl|NC_010576. 144 INTARVGKIMQ--------------FFPRQV------------------MVRVWNDNTGLEQDLL-VSKENC--IIIESP 188 (447) Q Consensus 144 ~~~~~~~~~~~--------------~~~~~~------------------~~~~~~~~~~~~~~~~-~~~~~v--~~~~~~ 188 (447) .+....++.. ..++.. ....+.........+. .....+ +|.... T Consensus 160 -lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~ 238 (651) T protein:vir:99 160 -VPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDE 238 (651) T ss_pred -cChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCc Confidence 1111111110 001100 0001110000000000 000000 000000 Q ss_pred cccccc-chhHHHH------H-HHHHHHHHHHHHHHhhcCcccceee---eCCcCChHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_010576. 189 FYAILN-DTNQTLR------M-LEQKIKLMNSQDNRASSGKLNGFIQ---FPYSTKSTARAAQAARRKQEIENEMANNKY 257 (447) Q Consensus 189 ~~~~~~-~~~~~~~------~-~~~~~~~~~~~~~~~n~~~~~gvl~---~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 257 (447) ...... ....... . ....+. .....++.......|+.- +......-......+++...+.+.+ ...+ T Consensus 239 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG-~~p~ 316 (651) T protein:vir:99 239 ESEREPIFVDRETGDVTTGDANGLENRP-ANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDND-TIPR 316 (651) T ss_pred ceeeeeecccceeeeEEEcCCCceeEec-ccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCc Confidence 000000 0000000 0 000000 000000110000111110 0000001111122333333322111 1123 Q ss_pred ceeecCCCceeeecCCChhhhhHHH-HHHHHHHHHHHhCCCH---------HHhc-C--------Cc-HHHHHHHHHHHH Q lcl|NC_010576. 258 GVATLDTQEKFVSAGMGLQNNLLSD-VRQLQQDFYNQMGITE---------AILN-G--------TA-NEQQTLGYYNRC 317 (447) Q Consensus 258 ~~~vl~~g~~~~~l~~~~~~~~l~~-~~~~~~~Ia~~fgVP~---------~~l~-g--------~~-~e~~~~~f~~~t 317 (447) +++.++++. ++.+ +.+. ++......-+.+++.. ..++ | +. .+.|...+.+.+ T Consensus 317 gil~~~~~~------ls~e--~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~ 388 (651) T protein:vir:99 317 MVIKVTGGE------LSEE--SKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKN 388 (651) T ss_pred eEEEecCCC------CCHH--HHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHH Confidence 444333221 1222 2222 1222222223332210 0000 1 11 133444443443 Q ss_pred HhHHHHHHHHHHHhhcCChhHhcC---Cce--EEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHH------HhCCCC Q lcl|NC_010576. 318 VDVLLQYVTDAISRIALTKTAVSQ---GQV--LVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRE------LTGKAP 386 (447) Q Consensus 318 i~P~~~~ie~~l~~kLl~~~e~~~---g~~--i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~------~~gl~p 386 (447) +.-+|. ++. +++.-... +.+ ++=....+.+..+.-.+..+...++.-+++..|... .+.... T Consensus 389 ~~eIa~----afg---VPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~ 461 (651) T protein:vir:99 389 EHEIAK----VLE---VPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGAD 461 (651) T ss_pred HHHHHH----HhC---CCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccch Confidence 333332 222 22221110 000 000001111222222333344444444454433221 112222 Q ss_pred CCCcccc-------ccccccccchhh---------cc----c-ccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCC Q lcl|NC_010576. 387 HPNPLAN-------ELFNRNIADGNQ---------VG----G-INTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGG 445 (447) Q Consensus 387 ~~g~~~~-------~~~~~~~~~~~~---------~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (447) +.-.+.. .++.......+. .+ + .-.+.......++..+.++..++..++....+..+.. T Consensus 462 llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~ 541 (651) T protein:vir:99 462 QPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWD 541 (651) T ss_pred hhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCccccccccchhh Confidence 2111000 000000000000 00 0 0000000001111111111111112222222222222 Q ss_pred CC Q lcl|NC_010576. 446 SY 447 (447) Q Consensus 446 ~~ 447 (447) +- T Consensus 542 ~~ 543 (651) T protein:vir:99 542 TV 543 (651) T ss_pred hh Confidence 22 No 164 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=394 Identities=9% Similarity=0.014 Sum_probs=141.9 Q ss_pred CchhHhhhhhccc-----ccCCcc--cc-cccccc---cccc----cccccccccc--ccCCcccccchhhhhhHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNA-----FQSNQN--QN-QNTNDF---LTPS----NGMTSFGGYY--GRGQSNYSRSYSYNKADLIKSV 63 (447) Q Consensus 1 Mg~~~~l~~~~~~-----f~~~~~--~~-~~~~~~---~~~~----~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~c 63 (447) ||+|+||++++.- |.+.-. .. .....+ +... .|+..-+... ....+. ...+...+...-..+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~-~~~~~~~slnl~~~i 79 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGE-TKKRDLNHLPIARTA 79 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCC-cccCceeecchHHHH Confidence 9999999887632 211100 00 011100 0000 0111000000 000000 011111222223344 Q ss_pred HHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccce-- Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGS-- 141 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~-- 141 (447) ++.+|+-+..-|.. +.. ++ +..+..+..+|. -|... .-.+..+...+..|..++.+..+...+.... T Consensus 80 ~~~~A~lv~~e~~~-i~~--~d---~~~~~~l~~il~--~n~f~---~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ 148 (500) T protein:vir:98 80 AKKIASLVFNEQAE-IKV--DD---DAANEFISETLK--NDRFN---KNFERYLESCLALGGLAMRPYVDGDKVRVAFVQ 148 (500) T ss_pred HHHHhhhhcCCcce-Eec--CC---hHHHHHHHHHHh--hccHH---HHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEc Confidence 45555555443432 111 11 112233444443 23322 2233334444445555555444433221111 Q ss_pred ---eeeccCCCcce---ee---ec----CCc------------------eEEEEeeeccc--ccceee---e-------- Q lcl|NC_010576. 142 ---FDINTARVGKI---MQ---FF----PRQ------------------VMVRVWNDNTG--LEQDLL---V-------- 177 (447) Q Consensus 142 ---~~~~~~~~~~~---~~---~~----~~~------------------~~~~~~~~~~~--~~~~~~---~-------- 177 (447) +.|.......+ .. ++ .+. +....|..... .+..+. + T Consensus 149 ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 228 (500) T protein:vir:98 149 APVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEA 228 (500) T ss_pred CCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcce Confidence 11111110000 00 00 001 11111111000 000000 0 Q ss_pred -----ccccccccccccccc----ccchhHHHHHHHHHHHHHHHHHHH----hhcCcccc-----eeeeCCcCChHHHHH Q lcl|NC_010576. 178 -----SKENCIIIESPFYAI----LNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLNG-----FIQFPYSTKSTARAA 239 (447) Q Consensus 178 -----~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~g-----vl~~~~~~~~~~~~~ 239 (447) +.-...|++.|..+- ...+-+.+..+...++.+...... ...++.+- +++......... T Consensus 229 ~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~--- 305 (500) T protein:vir:98 229 KVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGD--- 305 (500) T ss_pred EeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcc--- Confidence 000112333221110 111123333333333333222211 11222221 121111000000 Q ss_pred HHHHHHHHHHHHhccCCccee-ec----CCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC------CcHH Q lcl|NC_010576. 240 QAARRKQEIENEMANNKYGVA-TL----DTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG------TANE 307 (447) Q Consensus 240 ~~~~~~~~~~~~~~~n~~~~~-vl----~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e 307 (447) ...... | +-...+. .+ +++..++.++... .++....++...++|+...|+++..++. |++| T Consensus 306 ~~~~~~--~-----d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAte 378 (500) T protein:vir:98 306 VVPRPR--F-----ESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATE 378 (500) T ss_pred ccCCcc--c-----CCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHH Confidence 000000 0 0000010 01 2233466666554 4566777888889999999999998862 2333 Q ss_pred HH------------HHHHHHHHHhHHHHHHHHHHHh-hcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcC Q lcl|NC_010576. 308 QQ------------TLGYYNRCVDVLLQYVTDAISR-IALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYT 374 (447) Q Consensus 308 ~~------------~~~f~~~ti~P~~~~ie~~l~~-kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t 374 (447) .. ....++.+|..++..|-+.... .++. ......+.+.+++++-...|..+.++...+++.+|+|+ T Consensus 379 i~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s 457 (500) T protein:vir:98 379 IVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQ-SEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 21 1124455555555555433221 1211 11123456788888888889999999999999999999 Q ss_pred HHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccC Q lcl|NC_010576. 375 PNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIE 437 (447) Q Consensus 375 ~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (447) .-+++.+. ...++......+. .. +.... +.... -+++.+...+ T Consensus 458 ~~~~i~~~--~g~~eeea~~~l~----~i-~~E~~--~~~~~-----------~~~~~~~~g~ 500 (500) T protein:vir:98 458 REMAIQKV--LNVTEEKAQEIAA----EI-NTGIV--DEINQ-----------QRTDTHLYGE 500 (500) T ss_pred HHHHHHhc--CCCCHHHHHHHHH----HH-HHhcc--ccCCC-----------CCccccccCC Confidence 99987653 2223322222111 00 00000 00000 0001111111 No 165 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=394 Identities=9% Similarity=0.014 Sum_probs=141.9 Q ss_pred CchhHhhhhhccc-----ccCCcc--cc-cccccc---cccc----cccccccccc--ccCCcccccchhhhhhHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNA-----FQSNQN--QN-QNTNDF---LTPS----NGMTSFGGYY--GRGQSNYSRSYSYNKADLIKSV 63 (447) Q Consensus 1 Mg~~~~l~~~~~~-----f~~~~~--~~-~~~~~~---~~~~----~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~c 63 (447) ||+|+||++++.- |.+.-. .. .....+ +... .|+..-+... ....+. ...+...+...-..+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~-~~~~~~~slnl~~~i 79 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGE-TKKRDLNHLPIARTA 79 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCC-cccCceeecchHHHH Confidence 9999999887632 211100 00 011100 0000 0111000000 000000 011111222223344 Q ss_pred HHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccce-- Q lcl|NC_010576. 64 ITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGS-- 141 (447) Q Consensus 64 v~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~-- 141 (447) ++.+|+-+..-|.. +.. ++ +..+..+..+|. -|... .-.+..+...+..|..++.+..+...+.... T Consensus 80 ~~~~A~lv~~e~~~-i~~--~d---~~~~~~l~~il~--~n~f~---~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ 148 (500) T protein:vir:30 80 AKKIASLVFNEQAE-IKV--DD---DAANEFISETLK--NDRFN---KNFERYLESCLALGGLAMRPYVDGDKVRVAFVQ 148 (500) T ss_pred HHHHhhhhcCCcce-Eec--CC---hHHHHHHHHHHh--hccHH---HHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEc Confidence 45555555443432 111 11 112233444443 23322 2233334444445555555444433221111 Q ss_pred ---eeeccCCCcce---ee---ec----CCc------------------eEEEEeeeccc--ccceee---e-------- Q lcl|NC_010576. 142 ---FDINTARVGKI---MQ---FF----PRQ------------------VMVRVWNDNTG--LEQDLL---V-------- 177 (447) Q Consensus 142 ---~~~~~~~~~~~---~~---~~----~~~------------------~~~~~~~~~~~--~~~~~~---~-------- 177 (447) +.|.......+ .. ++ .+. +....|..... .+..+. + T Consensus 149 ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 228 (500) T protein:vir:30 149 APVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEA 228 (500) T ss_pred CCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcce Confidence 11111110000 00 00 001 11111111000 000000 0 Q ss_pred -----ccccccccccccccc----ccchhHHHHHHHHHHHHHHHHHHH----hhcCcccc-----eeeeCCcCChHHHHH Q lcl|NC_010576. 178 -----SKENCIIIESPFYAI----LNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLNG-----FIQFPYSTKSTARAA 239 (447) Q Consensus 178 -----~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~g-----vl~~~~~~~~~~~~~ 239 (447) +.-...|++.|..+- ...+-+.+..+...++.+...... ...++.+- +++......... T Consensus 229 ~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~--- 305 (500) T protein:vir:30 229 KVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGD--- 305 (500) T ss_pred EeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcc--- Confidence 000112333221110 111123333333333333222211 11222221 121111000000 Q ss_pred HHHHHHHHHHHHhccCCccee-ec----CCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC------CcHH Q lcl|NC_010576. 240 QAARRKQEIENEMANNKYGVA-TL----DTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG------TANE 307 (447) Q Consensus 240 ~~~~~~~~~~~~~~~n~~~~~-vl----~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e 307 (447) ...... | +-...+. .+ +++..++.++... .++....++...++|+...|+++..++. |++| T Consensus 306 ~~~~~~--~-----d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAte 378 (500) T protein:vir:30 306 VVPRPR--F-----ESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATE 378 (500) T ss_pred ccCCcc--c-----CCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHH Confidence 000000 0 0000010 01 2233466666554 4566777888889999999999998862 2333 Q ss_pred HH------------HHHHHHHHHhHHHHHHHHHHHh-hcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcC Q lcl|NC_010576. 308 QQ------------TLGYYNRCVDVLLQYVTDAISR-IALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYT 374 (447) Q Consensus 308 ~~------------~~~f~~~ti~P~~~~ie~~l~~-kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t 374 (447) .. ....++.+|..++..|-+.... .++. ......+.+.+++++-...|..+.++...+++.+|+|+ T Consensus 379 i~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s 457 (500) T protein:vir:30 379 IVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQ-SEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 21 1124455555555555433221 1211 11123456788888888889999999999999999999 Q ss_pred HHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccC Q lcl|NC_010576. 375 PNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIE 437 (447) Q Consensus 375 ~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (447) .-+++.+. ...++......+. .. +.... +.... -+++.+...+ T Consensus 458 ~~~~i~~~--~g~~eeea~~~l~----~i-~~E~~--~~~~~-----------~~~~~~~~g~ 500 (500) T protein:vir:30 458 REMAIQKV--LNVTEEKAQEIAA----EI-NTGIV--DEINQ-----------QRTDTHLYGE 500 (500) T ss_pred HHHHHHhc--CCCCHHHHHHHHH----HH-HHhcc--ccCCC-----------CCccccccCC Confidence 99987653 2223322222111 00 00000 00000 0001111111 No 166 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=406 Identities=10% Similarity=0.036 Sum_probs=142.7 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.+.++|.+.|.-..++ ......++....-.. ..+. ....... ..........-+|+.+++.+---.|.+-. T Consensus 10 ~~~i~~L~~~~~~~~~r---~~~~~~Yy~g~~~i~-~~~~--~~~~~~~--~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~ 81 (488) T protein:vir:23 10 EKLRDQLLDAFENKQNE---LKSSKAYYDAERRPD-AIGL--AVPLDMR--KYLAHVGYPRTYVDAIAERQELEGFRIPS 81 (488) T ss_pred HHHHHHHHHHHHHHHHH---HHHHHHHHhcccchh-hcCc--ccchhhh--hhhhhcchHHHHHHHHHHhhhccceeccC Confidence 44555554444322211 111111111000000 0000 0000000 00011223344555555433222221110 Q ss_pred EcC---CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc------cceeeeccC---- Q lcl|NC_010576. 81 IDP---ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD------SGSFDINTA---- 147 (447) Q Consensus 81 ~~~---~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~------~~~~~~~~~---- 147 (447) ... ...........+.+++. -|. .......+...++.+|.||+++..+..... ...+.+.++ T Consensus 82 ~~~~~~~~~~d~~~~~~l~~i~~--~N~---~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~ 156 (488) T protein:vir:23 82 ANGEEPESGGENDPASELWDWWQ--ANN---LDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALY 156 (488) T ss_pred CcccccccccchhHHHHHHHHHH--hcC---hhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeE Confidence 000 00001112234555553 233 235566678888999999988655431100 001111111 Q ss_pred -----CCccee----eecC---Cce-EEEEeee--------cccccce---eeec--cccccccccccc----ccccch- Q lcl|NC_010576. 148 -----RVGKIM----QFFP---RQV-MVRVWND--------NTGLEQD---LLVS--KENCIIIESPFY----AILNDT- 196 (447) Q Consensus 148 -----~~~~~~----~~~~---~~~-~~~~~~~--------~~~~~~~---~~~~--~~~v~~~~~~~~----~~~~~~- 196 (447) ....+. .++. +.+ .+.+|.. ..+.+.. ..+. .=.|+++.+... .+.+.+ T Consensus 157 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~ 236 (488) T protein:vir:23 157 AEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEIS 236 (488) T ss_pred EEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchh Confidence 101110 0000 001 0111100 0000000 0000 012233332110 111111 Q ss_pred ---hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCC--ceeeec Q lcl|NC_010576. 197 ---NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQ--EKFVSA 271 (447) Q Consensus 197 ---~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g--~~~~~l 271 (447) ......+...+.-+....++.. .+.-++. +........+.. .-...| ....++++.++.| .+|.++ T Consensus 237 ~~v~~l~Da~~~~~s~~~~~~~~~a--~p~~~i~-G~~~~~~~~~~~--~~~~~~----~~~~~~v~~~~~g~~~~~~q~ 307 (488) T protein:vir:23 237 PELRSVTDAAAQILMNMQGTANLMA--IPQRLIF-GAKPEELGINAE--TGQRMF----DAYMARILAFEGGEGAHAEQF 307 (488) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhh--hHHHHHh-CCCccccccccc--ccchhh----hhhhhhhccCCCCCCceeEec Confidence 1111222222222222222221 1111111 111111000000 000111 1112456677766 456666 Q ss_pred CCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHH---------------HHHHHHHhHHHHHHHHHHHhhc Q lcl|NC_010576. 272 GMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTL---------------GYYNRCVDVLLQYVTDAISRIA 333 (447) Q Consensus 272 ~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~---------------~f~~~ti~P~~~~ie~~l~~kL 333 (447) ....-..+++.++....+|+..=++|++.++++. .....+ ..+...|.-+++.+...++..- T Consensus 308 ~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~ 387 (488) T protein:vir:23 308 SAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGD 387 (488) T ss_pred CCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 5444445667778888899999999999997642 111111 2233344444443332222111 Q ss_pred CChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcccccccccccc----chhhcc Q lcl|NC_010576. 334 LTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELFNRNIA----DGNQVG 407 (447) Q Consensus 334 l~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~----~~~~~~ 407 (447) .+ .....+++.+..-...+..+.++++.+++++| +++..-+++++|+-+-+-..........-. ...+.. T Consensus 388 ~~----~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 463 (488) T protein:vir:23 388 IP----TEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLY 463 (488) T ss_pred cc----hhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11 01123444444555668889999999999876 788888888887754321111111000000 000111 Q ss_pred cccCCCCCCCCCCCcCCCCCCCcccccc Q lcl|NC_010576. 408 GINTPGQITSDQPATASTDPLNNVSTSA 435 (447) Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (447) ....+.+... +... .+..+.|-++| T Consensus 464 ~~~~~~~~~~--~~~~-~~~~~~e~~~a 488 (488) T protein:vir:23 464 GASTPEGKPG--EAPV-GEPPAPEPDAA 488 (488) T ss_pred ccCCCcccCC--CCCC-CCCCCCCCCCC Confidence 1111111111 1111 12222222223 No 167 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=94.69 E-value=0.0037 Score=33.88 Aligned_cols=407 Identities=10% Similarity=-0.005 Sum_probs=134.6 Q ss_pred CchhHhhhhhcccccCCcccc---cccccc---ccccc--c-----ccccccccccCCcccccchhhhhhHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQN---QNTNDF---LTPSN--G-----MTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRI 67 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~---~~~~~~---~~~~~--~-----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~i 67 (447) ||++++|..+.+.--+.++.. .....+ +.... | +..+|+..... ....+ -++.+.-..+++.+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~---~~~~~-~~~~~l~~~i~~~~ 76 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVP---TVHDK-LMNSGTGNEIVVVA 76 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCC---ccccc-cccCChHHHHHHHH Confidence 999999865433222221110 000000 00000 0 00122211111 11111 12333445566777 Q ss_pred HHhhccCceEEEEEcCCCc-eeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeecc Q lcl|NC_010576. 68 ALDASMVDFKHLKIDPISG-NQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINT 146 (447) Q Consensus 68 a~~ia~lp~~~~r~~~~~~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~ 146 (447) |+-+..-|.. +.....+. ..+..+..+.++|.. |... ..|.+.+.. .+-.|..++.+..+...+.. .... T Consensus 77 A~ll~~e~~~-i~v~~~~~~d~e~~~~~l~~il~~--n~f~--~~~~~~~e~-a~a~G~~~~k~~~d~~~~~i---~~v~ 147 (518) T protein:vir:78 77 AEYISGKPLS-IDVTGVNGSKDENLTKQLKEALRI--DNFD--SKSVKIVEL-AGGSGVSAVKINILNGRPSI---SVHS 147 (518) T ss_pred HHhhcCCCce-EEecCccccCcHHHHHHHHHHHHh--ccHH--HHHHHHHHH-hhccCceEEEEEEECCeeEE---EEEc Confidence 7777554543 22221111 111112234444431 1111 233333334 44445455444333322111 1111 Q ss_pred CCCcceeeecCCceE-EEEeee-cccccc-eee----eccccc---------cccccccc--ccccchhHHHHHHHHHHH Q lcl|NC_010576. 147 ARVGKIMQFFPRQVM-VRVWND-NTGLEQ-DLL----VSKENC---------IIIESPFY--AILNDTNQTLRMLEQKIK 208 (447) Q Consensus 147 ~~~~~~~~~~~~~~~-~~~~~~-~~~~~~-~~~----~~~~~v---------~~~~~~~~--~~~~~~~~~~~~~~~~~~ 208 (447) ....-+. +..+.+. +.++.. ..+... ..+ |..... .++++-++ ............+...+. T Consensus 148 ad~~~P~-~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~ 226 (518) T protein:vir:78 148 SSQFWID-FKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQIT 226 (518) T ss_pred CCeeEEE-eecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccc Confidence 1110000 1111111 111110 000000 000 000000 00111000 000000000000000000 Q ss_pred HHH---H---HHHHhhcCccccee--eeCC----------cCChH-HHHH---HHHHHHHHHHHHhccCCcceee----- Q lcl|NC_010576. 209 LMN---S---QDNRASSGKLNGFI--QFPY----------STKST-ARAA---QAARRKQEIENEMANNKYGVAT----- 261 (447) Q Consensus 209 ~~~---~---~~~~~n~~~~~gvl--~~~~----------~~~~~-~~~~---~~~~~~~~~~~~~~~n~~~~~v----- 261 (447) ... . ......+..+..+. .... ..+.- ..+. .......+|...+.....++.| T Consensus 227 ~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l 306 (518) T protein:vir:78 227 SYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMF 306 (518) T ss_pred cccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHh Confidence 000 0 00001111111111 1000 00000 0011 1111111121111111111211 Q ss_pred --------------cC--------------CCce----eeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcC----- Q lcl|NC_010576. 262 --------------LD--------------TQEK----FVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNG----- 303 (447) Q Consensus 262 --------------l~--------------~g~~----~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g----- 303 (447) ++ .|.+ ++.++... .++.+...+...++|....|+++..++. T Consensus 307 ~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~ 386 (518) T protein:vir:78 307 RKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREV 386 (518) T ss_pred ccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccc Confidence 11 1111 33333332 2345566777888999999999998852 Q ss_pred CcHHHH----H--------HHHHHHHHhHHHHHHHHHHHhhcCCh--hHhcCCceEEEecchhhhcCHHHHHHHHHHHHh Q lcl|NC_010576. 304 TANEQQ----T--------LGYYNRCVDVLLQYVTDAISRIALTK--TAVSQGQVLVYYRNPFKLVPVEQLATVADVLTR 369 (447) Q Consensus 304 ~~~e~~----~--------~~f~~~ti~P~~~~ie~~l~~kLl~~--~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~ 369 (447) |++|-. . ...++.+|.-++..+.+.+....... ......+.+.|++++-+..|..+.++.+.+++. T Consensus 387 TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~ 466 (518) T protein:vir:78 387 KATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS 466 (518) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Confidence 223211 1 12344444444444433332211100 001123468888999999999999999999999 Q ss_pred CCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCC Q lcl|NC_010576. 370 NAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGG 445 (447) Q Consensus 370 ~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (447) +|+|++-++-+++... .++.+...- +.......... ..+ ++..-.|-.++|| T Consensus 467 aGimS~e~~i~~~~~~-~~deea~~e----~~ri~~E~~~~----~~~---------------~p~~~~g~~~~~g 518 (518) T protein:vir:78 467 ALAMSVEEKVKLIHPK-WEDEEIQAE----VKRIYLENAIG----EVP---------------DPEAIGGMETKGG 518 (518) T ss_pred cCCCCHHHHHHHhCCC-CCHHHHHHH----HHHHHHHhccc----CCC---------------CCccccCCCCCCC Confidence 9999998866554322 233222211 11111100000 000 0011112222333 No 168 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=94.54 E-value=0.0041 Score=33.66 Aligned_cols=374 Identities=11% Similarity=0.024 Sum_probs=130.7 Q ss_pred CchhHhhhhhcc-----cccCCcccc----cccccc---ccccc-cccccccccccCCcccc------cchhhhhhHHHH Q lcl|NC_010576. 1 MASSDRLLHSWN-----AFQSNQNQN----QNTNDF---LTPSN-GMTSFGGYYGRGQSNYS------RSYSYNKADLIK 61 (447) Q Consensus 1 Mg~~~~l~~~~~-----~f~~~~~~~----~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~------~~~~~~~~~~v~ 61 (447) |++|+||+.+|. ++. +...+ .....+ ...+. |..-+-|.... ..+. ..+..++...- T Consensus 1 m~~~~~ik~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~--~~~~~~~~~~~~~~~~sl~~~- 76 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSG-QTLKSINDHEKINIDPNELARIERNLRQYEGDYPQ--VEYINSQGKIQERDYMTLNLR- 76 (517) T ss_pred CchHHHHHHHHHHHHHHhcc-cchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcc--cccccccccccccceeecCcH- Confidence 999999988774 221 11110 011111 11111 11001111110 0000 00111111112 Q ss_pred HHHHHHHHhhccC----ceEEEEEcCCCcee-----ccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEee Q lcl|NC_010576. 62 SVITRIALDASMV----DFKHLKIDPISGNQ-----TPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPID 132 (447) Q Consensus 62 ~cv~~ia~~ia~l----p~~~~r~~~~~~~~-----~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~ 132 (447) ..||+.+|.| |..+.-.+.++... ...+..|..+|. =|... ..|.+.+... +..|.+++.+.. T Consensus 77 ---~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~--~n~f~--~~~~~~~e~a-~a~G~~a~k~~~ 148 (517) T protein:vir:98 77 ---KLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ--HNKFI--KNLSDYLEPT-FALGGLTVRPYV 148 (517) T ss_pred ---HHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHH--hccHH--HHHHHHHHHH-hhhCCEEEEEEE Confidence 2455555554 22211111111100 111223445543 12211 1333344444 444555555554 Q ss_pred ccCCcccce-----eeeccCCCcceee----e--c----CCceEEEEeeecccccceeeecccccccccc---------- Q lcl|NC_010576. 133 TTVDPDSGS-----FDINTARVGKIMQ----F--F----PRQVMVRVWNDNTGLEQDLLVSKENCIIIES---------- 187 (447) Q Consensus 133 ~~~~~~~~~-----~~~~~~~~~~~~~----~--~----~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~---------- 187 (447) +........ +.|.......+.. + . .+...+............ ..+.-.+|++ T Consensus 149 d~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~---~~~~~y~I~n~ly~s~~~~~ 225 (517) T protein:vir:98 149 DNGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTE---EGESLYVITNELYKSDNEGE 225 (517) T ss_pred eCCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCcee---ccCCcEEEEEEEEecCCCcc Confidence 443322111 1111111111100 0 0 000001000000000000 0000011111 Q ss_pred -------------------------cccccc-----c-------chhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCC Q lcl|NC_010576. 188 -------------------------PFYAIL-----N-------DTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPY 230 (447) Q Consensus 188 -------------------------~~~~~~-----~-------~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~ 230 (447) |+.... + .+-+.+..+...++.+. T Consensus 226 lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD------------------- 286 (517) T protein:vir:98 226 IGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKIN------------------- 286 (517) T ss_pred ccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHH------------------- Confidence 111000 0 01111111111111111 Q ss_pred cCChHHHHHHHHHHHHHHHHHhccCCcceee-----------------------------c---CCCceeeecCCCh-hh Q lcl|NC_010576. 231 STKSTARAAQAARRKQEIENEMANNKYGVAT-----------------------------L---DTQEKFVSAGMGL-QN 277 (447) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v-----------------------------l---~~g~~~~~l~~~~-~~ 277 (447) +...+|...+ .....++.| + +++-.++.++... .+ T Consensus 287 --------~~~s~~~~e~----~~g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e 354 (517) T protein:vir:98 287 --------DTYDQFWWEI----KMGQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTE 354 (517) T ss_pred --------HHHHHHHHHH----HhCCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchH Confidence 1111111111 110111111 0 0111233333333 23 Q ss_pred hhHHHHHHHHHHHHHHhCCCHHHhcC------CcHHHHH------------HHHHHHHHhHHHHHHHHHHHh-hcCChhH Q lcl|NC_010576. 278 NLLSDVRQLQQDFYNQMGITEAILNG------TANEQQT------------LGYYNRCVDVLLQYVTDAISR-IALTKTA 338 (447) Q Consensus 278 ~~l~~~~~~~~~Ia~~fgVP~~~l~g------~~~e~~~------------~~f~~~ti~P~~~~ie~~l~~-kLl~~~e 338 (447) +.+...+...++|+...|+++..++- |++|... ...+..+|.-++..+-..... .++.. . T Consensus 355 ~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~-~ 433 (517) T protein:vir:98 355 QYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGG-E 433 (517) T ss_pred HHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-C Confidence 55677788889999999999999862 2333211 112333333333333221111 12211 1 Q ss_pred hcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCccccccccccccchhhcccccCCCCCCC Q lcl|NC_010576. 339 VSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELT-GKAPHPNPLANELFNRNIADGNQVGGINTPGQITS 417 (447) Q Consensus 339 ~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~-gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 417 (447) ....+.+.+++++-+..|..+.++.+.+++.+|+|++-+++.+. |+. +......++. .........+... T Consensus 434 ~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~---eeeA~~e~~~----i~~E~~~~~~~~~-- 504 (517) T protein:vir:98 434 IPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRIFKVP---KKTAEQWLEE----IRKDQIELDPVTI-- 504 (517) T ss_pred CCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCC---hHHHHHHHHH----HHHhccccCCCCc-- Confidence 12345688899899999999999999999999999999987654 543 3323322211 0000000000000 Q ss_pred CCCCcCCCCCCCcc Q lcl|NC_010576. 418 DQPATASTDPLNNV 431 (447) Q Consensus 418 ~~~~~~~~~~~~~~ 431 (447) ..+..+..+.+.| T Consensus 505 -~~~~~~~~~gd~e 517 (517) T protein:vir:98 505 -SQRAQKRMFGDEE 517 (517) T ss_pred -cccccCCCCCCCC Confidence 0011111111222 No 169 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=92.93 E-value=0.0094 Score=31.69 Aligned_cols=395 Identities=9% Similarity=0.028 Sum_probs=144.1 Q ss_pred Cc----hhHhh-----------hhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHH Q lcl|NC_010576. 1 MA----SSDRL-----------LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVIT 65 (447) Q Consensus 1 Mg----~~~~l-----------~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~ 65 (447) |+ ++.+| .+....+..+..... .+... .......+ .......-+|+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~---------------~~~~~--~~~~~~~~--~~~n~~~~ivd 61 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT---------------IGIGA--PPELAYLD--VQPGWVATYLR 61 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchh---------------ccccc--chhhhhhh--hhcchHHHHHH Confidence 33 12222 222222322211000 00000 00000000 01112334455 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc---c-cce Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP---D-SGS 141 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~---~-~~~ 141 (447) ..++.+--.. |+... + ...+..+..+++. |.. ......+....+.+|.||+++.+..... . ... T Consensus 62 ~~~~~l~~~g---~~~~~-d---~~~~~~l~~i~~~--N~~---~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~ 129 (480) T protein:vir:78 62 TLSDRLDIEG---FRISE-D---SEGLEELWNWWQA--NDL---DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPL 129 (480) T ss_pred HHHhhhccCc---eecCC-C---chhHHHHHHHHHh--cCH---HHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeE Confidence 5554442122 32221 1 1234556666642 432 3456677889999999998765432110 0 001 Q ss_pred eeecc-CCC---------ccee---eec---CC--ce-EEEEeee--------cccc-------cceeeec--ccccccc Q lcl|NC_010576. 142 FDINT-ARV---------GKIM---QFF---PR--QV-MVRVWND--------NTGL-------EQDLLVS--KENCIII 185 (447) Q Consensus 142 ~~~~~-~~~---------~~~~---~~~---~~--~~-~~~~~~~--------~~~~-------~~~~~~~--~~~v~~~ 185 (447) +.+.+ ..+ ..+. .++ .+ .. .+.+|.. ..+. .....+. .=.++|+ T Consensus 130 i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f 209 (480) T protein:vir:78 130 IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPL 209 (480) T ss_pred EEEEcccceEEEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEe Confidence 11111 110 0000 000 00 00 0111100 0000 0000000 0122333 Q ss_pred ccccc----ccccch----hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_010576. 186 ESPFY----AILNDT----NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKY 257 (447) Q Consensus 186 ~~~~~----~~~~~~----~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 257 (447) .+... .+.+.+ ......+...+.-+.....+.. .+.-++. +........+... ..|. -..+ T Consensus 210 ~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a--~p~~~i~-G~~~~~~~~~~~~----~~~~----~~~~ 278 (480) T protein:vir:78 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG--TPLRVIS-GVTTDELTNDGEN----TTLD----IYYG 278 (480) T ss_pred ecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhc--chhhhhh-CCCcccccccccc----chhh----hhhh Confidence 32110 011111 1112222222222222222221 2211221 1111111000000 0111 1123 Q ss_pred ceeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHHHH---------------HHHHH Q lcl|NC_010576. 258 GVATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTLGY---------------YNRCV 318 (447) Q Consensus 258 ~~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~~f---------------~~~ti 318 (447) .++.++ .+.+|.++....-..+++..+....+|+..=++|++.+++.. +......| +...| T Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l 358 (480) T protein:vir:78 279 RILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAW 358 (480) T ss_pred hhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444544 345676665544445667778888899999999999997642 11112222 22233 Q ss_pred hHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCccccccc Q lcl|NC_010576. 319 DVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELF 396 (447) Q Consensus 319 ~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~ 396 (447) .-.++.+.... ..........+++.+..-...+..+.++.+.+++.+| +++..-+++++|+.+-+-..-.... T Consensus 359 ~~~~rl~~~~~-----~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~ 433 (480) T protein:vir:78 359 ERAMRIAMQIM-----GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWD 433 (480) T ss_pred HHHHHHHHHHc-----CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHH Confidence 33333222111 1111112234566655666678888999999998876 6676667888887653211111111 Q ss_pred c-ccccchhhcccccCCCCCCCCCCCcCCCCCCCcccccccCCccCcCCCC Q lcl|NC_010576. 397 N-RNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTSAIENGSLTDGGS 446 (447) Q Consensus 397 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (447) . .......+.... ..++.....+++.++.+. .+.+..++...+++. T Consensus 434 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 434 KQETEDMIDTLYST-TKAQADATPKPTVTETKT---ETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHhhcc-ccCCCccccCCCCCCCCC---ccCCCcccCCCcCCC Confidence 0 000111111111 111111222223332222 222334444444455 No 170 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=92.48 E-value=0.011 Score=31.27 Aligned_cols=343 Identities=9% Similarity=-0.032 Sum_probs=134.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccc-hhhhh--hHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRS-YSYNK--ADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~v~~cv~~ia~~ia~lp~~ 77 (447) .-.+++|...|.-.+++ ......++........ .+ ..+... +...+ .....-+|+.+++.+ -+. T Consensus 3 ~~~i~~L~~~~~~~~~r---~~~~~~yY~g~~~~~~-~~------~~~p~~~~~~~~~v~nw~~~iVds~a~rl---~~~ 69 (409) T protein:vir:94 3 EKGIGYLRFKLSVHKRR---AEMRYDQYAMKYVDRF-KG------ITIPQALSQQYRSILGWCAKGVDSLADRL---VFR 69 (409) T ss_pred HHHHHHHHHHHHHHhHH---HHHHHHHhcccCchhh-cC------hhhhHHHHHHHhhhcchhHHHHHHhHhhc---ccC Confidence 34455565544332222 2222222211110000 00 000000 00001 112233344333322 222 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccC---------C Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTA---------R 148 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~---------~ 148 (447) =|+ ..+..+.+++. =|... .....+....+.+|.||+++..+..+.+. +.+.++ . T Consensus 70 Gf~---------~~d~~l~~i~~--~N~ld---~~~~~~~~~aliyG~sf~~v~~~~dg~~~--i~~~sp~~~~~i~D~~ 133 (409) T protein:vir:94 70 EFE---------NDDFTVNEIFE--ENNPD---IFFDSAVLSSLIASCSFTYISKGENDAVR--LQVIEAVNATGIIDPI 133 (409) T ss_pred ccc---------CCchHHHHHHH--hcChh---HHHHHHHHHHHHhcceeEEEecCCCCceE--EEEeccceEEEEEecC Confidence 222 12334666654 24443 34456778888999999988776654321 111111 1 Q ss_pred Cccee----eecC---Cce-EEEEee--------ecccccceeeeccc--cccccccc--cc--cccc----chhHHHHH Q lcl|NC_010576. 149 VGKIM----QFFP---RQV-MVRVWN--------DNTGLEQDLLVSKE--NCIIIESP--FY--AILN----DTNQTLRM 202 (447) Q Consensus 149 ~~~~~----~~~~---~~~-~~~~~~--------~~~~~~~~~~~~~~--~v~~~~~~--~~--~~~~----~~~~~~~~ 202 (447) ..++. .+.. +.. ...+|. ...+.+....++.. .++++.+. .. .+.+ ........ T Consensus 134 ~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da 213 (409) T protein:vir:94 134 TGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSN 213 (409) T ss_pred CCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHH Confidence 11110 0100 000 001110 01111111111111 12333210 00 0111 11122222 Q ss_pred HHHHHHHHHHHHHHhhcCcc-cceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-----CCceeeecCCChh Q lcl|NC_010576. 203 LEQKIKLMNSQDNRASSGKL-NGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-----TQEKFVSAGMGLQ 276 (447) Q Consensus 203 ~~~~~~~~~~~~~~~n~~~~-~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-----~g~~~~~l~~~~~ 276 (447) +...+..+....++.. .| +.++-.+... +..+.|+.. .++++.++ .+.++.++....- T Consensus 214 ~~r~~~~~~~~~e~~a--~pqr~i~G~d~d~------~~~~~~~~~--------~~~i~~~~~d~dg~~~~v~q~~~~~l 277 (409) T protein:vir:94 214 AKRTLERADVTAEFYS--FPQKYVTGLSDDA------EPMETWKAT--------VSSMLQFTKDEDGDKPTLGQFTQPSM 277 (409) T ss_pred HHHHHHHHHHHHHHhc--ChhheeEecCCCC------cccchhhhh--------HHHhhcCCCCCCCCCceEEecCCCCh Confidence 3333332333333322 22 2222222111 111223221 12344443 2346666654333 Q ss_pred hhhHHHHHHHHHHHHHHhCCCHHHhcCCcH---HHHHHHHHHHHHhHHHH--------HHHHHHHh--hcCChh--HhcC Q lcl|NC_010576. 277 NNLLSDVRQLQQDFYNQMGITEAILNGTAN---EQQTLGYYNRCVDVLLQ--------YVTDAISR--IALTKT--AVSQ 341 (447) Q Consensus 277 ~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~---e~~~~~f~~~ti~P~~~--------~ie~~l~~--kLl~~~--e~~~ 341 (447) ..+++.++.+..++|..-++|++.+++... ......+-...|.-.++ .+++.+-. .+.... .... T Consensus 278 ~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~ 357 (409) T protein:vir:94 278 SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQ 357 (409) T ss_pred hHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Confidence 345788889999999999999999987432 11111111111111111 11111110 111100 0111 Q ss_pred CceEEEecchhhhcC---HHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC Q lcl|NC_010576. 342 GQVLVYYRNPFKLVP---VEQLATVADVLTRNA--IYTPNEIRELTGKAPHP 388 (447) Q Consensus 342 g~~i~f~~~~l~~~d---~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~ 388 (447) ...+++...++...+ ....++++.|+++.| +..-+-+++++|+..-+ T Consensus 358 ~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 358 FRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred cccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 223444444554444 456778899999998 55668899999999765 No 171 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=92.10 E-value=0.013 Score=30.96 Aligned_cols=405 Identities=9% Similarity=0.072 Sum_probs=147.4 Q ss_pred CchhHhh--hhhcccccCCccc------ccccccccccccc--------cccccccccc----CCccc-------ccchh Q lcl|NC_010576. 1 MASSDRL--LHSWNAFQSNQNQ------NQNTNDFLTPSNG--------MTSFGGYYGR----GQSNY-------SRSYS 53 (447) Q Consensus 1 Mg~~~~l--~~~~~~f~~~~~~------~~~~~~~~~~~~~--------~~~~~~~~~~----~~~~~-------~~~~~ 53 (447) ||||+.| +++|.-.-..+.. ..+++.+-....- ....+|.+.. ..+.. ..=+. T Consensus 4 ~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ 83 (524) T protein:vir:98 4 LGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRG 83 (524) T ss_pred cchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHH Confidence 8888864 4455422111111 1111111111000 0011121111 00000 11123 Q ss_pred hhhhHHHHHHHHHHHHhhccC-----ceEEEEEcCC--CceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCe Q lcl|NC_010576. 54 YNKADLIKSVITRIALDASMV-----DFKHLKIDPI--SGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQI 126 (447) Q Consensus 54 ~~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna 126 (447) .+.+|-|.+||+-|.+++.-. |+.+-=.+.+ ...++.......++|+. -|-...+++ ++..+...|-. T Consensus 84 ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi 158 (524) T protein:vir:98 84 IMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNI-YDFDNMGAR----LFRDWYVDSRI 158 (524) T ss_pred HhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhccee Confidence 456788999999998887422 2222111111 00111112223344432 122233333 44556678888 Q ss_pred eEEEeeccCCc--ccceeeeccCCCcceeee----cCCce-------EEEEee----------ecccccceeeecccccc Q lcl|NC_010576. 127 AMVPIDTTVDP--DSGSFDINTARVGKIMQF----FPRQV-------MVRVWN----------DNTGLEQDLLVSKENCI 183 (447) Q Consensus 127 ~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~----~~~~~-------~~~~~~----------~~~~~~~~~~~~~~~v~ 183 (447) |..+.-+.... +.....+.|..+..+... .++.+ .+.+|. ..+...+.+.++.+.|. T Consensus 159 ~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIv 238 (524) T protein:vir:98 159 YFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIV 238 (524) T ss_pred EEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhhee Confidence 87665433322 333333333333222100 00000 011111 11123345678888899 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHH----H--hhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhcc-- Q lcl|NC_010576. 184 IIESPFYAILNDTNQTLRMLEQKIKLMNSQDN----R--ASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMAN-- 254 (447) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~--~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 254 (447) |..+.+.+.....-+-+..+...+..+.-... + .+.---+ +.|.++ .+. +..+++....+...+++ T Consensus 239 y~hSGL~d~~~~iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG-nlP----k~KAeqYl~~im~k~kNkl 313 (524) T protein:vir:98 239 YAHSGLEDCSNNIIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVG-QMG----GNKATQYVNNIAQGLKNRV 313 (524) T ss_pred eeccCcccCCCCeeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC-CCC----chhHHHHHHHHHHhcCcee Confidence 98876544332221222322222222211111 1 1111111 223333 232 23344444444444431 Q ss_pred ----CCcce-------eec-------CC---CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc--------CCc Q lcl|NC_010576. 255 ----NKYGV-------ATL-------DT---QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN--------GTA 305 (447) Q Consensus 255 ----n~~~~-------~vl-------~~---g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~--------g~~ 305 (447) +.|.| ..| -+ |.+++.|.-.-.--+++..++..+...++++||.+-|. |.+ T Consensus 314 vYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~ 393 (524) T protein:vir:98 314 VYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGG 393 (524) T ss_pred EeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccc Confidence 11222 112 13 33444443333334688899999999999999998883 111 Q ss_pred H-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHHHHHH Q lcl|NC_010576. 306 N-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATVADVL 367 (447) Q Consensus 306 ~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~~~~~ 367 (447) + |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.++..+ T Consensus 394 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:98 394 GEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQV 473 (524) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHh Confidence 2 222233433222233344444444443 4555552 233455443 2221 000 11222222221 Q ss_pred Hh-CC-CcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 368 TR-NA-IYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 368 ~~-~G-~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) -. .| +++.+=+|+. +.+.-. ++. .. ..+..+..+.+.-++ |.+++.+= T Consensus 474 dpyvGky~s~dyi~k~ILr~tDe------ei~-~~---~k~I~~E~k~~~~~~---------p~~e~~~f 524 (524) T protein:vir:98 474 EGVVGKYVSHKYIMKEILRMSDE------DID-EQ---AKLIEEESKEERFKN---------PEAEEENF 524 (524) T ss_pred ccccccccchHHHHHHHhccCHH------HHH-HH---HHHHHHHHhCCCCcC---------CccccccC Confidence 11 11 3333333321 121100 000 00 000000001111110 00000000 No 172 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=91.21 E-value=0.017 Score=30.29 Aligned_cols=398 Identities=9% Similarity=0.019 Sum_probs=138.2 Q ss_pred Cc----hhHhh-----------hhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHH Q lcl|NC_010576. 1 MA----SSDRL-----------LHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVIT 65 (447) Q Consensus 1 Mg----~~~~l-----------~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~ 65 (447) |+ ++.+| .+....+..+..-... +.. .. ......+ .......-+|+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~---------------~~~-~~-~~~~~~~--~~~n~~~~ivd 61 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTI---------------GIG-AP-PELAYLD--VQPGWVATYLR 61 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---------------ccc-cc-hhHhhhh--hhcchHHHHHH Confidence 33 12222 2222233322110000 000 00 0000000 01112344555 Q ss_pred HHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcc--c--ce Q lcl|NC_010576. 66 RIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPD--S--GS 141 (447) Q Consensus 66 ~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~--~--~~ 141 (447) ..++.+--.. ++... + ...+..+..+++. |.. ......+....+.+|.||+++.+...... . .. T Consensus 62 ~~~~~l~~~g---~~~~~-d---~~~~~~l~~i~~~--N~~---d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~ 129 (480) T protein:vir:78 62 TLSDRLDIEG---FRISE-D---SEGLEELWNWWQA--NDL---DEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPL 129 (480) T ss_pred HHHhhhccCc---eecCC-C---chhHHHHHHHHHh--cCH---HHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeE Confidence 5555442222 22221 1 1234456666642 433 34566778899999999987764321100 0 00 Q ss_pred eeeccC-CC---------ccee---eec---C--Cce-EEEEeee--------cccc-------cceeee--cccccccc Q lcl|NC_010576. 142 FDINTA-RV---------GKIM---QFF---P--RQV-MVRVWND--------NTGL-------EQDLLV--SKENCIII 185 (447) Q Consensus 142 ~~~~~~-~~---------~~~~---~~~---~--~~~-~~~~~~~--------~~~~-------~~~~~~--~~~~v~~~ 185 (447) +.+.++ .+ .++. .++ . +.. .+.+|.. ..+. .....+ ..-.++++ T Consensus 130 i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f 209 (480) T protein:vir:78 130 IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPL 209 (480) T ss_pred EEEEcccceEEEEcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEe Confidence 111111 00 0110 000 0 000 0111100 0000 000000 00122333 Q ss_pred cccc-c---ccccch----hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_010576. 186 ESPF-Y---AILNDT----NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKY 257 (447) Q Consensus 186 ~~~~-~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 257 (447) .+.. . .+.+.+ ......+...+..+....++.. .+.-++. +....+.. .+.. ...|.. -.+ T Consensus 210 ~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a--~p~~~i~-G~~~~~~~-~~~~---~~~~~~----~~~ 278 (480) T protein:vir:78 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG--TPLRVIS-GVTTDELT-NDGE---NTTLDI----YYG 278 (480) T ss_pred ecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhc--chhhhhh-cCCccccc-cccc---cchhhh----hhh Confidence 3210 0 011111 1112222222222222222221 2211221 11111111 0100 001111 123 Q ss_pred ceeecC-CCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHHHHHHHHHhHHHHHHHHHHHhhc Q lcl|NC_010576. 258 GVATLD-TQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTLGYYNRCVDVLLQYVTDAISRIA 333 (447) Q Consensus 258 ~~~vl~-~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~~f~~~ti~P~~~~ie~~l~~kL 333 (447) .++.++ ...+|.++....-...++.++....+|+..=++|++.+++.. .....+.+....|.-.+...+..|...| T Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l 358 (480) T protein:vir:78 279 RILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAW 358 (480) T ss_pred hhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 344443 445777766544444567778888889999999999997642 1111222222222222222222111111 Q ss_pred ----------CChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcccccccccccc Q lcl|NC_010576. 334 ----------LTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNA--IYTPNEIRELTGKAPHPNPLANELFNRNIA 401 (447) Q Consensus 334 ----------l~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~ 401 (447) ...........+++.+..-...+..+.++.+.+++.+| +++.--+++.+|+.+-+-..-.+....... T Consensus 359 ~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~ 438 (480) T protein:vir:78 359 ERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETE 438 (480) T ss_pred HHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHH Confidence 11100011123455555555668889999999988876 677766788877765321111111000000 Q ss_pred -chhhcccccCCCCCCCCCCCcCCCC--CCCcccccccCCccCcC Q lcl|NC_010576. 402 -DGNQVGGINTPGQITSDQPATASTD--PLNNVSTSAIENGSLTD 443 (447) Q Consensus 402 -~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 443 (447) ...+.... .+++......++.++. +++++..+..++.+ . T Consensus 439 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 480 (480) T protein:vir:78 439 DMIDTLYST-TKAQADATPKPTVTETKTETQTSPSGFNRTKT--R 480 (480) T ss_pred HHHHHhhcc-ccccCCCCCCCCCCCCCCccccccCCCCcccC--C Confidence 00011110 0111111111122222 22222222333322 2 No 173 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=90.32 E-value=0.021 Score=29.73 Aligned_cols=358 Identities=9% Similarity=0.011 Sum_probs=133.0 Q ss_pred Cch--hHhhhhhcccccCCccccccccccccccccccccccccccCCcccccc-hhhhhh--HHHHHHHHHHHHhhccCc Q lcl|NC_010576. 1 MAS--SDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRS-YSYNKA--DLIKSVITRIALDASMVD 75 (447) Q Consensus 1 Mg~--~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~v~~cv~~ia~~ia~lp 75 (447) |-. +++|...|.-.+++ ......++........ .+ ..+... +...+. ....-+|+.+++ .+- T Consensus 1 m~~~~i~~L~~~~~~~~~r---~~~~~~yy~g~~~~~~-~~------~~~p~~~~~~~~~v~nw~~~~Vd~~a~---rl~ 67 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTG---VDKRYRYYAMDDRDDT-RS------IVMPNNVREMYRSVLEWTAKGVDSLAD---RII 67 (422) T ss_pred CChHHHHHHHHHHHHHHHH---HHHHHHHHhcCCChhh-cC------ccccHHHHHHHHhhcchhHHHHHHHHh---ccc Confidence 332 24555545443322 2222222211111000 00 000000 011110 112223333332 122 Q ss_pred eEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCC-cceee Q lcl|NC_010576. 76 FKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARV-GKIMQ 154 (447) Q Consensus 76 ~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~-~~~~~ 154 (447) +.=|+ ..+..+.+++. =|... .....+....+.+|.||+++..+.....+ .+.+.++.. .-++. T Consensus 68 ~~Gf~---------~~d~~l~~~w~--~N~ld---~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D 132 (422) T protein:vir:97 68 FREFT---------NDDFNAWEIFK--ANNPD---IFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILD 132 (422) T ss_pred cceee---------CCchhHHHHHH--hcChH---HHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEe Confidence 22222 12334666664 25543 34446678889999999998765422111 121111111 00000 Q ss_pred -----------ec----CCceEEEE-eee-------cccccceeeeccc--cccccccc-c-c--ccccc----hhHHHH Q lcl|NC_010576. 155 -----------FF----PRQVMVRV-WND-------NTGLEQDLLVSKE--NCIIIESP-F-Y--AILND----TNQTLR 201 (447) Q Consensus 155 -----------~~----~~~~~~~~-~~~-------~~~~~~~~~~~~~--~v~~~~~~-~-~--~~~~~----~~~~~~ 201 (447) .+ .+...... +.+ ..+.+....++.. .++++.+. . . .+.+. ...... T Consensus 133 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~d 212 (422) T protein:vir:97 133 PTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQK 212 (422) T ss_pred CCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHH Confidence 00 01111111 000 0000000000000 12222210 0 0 01111 112222 Q ss_pred HHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCC-----CceeeecCCChh Q lcl|NC_010576. 202 MLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDT-----QEKFVSAGMGLQ 276 (447) Q Consensus 202 ~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~-----g~~~~~l~~~~~ 276 (447) .+...+..+....++.. ...+.++-.+ ++. ...+.|+.. -++++.++. +.++.++....- T Consensus 213 a~~r~~~~~~~~~e~~a-~pqr~i~G~d----~d~--~~~~~~~~~--------~~~i~~~~~de~~~~~~v~q~~~~~l 277 (422) T protein:vir:97 213 AAKRTLERAEVTAEFYS-FPQKYVLGMD----PDA--KPMEKWRAT--------VSTLLEISKDEDGDKPTVGQFTTASM 277 (422) T ss_pred HHHHHHHHHHHHHHHhc-chhhhhcccC----ccc--ccCchhhhh--------hhhhhccCCCCCCCcceeeecCCCCh Confidence 22333322223333322 1112222111 111 111222221 134444432 346666655444 Q ss_pred hhhHHHHHHHHHHHHHHhCCCHHHhcCCcH---HHHHHHHHHHHHhHHHH--------HHHHHHHhh--cCCh--hHhcC Q lcl|NC_010576. 277 NNLLSDVRQLQQDFYNQMGITEAILNGTAN---EQQTLGYYNRCVDVLLQ--------YVTDAISRI--ALTK--TAVSQ 341 (447) Q Consensus 277 ~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~---e~~~~~f~~~ti~P~~~--------~ie~~l~~k--Ll~~--~e~~~ 341 (447) ..+++.++.+..+||..=++|++.+++... ......+-...|.-.++ .+++.+-.. +... ..... T Consensus 278 ~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~ 357 (422) T protein:vir:97 278 APFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQ 357 (422) T ss_pred hHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchh Confidence 456788899999999999999999987432 11111111111111111 111111110 1110 00111 Q ss_pred CceEEEecchhhhcC---HHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCccccccccccccchhhccccc Q lcl|NC_010576. 342 GQVLVYYRNPFKLVP---VEQLATVADVLTRN--AIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGIN 410 (447) Q Consensus 342 g~~i~f~~~~l~~~d---~~~~~~~~~~~~~~--G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~ 410 (447) ...+++........| ....++++.|++++ |++...-+++++|+...+. +... ..+....+ T Consensus 358 ~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~----~~~~-----~~~~~~d~ 422 (422) T protein:vir:97 358 FMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADK----PIPA-----ITEVTTDG 422 (422) T ss_pred hccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhH----HHHH-----HHhhhccC Confidence 223344444444445 55667888888888 7888889999999965322 1110 00110000 No 174 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=90.22 E-value=0.022 Score=29.67 Aligned_cols=394 Identities=9% Similarity=-0.016 Sum_probs=143.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhh-hhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSY-NKADLIKSVITRIALDASMVDFKHL 79 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~cv~~ia~~ia~lp~~~~ 79 (447) +-+..+|.+.|.-.+++ ......++........ .+.. ....+ +.+ ....+..-||+.+++.+- +.=| T Consensus 17 ~~~~~~L~~~~~~~~~~---~~~~~~Yy~G~~~~~~-~~~~--~p~~~---r~~~~v~nw~~~~Vd~~a~rl~---~~Gf 84 (474) T protein:vir:81 17 NALINGLLAQIENLRWK---NLLRTSYYENKRTIQY-VGTL--IPPQY---FNLGLVLGWTGKAVDALARRCN---LEGF 84 (474) T ss_pred HHHHHHHHHHHHHHhhH---HHHHHHHhccCCChhh-cccc--ccHHH---HHHHhhcChHHHHHHHHHhhhc---ccce Confidence 56666666655443322 2222222211111000 0000 00000 000 111233445555555333 3334 Q ss_pred EEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCC---------Cc Q lcl|NC_010576. 80 KIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTAR---------VG 150 (447) Q Consensus 80 r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~---------~~ 150 (447) +. +++. ..+..+..++. =|... .....+....+.+|.||+++..++.+.....+.+.++. .. T Consensus 85 ~~-~d~~---~~~~~l~~iw~--~N~ld---~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~ 155 (474) T protein:vir:81 85 VW-PDGD---LDSLGGTEVVD--DNHLL---SEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRR 155 (474) T ss_pred EC-CCCC---ccchHHHHHHH--hcChh---HHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCC Confidence 42 2221 12345666663 35443 34556677888999999988765544322222111111 11 Q ss_pred ce---e-ee---cCCce-EEEEee---------ecc-cccce--eeecc-ccccccccc-c-c--cccc----chhHHHH Q lcl|NC_010576. 151 KI---M-QF---FPRQV-MVRVWN---------DNT-GLEQD--LLVSK-ENCIIIESP-F-Y--AILN----DTNQTLR 201 (447) Q Consensus 151 ~~---~-~~---~~~~~-~~~~~~---------~~~-~~~~~--~~~~~-~~v~~~~~~-~-~--~~~~----~~~~~~~ 201 (447) .+ + .+ ..+.. ...+|. ... +.+.. ..++- -.++++.+. . . .+.. ....... T Consensus 156 ~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~d 235 (474) T protein:vir:81 156 GLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRITKPMMGLQD 235 (474) T ss_pred cceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccccchhHHHHHH Confidence 10 0 00 01111 011110 000 00000 00000 112333211 0 0 1111 1112222 Q ss_pred HHHHHHHHHHHHHHHhhcCcccceeeeCCc-CChHHHHHHHHHHHHHHHH--HhccCCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 202 MLEQKIKLMNSQDNRASSGKLNGFIQFPYS-TKSTARAAQAARRKQEIEN--EMANNKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 202 ~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) .+...+..+....++.. ..-+.++-.... ..+++ ......|+..... .+..+..+..+...+.++-++....-.. T Consensus 236 a~~r~~~~~~~~~e~~a-~pqr~i~G~~~~~~~d~d-~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~ 313 (474) T protein:vir:81 236 AGVRELARREGHMDVFS-YPEFWLLGADESALKNAD-GTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDA 313 (474) T ss_pred HHHHHHHHHHHHHHHhc-chhheeecCChhhccccc-ccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhH Confidence 22233322222333321 111223221100 00000 0111223222211 1122222222233455666766554445 Q ss_pred hHHHHHHHHHHHHHHhCCCHHHhc-C--Cc-HHHHHHHHHHHHHhHHH--------HHHHHHHHhhcC--Ch---hHh-c Q lcl|NC_010576. 279 LLSDVRQLQQDFYNQMGITEAILN-G--TA-NEQQTLGYYNRCVDVLL--------QYVTDAISRIAL--TK---TAV-S 340 (447) Q Consensus 279 ~l~~~~~~~~~Ia~~fgVP~~~l~-g--~~-~e~~~~~f~~~ti~P~~--------~~ie~~l~~kLl--~~---~e~-~ 340 (447) +++.++.+..+||..=++|++.|| + .+ ............|.--+ ..+++.+-..+- .. .+. . T Consensus 314 ~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~ 393 (474) T protein:vir:81 314 HWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPD 393 (474) T ss_pred HHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccch Confidence 678888889999999999999986 2 11 11111111111111111 122222111110 00 000 1 Q ss_pred CCceEEEecchhhhcCHHHHHHHHHHHHhCCC-c-CHHHHHHHhCCCCCCCc-cccccccc-cccchhhcccccCCCCCC Q lcl|NC_010576. 341 QGQVLVYYRNPFKLVPVEQLATVADVLTRNAI-Y-TPNEIRELTGKAPHPNP-LANELFNR-NIADGNQVGGINTPGQIT 416 (447) Q Consensus 341 ~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~-~-t~NE~R~~~gl~p~~g~-~~~~~~~~-~~~~~~~~~~~~~~~~~~ 416 (447) ..+.+++...+....+..++++++.|+++.|. + .-.=+++++|+.+-+=. +-.+.-.. .............+++.. T Consensus 394 ~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~a 473 (474) T protein:vir:81 394 EWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATA 473 (474) T ss_pred hhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCC Confidence 11234444456667788999999999999874 3 33446777888754210 00000000 000011111111111111 Q ss_pred C Q lcl|NC_010576. 417 S 417 (447) Q Consensus 417 ~ 417 (447) + T Consensus 474 q 474 (474) T protein:vir:81 474 Q 474 (474) T ss_pred C Confidence 1 No 175 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=87.56 E-value=0.038 Score=28.37 Aligned_cols=344 Identities=8% Similarity=-0.031 Sum_probs=133.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccc-hhhhh--hHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRS-YSYNK--ADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~v~~cv~~ia~~ia~lp~~ 77 (447) .-.+++|.+.|...+++ ......++........ .+ ..+... +...+ .....-+|+.+++.+ -+. T Consensus 3 ~~~i~~L~~~~~~~~~r---~~~~~~yY~g~~~~~~-~~------~~~p~~~~~~~~~v~nw~~~iVds~a~rl---~~~ 69 (409) T protein:vir:16 3 EKGIGYLRFKLSVHKRR---AEMRYEQYAMKHVDRF-KG------ITIPQALSQQYRSILGWCAKGVDSLADRL---VFR 69 (409) T ss_pred HHHHHHHHHHHHHHhHH---HHHHHHHHhccCchhh-cc------hhhhHHHHHHHhhhcChhHHHHHHhHhhc---ccc Confidence 34455565554433222 1222222111110000 00 000000 00001 112223333333322 222 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccC---------C Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTA---------R 148 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~---------~ 148 (447) =|+ ..+..+.+++. =|... .....+....+.+|.||+++..+..+.+. +.+.++ . T Consensus 70 Gf~---------~~d~~l~~i~~--~N~ld---~~~~~~~~~al~yG~sf~~v~~~~dg~~~--i~~~sP~~~~~i~D~~ 133 (409) T protein:vir:16 70 EFE---------NDDFTVNEIFE--ENNPD---IFFDSTVLSALIASCSFTYISKGENDAVR--LQVIEATNATGIIDPI 133 (409) T ss_pred ccc---------CcchHHHHHHH--hcChh---HHHHHHHHHHHHhCceeEEEecCCCCceE--EEEEcccceEEEeecc Confidence 222 12334666653 24433 34456777889999999998876654321 111111 1 Q ss_pred Ccce----eeecC----CceEEEEee--------ecccccceeeeccc--cccccccc--cc--cccc----chhHHHHH Q lcl|NC_010576. 149 VGKI----MQFFP----RQVMVRVWN--------DNTGLEQDLLVSKE--NCIIIESP--FY--AILN----DTNQTLRM 202 (447) Q Consensus 149 ~~~~----~~~~~----~~~~~~~~~--------~~~~~~~~~~~~~~--~v~~~~~~--~~--~~~~----~~~~~~~~ 202 (447) ..++ ..+.. ......+|. ...+.+....++.. .++++.+. +. .+.+ ........ T Consensus 134 ~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da 213 (409) T protein:vir:16 134 TGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSN 213 (409) T ss_pred cccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHH Confidence 1111 00110 001111110 00111111111111 12333211 00 1111 12222233 Q ss_pred HHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-----CCceeeecCCChhh Q lcl|NC_010576. 203 LEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-----TQEKFVSAGMGLQN 277 (447) Q Consensus 203 ~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-----~g~~~~~l~~~~~~ 277 (447) +...+..+....++.. ...+.++-.+.... ..+.|+.. .++++.++ .+.++.++....-. T Consensus 214 ~~r~~~~~~~~~e~~a-~pqr~i~G~d~d~~------~~~~~~~~--------~~~i~~~~~d~~g~~~~v~q~~~~~l~ 278 (409) T protein:vir:16 214 AKRTLERADVTAEFYS-FPQKYVTGLSDDAE------PMETWKAT--------VSSMLQFTKDEDGDKPTLGQFTQPSMS 278 (409) T ss_pred HHHHHHHHHHHHHHhc-ChhheeEecCCCCC------ccchhhhh--------hhHhhccCCCCCCCCceEEecCCCChh Confidence 3333333333333332 11122322221111 11122211 13344443 23466666544434 Q ss_pred hhHHHHHHHHHHHHHHhCCCHHHhcCCcHH---HHHHHHHHHHHhHHHHHHHHHHHhhc----------CCh--hHhcCC Q lcl|NC_010576. 278 NLLSDVRQLQQDFYNQMGITEAILNGTANE---QQTLGYYNRCVDVLLQYVTDAISRIA----------LTK--TAVSQG 342 (447) Q Consensus 278 ~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~e---~~~~~f~~~ti~P~~~~ie~~l~~kL----------l~~--~e~~~g 342 (447) .+++.++.+..++|..=++|++.++++... ......-...|.-.++.-+..|...+ ... ...... T Consensus 279 ~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~ 358 (409) T protein:vir:16 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhh Confidence 567888999999999999999999875321 11111111111111111111111111 000 001111 Q ss_pred ceEEEecchhh---hcCHHHHHHHHHHHHhCC-Cc-CHHHHHHHhCCCCCC Q lcl|NC_010576. 343 QVLVYYRNPFK---LVPVEQLATVADVLTRNA-IY-TPNEIRELTGKAPHP 388 (447) Q Consensus 343 ~~i~f~~~~l~---~~d~~~~~~~~~~~~~~G-~~-t~NE~R~~~gl~p~~ 388 (447) ..+++...+.. ..+....++++.|+++.| .+ .-+-+++++|+..-+ T Consensus 359 ~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 359 SKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred ccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 22333333433 334678899999999997 33 346679999998755 No 176 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=86.80 E-value=0.043 Score=28.07 Aligned_cols=427 Identities=11% Similarity=0.097 Sum_probs=148.5 Q ss_pred Cchh---Hh---------hhh-hcccccCCccccccccccc--cc--cccc-cccccccccCCc-ccccchh-hhhhHHH Q lcl|NC_010576. 1 MASS---DR---------LLH-SWNAFQSNQNQNQNTNDFL--TP--SNGM-TSFGGYYGRGQS-NYSRSYS-YNKADLI 60 (447) Q Consensus 1 Mg~~---~~---------l~~-~~~~f~~~~~~~~~~~~~~--~~--~~~~-~~~~~~~~~~~~-~~~~~~~-~~~~~~v 60 (447) |-.| .| |+. .+++-.++...-.+..... .+ ..+. ..+.|+...+-- -+..=+. ++.+|-| T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEV 80 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLI 80 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcch Confidence 3322 11 111 1111111111111111100 00 0000 001111000000 0111112 2467889 Q ss_pred HHHHHHHHHhhccC-----ceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec-c Q lcl|NC_010576. 61 KSVITRIALDASMV-----DFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT-T 134 (447) Q Consensus 61 ~~cv~~ia~~ia~l-----p~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~-~ 134 (447) .+||+.|.+.+.-. |+.+...+.+ ..+.....+.+||+. ..+++ .++..+..+|..|..+.-. . T Consensus 81 d~AideIvneaiv~d~~~~pV~v~l~~~e--~s~~iK~kI~~lldf----~~~~~----~~fR~WYVDGriy~Hkiik~~ 150 (533) T protein:vir:58 81 STVLDIIADECTIPNENGNIVDVVTKDIE--LAKAILSYLDYVINI----EKNAY----PIIRNMIKYGDMFLHILEKGS 150 (533) T ss_pred hhHHHhhhceeeEecCCCceeEeeccccc--ccHHHHHHHHHHhcc----hhhhh----HHHHhhhhcceeEEEeccCCc Confidence 99999999887533 3333221111 111112223333332 22333 3456677889988776432 2 Q ss_pred CCcccceeeeccCCCcceeeecCCceEEEEeee----cccccceeeecccccccccccccccccc-hhHHHHHHHH---H Q lcl|NC_010576. 135 VDPDSGSFDINTARVGKIMQFFPRQVMVRVWND----NTGLEQDLLVSKENCIIIESPFYAILND-TNQTLRMLEQ---K 206 (447) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~~~~---~ 206 (447) ...+.....+.|..+..+..... ...+.+|.. .......+.++.+.|.|+.+.+..+... +-+-+..+.. + T Consensus 151 k~GI~elr~lDPr~i~~vr~~~t-~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQ 229 (533) T protein:vir:58 151 DGTIEKFQVVSPYIFSKRYNPET-DTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQ 229 (533) T ss_pred ccchhhheecCCeeeEEEEeecc-ceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHH Confidence 22222233333333322222111 122222211 1112233567788999998765432221 1122333322 2 Q ss_pred HHHHHHHHH-HhhcCcc--c-ceeeeCCcCChHHHHHHHHHHHHHHHHHhc-c-CCccee----------ec-------- Q lcl|NC_010576. 207 IKLMNSQDN-RASSGKL--N-GFIQFPYSTKSTARAAQAARRKQEIENEMA-N-NKYGVA----------TL-------- 262 (447) Q Consensus 207 ~~~~~~~~~-~~n~~~~--~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-n~~~~~----------vl-------- 262 (447) |..+..+.- +.-+-.| + +.+.++. +.....++-.+.+..++.+.+. + +.|.|. .| T Consensus 230 LkmiEDAlVIYRisRAPeRRvFYIDVGN-lpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR 308 (533) T protein:vir:58 230 LRLMEDALMLYRVVRSVDRRVFYVDVGN-VPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR 308 (533) T ss_pred HHHHHHHHHHHhhcCChhheEEEEeecC-CCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc Confidence 222222111 1111111 1 2344433 2221112222223333332221 1 234331 11 Q ss_pred --CCCceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc-----CCcHHH--HHHHHHHHHHhHHHHHHHHHHHhhc Q lcl|NC_010576. 263 --DTQEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN-----GTANEQ--QTLGYYNRCVDVLLQYVTDAISRIA 333 (447) Q Consensus 263 --~~g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~-----g~~~e~--~~~~f~~~ti~P~~~~ie~~l~~kL 333 (447) ..|.+++.|.- ..--+++..++..+.+.++++||.+-|. |.++|= ....| ...|.-+-..+.+.|...| T Consensus 309 eGgrgTEI~TLpG-g~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF-~KFI~rLR~rF~~ll~~qL 386 (533) T protein:vir:58 309 GDRRAVEIDILQG-SKVDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKF-NNTIKRIQGFFVEELERMV 386 (533) T ss_pred CCCccceeeecCC-CCCCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHH-HHHHHHHHHHHHHHHhccc Confidence 23456666652 2334678899999999999999999885 222221 11223 3445556667777777777 Q ss_pred CChhHhcC-CceEEEecchh----hhcC-HHHHHHHHHHHH--------hCCC--cCHHHHHHH------hCCCCC-CCc Q lcl|NC_010576. 334 LTKTAVSQ-GQVLVYYRNPF----KLVP-VEQLATVADVLT--------RNAI--YTPNEIREL------TGKAPH-PNP 390 (447) Q Consensus 334 l~~~e~~~-g~~i~f~~~~l----~~~d-~~~~~~~~~~~~--------~~G~--~t~NE~R~~------~gl~p~-~g~ 390 (447) +....... -+.+.|..|.. .... +..|+.++..+- ..-+ +| +|+.+. ++-.|+ +.+ T Consensus 387 ilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~~~ 465 (533) T protein:vir:58 387 RMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFDTG 465 (533) T ss_pred ccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCCCC Confidence 54321111 12344433322 1111 112333322211 0001 12 122211 111111 111 Q ss_pred -cccccccccccchhhcccccCCCCCCCCCCC-----------cCCCCCCCcccccccCCccCcCCCCC Q lcl|NC_010576. 391 -LANELFNRNIADGNQVGGINTPGQITSDQPA-----------TASTDPLNNVSTSAIENGSLTDGGSY 447 (447) Q Consensus 391 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (447) .+. ++.+....+..++|.+.+..... +...+......+.+-.+|++.+-=-+ T Consensus 466 ~~~~-----e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~ 529 (533) T protein:vir:58 466 GFGE-----ETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPF 529 (533) T ss_pred Cccc-----ccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCC Confidence 011 11111111111111111100000 00000011111111122222222122 No 177 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=85.77 E-value=0.05 Score=27.70 Aligned_cols=407 Identities=7% Similarity=0.017 Sum_probs=140.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) --+..+|...|.- +..+......++....-.... +. ..+.-.. +.........-+|+.+++.+--.. |+ T Consensus 15 ~~~~~~l~~~~~~---~~~r~~~l~~YY~G~~~i~~~-~~---~~~~~~~-~~~~v~n~~~~iVd~~~~~l~~~g---~~ 83 (486) T protein:vir:42 15 AVVREEMISAFED---ASKDLASNTSYYDAERRPEAI-GV---TVPREMQ-QLLAHVGYPRLYVDSVAERQAVEG---FR 83 (486) T ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHhcccCcchhc-cc---ccchhHh-hhhhccchHHHHHHHHHhhhcccc---ee Confidence 1123333332221 111111111111110000000 00 0000000 000011233445555554442222 33 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc------ceeeeccC------- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS------GSFDINTA------- 147 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~------~~~~~~~~------- 147 (447) .. +. ...+..+.+++. -|... .....+..+++.+|.||+++.++..+... ..+.+.++ T Consensus 84 ~~--~~--~~~~~~~~~i~~--~N~~d---~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~ 154 (486) T protein:vir:42 84 LG--DA--DEADEELWQWWQ--ANNLD---IEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEI 154 (486) T ss_pred cC--CC--chhHHHHHHHHH--hcChh---HHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEE Confidence 22 11 112344566653 35433 45567888899999999887665422110 01111110 Q ss_pred --CCccee----eec---CCceE-EEEeeec--------ccccc---eeeecc--ccccccccccc----ccccch---- Q lcl|NC_010576. 148 --RVGKIM----QFF---PRQVM-VRVWNDN--------TGLEQ---DLLVSK--ENCIIIESPFY----AILNDT---- 196 (447) Q Consensus 148 --~~~~~~----~~~---~~~~~-~~~~~~~--------~~~~~---~~~~~~--~~v~~~~~~~~----~~~~~~---- 196 (447) ...++. .++ .+.+. +.+|... .+.+. ...+.- =.|+++.+... .+.+.+ T Consensus 155 d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v 234 (486) T protein:vir:42 155 DPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPEL 234 (486) T ss_pred eCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhH Confidence 111110 001 01111 1111110 01000 011110 01222322100 011111 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecC-CCceeeecCCCh Q lcl|NC_010576. 197 NQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLD-TQEKFVSAGMGL 275 (447) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~ 275 (447) .+....+...+.-+....++. +.+.-++. +... ++......+....| ....++++.++ ...+|.++.... T Consensus 235 ~~liDa~~~~~s~~~~~~e~~--a~p~~~i~-G~~~--~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~q~~~~~ 305 (486) T protein:vir:42 235 RSMTDAAARILMLMQATAELM--GVPQRLIF-GIKP--EEIGVDSETGQTLF----DAYLARILAFEDAEGKIQQFSAAE 305 (486) T ss_pred HHHHHHHHHHHHHHHHHHHhh--cchHHHhh-cCCc--cccccccccccchh----hhhhchhcccCCCCceEEeecccC Confidence 111222222222222222222 11211221 0011 10000000001111 11234555554 456776665444 Q ss_pred hhhhHHHHHHHHHHHHHHhCCCHHHhcCCc---HHHHHH---------------HHHHHHHhHHHHHHHHHHHhhcCChh Q lcl|NC_010576. 276 QNNLLSDVRQLQQDFYNQMGITEAILNGTA---NEQQTL---------------GYYNRCVDVLLQYVTDAISRIALTKT 337 (447) Q Consensus 276 ~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~---~e~~~~---------------~f~~~ti~P~~~~ie~~l~~kLl~~~ 337 (447) -..+++.++....++|..=++|++.++++. .....+ ..+...|.-+++.+....+..-.+. T Consensus 306 ~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~- 384 (486) T protein:vir:42 306 LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPP- 384 (486) T ss_pred HHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc- Confidence 445667778888889999999999997642 111111 2233344444333322221111110 Q ss_pred HhcCCceEEEecchhhhcCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCC Q lcl|NC_010576. 338 AVSQGQVLVYYRNPFKLVPVEQLATVADVLTRN--AIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQI 415 (447) Q Consensus 338 e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~--G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (447) ....+++.+..-...+..+.++++.++++. |+++..-+++.+|+-+-+-..-..+....-....+.- ....+.. T Consensus 385 ---d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~-~~~~~~~ 460 (486) T protein:vir:42 385 ---DMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLL-GTMVDAD 460 (486) T ss_pred ---cceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHH-HHhhcCC Confidence 113455555566677889999999999886 6788777788777654321100001000000000000 0000111 Q ss_pred CCCCCCcCCCCCCCcccccccCCccCc Q lcl|NC_010576. 416 TSDQPATASTDPLNNVSTSAIENGSLT 442 (447) Q Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (447) ...+.....+. .+....++..+|+.. T Consensus 461 ~~~~~~~~~~~-~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 461 PTVPGSPSPTA-PPKPQPAIESSGGDA 486 (486) T ss_pred CCCCCCCCCCC-CCCCCcccCCCCCCC Confidence 11110000000 001111111222211 No 178 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=85.56 E-value=0.052 Score=27.63 Aligned_cols=394 Identities=8% Similarity=-0.062 Sum_probs=139.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +....|+.+....+..+...-... ........+ ..+ .......-.|+..+.-+-.-|+. |. T Consensus 54 ~~~~~r~~~l~~YY~g~~~i~~~~-------------~~~~~~~~~---~~k--i~~n~~k~Ivd~~~~yl~g~p~~-~~ 114 (512) T protein:vir:97 54 DYQRPRLKVLSDYYEGKTKNLVEL-------------TRRKEEYMA---DNR--VAHDYASYISDFINGYFLGNPIQ-CQ 114 (512) T ss_pred HhhHHHHHHHHHHhcccCcccccc-------------CcccccccC---cce--eecchHHHHHHHHhhhhcccCce-ec Confidence 122222222222222221100000 000000000 001 11223445666666666666765 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceee------ Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQ------ 154 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~------ 154 (447) . ++. .....+..++.. |. .......+...++.+|.||+++..+..+.... -.+.|.....++. T Consensus 115 ~--~d~---~~~~~l~~~~~~--n~---~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i-~~~~p~~~~~iyd~~~~~~ 183 (512) T protein:vir:97 115 D--DDK---DVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRL-YKSDAMSTFVIYDNTIERN 183 (512) T ss_pred c--CCh---HHHHHHHHHHhh--cC---HHHHHHHHHHHHHhcCeEEEEEEeCCCCceEE-EEEcccceEEEEcCCCCCc Confidence 1 111 122345555432 32 23455667788889999999887765442211 1111111100000 Q ss_pred ------ecC---------Cce-EEEEeeecc--------cccc--------eeee--cccccccccccc--cccccchhH Q lcl|NC_010576. 155 ------FFP---------RQV-MVRVWNDNT--------GLEQ--------DLLV--SKENCIIIESPF--YAILNDTNQ 198 (447) Q Consensus 155 ------~~~---------~~~-~~~~~~~~~--------~~~~--------~~~~--~~~~v~~~~~~~--~~~~~~~~~ 198 (447) ++. ..+ .+.+|.... +... ...+ ..=.++++++.. .+....... T Consensus 184 ~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~ 263 (512) T protein:vir:97 184 SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (512) T ss_pred eEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHH Confidence 000 000 011111000 0000 0000 001133333211 111111222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 199 TLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) ....+...++-..+...+.+ .+--++.-.....+...........-.+......+.....-.+.|.+++.+....... T Consensus 264 liDa~d~~~S~~~~~~~~~~--~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~ 341 (512) T protein:vir:97 264 LIDLYDNAESDTANYMSDLN--DAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQ 341 (512) T ss_pred HHHHHHHHHHHHHHHHHHhc--CceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHH Confidence 22222222222222222222 2222222111122211111111110000000000111112245666777776554443 Q ss_pred -hHHHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhc Q lcl|NC_010576. 279 -LLSDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVS 340 (447) Q Consensus 279 -~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~ 340 (447) .....+.+.+.|+..-++|..-. +|+.+.. .....+...|.-.++.|...++.+--.....+ T Consensus 342 ~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d 421 (512) T protein:vir:97 342 GTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKD 421 (512) T ss_pred HHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccc Confidence 33456777888888888886433 2332211 11224455555555555544432211100111 Q ss_pred CCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCCCCC Q lcl|NC_010576. 341 QGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTPGQI 415 (447) Q Consensus 341 ~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~~~~ 415 (447) -..+++.++.-+-.|..+.++.+.++. |+++.--++++++. ++++. .++- ... ..+..+......+++. T Consensus 422 -~~~i~~~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~--v~d~~-~E~eri~~E~~~~~~~~~~~~~~~~~~~ 495 (512) T protein:vir:97 422 -FNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRDI 495 (512) T ss_pred -cccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhcccCCCCCC Confidence 113555556666778889999988884 88998777777643 33321 1111 000 0111111111111111 Q ss_pred CCCCCCcCCCCCCCcccc Q lcl|NC_010576. 416 TSDQPATASTDPLNNVST 433 (447) Q Consensus 416 ~~~~~~~~~~~~~~~~~~ 433 (447) .++.++.++ .+.+.++. T Consensus 496 ~~~~~~~~~-~~~~~~~~ 512 (512) T protein:vir:97 496 NDDEQDDDT-KDTVDKKE 512 (512) T ss_pred CCCCCCCCc-cccccccC Confidence 111111111 11111111 No 179 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=83.63 E-value=0.067 Score=27.02 Aligned_cols=394 Identities=8% Similarity=-0.031 Sum_probs=144.4 Q ss_pred Cch---------------------------hHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchh Q lcl|NC_010576. 1 MAS---------------------------SDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYS 53 (447) Q Consensus 1 Mg~---------------------------~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (447) |-+ ..|+.+.+..++...........+.......+..+......... ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~k- 78 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVS-VNNK- 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccC-cccc- Confidence 111 11111111111110000000000000000000000000000000 0001 Q ss_pred hhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec Q lcl|NC_010576. 54 YNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT 133 (447) Q Consensus 54 ~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~ 133 (447) +.......+|+..+.-+-.-|+.+ ....+....+.....+..++.. | ........+...++.+|.||.++..+ T Consensus 79 -i~~n~~~~ivd~~~~yl~g~pv~~-~~~~~~~~~e~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:10 79 -LNNSFDSEIVDTRVGYLHGVPVTY-DLDENAEKNEKLKKFITNFAIR--N---SVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred -cccchHHHHHHhHhhheeccceeE-eeCCCCcchHHHHHHHHHHHhh--c---CHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 123345566777777666667763 2222111111111223333321 2 23346666778889999999887665 Q ss_pred cCCcccc-------eeeeccCCCccee--eec---C--Cce---EEEEeeec--------c-cccce---eeecc--ccc Q lcl|NC_010576. 134 TVDPDSG-------SFDINTARVGKIM--QFF---P--RQV---MVRVWNDN--------T-GLEQD---LLVSK--ENC 182 (447) Q Consensus 134 ~~~~~~~-------~~~~~~~~~~~~~--~~~---~--~~~---~~~~~~~~--------~-~~~~~---~~~~~--~~v 182 (447) ..+.... .+++......... .++ . +.. .+.+|... . +.... ..++- =.+ T Consensus 152 ~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 231 (474) T protein:vir:10 152 TNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPL 231 (474) T ss_pred CCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccce Confidence 5442211 1111111110000 000 0 000 01111100 0 00000 00000 112 Q ss_pred ccccccccc--cccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCccee Q lcl|NC_010576. 183 IIIESPFYA--ILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVA 260 (447) Q Consensus 183 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 260 (447) +++++...+ ...........+...++-..+...+. +.+--+++ +..+.++... .++ ..+.+. T Consensus 232 v~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~--~~~~l~i~-g~~~~~~~~~----~~~---------~~~~i~ 295 (474) T protein:vir:10 232 FGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQT--RLAYLVLR-GMGMSEEMIQ----ETQ---------KSGAFE 295 (474) T ss_pred EEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh--hcchhhhc-cCCCCchhhh----hhh---------hcceeE Confidence 333322111 11111122222222222222222222 22222222 1122221111 110 123455 Q ss_pred ecCCCceeeecCCChhh-hhHHHHHHHHHHHHHHhCCCHHHh---cCCcHH--------------HHHHHHHHHHHhHHH Q lcl|NC_010576. 261 TLDTQEKFVSAGMGLQN-NLLSDVRQLQQDFYNQMGITEAIL---NGTANE--------------QQTLGYYNRCVDVLL 322 (447) Q Consensus 261 vl~~g~~~~~l~~~~~~-~~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e--------------~~~~~f~~~ti~P~~ 322 (447) +.+.+.+++-+...... ......+.+.+.|...-++|..-. +|+.+. ......+...|.-.+ T Consensus 296 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 375 (474) T protein:vir:10 296 LFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQF 375 (474) T ss_pred ecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 56777777777665443 344557778889999889886433 232221 111235566666666 Q ss_pred HHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccc Q lcl|NC_010576. 323 QYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNI 400 (447) Q Consensus 323 ~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~ 400 (447) +.|...++.+-....+. ....+++.+..-+-.|.++.++.+.++. |+++.--+.++++.- +++. .++-. ..- T Consensus 376 ~li~~~l~~~~~~~~~~-~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v--~d~~-~E~eri~~E~ 449 (474) T protein:vir:10 376 KVILSALKRKGYNLDDD-SYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLV--DDVD-YELDEMEKES 449 (474) T ss_pred HHHHHHHhhccCCCCcc-ccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHH-HHHHHHHHHH Confidence 66666655432211111 1124666667777789999999998884 889988888886543 2321 11110 000 Q ss_pred cchhhcccccCCCCCCCCCCCcCCCCCCCcccc Q lcl|NC_010576. 401 ADGNQVGGINTPGQITSDQPATASTDPLNNVST 433 (447) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (447) ....+.-....+++..+ .+.+++++ T Consensus 450 ~e~~~~~~~~~~~~~~~--------~~~~~~s~ 474 (474) T protein:vir:10 450 LEFNDKLPDIDEGDAND--------KSQNNQSE 474 (474) T ss_pred HHHHhhcccccCCCcCC--------CCccccCC Confidence 00000000000011111 11111111 No 180 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=83.63 E-value=0.067 Score=27.02 Aligned_cols=394 Identities=8% Similarity=-0.031 Sum_probs=144.4 Q ss_pred Cch---------------------------hHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchh Q lcl|NC_010576. 1 MAS---------------------------SDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYS 53 (447) Q Consensus 1 Mg~---------------------------~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (447) |-+ ..|+.+.+..++...........+.......+..+......... ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~k- 78 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVS-VNNK- 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccC-cccc- Confidence 111 11111111111110000000000000000000000000000000 0001 Q ss_pred hhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec Q lcl|NC_010576. 54 YNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT 133 (447) Q Consensus 54 ~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~ 133 (447) +.......+|+..+.-+-.-|+.+ ....+....+.....+..++.. | ........+...++.+|.||.++..+ T Consensus 79 -i~~n~~~~ivd~~~~yl~g~pv~~-~~~~~~~~~e~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:94 79 -LNNSFDSEIVDTRVGYLHGVPVTY-DLDENAEKNEKLKKFITNFAIR--N---SVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred -cccchHHHHHHhHhhheeccceeE-eeCCCCcchHHHHHHHHHHHhh--c---CHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 123345566777777666667763 2222111111111223333321 2 23346666778889999999887665 Q ss_pred cCCcccc-------eeeeccCCCccee--eec---C--Cce---EEEEeeec--------c-cccce---eeecc--ccc Q lcl|NC_010576. 134 TVDPDSG-------SFDINTARVGKIM--QFF---P--RQV---MVRVWNDN--------T-GLEQD---LLVSK--ENC 182 (447) Q Consensus 134 ~~~~~~~-------~~~~~~~~~~~~~--~~~---~--~~~---~~~~~~~~--------~-~~~~~---~~~~~--~~v 182 (447) ..+.... .+++......... .++ . +.. .+.+|... . +.... ..++- =.+ T Consensus 152 ~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 231 (474) T protein:vir:94 152 TNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPL 231 (474) T ss_pred CCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccce Confidence 5442211 1111111110000 000 0 000 01111100 0 00000 00000 112 Q ss_pred ccccccccc--cccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCccee Q lcl|NC_010576. 183 IIIESPFYA--ILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVA 260 (447) Q Consensus 183 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 260 (447) +++++...+ ...........+...++-..+...+. +.+--+++ +..+.++... .++ ..+.+. T Consensus 232 v~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~--~~~~l~i~-g~~~~~~~~~----~~~---------~~~~i~ 295 (474) T protein:vir:94 232 FGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQT--RLAYLVLR-GMGMSEEMIQ----ETQ---------KSGAFE 295 (474) T ss_pred EEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh--hcchhhhc-cCCCCchhhh----hhh---------hcceeE Confidence 333322111 11111122222222222222222222 22222222 1122221111 110 123455 Q ss_pred ecCCCceeeecCCChhh-hhHHHHHHHHHHHHHHhCCCHHHh---cCCcHH--------------HHHHHHHHHHHhHHH Q lcl|NC_010576. 261 TLDTQEKFVSAGMGLQN-NLLSDVRQLQQDFYNQMGITEAIL---NGTANE--------------QQTLGYYNRCVDVLL 322 (447) Q Consensus 261 vl~~g~~~~~l~~~~~~-~~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e--------------~~~~~f~~~ti~P~~ 322 (447) +.+.+.+++-+...... ......+.+.+.|...-++|..-. +|+.+. ......+...|.-.+ T Consensus 296 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 375 (474) T protein:vir:94 296 LFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQF 375 (474) T ss_pred ecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 56777777777665443 344557778889999889886433 232221 111235566666666 Q ss_pred HHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccc Q lcl|NC_010576. 323 QYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNI 400 (447) Q Consensus 323 ~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~ 400 (447) +.|...++.+-....+. ....+++.+..-+-.|.++.++.+.++. |+++.--+.++++.- +++. .++-. ..- T Consensus 376 ~li~~~l~~~~~~~~~~-~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v--~d~~-~E~eri~~E~ 449 (474) T protein:vir:94 376 KVILSALKRKGYNLDDD-SYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLV--DDVD-YELDEMEKES 449 (474) T ss_pred HHHHHHHhhccCCCCcc-ccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHH-HHHHHHHHHH Confidence 66666655432211111 1124666667777789999999998884 889988888886543 2321 11110 000 Q ss_pred cchhhcccccCCCCCCCCCCCcCCCCCCCcccc Q lcl|NC_010576. 401 ADGNQVGGINTPGQITSDQPATASTDPLNNVST 433 (447) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (447) ....+.-....+++..+ .+.+++++ T Consensus 450 ~e~~~~~~~~~~~~~~~--------~~~~~~s~ 474 (474) T protein:vir:94 450 LEFNDKLPDIDEGDAND--------KSQNNQSE 474 (474) T ss_pred HHHHhhcccccCCCcCC--------CCccccCC Confidence 00000000000011111 11111111 No 181 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=81.48 E-value=0.085 Score=26.44 Aligned_cols=399 Identities=10% Similarity=-0.015 Sum_probs=143.2 Q ss_pred Cch-----------------------hHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhh Q lcl|NC_010576. 1 MAS-----------------------SDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKA 57 (447) Q Consensus 1 Mg~-----------------------~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (447) |.. ..||.+....+..+... . ... ........ ...+ ... T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~---i---------~~~-~~~~~~~~---~~~k--i~~ 92 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHD---V---------LKS-GRRKDNEM---ADKR--AVH 92 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---c---------ccc-cccccccc---ccce--eec Confidence 111 12222222222221000 0 000 00000000 0001 112 Q ss_pred HHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCc Q lcl|NC_010576. 58 DLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDP 137 (447) Q Consensus 58 ~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~ 137 (447) ......|+.++.-+-.-|+.+- ... +. .+..+...|+ +-...-........+...++.+|.||+++..+..+. T Consensus 93 n~~k~Ivd~~~~yl~g~p~~~~-~~d-~~----~~~~~~~~l~-~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~ 165 (502) T protein:vir:48 93 NYGRMISKFKTGYLAGNPIRVE-YDD-NE----DNSQNDDAIK-RIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDE 165 (502) T ss_pred chHHHHHHHHhhhhcccCeeEe-cCC-cc----chhHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCc Confidence 3445566777777777777532 211 11 1233433332 122222334577778889999999999887765443 Q ss_pred ccceeeeccCCCcceee------------ecC-----Cc-eEEEEeeecc-------cccce---eeecc--cccccccc Q lcl|NC_010576. 138 DSGSFDINTARVGKIMQ------------FFP-----RQ-VMVRVWNDNT-------GLEQD---LLVSK--ENCIIIES 187 (447) Q Consensus 138 ~~~~~~~~~~~~~~~~~------------~~~-----~~-~~~~~~~~~~-------~~~~~---~~~~~--~~v~~~~~ 187 (447) .... .+.|..+..++. ++. +. ..+.+|.... +.... ..++- =.++++++ T Consensus 166 ~~i~-~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 244 (502) T protein:vir:48 166 TRIK-RLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLN 244 (502) T ss_pred eEEE-EEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecC Confidence 2111 011111101110 000 00 0111111100 00000 00000 01222222 Q ss_pred ccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCC Q lcl|NC_010576. 188 PFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQ 265 (447) Q Consensus 188 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g 265 (447) ... +......+.+..+...++...+...+.+ .+--++.-......+ +....+++.. .......+.....+.+ T Consensus 245 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~--~~~lv~~g~~~~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~ 318 (502) T protein:vir:48 245 NADGIGDYETELYLIDLYDSAESDTANHMSDMA--DAILAIYGDLALPQG---MQASDMKRTR-LMQLKPPKSADGKEGT 318 (502) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhc--CceeeeecCcccccc---cchhhhhhcc-eeeccccccccccccC Confidence 111 1111112222222222332322222222 222222211111111 1111111100 0000000001112345 Q ss_pred ceeeecCCChhhhhH-HHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHH Q lcl|NC_010576. 266 EKFVSAGMGLQNNLL-SDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTD 327 (447) Q Consensus 266 ~~~~~l~~~~~~~~l-~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~ 327 (447) .+++.++.......+ ...+.+.+.|+..=++|+.-. +|+.+.. .....+...|.-.++.+.. T Consensus 319 ~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 398 (502) T protein:vir:48 319 VKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAAR 398 (502) T ss_pred cceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 566666555444433 446788899999999987543 2322211 1123455666666666555 Q ss_pred HHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcc Q lcl|NC_010576. 328 AISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVG 407 (447) Q Consensus 328 ~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~ 407 (447) .++.+--.. ..+ ...+++.+..-+..|.++.++++.++. |+++..-+.+++++ ++++. .++....- ...+.. T Consensus 399 ~~~~~~~~~-~~d-~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~iS~et~l~~l~~--v~D~~-~E~~ri~~-E~~~~~ 470 (502) T protein:vir:48 399 IGSLVNEFK-DFD-ESRLKITFTPNLPKSLYEQVSILNDLG--GQVSQETALSLSGL--VENPT-EELDKINE-ESSKID 470 (502) T ss_pred HHhhccccc-ccc-cccceEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCC--CCCHH-HHHHHHHH-HHHhhh Confidence 554332111 111 134566667778889999999998884 78998778787654 33321 11111000 000000 Q ss_pred cccCCCCCCCCCCCcCCCCCCCcccccccCCccCcC Q lcl|NC_010576. 408 GINTPGQITSDQPATASTDPLNNVSTSAIENGSLTD 443 (447) Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (447) .........+ ....++++. +.+.+++..-.++ T Consensus 471 ~~~~~~~~~~--~~~~~~d~~--~e~~~~~~~~~~~ 502 (502) T protein:vir:48 471 FKGYPSYFYD--NVGKYTDEV--KETHTDDFERVYE 502 (502) T ss_pred hhcccccccc--cccccCCCc--cCCCCcCcCCCCC Confidence 0000000000 000011100 0111111111111 No 182 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=79.27 E-value=0.11 Score=25.93 Aligned_cols=393 Identities=8% Similarity=-0.069 Sum_probs=142.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+...-... . .......+ ..+ .......-.|+..+.-+-.-|+. |. T Consensus 54 ~~~~~r~~~l~~Yy~g~~~i~~~~-----------~--~~~~~~~~---~~k--i~~n~~k~Iv~~~~~yl~g~p~~-~~ 114 (511) T protein:vir:96 54 DYQRPRLKVLSDYYEGKTKNLVEL-----------T--RRKEEYMA---DNR--VAHDYASYISDFINGYFLGNPIQ-YQ 114 (511) T ss_pred HhhHHHHHHHHHHhcccCcccccc-----------C--cCcccccC---cce--eecchHHHHHHHHHhhhccCCce-ee Confidence 111222222222222211100000 0 00000000 001 11223445566666666666765 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec-CCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF-PRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 159 (447) . ++. .....+..++.. |. .......+...++.+|.||+++-.+..+.+... ++.|..+..++... ... T Consensus 115 ~--~~~---~~~~~l~~~~~~--n~---~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~-~~~p~~~~~vydd~~~~~ 183 (511) T protein:vir:96 115 D--DDK---DVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLY-KSDAMSTFVIYDNTIERN 183 (511) T ss_pred c--Cch---HHHHHHHHHHhh--cC---HHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEE-EEccceeEEEEcCCCCCc Confidence 1 111 123344555432 32 334556677888999999998777654432111 11111110000000 000 Q ss_pred e---------------------EEEEeeecc--------cccce--------eee--cccccccccccc--cccccchhH Q lcl|NC_010576. 160 V---------------------MVRVWNDNT--------GLEQD--------LLV--SKENCIIIESPF--YAILNDTNQ 198 (447) Q Consensus 160 ~---------------------~~~~~~~~~--------~~~~~--------~~~--~~~~v~~~~~~~--~~~~~~~~~ 198 (447) . .+.+|.... +.+.. ..+ ..=.++++++.. .+....... T Consensus 184 ~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~ 263 (511) T protein:vir:96 184 SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHH Confidence 0 011111000 00000 000 001123333211 111112222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 199 TLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) .+..+...++...+...+. +.+--+++-.............+............. +...-.+.+.+++-|+...... T Consensus 264 liDa~d~~~S~~~~~~~~~--~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~ 340 (511) T protein:vir:96 264 LIDLYDNAESDTANYMSDL--NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYAD-SEGRETEGSVDGGYIYKQYDVQ 340 (511) T ss_pred HHHHHHHHHHHHHHHHHHh--hCceeeeecCccCCchhhcccccccceecccccccc-cccccCCCCcceeEEeecCCHH Confidence 2222222222222222222 222222221111222111111100000000000001 1111234566666666554443 Q ss_pred -hHHHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhc Q lcl|NC_010576. 279 -LLSDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVS 340 (447) Q Consensus 279 -~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~ 340 (447) .....+.+.+.|+..-++|..-. +|+.+.. ....++...|.-.++.|...++.+--.....+ T Consensus 341 ~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d 420 (511) T protein:vir:96 341 GTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKD 420 (511) T ss_pred HHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 34556778888999888886433 2322211 11234556666666666655543321111111 Q ss_pred CCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--cc---cccchhhcccccCCCCC Q lcl|NC_010576. 341 QGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NR---NIADGNQVGGINTPGQI 415 (447) Q Consensus 341 ~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~---~~~~~~~~~~~~~~~~~ 415 (447) -..+++.++.-+-.|.++.++.+.++ .|+++.-.+.+++++- +++. .++- .. ...+..+......+.+. T Consensus 421 -~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v--~D~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:96 421 -FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPE-LEVKKIEEDEKESIKKAQKGIYKDPRDI 494 (511) T ss_pred -cccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHH-HHHHHHHHHHHHHHHHHhhccccCCCCC Confidence 12355556677778899999998887 5899998888876543 3321 1111 00 00111111111111111 Q ss_pred CCCCCCcCCCCCCCccc Q lcl|NC_010576. 416 TSDQPATASTDPLNNVS 432 (447) Q Consensus 416 ~~~~~~~~~~~~~~~~~ 432 (447) .++.+..++.+....++ T Consensus 495 ~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 495 NDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCcccccccccC Confidence 11111111111111111 No 183 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=77.60 E-value=0.12 Score=25.57 Aligned_cols=391 Identities=6% Similarity=0.035 Sum_probs=141.2 Q ss_pred CchhHh---------------------------hhhhcccccCCccccccccccccccccccccccccccCCcccccchh Q lcl|NC_010576. 1 MASSDR---------------------------LLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYS 53 (447) Q Consensus 1 Mg~~~~---------------------------l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (447) |.+... +.+....+..+..-....... ....+...... .....+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~-------~~~~~~~~~~~-~~~~~r- 83 (503) T protein:vir:59 13 EELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTY-------YDAAGQQLVDD-TKTNNR- 83 (503) T ss_pred HhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchh-------ccccccccccc-ccccce- Confidence 222111 111111221111000000000 00000000000 000011 Q ss_pred hhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeec Q lcl|NC_010576. 54 YNKADLIKSVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDT 133 (447) Q Consensus 54 ~~~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~ 133 (447) +.......+|+..+.-+-.-|+. |. .++. .....+..++ . | ........+....+.+|.+|+++..+ T Consensus 84 -i~~n~~~~ivd~~~~yl~g~~~~-~~--~~d~---~~~~~l~~~~--~-n---~~~~~~~~~~~~~~~~G~~~~~v~~d 150 (503) T protein:vir:59 84 -TSHAWHKLFVDQKTQYLVGEPVT-FT--SDNK---TLLEYVNELA--D-D---DFDDILNETVKNMSNKGIEYWHPFVD 150 (503) T ss_pred -eecchHHHHHHHHHhhhhcCCee-ec--cCcH---HHHHHHHHHH--h-c---CHHHHHHHHHHHHhhCCeEEEEEeec Confidence 12334566777777777766765 32 1111 1112232322 1 3 23345566778888999999988776 Q ss_pred cCCcccceeeeccCCC----------ccee---eecC---C-c--e-EEEEeeecc-------cccce------------ Q lcl|NC_010576. 134 TVDPDSGSFDINTARV----------GKIM---QFFP---R-Q--V-MVRVWNDNT-------GLEQD------------ 174 (447) Q Consensus 134 ~~~~~~~~~~~~~~~~----------~~~~---~~~~---~-~--~-~~~~~~~~~-------~~~~~------------ 174 (447) ..+... +.+..+.. .++. .++. . . . .+.+|.... ..... T Consensus 151 ~dg~~~--i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 228 (503) T protein:vir:59 151 EEGEFD--YVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRP 228 (503) T ss_pred CCCceE--EEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCccccccccccccccc Confidence 554321 11111111 1100 0000 0 0 0 011111000 00000 Q ss_pred ------ee--eccccccccccccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHH Q lcl|NC_010576. 175 ------LL--VSKENCIIIESPFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARR 244 (447) Q Consensus 175 ------~~--~~~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~ 244 (447) .. +..-.++.+++... +...........+...++...+...+ .+.+-.+++ +.-..+ .++ + T Consensus 229 ~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~--~~~~~~v~~--g~~~~~-~~~----~ 299 (503) T protein:vir:59 229 HMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSD--FQQIVYVLK--NYDGEN-PKE----F 299 (503) T ss_pred ceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHH--hcCCeeEee--cCCccc-cch----h Confidence 00 00001122221111 11111111222222222222222222 223322332 211111 111 1 Q ss_pred HHHHHHHhccCCcceeecCCCceeeecCCChhhh-hHHHHHHHHHHHHHHhCCC---HHHhcCCcHH------------- Q lcl|NC_010576. 245 KQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN-LLSDVRQLQQDFYNQMGIT---EAILNGTANE------------- 307 (447) Q Consensus 245 ~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~-~l~~~~~~~~~Ia~~fgVP---~~~l~g~~~e------------- 307 (447) ...+ ...+++.++++.+++.+....... .....+.+.+.|+..-++| +..++|+.+. T Consensus 300 ~~~~------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k 373 (503) T protein:vir:59 300 TANL------RYHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLK 373 (503) T ss_pred hhhh------hcccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHH Confidence 1111 123455666666666555443332 2334555555565555554 4444443221 Q ss_pred -HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_010576. 308 -QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAP 386 (447) Q Consensus 308 -~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 386 (447) +.....+...|.-++..|...++..--. .......|.+.+..-+..|.++.++.+.+++.+|+++.-.+.++++. T Consensus 374 ~~~~~~~~~~~l~~~~~~i~~~~~~~~~~--~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~-- 449 (503) T protein:vir:59 374 ANMAERKIRAGLRLFFWFFAEYLRNTGKG--DFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPF-- 449 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCc--ccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCC-- Confidence 1122345666666666666555432211 11222346666678888899999999999999999999888888644 Q ss_pred CCCcccccccc--ccccc-hhhcccc--cCCCCCCCC-CCCcCCCCCCCcccccc Q lcl|NC_010576. 387 HPNPLANELFN--RNIAD-GNQVGGI--NTPGQITSD-QPATASTDPLNNVSTSA 435 (447) Q Consensus 387 ~~g~~~~~~~~--~~~~~-~~~~~~~--~~~~~~~~~-~~~~~~~~~~~~~~~~~ 435 (447) ++++.. ++-. ..... ..+.... ...+...++ .++..+.+..+.++.++ T Consensus 450 v~d~~~-E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 450 VQDPEE-ELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred CCCHHH-HHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 333211 1110 00000 0000000 001111111 11111111111111111 No 184 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=77.24 E-value=0.13 Score=25.50 Aligned_cols=393 Identities=8% Similarity=-0.070 Sum_probs=142.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+...-.... . ......+ ..+ .......-.|+..+.-+-.-|+. +. T Consensus 54 ~~~~~r~~~l~~Yy~g~~~il~~~~--------~-----~~~~~~~---~~k--i~~n~~k~Iv~~~~~yl~g~p~~-~~ 114 (511) T protein:vir:93 54 DYQRPRLKVLSDYYEGKTKNLVELT--------R-----RKEEYMA---DNR--VAHDYASYISDFINGYFLGNPIQ-YQ 114 (511) T ss_pred HhhHHHHHHHHHHhcccCccccccC--------c-----CcccccC---cce--eecchHHHHHHHHhhhhcccCee-ec Confidence 1222223222223322211000000 0 0000000 001 11223445566666666566765 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeeec-CCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQFF-PRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 159 (447) .++ +.....+..++. =|. .......+...++.+|.||+++..+..+..... .+.|..+..++... .+. T Consensus 115 --~~d---~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~-~~~p~~~~~vydd~~~~~ 183 (511) T protein:vir:93 115 --DDD---KDVLEVIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLY-KSDAMSTFVIYDNTIERN 183 (511) T ss_pred --cCC---hHHHHHHHHHHh--hcC---HhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEE-EEccceeEEEEcCCCCCc Confidence 111 112233444443 232 334666777889999999998877654432111 11111110010000 000 Q ss_pred e---------------------EEEEeeecc--------cccc--------eeee--cccccccccccc--cccccchhH Q lcl|NC_010576. 160 V---------------------MVRVWNDNT--------GLEQ--------DLLV--SKENCIIIESPF--YAILNDTNQ 198 (447) Q Consensus 160 ~---------------------~~~~~~~~~--------~~~~--------~~~~--~~~~v~~~~~~~--~~~~~~~~~ 198 (447) . .+.+|.... +... ...+ ..=.++++++.. .+......+ T Consensus 184 ~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~ 263 (511) T protein:vir:93 184 SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHH Confidence 0 011111100 0000 0000 001123333211 111111222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh Q lcl|NC_010576. 199 TLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN 278 (447) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~ 278 (447) ....+...++...+...+.. .+--++.-......+..++..+............. +...-.+.+.+++.++...... T Consensus 264 liDa~d~~~S~~~~~~~~~~--~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~ 340 (511) T protein:vir:93 264 LIDLYDNAESDTANYMSDLN--DAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYAD-SEGRETEGSVDGGYIYKQYDVQ 340 (511) T ss_pred HHHHHHHHHHHHHHHHHHhh--CcceeeecCcccCchhhcccccccceecccccccc-cccccCCCCcceeEEeecCCHH Confidence 22222222222222222222 22222221111121111111100000000000000 1111245566777666554443 Q ss_pred -hHHHHHHHHHHHHHHhCCCHHHh---cCCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhc Q lcl|NC_010576. 279 -LLSDVRQLQQDFYNQMGITEAIL---NGTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVS 340 (447) Q Consensus 279 -~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~ 340 (447) .....+.+.+.|+..-++|..-. +|+.+. .....++...|...++.|...++.+--..... T Consensus 341 ~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~- 419 (511) T protein:vir:93 341 GTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANK- 419 (511) T ss_pred HHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc- Confidence 33456778888888888886432 232221 11123556666666666665554332111101 Q ss_pred CCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCCCCC Q lcl|NC_010576. 341 QGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTPGQI 415 (447) Q Consensus 341 ~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~~~~ 415 (447) .-..+++.++.-+-.|.++.++.+.++ .|+++.-.+++++++ ++++. .++- ... .....+......+++. T Consensus 420 d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:93 420 DFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRDI 494 (511) T ss_pred ccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhhcccCCCCC Confidence 112356666677778899999998888 488998778777643 33321 1111 000 0111111111111111 Q ss_pred CCCCCCcCCCCCCCccc Q lcl|NC_010576. 416 TSDQPATASTDPLNNVS 432 (447) Q Consensus 416 ~~~~~~~~~~~~~~~~~ 432 (447) .++.+..++..+...|+ T Consensus 495 ~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 495 NDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCcccccccccC Confidence 11111111111111112 No 185 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=75.29 E-value=0.15 Score=25.13 Aligned_cols=403 Identities=9% Similarity=0.091 Sum_probs=138.9 Q ss_pred hhHhh--hhhcccccCCcccc--ccccccc-ccc--cccc----------ccccccccCCcc-----------cccchhh Q lcl|NC_010576. 3 SSDRL--LHSWNAFQSNQNQN--QNTNDFL-TPS--NGMT----------SFGGYYGRGQSN-----------YSRSYSY 54 (447) Q Consensus 3 ~~~~l--~~~~~~f~~~~~~~--~~~~~~~-~~~--~~~~----------~~~~~~~~~~~~-----------~~~~~~~ 54 (447) .|++| +++|.-+..++-++ .+.+.+. .|. .+.. +..|++...... +..=+.. T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 56653 56665444332111 1111111 110 0100 011111000000 0011234 Q ss_pred hhhHHHHHHHHHHHHhhccC-----ceEEEEEcC--CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCee Q lcl|NC_010576. 55 NKADLIKSVITRIALDASMV-----DFKHLKIDP--ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIA 127 (447) Q Consensus 55 ~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~--~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~ 127 (447) +.+|-|.+||+-|.+++.-. |+.+-=.+. ....++.......++|+. -|-...+++ ++..+...|-.| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~ 155 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-IQFDRRGQD----MFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceeE Confidence 56788999999998887533 222211111 001111122223344432 122233333 445566788888 Q ss_pred EEEeec--cCCcccceeeeccCCCcceeee--------------------cCCceEEEEeeecccccceeeecccccccc Q lcl|NC_010576. 128 MVPIDT--TVDPDSGSFDINTARVGKIMQF--------------------FPRQVMVRVWNDNTGLEQDLLVSKENCIII 185 (447) Q Consensus 128 i~~~~~--~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 185 (447) ..+.-+ ....+.....+.|..+..+... .++...+......+.....+.++.+-|.+. T Consensus 156 fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:65 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeee Confidence 766533 2233333333333322222110 011111111111122233445555555544 Q ss_pred cccccccccc-hhHHHHHHHHHH---HHHHHH---HHHhhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--- Q lcl|NC_010576. 186 ESPFYAILND-TNQTLRMLEQKI---KLMNSQ---DNRASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMAN--- 254 (447) Q Consensus 186 ~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~---~~~~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 254 (447) .+.+...... .-+-+..+...+ ..+..+ ....+.---+ +.|.++. +. +..+++....+...+++ T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGn-lP----k~KAeqYl~~im~k~kNklv 310 (521) T protein:vir:65 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGN-MN----NRKAAQHMNSVAQSFKNRVV 310 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC-CC----chhHHHHHHHHHHhcCceeE Confidence 4433222110 111122222222 221111 1111111111 2233332 32 22334444444443332 Q ss_pred ---CCcce-------eec-------CCC---ceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc---------CCc Q lcl|NC_010576. 255 ---NKYGV-------ATL-------DTQ---EKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN---------GTA 305 (447) Q Consensus 255 ---n~~~~-------~vl-------~~g---~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~---------g~~ 305 (447) +.|.| ..| -+| .+++.|.-...--+++..++..+...++++||.+-|. |.+ T Consensus 311 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~ 390 (521) T protein:vir:65 311 YDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDG 390 (521) T ss_pred eecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceecccc Confidence 11221 122 133 3444443323334688899999999999999998862 222 Q ss_pred H-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHHHHHH Q lcl|NC_010576. 306 N-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATVADVL 367 (447) Q Consensus 306 ~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~~~~~ 367 (447) + |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.++..+ T Consensus 391 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 470 (521) T protein:vir:65 391 SEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERI 470 (521) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 2 222233433222233334444444443 4555552 233455543 2221 000 11222222221 Q ss_pred Hh--CCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCC Q lcl|NC_010576. 368 TR--NAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATAST 425 (447) Q Consensus 368 ~~--~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (447) -. +-.++.+=+|+. +.+.-.+ + ... ..+..+...++--.+..++.++= T Consensus 471 dpyvGky~S~dyi~k~ILr~tDee------i-~~~---~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 471 TPYIGKYFSNQTVMRDILKYTDDQ------M-DTE---KKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred hhhhccccchHHHHHHHhccCHHH------H-HHH---HHHHHHhhhCCCCCCCcccccCC Confidence 11 012233333321 1211000 0 000 00000001111111000000000 No 186 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=73.42 E-value=0.17 Score=24.80 Aligned_cols=407 Identities=8% Similarity=-0.042 Sum_probs=145.6 Q ss_pred CchhHhhhhhcccccCCc-ccccccccccccccccccc-ccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNGMTSF-GGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |--.++|.++.+.+..+. ++-+....++......... ........+ ..+ +........|+..+.-+-.-|+. T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~---~~k--i~~n~~k~Iv~~~~~yl~g~p~~- 112 (511) T protein:vir:10 39 LQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA---DNR--VAHDYASYISDFINGYFLGNPIQ- 112 (511) T ss_pred ccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccC---cce--eecchHHHHHHHHhhhhcccCce- Confidence 323333333332221111 0001111111110000000 000000000 001 11223445566666666666765 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc-------eeeeccCCC-c Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG-------SFDINTARV-G 150 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~-------~~~~~~~~~-~ 150 (447) |.. ++ +.....+..++.. |. .......+...++.+|.||+++..+..+.... .+++..... . T Consensus 113 ~~~--~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~ 182 (511) T protein:vir:10 113 YQD--DD---KDVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIER 182 (511) T ss_pred eec--Cc---hHHHHHHHHHHhh--cC---HHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCC Confidence 321 11 1123345555432 33 22455667788899999999877665432211 111110000 0 Q ss_pred cee---eecC---------Cce-EEEEeeecc--------cccc--------eeee--cccccccccccc--cccccchh Q lcl|NC_010576. 151 KIM---QFFP---------RQV-MVRVWNDNT--------GLEQ--------DLLV--SKENCIIIESPF--YAILNDTN 197 (447) Q Consensus 151 ~~~---~~~~---------~~~-~~~~~~~~~--------~~~~--------~~~~--~~~~v~~~~~~~--~~~~~~~~ 197 (447) ++. .++. ..+ .+.+|.... +.+. ...+ ..-.++++++.. .+...... T Consensus 183 ~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~ 262 (511) T protein:vir:10 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVI 262 (511) T ss_pred ceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhH Confidence 000 0000 000 011111100 0000 0000 001123333211 11111122 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh Q lcl|NC_010576. 198 QTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN 277 (447) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~ 277 (447) ..+..+...++...+...+. +.+--++.-.....++...+..+............. +...-.+.+.+++-++..... T Consensus 263 ~liDa~d~~~S~~~~~~~~~--~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~d~~~l~~~~~~ 339 (511) T protein:vir:10 263 TLIDLYDNAESDTANYMSDL--NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYAD-SEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred HHHHHHHHHHHHHHHHHHHh--hCceeeeeccccCCchhhccchhccceecccccccc-cccccCCCCcceeEEeecCCH Confidence 22222222222222222222 222222221111222211111111000000000001 111123556677777655444 Q ss_pred hh-HHHHHHHHHHHHHHhCCCHHHh---cCCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHh Q lcl|NC_010576. 278 NL-LSDVRQLQQDFYNQMGITEAIL---NGTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAV 339 (447) Q Consensus 278 ~~-l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~ 339 (447) .. ....+.+.+.|+..-++|..-. +|+.+. .....++...|.-.++.|...+..+--.... T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~- 418 (511) T protein:vir:10 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN- 418 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccc- Confidence 43 4556778888888888886433 232221 1122345566666666665555433211111 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCCCC Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTPGQ 414 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~~~ 414 (447) ..-..+++.+..-+-.|.++.++.+.++. |+++.--+.+++++ ++++. .++- ... ..+..+......+.+ T Consensus 419 ~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~--v~d~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:10 419 KDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRD 493 (511) T ss_pred cccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhhcccCCCC Confidence 11124677777888889999999999885 88988777777643 33321 1111 000 000001011111111 Q ss_pred CCCCCCCcCCCCCCCccc Q lcl|NC_010576. 415 ITSDQPATASTDPLNNVS 432 (447) Q Consensus 415 ~~~~~~~~~~~~~~~~~~ 432 (447) ..++.+..++.+....++ T Consensus 494 ~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 494 INDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCcccCcccccC Confidence 111111111111111111 No 187 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=68.51 E-value=0.24 Score=24.02 Aligned_cols=402 Identities=9% Similarity=0.044 Sum_probs=141.8 Q ss_pred CchhHhhhhhcccccCC-----c-ccccccccccccccccc---------cccccc----------ccCCcccccchhhh Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSN-----Q-NQNQNTNDFLTPSNGMT---------SFGGYY----------GRGQSNYSRSYSYN 55 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~-----~-~~~~~~~~~~~~~~~~~---------~~~~~~----------~~~~~~~~~~~~~~ 55 (447) |.+.+ |+++|.-.... . ....+++.+... .+.. +.+|.+ .....-+..=+..+ T Consensus 1 ~~~~~-lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~-DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma 78 (516) T protein:vir:10 1 MKFLD-LFKFWDRVDQNEYDERLKQGHESIATPKKD-DGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLT 78 (516) T ss_pred CCchH-hcccccchhhHHHHhhhcCCCCcccCCCCc-cCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhh Confidence 66543 44444211110 0 111222211111 0000 011111 10101111123355 Q ss_pred hhHHHHHHHHHHHHhhccC-----ceEEEEEcC--CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_010576. 56 KADLIKSVITRIALDASMV-----DFKHLKIDP--ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAM 128 (447) Q Consensus 56 ~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~--~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i 128 (447) .+|-|..||+-|.+++.-. |+.+--.+. ....+........++|+. -|-..++++ ++..+...|-.|. T Consensus 79 ~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~f 153 (516) T protein:vir:10 79 NNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRL-LDASRKLDT----LFRRWYIDSRIFF 153 (516) T ss_pred hccchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHHhhhhcceEEE Confidence 6778899999998887533 222211000 000011112223344432 122233333 3445566777665 Q ss_pred EE-eeccCCcccceeeeccCCCcceeee--------------------cCCceEEEEeeecccccceeeecccccccccc Q lcl|NC_010576. 129 VP-IDTTVDPDSGSFDINTARVGKIMQF--------------------FPRQVMVRVWNDNTGLEQDLLVSKENCIIIES 187 (447) Q Consensus 129 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 187 (447) .+ ..+....+.....+.|..+..+... .++...+.+....+.....+.++.+-|.+..+ T Consensus 154 hKiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hS 233 (516) T protein:vir:10 154 HKIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHS 233 (516) T ss_pred EEEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeec Confidence 53 3333333334444444333222111 01111111101111222334555555444444 Q ss_pred ccccccc-chhHHHHHHHHH---HHHHHHH---HHHhhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhcc----- Q lcl|NC_010576. 188 PFYAILN-DTNQTLRMLEQK---IKLMNSQ---DNRASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMAN----- 254 (447) Q Consensus 188 ~~~~~~~-~~~~~~~~~~~~---~~~~~~~---~~~~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 254 (447) .+..... ..-+-+..+... |..+..+ ....+.---+ +.|.++. +. +..+++....+...+++ T Consensus 234 Gl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGn-LP----k~KAeqYl~~iM~k~KNklvYD 308 (516) T protein:vir:10 234 GLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGN-MP----NRKATEYVNGIMQSLKNRVVYD 308 (516) T ss_pred CcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC-CC----chhHHHHHHHHHHhcCceeEEe Confidence 3322211 111112222222 2221111 1111111111 2233332 32 22334444444443322 Q ss_pred -CCcce------e-ec-------CC---CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc---------CCcHH Q lcl|NC_010576. 255 -NKYGV------A-TL-------DT---QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN---------GTANE 307 (447) Q Consensus 255 -n~~~~------~-vl-------~~---g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~---------g~~~e 307 (447) +.|.| + .| -+ |.+++.|.-.-.--+++..++..+...++++||.+-|. |.++| T Consensus 309 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~E 388 (516) T protein:vir:10 309 SNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMA 388 (516) T ss_pred CCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccch Confidence 22222 1 11 13 33444443333334688899999999999999998773 23332 Q ss_pred -----HHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chh----hhcC-HHHHHHHHHHHH- Q lcl|NC_010576. 308 -----QQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPF----KLVP-VEQLATVADVLT- 368 (447) Q Consensus 308 -----~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l----~~~d-~~~~~~~~~~~~- 368 (447) -....|+..-=.-+...+-+.|-..| +++.|+. ...+|+|++ |.- .... +..|+.++..+- T Consensus 389 ItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dp 468 (516) T protein:vir:10 389 ITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEP 468 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 22233433222233344555555533 5666663 233455543 222 1111 113333333322 Q ss_pred -hCCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 369 -RNAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 369 -~~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) -+.+++.+=+|+. +.+.-.+=..-+..+ .+...++--.+ |..+.+= T Consensus 469 yvGky~s~~yi~k~ILr~tDeei~~~~k~I----------~~E~~~~~~~~--p~~e~~f 516 (516) T protein:vir:10 469 YVGKYVSHDYVMKNILQMTDEQIAQEEKQI----------EKEANVKRFQN--PENEDDF 516 (516) T ss_pred hhccccchHHHHHHHhcCCHhHHHHHHHHH----------HHhhhCCCCCC--CCccccC Confidence 2235555555442 233211000000000 00011110000 0000000 No 188 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=66.94 E-value=0.26 Score=23.79 Aligned_cols=348 Identities=10% Similarity=0.015 Sum_probs=131.5 Q ss_pred cccccCCccccccccccccccccccccccccccCCcccccc-hhhh--hhHHHHHHHHHHHHhhccCceEEEEEcCCCce Q lcl|NC_010576. 11 WNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRS-YSYN--KADLIKSVITRIALDASMVDFKHLKIDPISGN 87 (447) Q Consensus 11 ~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~v~~cv~~ia~~ia~lp~~~~r~~~~~~~ 87 (447) ++.++++ ......++........+ + ..+... +... ......-+|+.+++.+ -+.=|+ T Consensus 1 l~~~~~r---~~~~~~yY~g~~~~~~~-~------~~~p~~~~~~~~~v~nw~~~~Vds~a~rl---~~~Gf~------- 60 (410) T protein:vir:95 1 MNLYQSR---VNLRYKHYAMQHYEAPT-G------ITIPAHIRAKYQAVLGWAAKGVDSLADRL---IFRAFA------- 60 (410) T ss_pred CCcchhh---HHHHHHHhcCCCCcccc-c------hhccHHHHhHHHhhcchhHHHHHHhHhhh---cccccc------- Confidence 3444433 22222222111100000 0 000000 0000 1112223333333322 222222 Q ss_pred eccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccC--------CCcceee----e Q lcl|NC_010576. 88 QTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTA--------RVGKIMQ----F 155 (447) Q Consensus 88 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~--------~~~~~~~----~ 155 (447) ..+..+..++. =|... .....+..+.+.+|.||+++..+..+.+... ...|. ....+.. + T Consensus 61 --~~d~~l~~i~~--~N~ld---~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~-~~sP~~~~~i~Dp~~~~~~~al~~~ 132 (410) T protein:vir:95 61 --NDDFNVTEIFD--RNNPD---IFFDSAILSALIGSCSFVYISKGEDDEVRLQ-VIESSNATGVIDPITGLLVEGYAVL 132 (410) T ss_pred --CCCchHHHHHh--hcChH---HHHHHHHHHHHHhCceeEEEecCCCCceEEE-EEcccceEEEEeCCCCceEEEEEEE Confidence 12334666653 24433 3445677888999999999877655432111 11111 1111110 0 Q ss_pred c-C--Cce-EEEEeee-------cccccceeeecc--ccccccccc-c-c--cccc----chhHHHHHHHHHHHHHHHHH Q lcl|NC_010576. 156 F-P--RQV-MVRVWND-------NTGLEQDLLVSK--ENCIIIESP-F-Y--AILN----DTNQTLRMLEQKIKLMNSQD 214 (447) Q Consensus 156 ~-~--~~~-~~~~~~~-------~~~~~~~~~~~~--~~v~~~~~~-~-~--~~~~----~~~~~~~~~~~~~~~~~~~~ 214 (447) . . +.. ...+|.. ..+....+.++. -.++++.+. . . .+.+ ........+...+..+.... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~ 212 (410) T protein:vir:95 133 ARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITA 212 (410) T ss_pred EecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 0 0 000 0111100 000000011100 012333210 0 0 0111 12222223333333333333 Q ss_pred HHhhcCcc-cceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCC-----CceeeecCCChhhhhHHHHHHHHH Q lcl|NC_010576. 215 NRASSGKL-NGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDT-----QEKFVSAGMGLQNNLLSDVRQLQQ 288 (447) Q Consensus 215 ~~~n~~~~-~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~-----g~~~~~l~~~~~~~~l~~~~~~~~ 288 (447) ++. +.| +.++-.+ ++. ...+.|+. ..++++.++. +.++.++....-..+++.++.+.. T Consensus 213 e~~--a~pqr~i~G~d----~d~--~~~~~~~~--------~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~ 276 (410) T protein:vir:95 213 EFY--SWPQKYILGLD----PDA--EPMEKWKA--------TVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAA 276 (410) T ss_pred HHh--cchhheeeccC----CCC--CcCchhhh--------hhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHH Confidence 332 222 2222211 111 11112221 1234555543 346666654333345788889999 Q ss_pred HHHHHhCCCHHHhcCCcH---HHHHHHHHHHHHhHHHH--------HHHHHHHhhc--CChh--HhcCCceEEEecc--- Q lcl|NC_010576. 289 DFYNQMGITEAILNGTAN---EQQTLGYYNRCVDVLLQ--------YVTDAISRIA--LTKT--AVSQGQVLVYYRN--- 350 (447) Q Consensus 289 ~Ia~~fgVP~~~l~g~~~---e~~~~~f~~~ti~P~~~--------~ie~~l~~kL--l~~~--e~~~g~~i~f~~~--- 350 (447) +||..=++|++.+++... ....+.+....|.-.++ .+++.+-..+ .... .......+++... T Consensus 277 ~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~ 356 (410) T protein:vir:95 277 GFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLF 356 (410) T ss_pred HHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecC Confidence 999999999999987432 11111211222211111 1222111111 0100 0011122333222 Q ss_pred hhhhcCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCC Q lcl|NC_010576. 351 PFKLVPVEQLATVADVLTRN--AIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQ 414 (447) Q Consensus 351 ~l~~~d~~~~~~~~~~~~~~--G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (447) +....+....++++.|+++. |+....-+++++|+.+-+- .. -+... + ...|+ T Consensus 357 d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~--~~-----~~~~e-~----~~~g~ 410 (410) T protein:vir:95 357 EADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMS--AK-----PVVSE-G----GSNGE 410 (410) T ss_pred CcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHH--HH-----HHHHH-H----HhCCC Confidence 22334678899999999998 6777777999999975321 11 01100 0 01111 No 189 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=63.27 E-value=0.32 Score=23.30 Aligned_cols=403 Identities=8% Similarity=0.080 Sum_probs=138.0 Q ss_pred hhHh--hhhhcccccCCcccc------cccccccccccc-------c--cccccccccCC----c-------ccccchhh Q lcl|NC_010576. 3 SSDR--LLHSWNAFQSNQNQN------QNTNDFLTPSNG-------M--TSFGGYYGRGQ----S-------NYSRSYSY 54 (447) Q Consensus 3 ~~~~--l~~~~~~f~~~~~~~------~~~~~~~~~~~~-------~--~~~~~~~~~~~----~-------~~~~~~~~ 54 (447) .|+. +|++|--|.-+..+. .+++.|-....- . ....|++.... + -+..=+.. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:81 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CcchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHH Confidence 3444 345554443332111 122222111100 0 00001110000 0 00111234 Q ss_pred hhhHHHHHHHHHHHHhhccC-----ceEEEEEcC--CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCee Q lcl|NC_010576. 55 NKADLIKSVITRIALDASMV-----DFKHLKIDP--ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIA 127 (447) Q Consensus 55 ~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~--~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~ 127 (447) +.+|-|.+||+-|.+++.-. |+.+-=.+. ....++.......++|+. -|-..++++ ++..+...|-.| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgRi~ 155 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-IQFDRRGQD----MFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhhhcceEE Confidence 56788999999998887533 222211111 001111122223344432 122233333 445566788888 Q ss_pred EEEeec--cCCcccceeeeccCCCcceeee----cC------CceEEEEe----------eecccccceeeecccccccc Q lcl|NC_010576. 128 MVPIDT--TVDPDSGSFDINTARVGKIMQF----FP------RQVMVRVW----------NDNTGLEQDLLVSKENCIII 185 (447) Q Consensus 128 i~~~~~--~~~~~~~~~~~~~~~~~~~~~~----~~------~~~~~~~~----------~~~~~~~~~~~~~~~~v~~~ 185 (447) ..+.-+ ....+.....+.|..+..+... .+ +-..+.+| ...+.....+.++.+-|.+. T Consensus 156 fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:81 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeee Confidence 766533 2233333333333332222110 00 00011111 11112233445555555444 Q ss_pred ccccccccc-chhHHHHHHHHHHHHHHHHHH----H--hhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhcc--- Q lcl|NC_010576. 186 ESPFYAILN-DTNQTLRMLEQKIKLMNSQDN----R--ASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMAN--- 254 (447) Q Consensus 186 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~--~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 254 (447) .+.+..... ..-+-+..+...+..+.-... + .+.---+ +.|.++ .+. +..+++....+...+++ T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG-nlp----k~KAeqYl~~im~k~kNklv 310 (521) T protein:vir:81 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTG-NMN----NRKAAQHMNSVAQSFKNRVV 310 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC-CCC----chhHHHHHHHHHHhcCceeE Confidence 443322211 011112222222222111111 1 1111111 223333 232 23334444444443332 Q ss_pred ---CCcce-------eec-------CCC---ceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc---------CCc Q lcl|NC_010576. 255 ---NKYGV-------ATL-------DTQ---EKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN---------GTA 305 (447) Q Consensus 255 ---n~~~~-------~vl-------~~g---~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~---------g~~ 305 (447) +.|.| ..| -+| .+++.|.-...--+++..++..+...++++||.+-|. |.+ T Consensus 311 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~ 390 (521) T protein:vir:81 311 YDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDG 390 (521) T ss_pred eecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceecccc Confidence 11221 122 133 3444443223334688899999999999999999883 222 Q ss_pred H-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHHHHHH Q lcl|NC_010576. 306 N-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATVADVL 367 (447) Q Consensus 306 ~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~~~~~ 367 (447) + |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.++..+ T Consensus 391 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 470 (521) T protein:vir:81 391 SEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERI 470 (521) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHh Confidence 2 222233433222233344444444443 4555552 233455543 2221 000 11222222221 Q ss_pred Hh--CCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 368 TR--NAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 368 ~~--~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) -. +-.++.+=+|+. +.+.-.+ + ... ..+..+...++--.+.. .++++= T Consensus 471 dpyvGky~s~dyi~k~ILr~tDee------i-~~~---~k~I~~E~~~~~~~~p~---------~~~~~f 521 (521) T protein:vir:81 471 TPYIGKYFSNQTVMRDILKYTDDQ------M-DTE---KKQIEEEANDPRFKQTP---------DEIEDF 521 (521) T ss_pred hhhhccccchHHHHHHHhccCHHH------H-HHH---HHHHHHHhhCCCCCCCc---------ccccCC Confidence 10 012233333221 1211000 0 000 00000000111111000 000000 No 190 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=62.70 E-value=0.33 Score=23.22 Aligned_cols=404 Identities=10% Similarity=0.067 Sum_probs=149.0 Q ss_pred CchhHhhhhhcccccCCccc---------ccccccccccccccc---------ccccc----cccCCc-------ccccc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ---------NQNTNDFLTPSNGMT---------SFGGY----YGRGQS-------NYSRS 51 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~---------~~~~~~~~~~~~~~~---------~~~~~----~~~~~~-------~~~~~ 51 (447) |.-|+.++++|++..+.... +.++..+-... +.. +..+. +...-+ -+..= T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~d-Ga~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDD-GAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTY 79 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCC-CceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHH Confidence 99999888877776543221 11222211111 110 00000 000000 00111 Q ss_pred hhhhhhHHHHHHHHHHHHhhccC-----ceEEEEEcCC--CceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcC Q lcl|NC_010576. 52 YSYNKADLIKSVITRIALDASMV-----DFKHLKIDPI--SGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEG 124 (447) Q Consensus 52 ~~~~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~G 124 (447) +..+.+|-|.+||+-|.+++.-. |+.+-=.+.+ ...++.......++|+. -|-..++++ ++..+...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNL-LNFQRKGTD----HFQRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhheeec Confidence 23456788999999998887532 2222111111 00111112223344432 122233333 445566778 Q ss_pred CeeEEEeeccC---CcccceeeeccCCCcceeeecC----------CceEEEEee----------ecccccceeeecccc Q lcl|NC_010576. 125 QIAMVPIDTTV---DPDSGSFDINTARVGKIMQFFP----------RQVMVRVWN----------DNTGLEQDLLVSKEN 181 (447) Q Consensus 125 na~i~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~----------~~~~~~~~~~~~~~~ 181 (447) -.|..++-+.. ..+.....+.|..+..+..... +--.+.+|. ..+.....+.++.+. T Consensus 155 Ri~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAA 234 (524) T ss_pred eEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhh Confidence 77765543322 2233333333332222111100 000011111 112334556788888 Q ss_pred ccccccccccccc-chhHHHHHHHHHHHHHHHHHH----H--hhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhc Q lcl|NC_010576. 182 CIIIESPFYAILN-DTNQTLRMLEQKIKLMNSQDN----R--ASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMA 253 (447) Q Consensus 182 v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~--~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (447) |.|..+.+.+... ..-+-|..+...+..+.-... + .+.---+ +.|.++. +. +..+++....+...++ T Consensus 235 Ivy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGn-lP----k~KAeqYl~~im~k~k 309 (524) T protein:vir:10 235 VVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGN-MP----SRKAAAQMQHIMNTMK 309 (524) T ss_pred eeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCC-CC----chhHHHHHHHHHHhcC Confidence 9998876544332 111222222222222111111 1 1111111 2233332 32 2233444444443332 Q ss_pred c------CCcce------e-ec-------CCC---ceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc-------- Q lcl|NC_010576. 254 N------NKYGV------A-TL-------DTQ---EKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN-------- 302 (447) Q Consensus 254 ~------n~~~~------~-vl-------~~g---~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~-------- 302 (447) + +.|.| + .| -+| .+++.|.-.-.--+++..++..+...++++||.+-|. T Consensus 310 NKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~ 389 (524) T protein:vir:10 310 NRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVM 389 (524) T ss_pred ceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCcccc Confidence 1 11222 1 11 133 3444443333334688899999999999999999883 Q ss_pred -CCcH-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHH Q lcl|NC_010576. 303 -GTAN-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATV 363 (447) Q Consensus 303 -g~~~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~ 363 (447) |.++ |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.+ T Consensus 390 ~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 469 (524) T protein:vir:10 390 FDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINM 469 (524) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 1122 222223332222233334444444443 4556663 233455443 3221 100 1122332 Q ss_pred HHHHHh--CCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 364 ADVLTR--NAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 364 ~~~~~~--~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) +..+-. +-.++.+=+|+. +.+.-.+ -.+ ... +..+...++--.+..++.+. = T Consensus 470 l~~~dpyvGky~s~~yi~k~ILr~tDee---i~~-~~k------~I~~E~k~~~~~~~~~~~~~-f 524 (524) T protein:vir:10 470 LTMAEPFIGKYISHQTAMKDFLQMTDEE---INQ-EAK------QIEEESKEARFQNPDEEEED-F 524 (524) T ss_pred HHHhhhhhcccchhHHHHHHHhccCHHH---HHH-HHH------HHHHHhhcCCCCCCChhhhc-C Confidence 222211 012233333321 2221000 000 000 00000011111110000000 0 No 191 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=60.16 E-value=0.38 Score=22.90 Aligned_cols=407 Identities=9% Similarity=-0.031 Sum_probs=137.9 Q ss_pred CchhHhhhhhcccccCCc-cccccccccccccccc-cccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNGM-TSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |-...+|.++.+.+..+. +.-+....++...... ........... ...+ .......-.|+..+.-+-.-|+. T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~---~~~k--i~~n~~k~Iv~~~~~yl~g~p~~- 112 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYM---ADNR--VAHDYASYISDFINGYFLGNPIQ- 112 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCccccccc---Ccce--eecchHHHHHHHHhhhhcccCce- Confidence 222222222221111100 0000001111100000 00000000000 0001 11223445556666666666765 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc-------eeeeccCCC-c Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG-------SFDINTARV-G 150 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~-------~~~~~~~~~-~ 150 (447) |.. ++ +.....+..++.. |. ...+...+...++.+|.||+++..+..+.+.. .+++..... . T Consensus 113 ~~~--~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~ 182 (511) T protein:vir:96 113 YQD--DD---KDVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVER 182 (511) T ss_pred eec--Cc---hHHHHHHHHHHhh--cC---hhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCC Confidence 321 11 1123345555432 33 23455667788889999999877765442211 111111000 0 Q ss_pred cee---eecC---------Cce-EEEEeeecc--------cccce--------ee--eccccccccccccc--ccccchh Q lcl|NC_010576. 151 KIM---QFFP---------RQV-MVRVWNDNT--------GLEQD--------LL--VSKENCIIIESPFY--AILNDTN 197 (447) Q Consensus 151 ~~~---~~~~---------~~~-~~~~~~~~~--------~~~~~--------~~--~~~~~v~~~~~~~~--~~~~~~~ 197 (447) ++. .++. ..+ .+.+|.... +.+.. .. +..-.++++++... +...... T Consensus 183 ~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~ 262 (511) T protein:vir:96 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVI 262 (511) T ss_pred ceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhH Confidence 100 0000 000 011111110 00000 00 00011222222111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh Q lcl|NC_010576. 198 QTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN 277 (447) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~ 277 (447) .....+...++-..+...+. +.+--+++-......+..+...+...-.......-...+ .-.+.+.+++.++..... T Consensus 263 ~liDa~~~~~S~~~~~~~~~--~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:96 263 TLIDLYDNAESDTANYMSDL--NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEG-RETEGSVDGGYIYKQYDV 339 (511) T ss_pred HHHHHHHHHHHHHHHHHHHh--hcchhheecCccCCchhhcccccccceeccccceecccc-ccCCCCcceeEEeecCCH Confidence 22222222222222222222 222222222111222211111100000000000000000 012344555556554444 Q ss_pred hh-HHHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHh Q lcl|NC_010576. 278 NL-LSDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAV 339 (447) Q Consensus 278 ~~-l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~ 339 (447) .. ....+.+.+.|+..-++|..-. +|+.+.. ....++...|...++.|...+..+--..... T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~ 419 (511) T protein:vir:96 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 419 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 33 3456778889999998986433 2322211 1123455555555555555444322111011 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCCCC Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTPGQ 414 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~~~ 414 (447) .-..+++.++.-+-.|.++.++.+.++. |+++.--+.+++++ ++++. .++- ... ..+..+......+.+ T Consensus 420 -~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~--v~d~~-~El~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:96 420 -DFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRD 493 (511) T ss_pred -ccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhccccCCCC Confidence 1123556666777788999999998885 88988777777533 33321 1111 100 000001101111111 Q ss_pred CCCCCCCcCCCCCCCcccccccCCc Q lcl|NC_010576. 415 ITSDQPATASTDPLNNVSTSAIENG 439 (447) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (447) ..++.+ ++..++ .+++-+ T Consensus 494 ~~~~~~----~~~~~~---~~~e~~ 511 (511) T protein:vir:96 494 INDDEQ----DDDTKD---TVDKKE 511 (511) T ss_pred CCCCCC----CCCccC---cccccC Confidence 111111 111111 111111 No 192 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=60.16 E-value=0.38 Score=22.90 Aligned_cols=407 Identities=9% Similarity=-0.031 Sum_probs=137.9 Q ss_pred CchhHhhhhhcccccCCc-cccccccccccccccc-cccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNGM-TSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) |-...+|.++.+.+..+. +.-+....++...... ........... ...+ .......-.|+..+.-+-.-|+. T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~---~~~k--i~~n~~k~Iv~~~~~yl~g~p~~- 112 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYM---ADNR--VAHDYASYISDFINGYFLGNPIQ- 112 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCccccccc---Ccce--eecchHHHHHHHHhhhhcccCce- Confidence 222222222221111100 0000001111100000 00000000000 0001 11223445556666666666765 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc-------eeeeccCCC-c Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG-------SFDINTARV-G 150 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~-------~~~~~~~~~-~ 150 (447) |.. ++ +.....+..++.. |. ...+...+...++.+|.||+++..+..+.+.. .+++..... . T Consensus 113 ~~~--~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~ 182 (511) T protein:vir:78 113 YQD--DD---KDVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVER 182 (511) T ss_pred eec--Cc---hHHHHHHHHHHhh--cC---hhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCC Confidence 321 11 1123345555432 33 23455667788889999999877765442211 111111000 0 Q ss_pred cee---eecC---------Cce-EEEEeeecc--------cccce--------ee--eccccccccccccc--ccccchh Q lcl|NC_010576. 151 KIM---QFFP---------RQV-MVRVWNDNT--------GLEQD--------LL--VSKENCIIIESPFY--AILNDTN 197 (447) Q Consensus 151 ~~~---~~~~---------~~~-~~~~~~~~~--------~~~~~--------~~--~~~~~v~~~~~~~~--~~~~~~~ 197 (447) ++. .++. ..+ .+.+|.... +.+.. .. +..-.++++++... +...... T Consensus 183 ~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~ 262 (511) T protein:vir:78 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVI 262 (511) T ss_pred ceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhH Confidence 100 0000 000 011111110 00000 00 00011222222111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh Q lcl|NC_010576. 198 QTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN 277 (447) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~ 277 (447) .....+...++-..+...+. +.+--+++-......+..+...+...-.......-...+ .-.+.+.+++.++..... T Consensus 263 ~liDa~~~~~S~~~~~~~~~--~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:78 263 TLIDLYDNAESDTANYMSDL--NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEG-RETEGSVDGGYIYKQYDV 339 (511) T ss_pred HHHHHHHHHHHHHHHHHHHh--hcchhheecCccCCchhhcccccccceeccccceecccc-ccCCCCcceeEEeecCCH Confidence 22222222222222222222 222222222111222211111100000000000000000 012344555556554444 Q ss_pred hh-HHHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHh Q lcl|NC_010576. 278 NL-LSDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAV 339 (447) Q Consensus 278 ~~-l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~ 339 (447) .. ....+.+.+.|+..-++|..-. +|+.+.. ....++...|...++.|...+..+--..... T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~ 419 (511) T protein:vir:78 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 419 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 33 3456778889999998986433 2322211 1123455555555555555444322111011 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCCCC Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTPGQ 414 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~~~ 414 (447) .-..+++.++.-+-.|.++.++.+.++. |+++.--+.+++++ ++++. .++- ... ..+..+......+.+ T Consensus 420 -~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~--v~d~~-~El~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:78 420 -DFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRD 493 (511) T ss_pred -ccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhccccCCCC Confidence 1123556666777788999999998885 88988777777533 33321 1111 100 000001101111111 Q ss_pred CCCCCCCcCCCCCCCcccccccCCc Q lcl|NC_010576. 415 ITSDQPATASTDPLNNVSTSAIENG 439 (447) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (447) ..++.+ ++..++ .+++-+ T Consensus 494 ~~~~~~----~~~~~~---~~~e~~ 511 (511) T protein:vir:78 494 INDDEQ----DDDTKD---TVDKKE 511 (511) T ss_pred CCCCCC----CCCccC---cccccC Confidence 111111 111111 111111 No 193 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=59.52 E-value=0.39 Score=22.82 Aligned_cols=374 Identities=7% Similarity=-0.037 Sum_probs=141.1 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) .-...|+.+....+..+...... +-- ...+ ..+ +.......+|+..+.-+-.-|+.+ . T Consensus 39 ~~~~~~~~~l~~Yy~g~~~i~~~------------~~~----~~~~---~~k--i~~n~~~~Ivd~~~~~l~g~p~~~-~ 96 (470) T protein:vir:99 39 TVLKPRYRENMKLYLGKHKILTA------------PEK----ETGA---DNR--IVVNSAKYVVDVYNGYFCGIEPKL-A 96 (470) T ss_pred HhhHHHHHHHHHHhccccccccC------------ccc----ccCC---cce--eecchHHHHHHHHhhhhccCCeeE-e Confidence 22223333333333332110000 000 0000 000 112234455666666655557653 2 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCccee------- Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIM------- 153 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~------- 153 (447) ...+.. ....+..++.. | ........+....+.+|.||+++..+..+... .-.+.|..+..+. T Consensus 97 ~~~d~~----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~-i~~~~p~~~~~i~d~~~~~~ 166 (470) T protein:vir:99 97 LLNDSS----KIDEIARWNRQ--E---NFFDTINEISKQCDIFGRSIASIYQGEDARPH-LMYSSPNHAFIIYDDTVQRQ 166 (470) T ss_pred eCCchh----HHHHHHHHHHh--c---CHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEE-EEEEccceeEEEEcCCCCcc Confidence 211111 11234444431 3 33456677888899999999987765543221 0111111110000 Q ss_pred -----eec---CCce---EEEEeeeccc-------ccc------eeeec--ccccccccccccc--cccchhHHHHHHHH Q lcl|NC_010576. 154 -----QFF---PRQV---MVRVWNDNTG-------LEQ------DLLVS--KENCIIIESPFYA--ILNDTNQTLRMLEQ 205 (447) Q Consensus 154 -----~~~---~~~~---~~~~~~~~~~-------~~~------~~~~~--~~~v~~~~~~~~~--~~~~~~~~~~~~~~ 205 (447) .++ .+.. .+.+|..... ... ...++ .-.++++++...+ ........+..+.. T Consensus 167 ~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~ 246 (470) T protein:vir:99 167 PLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDK 246 (470) T ss_pred eEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHH Confidence 000 0000 0111111000 000 00000 0112223221111 11111122222222 Q ss_pred HHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeec-----CCCceeeecCCChhhhh- Q lcl|NC_010576. 206 KIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATL-----DTQEKFVSAGMGLQNNL- 279 (447) Q Consensus 206 ~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl-----~~g~~~~~l~~~~~~~~- 279 (447) .++-......+ .+.+--++.- .....+...+... .+. ..+++.+ +.+.+++.+........ T Consensus 247 ~~s~~~~~~~~--~~~~~~~i~g-~~~~~~~~g~~~~----~~~------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 313 (470) T protein:vir:99 247 VISQKANQVEY--FDNAYMYMIG-FKLPEDDEGNPKF----DFK------NNRVLYVSQLDPDTNPQIGFIAKPDADQMQ 313 (470) T ss_pred HHHHHHHHHHH--hcCceeeeec-CCcccccccchhh----hhh------hcceeeecCCCCCCCCcceEEeecCChHHH Confidence 22222222222 2233223321 1111111111111 111 1223322 34556677765544443 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhc---CCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAILN---GTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~l~---g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ....+.+.+.|+..-++|+.... |+.+. +.....+...|...++.+...++.+--... .. T Consensus 314 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~---~~ 390 (470) T protein:vir:99 314 ENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQE---LW 390 (470) T ss_pred HHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc---cc Confidence 34577888999999999975432 32221 111234555666666555555544322211 12 Q ss_pred ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCc Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPAT 422 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (447) ..+++.+..-+..|..+.++.+.++. |+++.-.++++++.- ++. .++-. +. ..+........+.....+.. T Consensus 391 ~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~v---d~~-~E~er--i~-~E~~~~~~~~~~~~~~~d~~ 461 (470) T protein:vir:99 391 SELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDI---EPD-AEMKQ--IA-KEKADAIKQTQQLSMPIDIL 461 (470) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCC---CHH-HHHHH--HH-HHHHHHHHHHHhhcCCCCcC Confidence 35566667777788999999998885 789987777776432 111 11110 00 00000000000001111111 Q ss_pred CCCCCCCcc Q lcl|NC_010576. 423 ASTDPLNNV 431 (447) Q Consensus 423 ~~~~~~~~~ 431 (447) ..++..+.+ T Consensus 462 ~~d~~~ee~ 470 (470) T protein:vir:99 462 KRDNNAEEE 470 (470) T ss_pred CCCCCccCC Confidence 111111111 No 194 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=56.90 E-value=0.45 Score=22.51 Aligned_cols=383 Identities=10% Similarity=0.010 Sum_probs=136.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +....|+.+....+..+... . ..... ........ ...+ +......-+|+..+.-+..-|+. |. T Consensus 44 ~~~~~~~~~~~~yY~g~~~~---i---~~~~~-------~~~~~~~~-~~~k--i~~n~~~~ivd~~~~~l~g~~~~-~~ 106 (481) T protein:vir:10 44 TEQVPRLEMLESYYLNRNTD---I---LAGER-------RLQKYGDK-ADHR--AVHNYAKYVSRFIVGYLTGNPIT-IT 106 (481) T ss_pred HHHHHHHHHHHHHhcCCCcc---c---ccCcc-------cccccccc-ccce--eecchHHHHHHHHHhhhccCCce-Ee Confidence 12223333333333222100 0 00000 00000000 0001 12334556777777766666664 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCC----------c Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARV----------G 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~----------~ 150 (447) .. + ...+..+..++.. |. ...+...+....+.+|.||+++..+..+... +.+.++.. . T Consensus 107 ~~--d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~--i~~~~p~~~~~v~d~~~~~ 174 (481) T protein:vir:10 107 HQ--D---NQTNDKIIELNDL--ND---ADEVNSDLALNLSIYGRAYEIVYRDFEDRDT--FKVLDPKSTFVVYDQTLDK 174 (481) T ss_pred cC--C---hhHHHHHHHHHHh--cC---hhHHHHHHHHHHHhcCeEEEEEEeCCCCeEE--EEEEcccceEEEEcCCCCC Confidence 21 1 1123456666642 33 3357777888999999999987665543221 11111111 1 Q ss_pred ceee---ec---CC-c--e-EEEEeeec--------ccccce---eeec--cccccccccccc--ccccchhHHHHHHHH Q lcl|NC_010576. 151 KIMQ---FF---PR-Q--V-MVRVWNDN--------TGLEQD---LLVS--KENCIIIESPFY--AILNDTNQTLRMLEQ 205 (447) Q Consensus 151 ~~~~---~~---~~-~--~-~~~~~~~~--------~~~~~~---~~~~--~~~v~~~~~~~~--~~~~~~~~~~~~~~~ 205 (447) ++.. ++ .. . + .+.+|... .+.+.. ..++ .=.++++++... +...........+.. T Consensus 175 ~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~ 254 (481) T protein:vir:10 175 KVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDS 254 (481) T ss_pred ceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHH Confidence 1100 00 00 0 0 01111100 000000 0010 012333333211 111111222222222 Q ss_pred HHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh-hHHHHH Q lcl|NC_010576. 206 KIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN-LLSDVR 284 (447) Q Consensus 206 ~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~-~l~~~~ 284 (447) .++.+.+...+. +.+--++.-.....++.. +.++..- .............+.+.+++-+....... ..+.++ T Consensus 255 ~~s~~~~~~~~~--~~~~~~~~g~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 327 (481) T protein:vir:10 255 AQSDTANYMTDL--NDAMLAIIGNVDLDSEDA----KAFRDAN-MIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKK 327 (481) T ss_pred HHHHHHHHHHHh--cCceeEeecCcCCCccch----hhhhhcc-ceeccccccccCCCCCcceeEEeecCCHHHHHHHHH Confidence 222222222222 222222321111122111 1111100 00000000011122344555554443333 345577 Q ss_pred HHHHHHHHHhCCCHHHhc---CCcHHHH--------------HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEE Q lcl|NC_010576. 285 QLQQDFYNQMGITEAILN---GTANEQQ--------------TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVY 347 (447) Q Consensus 285 ~~~~~Ia~~fgVP~~~l~---g~~~e~~--------------~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f 347 (447) .+.+.|+..-++|....+ |+.+... ....+...|.-.++.+...++..-... .....+++ T Consensus 328 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~---~~~~~i~v 404 (481) T protein:vir:10 328 RLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQ---HNYAELTI 404 (481) T ss_pred HHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---cccceeeE Confidence 888889999999875442 3222111 112233333333333333333221111 11234566 Q ss_pred ecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCCCcCCC Q lcl|NC_010576. 348 YRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQPATAST 425 (447) Q Consensus 348 ~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (447) .+..-...|.++.++++.++. |+++.-.+.+++++ ++++. .++-. ..-... .+........+..+ .++ T Consensus 405 ~f~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~--i~d~~-~E~~ri~~E~~~~---~~~~~~~~~~~~~~--~~~ 474 (481) T protein:vir:10 405 TFTPNLPKSMMESINAFNALS--GGVSESTRLSLLDF--IDNPK-EELEKMQEEEAQR---EKQADKRGYGEAFE--NHL 474 (481) T ss_pred EeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCC--CCCHH-HHHHHHHHHHHHH---HhhhhhccCCccCC--CCC Confidence 666777788999999998874 78888777777544 33321 11110 000000 00000000000000 000 Q ss_pred CCCCccc Q lcl|NC_010576. 426 DPLNNVS 432 (447) Q Consensus 426 ~~~~~~~ 432 (447) ++.+.+. T Consensus 475 ~~dd~~g 481 (481) T protein:vir:10 475 NVDDSNG 481 (481) T ss_pred CCCCCCC Confidence 0001111 No 195 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=55.64 E-value=0.47 Score=22.36 Aligned_cols=376 Identities=9% Similarity=0.021 Sum_probs=140.2 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+..--... .. ....... .......-.......-+|+..+.-+-.-|+. |. T Consensus 57 ~~~~~r~~~l~~YY~g~~~i~~~~----~~------~~~~~~~---~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~-~~ 122 (492) T protein:vir:97 57 LEKLPEISIGQEYYEQRPDIVKEP----KP------VDATGAV---DPLKPDDRMITNFHANLVDQKVSYIVGKPIA-FK 122 (492) T ss_pred HHHHHHHHHHHHHhcccCcccccc----cc------ccccccc---cccccccccccchHHHHHHHHhhhhcccCce-ec Confidence 212222222333333321100000 00 0000000 0000000011234556677777666666665 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCC----------c Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARV----------G 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~----------~ 150 (447) . ++. .....+..++. |.. ......+...++.+|.||+++..+..+... +.+..+.. . T Consensus 123 ~--~d~---~~~~~l~~~~~---n~~---~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~--~~~~~p~~~~~i~d~~~~~ 189 (492) T protein:vir:97 123 H--TDD---EVVKRIDEVLG---NRF---DDKLHSVLTGASNKGIEWLHPYLDEEGEFK--LFRVPAEQGIPIWTDKEHE 189 (492) T ss_pred c--Cch---HHHHHHHHHHh---ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceE--EEEEcccceEEEEcCCCCC Confidence 1 111 11223333332 332 234445678888999999887766544321 11111111 1 Q ss_pred cee---eecC--CceEEEEeee--------cccc----------cce---eee--ccccccccccccc--ccccchhHHH Q lcl|NC_010576. 151 KIM---QFFP--RQVMVRVWND--------NTGL----------EQD---LLV--SKENCIIIESPFY--AILNDTNQTL 200 (447) Q Consensus 151 ~~~---~~~~--~~~~~~~~~~--------~~~~----------~~~---~~~--~~~~v~~~~~~~~--~~~~~~~~~~ 200 (447) ++. .++. ....+.+|.. ..+. ... ..+ ..-.++++++... +......... T Consensus 190 ~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~li 269 (492) T protein:vir:97 190 ELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLI 269 (492) T ss_pred ceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHH Confidence 110 0000 0000011100 0000 000 000 0001222222111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh-h Q lcl|NC_010576. 201 RMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN-L 279 (447) Q Consensus 201 ~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~-~ 279 (447) ..+...++-..+...+ ...+-.+++ +. ..+..++ +.... . ..+++.++.+.+++.+....... . T Consensus 270 Da~d~~~S~~~~~~~~--~~~~~l~~~--g~-~~~~~~~----~~~~~----~--~~~~~~~~~~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:97 270 DAYNRRLSDLSNTFKD--SNELTYVLK--NY-DDQELPE----FKRLL----R--YYGAIKVSDNGGVDTIQVEVPVENS 334 (492) T ss_pred HHHHHHHHHHHHHHHH--hccceeeee--cC-Ccccchh----HHHHH----h--hccceecCCCCcceeEeccCCHHHH Confidence 2222222222222222 222222222 21 1111111 21111 1 23455566666666665444333 3 Q ss_pred HHHHHHHHHHHHHHhCCCHHH---hcCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAI---LNGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~---l~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ....+.+.+.|+..-++|..- ++|+.+.. .....+...|...++.|...++.+ ... T Consensus 335 ~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~-------~~~ 407 (492) T protein:vir:97 335 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------GEH 407 (492) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------ccc Confidence 455677788888888887533 33332211 112244555666665555544321 122 Q ss_pred ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) ..+++.++.-+-.|.++.++++.++. |+++.--+.++++.- +++. .++-. ..-....+ ......+...+..+ T Consensus 408 ~~i~v~f~~~~p~~~~e~a~~~~kl~--G~iS~et~l~~l~~v--~d~~-~Eleri~~E~~~~~~-~~~~~~~~~~~~~~ 481 (492) T protein:vir:97 408 KDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFV--EDLQ-AELERIEQEQTEYNK-QLPNLDDGGADSAQ 481 (492) T ss_pred ceeeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHH-HHHHHHHHHHHHHHH-hhhccccCCCCCCc Confidence 45666667777789999999998884 889987777776543 3321 11110 00000000 00011111122222 Q ss_pred CcCCCCCCCcc Q lcl|NC_010576. 421 ATASTDPLNNV 431 (447) Q Consensus 421 ~~~~~~~~~~~ 431 (447) ..++.++..+| T Consensus 482 ~~~~~~~~~~e 492 (492) T protein:vir:97 482 QQERSNNKESE 492 (492) T ss_pred ccccccccccC Confidence 22222222222 No 196 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=55.45 E-value=0.48 Score=22.33 Aligned_cols=395 Identities=7% Similarity=-0.023 Sum_probs=138.2 Q ss_pred hhHhhh----hhcc-cccCCcccc----cc---cccccccc-ccccccccccc--------cCCcccccchhhhhhHHHH Q lcl|NC_010576. 3 SSDRLL----HSWN-AFQSNQNQN----QN---TNDFLTPS-NGMTSFGGYYG--------RGQSNYSRSYSYNKADLIK 61 (447) Q Consensus 3 ~~~~l~----~~~~-~f~~~~~~~----~~---~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~ 61 (447) .|++|. .+++ .|..++... .. +...+... -|..-+-|... ..+... .+.......-. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~--~~~~~~~n~~k 78 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPV--NRRQLSMNLPK 78 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCcc--ccceeecchHH Confidence 344432 2221 111111000 00 00000000 01000011110 000000 11112223445 Q ss_pred HHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-c Q lcl|NC_010576. 62 SVITRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-G 140 (447) Q Consensus 62 ~cv~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-~ 140 (447) .+++..|+-+..=|..+ .. ++ +.....|..+|. -| ....-...++...+..|.+|+.+..+..+.+. . T Consensus 79 ~i~~~~a~~l~~~p~~i-~~--~d---~~~~e~l~~~~~--~n---~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~ 147 (496) T protein:vir:38 79 VTAKYMSKLLFNEKVKI-NI--DD---KAAEEFVLNVLK--TN---GFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVS 147 (496) T ss_pred HHHHHHhhhhhCCcceE-ee--CC---hHHHHHHHHHHh--cc---CHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEE Confidence 56677777666656542 21 11 112223344443 12 22334555667778889999887776543221 1 Q ss_pred ------eeeeccCCCccee---ee---cCCce-----------------EEEEeeecccc--cceeee------------ Q lcl|NC_010576. 141 ------SFDINTARVGKIM---QF---FPRQV-----------------MVRVWNDNTGL--EQDLLV------------ 177 (447) Q Consensus 141 ------~~~~~~~~~~~~~---~~---~~~~~-----------------~~~~~~~~~~~--~~~~~~------------ 177 (447) .+|+ ......+. -+ ..+.. ....|...... +..+.+ T Consensus 148 ~v~~~~~~P~-~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~ 226 (496) T protein:vir:38 148 FATADCMYPL-SNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVP 226 (496) T ss_pred EEcccceEEE-EecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccccccee Confidence 1111 11111111 00 00000 00111100000 000000 Q ss_pred ----cccccccccccccc----cccchhHHHHHHHHHHHHHHHHHHH-h---hcCccccee-----eeCCcCChHHHHHH Q lcl|NC_010576. 178 ----SKENCIIIESPFYA----ILNDTNQTLRMLEQKIKLMNSQDNR-A---SSGKLNGFI-----QFPYSTKSTARAAQ 240 (447) Q Consensus 178 ----~~~~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~---n~~~~~gvl-----~~~~~~~~~~~~~~ 240 (447) ..--+.|++.+..+ ....+.+.+..+...++.+...... . ..++.+-++ ........+ . T Consensus 227 ~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~----~ 302 (496) T protein:vir:38 227 LPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGS----T 302 (496) T ss_pred ecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCc----c Confidence 00001122222110 0011112233223333322211111 1 112222111 100000000 0 Q ss_pred HHHHHHHHHHHhccCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cHHHH--H- Q lcl|NC_010576. 241 AARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNGT------ANEQQ--T- 310 (447) Q Consensus 241 ~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~~--~- 310 (447) ...+.... +.+. .......+++..++.++... .++.....+....+|+..-|+||..++.+ ++|-. . T Consensus 303 ~~~~~~~~-~~~~--~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~ 379 (496) T protein:vir:38 303 TQYFDSTD-EAFF--LYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKS 379 (496) T ss_pred ccCCCCcc-ceEE--EeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHH Confidence 00000000 0000 00001122333466666554 34566777888889999999999998632 22211 1 Q ss_pred ---------HHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 311 ---------LGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 311 ---------~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) ...++.+|..++..+-+..+...........+..+.|.++.-+..|..+.++...+++.+|+++.-.+++. T Consensus 380 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~ 459 (496) T protein:vir:38 380 ETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQR 459 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHh Confidence 11334444444444443222111111111224567777778788899999999999999999998877654 Q ss_pred hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 382 TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 382 ~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) . +-++++...+-+ ...........+ ..+. ++...+.| T Consensus 460 ~--~~~~d~ea~~el----~ri~~E~~~~~~-----~~d~--~~~~~~~e 496 (496) T protein:vir:38 460 A--WNITEAEADEWA----EMLAKEKQAEMP-----NNDM--NGIFGEEE 496 (496) T ss_pred c--CCCChHHHHHHH----HHHHHhhhccCc-----cccc--cCCCCCCC Confidence 3 222222222111 111000000000 0000 00000111 No 197 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=51.27 E-value=0.59 Score=21.86 Aligned_cols=376 Identities=10% Similarity=0.012 Sum_probs=138.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+...-... ... ...... .......=+......-+|+..+.-+-.-|+.+ . T Consensus 48 ~~~~~r~~~l~~YY~g~~~i~~~~----~~~------~~~~~~---~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~-~ 113 (483) T protein:vir:12 48 LEKLPEISIGQEYYEQRPDIVKEP----KPV------DATGAV---DPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-K 113 (483) T ss_pred HHHHHHHHHHHHHhcccccccccc----ccc------cccccc---cccccccccccchHHHHHHHHhhhhcccCcee-c Confidence 222223333333333321000000 000 000000 00000000123345566677666665566652 1 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCC-C---------c Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTAR-V---------G 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~-~---------~ 150 (447) .++. .....+..++. |.. ......+....+.+|.||+++..+..+... +.+..+. + . T Consensus 114 --~~d~---~~~~~l~~~~~---n~~---~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~--i~~~~p~~~~~v~d~~~~~ 180 (483) T protein:vir:12 114 --HTDD---EVVKRIDEVLG---NRF---DDKLHSVLTGASNKGIEWLHPYLDEEGEFK--LFRVPAEQGIPIWTDKEHE 180 (483) T ss_pred --cCCh---HHHHHHHHHHh---ccH---HHHHHHHHHHHhhCCeEEEEEEEcCCCceE--EEEEcccceEEEEcCCCCC Confidence 1111 11122333332 322 233445667888999999887766544321 1111111 1 0 Q ss_pred cee---eecC--CceEEEEeeec-------cccc---------ce-ee------eccccccccccccc--ccccchhHHH Q lcl|NC_010576. 151 KIM---QFFP--RQVMVRVWNDN-------TGLE---------QD-LL------VSKENCIIIESPFY--AILNDTNQTL 200 (447) Q Consensus 151 ~~~---~~~~--~~~~~~~~~~~-------~~~~---------~~-~~------~~~~~v~~~~~~~~--~~~~~~~~~~ 200 (447) ++. .++. ....+.+|... .+.. .. .. +..-.++++++... +......... T Consensus 181 ~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~li 260 (483) T protein:vir:12 181 ELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLI 260 (483) T ss_pred ceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHH Confidence 010 0000 00001111000 0000 00 00 00001222222111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh-hh Q lcl|NC_010576. 201 RMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN-NL 279 (447) Q Consensus 201 ~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~-~~ 279 (447) ..+...++...+...+ .+.+-.+++ +.. .+..++. .... . ..+++-++.+.+++.+...... .. T Consensus 261 Da~d~~~S~~~~~~~~--~~~~~lv~~--g~~-~~~~~~~----~~~~----~--~~~~~~~~~~~~~~~l~~~~~~~~~ 325 (483) T protein:vir:12 261 DAYNRRLSDLSNTFKD--SNELTYVLT--NYD-DQELPEF----KRLL----R--YYGAIKVSDNGGVDTIQVEVPVENS 325 (483) T ss_pred HHHHHHHHHHHHHHHH--hcCceeeee--cCC-cccchhH----HHhh----h--hccccccCCCCcceEEeecCCHHHH Confidence 2222222222222222 223322332 211 1111111 1111 1 2234555666666666544333 33 Q ss_pred HHHHHHHHHHHHHHhCCCHHH---hcCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAI---LNGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~---l~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ....+.+.+.|+..-++|..- ++|+.+.. .....+...|...++.|...++.+. .. T Consensus 326 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~-------~~ 398 (483) T protein:vir:12 326 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-------EH 398 (483) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-------cc Confidence 455677778888888887533 23332211 1123455666666666655544221 23 Q ss_pred ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) ..+++.++.-+-.|.++.++.+.++ .|+++.--++++++.- +++. .++-. ..-....+.. ....+...++.. T Consensus 399 ~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v--~d~~-~E~~ri~~E~~~~~~~~-~~~~~~~~d~~~ 472 (483) T protein:vir:12 399 KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQ-AELERIEQEQMEYNKQL-PNLDDGGADGAQ 472 (483) T ss_pred ceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCC--CCHH-HHHHHHHHHHHHHHhhc-ccccccccCCcc Confidence 4566666777788999999999888 4899988887776543 3321 11111 0000000000 000111111111 Q ss_pred CcCCCCCCCcccccc Q lcl|NC_010576. 421 ATASTDPLNNVSTSA 435 (447) Q Consensus 421 ~~~~~~~~~~~~~~~ 435 (447) ..+.. +++.++ T Consensus 473 ~~~~~----~~~e~e 483 (483) T protein:vir:12 473 QQERS----NNKESE 483 (483) T ss_pred cCCCC----CcccCC Confidence 11111 111111 No 198 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=51.01 E-value=0.59 Score=21.83 Aligned_cols=397 Identities=7% Similarity=-0.027 Sum_probs=136.4 Q ss_pred CchhHhhhhhcc-cccCCcccc----cccccc---ccc-cccccccccccc-------cCCcccccchhhhhhHHHHHHH Q lcl|NC_010576. 1 MASSDRLLHSWN-AFQSNQNQN----QNTNDF---LTP-SNGMTSFGGYYG-------RGQSNYSRSYSYNKADLIKSVI 64 (447) Q Consensus 1 Mg~~~~l~~~~~-~f~~~~~~~----~~~~~~---~~~-~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~v~~cv 64 (447) =++.++++.+++ .|-.++... .....+ ... .-|..-+-|... ....... .+...+......++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~s~n~~~~iv 81 (499) T protein:vir:80 3 NQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPV-NRRQLSMNLPKVTA 81 (499) T ss_pred hHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcc-ccceeecchHHHHH Confidence 123333333332 111111110 000000 000 001000011100 0000000 01112223344556 Q ss_pred HHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc----- Q lcl|NC_010576. 65 TRIALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS----- 139 (447) Q Consensus 65 ~~ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~----- 139 (447) +..|+-+..=|..+ .. ++ +..+..+..+|. -|.+ ..-.+..+...+..|.+|+.+..+..+.+. T Consensus 82 ~~~a~~l~~ep~~i-~~--~d---~~~~e~l~~~~~--~n~f---~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (499) T protein:vir:80 82 KYMSKLLFNEKVKI-NI--DD---ETAEEFVLNVLK--TNGF---TKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (499) T ss_pred HHHHHhhhCCcceE-ee--CC---HHHHHHHHHHHh--hccH---HHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEc Confidence 66666665555542 11 11 111222333432 2222 223344455666678888877665432111 Q ss_pred -ce-eeeccCCCcceee---ec---CCce----------------EEEE----eeecccc--cceeee------------ Q lcl|NC_010576. 140 -GS-FDINTARVGKIMQ---FF---PRQV----------------MVRV----WNDNTGL--EQDLLV------------ 177 (447) Q Consensus 140 -~~-~~~~~~~~~~~~~---~~---~~~~----------------~~~~----~~~~~~~--~~~~~~------------ 177 (447) .. +|+. .....+.. +. .+.. .+.+ |...... +..+.+ T Consensus 151 a~~~~Pi~-~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~ 229 (499) T protein:vir:80 151 ADCMYPLS-NDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVP 229 (499) T ss_pred CCceEEEE-ecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCcee Confidence 11 2211 11111110 00 0000 0000 0000000 000000 Q ss_pred ----cccccccccccccc----cccchhHHHHHHHHHHHHHHHHHHH----hhcCcccce-----eeeCCcCChHHHHHH Q lcl|NC_010576. 178 ----SKENCIIIESPFYA----ILNDTNQTLRMLEQKIKLMNSQDNR----ASSGKLNGF-----IQFPYSTKSTARAAQ 240 (447) Q Consensus 178 ----~~~~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~----~n~~~~~gv-----l~~~~~~~~~~~~~~ 240 (447) ..-.+.|++.+..+ ....+-+.+..+...++.+...... ...++.+-+ +........+. T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~---- 305 (499) T protein:vir:80 230 LPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGST---- 305 (499) T ss_pred ecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCc---- Confidence 00002233322111 0011122333333333332222111 111222211 11111000000 Q ss_pred HHHHHHHHHHHhccCCcceeecCCCceeeecCCCh-hhhhHHHHHHHHHHHHHHhCCCHHHhcCC------cHHHHH--- Q lcl|NC_010576. 241 AARRKQEIENEMANNKYGVATLDTQEKFVSAGMGL-QNNLLSDVRQLQQDFYNQMGITEAILNGT------ANEQQT--- 310 (447) Q Consensus 241 ~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~-~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~------~~e~~~--- 310 (447) ...+... ...+. ......-+++..++.++... .++....++...++|...-|++++.++.. ++|-.. T Consensus 306 ~~~~~~~-~~~~~--~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~ 382 (499) T protein:vir:80 306 TQYFDST-DEAFF--LYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKS 382 (499) T ss_pred ccCCCcc-cceee--EeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHH Confidence 0000000 00000 00000112222466666553 45667778888889999999999998632 223211 Q ss_pred ---------HHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_010576. 311 ---------LGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIREL 381 (447) Q Consensus 311 ---------~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 381 (447) ...++.+|..++..|-...+...+..........+.|+++.-...|..+.++...+++.+|+|+.-.++.. T Consensus 383 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 462 (499) T protein:vir:80 383 ETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQR 462 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhh Confidence 11223333333333332222111111111223568888888888899999999999999999998888755 Q ss_pred h-CCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 382 T-GKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNV 431 (447) Q Consensus 382 ~-gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (447) . |. ++....+.+. ..... +....++++.+ +--.+.| T Consensus 463 ~~~~---~d~ea~~el~----~i~~E-----~~~~~~~~d~~--g~~ge~e 499 (499) T protein:vir:80 463 AWNI---TEAEADEWAE----MLAKE-----KQAEIPNNDMT--GIFGEEE 499 (499) T ss_pred cCCC---ChHHHHHHHH----HHHHH-----hhcCCCCCCcc--ccCCCCC Confidence 3 32 2222222111 00000 00000111110 0000001 No 199 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=48.68 E-value=0.66 Score=21.57 Aligned_cols=403 Identities=8% Similarity=0.038 Sum_probs=138.1 Q ss_pred CchhHhhhhhcccccCCcc---------cccccccccccc-------ccccccccccccCCccc--------------cc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN---------QNQNTNDFLTPS-------NGMTSFGGYYGRGQSNY--------------SR 50 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~---------~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~--------------~~ 50 (447) |.|. ++++|++..+... ...++..+.... ....+..+...-.+..+ .. T Consensus 1 m~f~--~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~ 78 (523) T protein:vir:68 1 MKFN--ILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDT 78 (523) T ss_pred CCCc--hhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHH Confidence 8872 3333333232111 011222221111 00111100000000001 11 Q ss_pred chhhhhhHHHHHHHHHHHHhhccC-----ceEEEEEcC--CCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhc Q lcl|NC_010576. 51 SYSYNKADLIKSVITRIALDASMV-----DFKHLKIDP--ISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDE 123 (447) Q Consensus 51 ~~~~~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~--~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~ 123 (447) =+..+.+|-|.+||+-|.+++.-. |+.+--.+. ....++.......++|+. -|-...+++ ++..+... T Consensus 79 YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVD 153 (523) T protein:vir:68 79 YRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNH-LSFQRKGSD----HFRRWYVD 153 (523) T ss_pred HHHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhhhH----HHHhheee Confidence 123456788999999998887533 222111110 000111112223344432 122233333 44556677 Q ss_pred CCeeEEEeeccCC---cccceeeeccCCCcceeeecC----------CceEEEEee----------ecccccceeeeccc Q lcl|NC_010576. 124 GQIAMVPIDTTVD---PDSGSFDINTARVGKIMQFFP----------RQVMVRVWN----------DNTGLEQDLLVSKE 180 (447) Q Consensus 124 Gna~i~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~----------~~~~~~~~~~~~~~ 180 (447) |-.|..+.-+... .+.....+.|..+..+..... +-..+.+|. ..+..+..+.++.+ T Consensus 154 gRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~d 233 (523) T protein:vir:68 154 SRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKA 233 (523) T ss_pred eEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchh Confidence 8777665444332 222333333332222111100 000011111 11122345566666 Q ss_pred cccccccccccccc-chhHHHHHHHHHHHHHHHHHH----H--hhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHh Q lcl|NC_010576. 181 NCIIIESPFYAILN-DTNQTLRMLEQKIKLMNSQDN----R--ASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEM 252 (447) Q Consensus 181 ~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~--~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (447) -|.|..+.+.+... ..-+-+..+...+..+.-... + .+.---+ +.|.++ .+. +..+++....+.+.+ T Consensus 234 AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvG-nlP----k~KAeqYl~~im~k~ 308 (523) T protein:vir:68 234 AIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTG-NMP----SRKAAEHMQHVMNTM 308 (523) T ss_pred heeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecC-CCC----chhHHHHHHHHHHhh Confidence 66666654433221 111122222222222111111 1 1111111 223333 232 223344444443333 Q ss_pred cc------CCcce------e-ec-------CC---CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC------ Q lcl|NC_010576. 253 AN------NKYGV------A-TL-------DT---QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNG------ 303 (447) Q Consensus 253 ~~------n~~~~------~-vl-------~~---g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g------ 303 (447) ++ +.|.| + .| -+ |.+++.|.-.-.--+++..++..+...++++||.+-|.+ T Consensus 309 kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~ 388 (523) T protein:vir:68 309 KNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQ 388 (523) T ss_pred cceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCccee Confidence 21 11222 1 11 13 334444433333346888999999999999999988831 Q ss_pred --CcH-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHH Q lcl|NC_010576. 304 --TAN-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATV 363 (447) Q Consensus 304 --~~~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~ 363 (447) .++ |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.+ T Consensus 389 ~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 468 (523) T protein:vir:68 389 FDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINM 468 (523) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHH Confidence 112 222233332222223334444444443 4556663 223455443 3221 000 1122222 Q ss_pred HHHHHh--CCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 364 ADVLTR--NAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 364 ~~~~~~--~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) +..+-. +-.++.+=+|+. +.+.-.+ + ... ..+..+...++--.+..++ +.+= T Consensus 469 l~~~dpyvGky~s~~yi~k~ILr~tDee------i-~~~---~kqI~~E~k~~~~~~p~~e---------~~~f 523 (523) T protein:vir:68 469 LQMAEPFIGKYISHRTAMKDILQMSDEE------I-EQE---AKQIEEESKEARFQDPDQE---------QEDF 523 (523) T ss_pred HHHhhhhhcccchhHHHHHHHhccCHHH------H-HHH---HHHHHHHhhcCCCCCCchh---------hhcC Confidence 222110 012233333221 1221000 0 000 0000000111111110000 0000 No 200 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=45.97 E-value=0.75 Score=21.27 Aligned_cols=406 Identities=8% Similarity=-0.055 Sum_probs=137.7 Q ss_pred CchhHhhhhhcccccCCcc-ccccccccccccccc-cccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN-QNQNTNDFLTPSNGM-TSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) +--..+|.++.+.+..+.. +-+....++...... ...........+ ..+ .......-.|+..+.-+-.-|+. T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~---~~k--i~~n~~k~Iv~~~~~yl~g~p~~- 112 (511) T protein:vir:99 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA---DNR--VAHDYASYISDFINGYFLGNPIQ- 112 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccC---cce--eecchHHHHHHHHHhhhcccCce- Confidence 1111222222211111100 000001111000000 000000000000 001 11223444556666666566765 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc-------eeeeccCCC-c Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG-------SFDINTARV-G 150 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~-------~~~~~~~~~-~ 150 (447) +.. ++ +.....+..+++. |. .......+...++.+|.||+++..+..+.+.. .+++..... . T Consensus 113 ~~~--~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~ 182 (511) T protein:vir:99 113 YQD--DD---KDVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIER 182 (511) T ss_pred eec--Cc---hHHHHHHHHHHhh--cC---HhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCC Confidence 321 11 1123345555532 32 33455667788899999999887765432211 111110000 0 Q ss_pred ce---eeecC---------Cce-EEEEeeecc------cccc----------eeee--cccccccccccc--cccccchh Q lcl|NC_010576. 151 KI---MQFFP---------RQV-MVRVWNDNT------GLEQ----------DLLV--SKENCIIIESPF--YAILNDTN 197 (447) Q Consensus 151 ~~---~~~~~---------~~~-~~~~~~~~~------~~~~----------~~~~--~~~~v~~~~~~~--~~~~~~~~ 197 (447) ++ +.++. ..+ .+.+|.... .... ...+ ..=.++++++.- .+...... T Consensus 183 ~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~ 262 (511) T protein:vir:99 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVI 262 (511) T ss_pred ceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhH Confidence 10 00000 000 011111100 0000 0000 001133333211 11111122 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh Q lcl|NC_010576. 198 QTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN 277 (447) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~ 277 (447) .....+...++...+...+.. .+--++.-......+......+............+. ...-.+.|.+++.|+..... T Consensus 263 ~liDa~d~~~S~~~~~~~~~~--~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~~l~~~~~~ 339 (511) T protein:vir:99 263 TLIDLYDNAESDTANYMSDLN--DAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS-EGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhh--chhhhhccCcccCchhhcccccccceeccccccccc-ccccCCCCcceeEEeecCCH Confidence 222222222322222222222 111122111111211111111100000000000011 11224556677777655444 Q ss_pred hh-HHHHHHHHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHh Q lcl|NC_010576. 278 NL-LSDVRQLQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAV 339 (447) Q Consensus 278 ~~-l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~ 339 (447) .. ....+.+.+.|+..-++|..-. +|+.+.. ....++...|.-.++.|...++.+--.... T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~- 418 (511) T protein:vir:99 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVS- 418 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc- Confidence 33 3456778888999888886433 2322211 112244455555555555444332110000 Q ss_pred cCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc--ccc---ccchhhcccccCC-C Q lcl|NC_010576. 340 SQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELF--NRN---IADGNQVGGINTP-G 413 (447) Q Consensus 340 ~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~--~~~---~~~~~~~~~~~~~-~ 413 (447) ..-..+++.+..-+-.|.++.++.+.++. |+++.--++++++. ++++. .++- ... ..+..+......+ + T Consensus 419 ~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~--v~D~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:99 419 KDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEVKKIEEDEKESIKKAQKNMYQDPRN 493 (511) T ss_pred cccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHHHHhhcccccCCC Confidence 00113455555666778999999988885 88998778887533 33321 1111 000 0000000000000 1 Q ss_pred CCCCCCCCcCCCCCCCcc Q lcl|NC_010576. 414 QITSDQPATASTDPLNNV 431 (447) Q Consensus 414 ~~~~~~~~~~~~~~~~~~ 431 (447) ...+.++..+..+....| T Consensus 494 ~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 494 INDDEQDDSTKDSIDKKE 511 (511) T ss_pred CCCCCCCCCCcCcccccC Confidence 111111111111111111 No 201 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=43.89 E-value=0.83 Score=21.04 Aligned_cols=377 Identities=10% Similarity=0.019 Sum_probs=138.9 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+...-.. . .. ....... .......=+......-+|+..+.-+-.-|+.+ . T Consensus 37 ~~~~~~~~~~~~YY~g~~~i~~~-~---~~------~~~~~~~---~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~-~ 102 (472) T protein:vir:93 37 LEKLPEISIGQEYYEQRPDIVKE-P---KP------VDATGAV---DPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-K 102 (472) T ss_pred HHHHHHHHHHHHHhccccccccc-c---ch------hhccccc---cccccccccccchHHHHHHHHhhhhcccCeee-c Confidence 22222333333333332100000 0 00 0000000 00000000123445666777777665566552 1 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccce-------eeec-cCCCcce Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGS-------FDIN-TARVGKI 152 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~-------~~~~-~~~~~~~ 152 (447) .++. .....+..++. |. .......+....+.+|.||+++..+..+..... +++. .....++ T Consensus 103 --~~d~---~~~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i~d~~~~~~~ 171 (472) T protein:vir:93 103 --HTDD---EVVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEEL 171 (472) T ss_pred --cCCh---HHHHHHHHHHh---cc---HHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCCce Confidence 1111 11222333332 32 224444567888899999998776654422111 1110 0001111 Q ss_pred ee---ecC--CceEEEEeee--------cccc-----------cc--eee--ecccccccccccccccccchh---HHHH Q lcl|NC_010576. 153 MQ---FFP--RQVMVRVWND--------NTGL-----------EQ--DLL--VSKENCIIIESPFYAILNDTN---QTLR 201 (447) Q Consensus 153 ~~---~~~--~~~~~~~~~~--------~~~~-----------~~--~~~--~~~~~v~~~~~~~~~~~~~~~---~~~~ 201 (447) .. ++. ....+.+|.. ..+. +. ... +..-.++++++... +.+.+. .... T Consensus 172 ~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~-g~s~~e~v~~liD 250 (472) T protein:vir:93 172 EAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDL-EISDIFMYKTLID 250 (472) T ss_pred EEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCC-CCCchhhhHHHHH Confidence 00 000 0000111100 0000 00 000 00001233332111 111111 1112 Q ss_pred HHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh-hhH Q lcl|NC_010576. 202 MLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN-NLL 280 (447) Q Consensus 202 ~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~-~~l 280 (447) .+...++...+...+ .+.+-.+++ +.. .+..++. ...+ . ..+++.++.+.+.+.+...... ... T Consensus 251 a~~~~~s~~~~~~~~--~~~~~~~~~--g~~-~~~~~~~----~~~~----~--~~~~~~~~~~~~~~~l~~~~~~~~~~ 315 (472) T protein:vir:93 251 AYNRRLSDLSNTFKD--SNELTYVLT--NYD-DQELPEF----KRLL----R--YYGAIKVSDNGGVDTIQVEVPVENSK 315 (472) T ss_pred HHHHHHHHHHHHHHH--hcCceeEee--cCC-cccchhh----HHHH----h--hccccccCCCCcceeEeecCCHHHHH Confidence 222222222222222 233333332 211 1111111 1111 1 2345556666666666544333 344 Q ss_pred HHHHHHHHHHHHHhCCCHHH---hcCCcHHHH--------------HHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCc Q lcl|NC_010576. 281 SDVRQLQQDFYNQMGITEAI---LNGTANEQQ--------------TLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQ 343 (447) Q Consensus 281 ~~~~~~~~~Ia~~fgVP~~~---l~g~~~e~~--------------~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~ 343 (447) ...+.+.+.|+..-++|..- ++|+.+... ....+...|.-.++.+...++.+. ... T Consensus 316 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-------~~~ 388 (472) T protein:vir:93 316 KYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-------EHK 388 (472) T ss_pred HHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-------ccc Confidence 55777788888888887533 333322111 112344555555555554443221 123 Q ss_pred eEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCCC Q lcl|NC_010576. 344 VLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQPA 421 (447) Q Consensus 344 ~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 421 (447) .+++.++.-+-.|..+.++.+.++. |+++.--+.+++++- +++.. ++.. ..-....+.. ...++...++.++ T Consensus 389 ~i~v~f~~~~p~~~~~~~~~~~k~~--giis~et~l~~l~~~--~d~~~-E~~ri~~E~~~~~~~~-~~~~~~~~d~~~~ 462 (472) T protein:vir:93 389 DVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFV--EDLQA-ELERIEQEQMEYNKQL-PNLDDGGADGAQQ 462 (472) T ss_pred eeeEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHHH-HHHHHHHHHHHHHHhc-cCcCcccCCCCCC Confidence 5566666777788999999988874 789887777776542 23211 1110 0000000000 0111112222222 Q ss_pred cCCCCCCCcc Q lcl|NC_010576. 422 TASTDPLNNV 431 (447) Q Consensus 422 ~~~~~~~~~~ 431 (447) .+..+..++| T Consensus 463 ~~~~~~~~~e 472 (472) T protein:vir:93 463 QERSNNKESE 472 (472) T ss_pred CCCCCcccCC Confidence 2222122222 No 202 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=41.95 E-value=0.91 Score=20.82 Aligned_cols=407 Identities=11% Similarity=0.034 Sum_probs=143.0 Q ss_pred Cc-hhHhhhhhcccccCCc-ccccccccccccccc-ccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_010576. 1 MA-SSDRLLHSWNAFQSNQ-NQNQNTNDFLTPSNG-MTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFK 77 (447) Q Consensus 1 Mg-~~~~l~~~~~~f~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~ 77 (447) |. ..+.|.++..-++.+. ++-..-..++..... ... .. ..........+ +......-+|+..+.-+-.-|+. T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~-~~--~~~~~~~~~~k--i~~n~~k~Ivd~~~~yl~g~p~~ 111 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQ-FG--RRKDREMADKR--AVHNYGRMISKFKTGYLAGNPIR 111 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-cC--ccCccccccce--eccchHHHHHHHHhhhhcccCee Confidence 21 1122222221111110 000000011110000 000 00 00000000001 12334556677777766666765 Q ss_pred EEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-ce------eeeccCCC- Q lcl|NC_010576. 78 HLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-GS------FDINTARV- 149 (447) Q Consensus 78 ~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-~~------~~~~~~~~- 149 (447) + ....++. .+.....+..++. =|. .......+...++.+|.||+++..+..+... .. +++..... T Consensus 112 ~-~~~d~~~-~~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~~~~ 184 (501) T protein:vir:27 112 V-EYDDNDN-NSQNDDTIKRIGR--IND---IDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDNSLE 184 (501) T ss_pred E-ecCCccc-hHHHHHHHHHHHH--hcC---hhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecCCCC Confidence 3 2221111 1111222333332 233 3356777888899999999988776544221 11 11110000 Q ss_pred ccee---eecC-----Cce-EEEEeeecc-------cccce---eeec--cccccccccccc--ccccchhHHHHHHHHH Q lcl|NC_010576. 150 GKIM---QFFP-----RQV-MVRVWNDNT-------GLEQD---LLVS--KENCIIIESPFY--AILNDTNQTLRMLEQK 206 (447) Q Consensus 150 ~~~~---~~~~-----~~~-~~~~~~~~~-------~~~~~---~~~~--~~~v~~~~~~~~--~~~~~~~~~~~~~~~~ 206 (447) .++. .++. +.. .+.+|.... +.... ..++ .=.++++++... +........+..+... T Consensus 185 ~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~ 264 (501) T protein:vir:27 185 DNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSA 264 (501) T ss_pred CceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHH Confidence 0010 0000 000 111111100 00000 0000 001333332111 1111122222222222 Q ss_pred HHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhH-HHHHH Q lcl|NC_010576. 207 IKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLL-SDVRQ 285 (447) Q Consensus 207 ~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l-~~~~~ 285 (447) ++-..+...+.. .+ ++.+.+..... ..+....++... .......+.....+.+.+++.++.......+ ...+. T Consensus 265 ~S~~~~~~~~~~--~~--~~v~~g~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 338 (501) T protein:vir:27 265 ESDTANHMSDMA--DA--ILAIYGDLALP-KGMQASDMKRTR-LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTR 338 (501) T ss_pred HHHHHHHHHHhc--Cc--eeeeecCccCC-cccchhhhhhcC-ceeecccccccCCCCCcceeeeeccCCHHHHHHHHHH Confidence 222222222222 22 22222211111 011111111100 0000111112224455666666655544444 44677 Q ss_pred HHHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEe Q lcl|NC_010576. 286 LQQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYY 348 (447) Q Consensus 286 ~~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~ 348 (447) +.+.|+..-++|..-. +|+.+.. .....+...|...++.+...++..--.. +. ....+++. T Consensus 339 l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~-~~-d~~~i~v~ 416 (501) T protein:vir:27 339 LNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK-DF-DESLLKIT 416 (501) T ss_pred HHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cc-ccccceEE Confidence 8888999888886433 2322211 1123455555555555555444322110 11 11235666 Q ss_pred cchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchh---hcccccCC-CCCCCCCCCc Q lcl|NC_010576. 349 RNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGN---QVGGINTP-GQITSDQPAT 422 (447) Q Consensus 349 ~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~---~~~~~~~~-~~~~~~~~~~ 422 (447) ++.-+-.|.++.++++.++ .|+++.--+.+++++ ++++.. ++-. ..-.... +.++-..+ +...+.++ T Consensus 417 f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~--v~D~~~-E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~-- 489 (501) T protein:vir:27 417 FTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGL--VESPNE-ELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVK-- 489 (501) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHHH-HHHHHHHHHHhhhHhhhcCccccccccccCCCC-- Confidence 6777888999999998887 488988777777543 333211 1110 0000000 00000000 11111111 Q ss_pred CCCCCCCcccccccCCccCcC Q lcl|NC_010576. 423 ASTDPLNNVSTSAIENGSLTD 443 (447) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~~ 443 (447) .+.+++...+++ T Consensus 490 ---------~~~~d~~e~~~~ 501 (501) T protein:vir:27 490 ---------ETHTDDFERAYE 501 (501) T ss_pred ---------CCccccccccCC Confidence 111111222222 No 203 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=37.34 E-value=1.1 Score=20.31 Aligned_cols=375 Identities=9% Similarity=0.009 Sum_probs=137.5 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) +.-..|+.+....+..+..--... ... ...... .......-+......-+|+..+.-+-.-|+. |. T Consensus 57 ~~~~~r~~~l~~YY~g~~~I~~~~----~~~------~~~~~~---~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~-~~ 122 (492) T protein:vir:94 57 LEKLPEISIGQEYYEQRPDIVKEP----KPV------DATGAV---DPLKPDDRMITNFHANLVDQKVSYIVGKPIA-FK 122 (492) T ss_pred HHHHHHHHHHHHHhcccccccccc----ccc------cccccc---cccccccccccchHHHHHHHHHhhhcccCce-ec Confidence 122222333333333221000000 000 000000 0000000012334556677777766666765 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCcceeee-cCCc Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVGKIMQF-FPRQ 159 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 159 (447) .++. .....+..++. |. .......+...++.+|.||+++..+..+.... -.+.|..+..++.. ..+. T Consensus 123 --~~d~---~~~~~l~~~~~---n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~-~~~~p~~~~~v~d~~~~~~ 190 (492) T protein:vir:94 123 --HTDD---EVVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEFKL-FRVPAEQGIPIWTDKEHEE 190 (492) T ss_pred --cCch---HHHHHHHHHHh---cc---HHHHHHHHHHHHhhCCeEEEEEEecCCCceEE-EEEcccceEEEEcCCCCCc Confidence 1111 11223333332 32 22445567788899999998877665432211 01111111000000 0000 Q ss_pred eE--EEEeeecc------------------ccc---------ceee-----ec--ccccccccccccccccchh---HHH Q lcl|NC_010576. 160 VM--VRVWNDNT------------------GLE---------QDLL-----VS--KENCIIIESPFYAILNDTN---QTL 200 (447) Q Consensus 160 ~~--~~~~~~~~------------------~~~---------~~~~-----~~--~~~v~~~~~~~~~~~~~~~---~~~ 200 (447) +. +++|.... ... .... ++ .=.++.+++... +.+.+. ... T Consensus 191 ~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~-~~sd~e~v~~li 269 (492) T protein:vir:94 191 LEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDL-EISDIFMYKTLI 269 (492) T ss_pred eEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCC-CCCchHHHHHHH Confidence 00 11110000 000 0000 00 001122221111 111111 111 Q ss_pred HHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhh-h Q lcl|NC_010576. 201 RMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNN-L 279 (447) Q Consensus 201 ~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~-~ 279 (447) ..+...++-..+...+ .+.+--+++ +. ..+..++ +.... ...+++.++.+.+++.+....... . T Consensus 270 Da~d~~~S~~~~~~~~--~~~p~lv~~--g~-~~~~~~~----~~~~~------~~~~~~~~~~~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:94 270 DAYNRRLSDLSNTFKD--SNELTYVLK--NY-DDQELPE----FKRLL------RYYGAIKVSDNGGVDTIQVEVPVENS 334 (492) T ss_pred HHHHHHHHHHHHHHHH--hcCceeeee--cC-Ccccchh----hHHHH------hhccceecCCCCcceeEeccCCHHHH Confidence 2222222222222222 222322222 21 1111111 11111 123455566666666655443333 3 Q ss_pred HHHHHHHHHHHHHHhCCCHH---HhcCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEA---ILNGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~---~l~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ....+.+.+.|+..-++|.. -++|+.+.. .....+...|...++.+...++.+. .. T Consensus 335 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~-------~~ 407 (492) T protein:vir:94 335 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-------EH 407 (492) T ss_pred HHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-------cc Confidence 34456777778888777742 334433211 1122445555555555555443221 12 Q ss_pred ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccch-hhcccccCCCCCCCCC Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADG-NQVGGINTPGQITSDQ 419 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~ 419 (447) ..+++.++.-+..|.++.++++.++. |+++.--++++++.- +++. .++-. ..-... .+... ..+...++. T Consensus 408 ~~i~v~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v--~d~~-~E~eri~~E~~~~~~~~~~--~~~~~~~~~ 480 (492) T protein:vir:94 408 KDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFV--EDLQ-AELERIEQEQMEYNKQLPN--LDDGGADSA 480 (492) T ss_pred ceeeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHH-HHHHHHHHHHHHHHhhccc--cccccCCCC Confidence 35666667777789999999998885 889988888876543 3321 11111 000000 00000 001111111 Q ss_pred CCcCCCCCCCcc Q lcl|NC_010576. 420 PATASTDPLNNV 431 (447) Q Consensus 420 ~~~~~~~~~~~~ 431 (447) +..+..+..++| T Consensus 481 ~~~~~~~~~e~e 492 (492) T protein:vir:94 481 QQQERSNNKESE 492 (492) T ss_pred ccccCCccccCC Confidence 111111112222 No 204 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=36.79 E-value=1.2 Score=20.24 Aligned_cols=370 Identities=8% Similarity=-0.017 Sum_probs=140.6 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ..-..|+.+..+.+..+........ . ...... .+ +......-.|+..+.-+-.-|+. |. T Consensus 30 ~~~~~r~~~~~~yy~g~~~i~~~~~----~---------~~~~~~-----~k--i~~n~~~~ivd~~~~~l~g~~~~-~~ 88 (453) T protein:vir:39 30 RLEVARYEYLKNMYRGIMAIDAEPT----K---------DLWKPD-----NR--LTVNFTKYIVDTFTGYFNGIPVK-KS 88 (453) T ss_pred HHHHHHHHHHHHHhhccCchhcCCC----c---------cccCcc-----ce--eecchHHHHHHHHhhhhcccCce-ec Confidence 1222233333333332211000000 0 000000 01 11234555666666666666654 22 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-ce------eeeccCCCcc-e Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-GS------FDINTARVGK-I 152 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-~~------~~~~~~~~~~-~ 152 (447) . .+ +.....+..++.. |... .....+....+.+|.||+++..+..+... .. +++....... + T Consensus 89 ~--~d---~~~~~~l~~i~~~--N~~~---~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~ 158 (453) T protein:vir:39 89 H--SD---KETLSKLQEFDNL--NDME---DEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEP 158 (453) T ss_pred c--CC---hHHHHHHHHHHHh--cChh---HHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeE Confidence 1 11 1123345666642 4333 45666778889999999987776544221 11 1111000000 0 Q ss_pred ---eeec--CCce-EEEEeee--------cccccce---eeec--cccccccccccc--ccccchhHHHHHHHHHHHHHH Q lcl|NC_010576. 153 ---MQFF--PRQV-MVRVWND--------NTGLEQD---LLVS--KENCIIIESPFY--AILNDTNQTLRMLEQKIKLMN 211 (447) Q Consensus 153 ---~~~~--~~~~-~~~~~~~--------~~~~~~~---~~~~--~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 211 (447) +.++ .+.. .+.+|.. ..+.+.. ..++ .-.++++++... +...........+...+.-.. T Consensus 159 ~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~ 238 (453) T protein:vir:39 159 LFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKA 238 (453) T ss_pred EEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHH Confidence 0111 0000 0111110 0000000 0011 012233332111 111111122222222222222 Q ss_pred HHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceee------cCCCceeeecCCChhhhh-HHHHH Q lcl|NC_010576. 212 SQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVAT------LDTQEKFVSAGMGLQNNL-LSDVR 284 (447) Q Consensus 212 ~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v------l~~g~~~~~l~~~~~~~~-l~~~~ 284 (447) ....+. +.+.-++. +..+.++..++ ++. .+++. .+.+.++..++.+..... ....+ T Consensus 239 ~~~~~~--~~p~~~~~-g~~~~~~~~~~----~~~----------~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~ 301 (453) T protein:vir:39 239 NDVDYF--SDQYLTFL-GAAVEEEDLKN----IRS----------NRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLD 301 (453) T ss_pred HHHHHh--hCceeeee-cCCCCchhhhh----hhh----------cceeeecCCCCCCCCCceeEEeecCCHHHHHHHHH Confidence 222222 23322332 22233222211 111 11111 233445555554444443 34567 Q ss_pred HHHHHHHHHhCCCHHHhc--CCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEe Q lcl|NC_010576. 285 QLQQDFYNQMGITEAILN--GTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYY 348 (447) Q Consensus 285 ~~~~~Ia~~fgVP~~~l~--g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~ 348 (447) .+.+.|+..-++|..-.. |+.+. ......+...|...++.+...++.+--. .....|++. T Consensus 302 ~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~----~~~~~i~v~ 377 (453) T protein:vir:39 302 RLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNK----EAWKDIEYT 377 (453) T ss_pred HHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc----cccccceEE Confidence 788888888888743221 22111 1122355666666666665544432111 112244555 Q ss_pred cchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCCCcCCCC Q lcl|NC_010576. 349 RNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQPATASTD 426 (447) Q Consensus 349 ~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (447) +..-+..|.++.++++.++ .|+++.--+.++++.- +++. .++-. ..-....+......++ .+...++. T Consensus 378 f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v--~D~~-~E~~ri~~E~~~~~~~~~~~~~~-----~~~~~~~~ 447 (453) T protein:vir:39 378 FTRNEPKDIKEQAETANIL--MGITSQETALSVISVI--PDVQ-AEMEKIKKEEASTAIFDKDKQPS-----EKGTDTVV 447 (453) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHH-HHHHHHHHHHHHHHHHHHhccCC-----CCCCCCCC Confidence 5677778899999999887 4789988788776543 3321 11111 1100010110111111 11111111 Q ss_pred CCCccc Q lcl|NC_010576. 427 PLNNVS 432 (447) Q Consensus 427 ~~~~~~ 432 (447) +.++++ T Consensus 448 ~~~~~e 453 (453) T protein:vir:39 448 PETNEE 453 (453) T ss_pred CCcCCC Confidence 222222 No 205 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=33.39 E-value=1.4 Score=19.85 Aligned_cols=363 Identities=10% Similarity=-0.027 Sum_probs=132.9 Q ss_pred CchhHhhhh-----------hcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHH Q lcl|NC_010576. 1 MASSDRLLH-----------SWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIAL 69 (447) Q Consensus 1 Mg~~~~l~~-----------~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~ 69 (447) .-+..+|.. ....+..+..-. . .+... ........ .......-+|+..++ T Consensus 6 ~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~---~------------~~~~~--~~~~~~~k--~~~n~~~~ivd~~~~ 66 (441) T protein:vir:80 6 LALIEGMYDRIQRLSSWHCCIEGYYEGSNRVR---D------------LGVAI--PPELQRVQ--TVVSWPGIAVDALEE 66 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcch---h------------cCccc--chhhhhhh--hhcchHHHHHHHHHh Confidence 222222222 222222221000 0 00000 00000000 011123334444433 Q ss_pred hhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCccc-ce------e Q lcl|NC_010576. 70 DASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDS-GS------F 142 (447) Q Consensus 70 ~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~-~~------~ 142 (447) -+ -+.-|+. ..+..+..++. -|. .......+..+++.+|.||+++..+..+.+. .. + T Consensus 67 ~l---~~~g~~~--------~d~~~l~~i~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~ 130 (441) T protein:vir:80 67 RL---DWLGWTN--------GDGYGLDGVYA--ANR---LATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCT 130 (441) T ss_pred hh---ccccccC--------CChHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEE Confidence 33 1112221 11234555554 243 3456677888999999999988776554321 11 1 Q ss_pred eeccCCCccee----eec---CCceEEEEeee---------cccccc---eeeec--ccccccccccccc----cccc-- Q lcl|NC_010576. 143 DINTARVGKIM----QFF---PRQVMVRVWND---------NTGLEQ---DLLVS--KENCIIIESPFYA----ILND-- 195 (447) Q Consensus 143 ~~~~~~~~~~~----~~~---~~~~~~~~~~~---------~~~~~~---~~~~~--~~~v~~~~~~~~~----~~~~-- 195 (447) ++......+.. .++ +......+|.. ..+.+. ...++ .-.++|+.+.-.. +.+. T Consensus 131 ~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~ 210 (441) T protein:vir:80 131 GKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEIT 210 (441) T ss_pred EEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccch Confidence 11011111110 000 00011111110 000000 00011 0122333321100 1111 Q ss_pred --hhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCC-----cee Q lcl|NC_010576. 196 --TNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQ-----EKF 268 (447) Q Consensus 196 --~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g-----~~~ 268 (447) .......+...+..+....++. +.+.-++. +..+..... ..++. ..++++.++.+ .++ T Consensus 211 ~~v~~liDa~~~~~s~~~~~~~~~--~~~~~~i~-G~~~~~~~~----~~~~~--------~~~~i~~~~~~~~~~~~~~ 275 (441) T protein:vir:80 211 RSIRAYTDEAVRTLLGQSVNRDFY--AYPQRWVT-GVSADEFSQ----PGWVL--------SMASVWAVDKDDDGDTPNV 275 (441) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhh--cCceeeee-cCCcccccc----chhhh--------cccccccCCCCCCCCccee Confidence 1111222222222222222222 22222232 111211110 01110 12344444432 345 Q ss_pred eecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH---HHHHH---------------HHHHHHHhHHHHHHHHHHH Q lcl|NC_010576. 269 VSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTAN---EQQTL---------------GYYNRCVDVLLQYVTDAIS 330 (447) Q Consensus 269 ~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~---e~~~~---------------~f~~~ti~P~~~~ie~~l~ 330 (447) .++.....+.+++.++....+|+..-++|++.++++.. ..... ..+...|.-.++.+...++ T Consensus 276 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 355 (441) T protein:vir:80 276 GSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALD 355 (441) T ss_pred EecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44443333445677788888999999999999875421 11111 1223333333333332222 Q ss_pred hhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCH--HHHHHHhCCCCCCCccccccccccccc-hhhcc Q lcl|NC_010576. 331 RIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTP--NEIRELTGKAPHPNPLANELFNRNIAD-GNQVG 407 (447) Q Consensus 331 ~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~--NE~R~~~gl~p~~g~~~~~~~~~~~~~-~~~~~ 407 (447) ...-. ......+++.+..-+..+..+.++++.+++.+|++.. .-+++.+|+.+-+- ..... .-.. ..+.+ T Consensus 356 ~~~~~---~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~---~~~~~-e~~e~~~~~~ 428 (441) T protein:vir:80 356 SRVDE---ADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV---EAVMR-HRAESSDPLA 428 (441) T ss_pred CCCcc---cccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH---HHHHH-HHHHHHHHHH Confidence 21110 1122456666677778899999999999999997643 34666766654221 11100 0000 00000 Q ss_pred cccCCCCCCCCCCCcC Q lcl|NC_010576. 408 GINTPGQITSDQPATA 423 (447) Q Consensus 408 ~~~~~~~~~~~~~~~~ 423 (447) . ..+. .+.++.+. T Consensus 429 ~--~~~~-~~~~~~~~ 441 (441) T protein:vir:80 429 V--LAGA-ISRQTNEV 441 (441) T ss_pred H--Hhhh-hhcccccC Confidence 0 0000 01111111 No 206 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=29.06 E-value=1.7 Score=19.33 Aligned_cols=403 Identities=10% Similarity=0.022 Sum_probs=140.1 Q ss_pred CchhHhhhhhcccccCCcc---------cccccccccccccc-------cc--cccccccc---CCcc-------cccch Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN---------QNQNTNDFLTPSNG-------MT--SFGGYYGR---GQSN-------YSRSY 52 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~---------~~~~~~~~~~~~~~-------~~--~~~~~~~~---~~~~-------~~~~~ 52 (447) |.|. ++++|++..+... ...++..+-....- .. +.++.... ..+. +..=+ T Consensus 1 m~~~--~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR 78 (521) T protein:vir:10 1 MNPI--FLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYR 78 (521) T ss_pred CCcc--hhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHH Confidence 7763 3333433333211 11122111111000 00 00111000 0000 01112 Q ss_pred hhhhhHHHHHHHHHHHHhhccC-----ceEEEEEcCCCc--eeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCC Q lcl|NC_010576. 53 SYNKADLIKSVITRIALDASMV-----DFKHLKIDPISG--NQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQ 125 (447) Q Consensus 53 ~~~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~~~~--~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 125 (447) ..+.+|-|.+||+-|.+.+.-. |+.+--.+.+.. .++.......++|+. -|-...+++ ++..+...|- T Consensus 79 ~ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYVDgR 153 (521) T protein:vir:10 79 SLSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKL-LKFEREGKR----HFRRWYVDSR 153 (521) T ss_pred HHhhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhheeeee Confidence 3456788999999999887533 222221111111 111122233444432 122233333 3455667787 Q ss_pred eeEEEeeccC---CcccceeeeccCCCcceeeecC----------CceEEEEee--------ecccccceeeeccccccc Q lcl|NC_010576. 126 IAMVPIDTTV---DPDSGSFDINTARVGKIMQFFP----------RQVMVRVWN--------DNTGLEQDLLVSKENCII 184 (447) Q Consensus 126 a~i~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~--------~~~~~~~~~~~~~~~v~~ 184 (447) .|..++-+.. ..+.....+.|..+..+..... +-..+.+|. ........+.++.+.|.| T Consensus 154 i~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y 233 (521) T protein:vir:10 154 IYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVY 233 (521) T ss_pred EEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheee Confidence 7765543322 2233333333332222211100 000111121 111122234566666665 Q ss_pred ccccccccc-cchhHHHHHHHHHHHH---HHHH---HHHhhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHHhcc-- Q lcl|NC_010576. 185 IESPFYAIL-NDTNQTLRMLEQKIKL---MNSQ---DNRASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENEMAN-- 254 (447) Q Consensus 185 ~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~---~~~~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 254 (447) ..+.+.... ...-+-+..+...+.. +..+ ....+.---+ +.|.++. +. +..+++....+.+.+++ T Consensus 234 ~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGn-lp----k~KAeqYl~~iM~k~kNkl 308 (521) T protein:vir:10 234 SHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGT-MP----NKKATQHLNNVMQGLKNRV 308 (521) T ss_pred ecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC-CC----chhHHHHHHHHHHhcCceE Confidence 555443321 1111112222222222 1111 1111111111 2233332 32 22333444444333321 Q ss_pred ----CCccee-------ec-------CC---CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC--------c Q lcl|NC_010576. 255 ----NKYGVA-------TL-------DT---QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGT--------A 305 (447) Q Consensus 255 ----n~~~~~-------vl-------~~---g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~--------~ 305 (447) ..|.|- .| -+ |.+++.|.-.-.--+++..++..+...++++||.+-|... . T Consensus 309 VYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~ 388 (521) T protein:vir:10 309 VYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAG 388 (521) T ss_pred EEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccc Confidence 122221 11 13 3344444333333468889999999999999999888421 1 Q ss_pred H-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHHHHHHHH Q lcl|NC_010576. 306 N-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLATVADVL 367 (447) Q Consensus 306 ~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~~~~~~~ 367 (447) + |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+.++..+ T Consensus 389 ~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~ 468 (521) T protein:vir:10 389 NDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTL 468 (521) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhh Confidence 2 222223332222233334444444443 4556663 233455443 3221 110 11233333332 Q ss_pred ----HhCCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 368 ----TRNAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 368 ----~~~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) +-+-+++.+=+|+. +.+.-.+=..-++ +..+...++--++. .+++.+= T Consensus 469 dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k----------~I~~E~~~~~~~~p---------~~e~~df 521 (521) T protein:vir:10 469 ASAEVTGKYLSHEYVMKNILRMSDEDIKTERE----------KIDGELKDSVYKNP---------EDPMEEF 521 (521) T ss_pred cCccccccccchHHHHHHHhcCCHhHHHHHHH----------HHHHhhhCCCCCCC---------cchhhcC Confidence 11113444444332 2222100000000 00000011100000 0000000 No 207 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=27.01 E-value=1.9 Score=19.07 Aligned_cols=398 Identities=7% Similarity=-0.053 Sum_probs=138.3 Q ss_pred CchhHh--------------hhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHH Q lcl|NC_010576. 1 MASSDR--------------LLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITR 66 (447) Q Consensus 1 Mg~~~~--------------l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ 66 (447) |.++-+ |.+..+-++.+-.+-.....++.... ......... ......+ +......-+|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~---~i~~~~~~~-~~~~~~k--i~~n~~~~Iv~~ 74 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQ---EIEKHEFDN-ATVEAAN--VMVNHAKYITDM 74 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc---chhcCCcCc-CCCCcce--eecchHHHHHHH Confidence 222211 11111111111000000001100000 000000000 0000011 112345556666 Q ss_pred HHHhhccCceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccc------ Q lcl|NC_010576. 67 IALDASMVDFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSG------ 140 (447) Q Consensus 67 ia~~ia~lp~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~------ 140 (447) .+.-+-.-|+. |.... . .....+..++.. |. ...+...+....+.+|.||.++..+..+.... T Consensus 75 ~~~~l~g~p~~-~~~~~-~----~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~ 143 (499) T protein:vir:10 75 NVGFMTGNPVK-YVAEK-G----KNIDDILEVFNQ--ID---IHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGN 143 (499) T ss_pred HhhhhcccCce-eecCC-h----hHHHHHHHHHhh--cC---HhHHHHHHHHHHHhcCceEEEEEecccccccccccccc Confidence 66666556765 32211 1 123345555532 32 23456677788899999998876555432110 Q ss_pred ---------eeee-ccCCCcceeeecCC-----------------c--e-EEEEeeecc--------cc----ccee--- Q lcl|NC_010576. 141 ---------SFDI-NTARVGKIMQFFPR-----------------Q--V-MVRVWNDNT--------GL----EQDL--- 175 (447) Q Consensus 141 ---------~~~~-~~~~~~~~~~~~~~-----------------~--~-~~~~~~~~~--------~~----~~~~--- 175 (447) .+.. .|..+..++....+ . + .+.+|.... .. .... T Consensus 144 ~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~ 223 (499) T protein:vir:10 144 EKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYD 223 (499) T ss_pred cccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceeccc Confidence 0111 11111000000000 0 0 011111000 00 0000 Q ss_pred -ee--ccccccccccccc--ccccchhHHHHHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHH Q lcl|NC_010576. 176 -LV--SKENCIIIESPFY--AILNDTNQTLRMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIEN 250 (447) Q Consensus 176 -~~--~~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~ 250 (447) .+ ..=.++++++... +......+....+...++-..+...+ .+.+--++ .+...++.. .....+ T Consensus 224 ~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~--~~~~~lv~--~G~~~~~~~-~~~~~~------ 292 (499) T protein:vir:10 224 GENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEA--FVDALLVT--FGFGLGDDK-DDIQRL------ 292 (499) T ss_pred ccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHH--hcCceeee--ecCcccccc-chhhhh------ Confidence 00 0011233332111 11111222222222222222222222 22222222 222211111 111111 Q ss_pred HhccCCcceee--cCCCceeeecCCChhh-hhHHHHHHHHHHHHHHhCCCH---HHhcCCcHH--------------HHH Q lcl|NC_010576. 251 EMANNKYGVAT--LDTQEKFVSAGMGLQN-NLLSDVRQLQQDFYNQMGITE---AILNGTANE--------------QQT 310 (447) Q Consensus 251 ~~~~n~~~~~v--l~~g~~~~~l~~~~~~-~~l~~~~~~~~~Ia~~fgVP~---~~l~g~~~e--------------~~~ 310 (447) ..+++.. .+.+.+++.+...... ......+.+.+.|...-++|. .-++|+.+. ... T Consensus 293 ----~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k 368 (499) T protein:vir:10 293 ----KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIK 368 (499) T ss_pred ----hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHH Confidence 1122322 3566677777654433 334556777778888777773 223332221 112 Q ss_pred HHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_010576. 311 LGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNP 390 (447) Q Consensus 311 ~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~ 390 (447) ..++...|.-.++.+...++.+- .+.+ ...+++.++.-+-.|..+.++.+.++ .|+++.--++++++. ++++ T Consensus 369 ~~~~~~~l~~~~~li~~~~~~~~---~~~d-~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~ 440 (499) T protein:vir:10 369 QRYFFDGLRRRLKLIQTIVNIKG---ANDD-ASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPD--VDNP 440 (499) T ss_pred HHHHHHHHHHHHHHHHHHHhccC---Cccc-cccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH Confidence 23455566666666655544321 1111 12345555566677899999999988 588999878877543 3332 Q ss_pred ccccc--cccc---ccchhhcccccCCCCCCCCCCCcCCCC------CCCcccccccCCc Q lcl|NC_010576. 391 LANEL--FNRN---IADGNQVGGINTPGQITSDQPATASTD------PLNNVSTSAIENG 439 (447) Q Consensus 391 ~~~~~--~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~ 439 (447) . .++ +... .....+....+..++..+..+..+.+. ..++.+.++++-- T Consensus 441 ~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 441 Q-DVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred H-HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 1 111 0000 000000000000011100011111111 1111111111111 No 208 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=26.01 E-value=2 Score=18.94 Aligned_cols=379 Identities=8% Similarity=-0.076 Sum_probs=136.0 Q ss_pred hhH--------hhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccC Q lcl|NC_010576. 3 SSD--------RLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMV 74 (447) Q Consensus 3 ~~~--------~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~l 74 (447) +++ |+.+....+..+...-.... .......+ ..+ +......-.|+..+.-+-.- T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~-------------~~~~~~~~---~~k--i~~n~~~~ivd~~~~~l~g~ 62 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGH-------------RRLDDEKA---DYR--VRHKWGGYISSFATGYVIGN 62 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCccccccc-------------ccccccCC---cce--eecchHHHHHHhhhhheecc Confidence 211 12222222222111000000 00000000 001 12233455566666655555 Q ss_pred ceEEEEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCCc---- Q lcl|NC_010576. 75 DFKHLKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARVG---- 150 (447) Q Consensus 75 p~~~~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~~---- 150 (447) |+++ .....+. ......+..++. =| ........+....+.+|.||+++..+..+.... -.+.|..+. T Consensus 63 ~~~~-~~~~~~~--~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i-~~~~p~~~~~~~d 133 (440) T protein:vir:95 63 PVSI-GVMEGGS--ADQLSTIKDIEW--QN---DINALNSDLAFDASVYGRAYEYHFRDKDKVDRV-VLISPLEMFVIRD 133 (440) T ss_pred CceE-eeCCCcc--HHHHHHHHHHHH--hc---CHhHHHHHHHHHHhhcCeEEEEEEecCCCceEE-EEEcccceEEEEc Confidence 6653 2222111 111123334432 12 222344566778889999999877665432210 011111110 Q ss_pred -----ceee----ec-CCceEEEEeeecc--------cc---cc---eeeecc--ccccccccccc--ccccchhHHHHH Q lcl|NC_010576. 151 -----KIMQ----FF-PRQVMVRVWNDNT--------GL---EQ---DLLVSK--ENCIIIESPFY--AILNDTNQTLRM 202 (447) Q Consensus 151 -----~~~~----~~-~~~~~~~~~~~~~--------~~---~~---~~~~~~--~~v~~~~~~~~--~~~~~~~~~~~~ 202 (447) .+.. +. .....+.+|.... .. .. ...++- =.++++++... +........... T Consensus 134 ~~~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida 213 (440) T protein:vir:95 134 LTVEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDA 213 (440) T ss_pred CCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHH Confidence 0100 00 0111111111100 00 00 000110 12344443221 111222222222 Q ss_pred HHHHHHHHHHHHHHhhcCcccceeeeC---CcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh-h Q lcl|NC_010576. 203 LEQKIKLMNSQDNRASSGKLNGFIQFP---YSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN-N 278 (447) Q Consensus 203 ~~~~~~~~~~~~~~~n~~~~~gvl~~~---~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~-~ 278 (447) +...++...+...+. +.+--+++.. ....++. ...+++.-. ............+.+.+++.+...... . T Consensus 214 ~~~~~s~~~~~~~~~--~~~~~v~~g~~~~~~~~~e~----~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~lt~~~~~~~ 286 (440) T protein:vir:95 214 YDAGQSDTANYMSDL--NDAMLLVKGDLDGIKLSPED----AAKMKDANM-LFLKTGISTTGQQTTADASYIYKQYDVNG 286 (440) T ss_pred HHHHHHHHHHHHHHh--hcceeeeecccccCCCCccc----hhhhhhccc-eecccccccccCCCCcceeEEeecCCHHH Confidence 222222222222222 2222233221 1112222 112221100 000011111122344455555544333 3 Q ss_pred hHHHHHHHHHHHHHHhCCCHHHh---cCCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcC Q lcl|NC_010576. 279 LLSDVRQLQQDFYNQMGITEAIL---NGTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQ 341 (447) Q Consensus 279 ~l~~~~~~~~~Ia~~fgVP~~~l---~g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~ 341 (447) ....++.+.+.|+..-++|..-. .|+.+. .....++...|...++.|...++..--. +. . T Consensus 287 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--~~-~ 363 (440) T protein:vir:95 287 TEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGP--VI-E 363 (440) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--cc-c Confidence 44567788899999999986433 222211 1112345556666666555555432211 11 1 Q ss_pred CceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccchhhcccccCCCCCCCCCCC Q lcl|NC_010576. 342 GQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPA 421 (447) Q Consensus 342 g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (447) ...+++.+..-.-.|.++.++.+.++ .|+++.--+.++++. ++.. .+.... .. .+.... ...++..+ T Consensus 364 ~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~--~d~~--~E~~ri--~~-E~~~~~----~~~~~~~~ 430 (440) T protein:vir:95 364 ANKLTFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENASF--TDYK--TEHSRI--LK-QGGSSD----LEIGQIVG 430 (440) T ss_pred cccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCC--CCcH--HHHHHH--HH-HHHHhh----hhHHhhcc Confidence 23456666677788999999999888 478887666666543 2211 121110 00 000000 00000000 Q ss_pred cCCCCCCCcc Q lcl|NC_010576. 422 TASTDPLNNV 431 (447) Q Consensus 422 ~~~~~~~~~~ 431 (447) ...+...+.| T Consensus 431 ~~~~~~~~~e 440 (440) T protein:vir:95 431 DADVGQADTE 440 (440) T ss_pred CCCCCCcCCC Confidence 0001111111 No 209 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=25.98 E-value=2 Score=18.94 Aligned_cols=410 Identities=14% Similarity=0.124 Sum_probs=147.8 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) -||.+- .++ ..- ....+..+...|+. .........+..-.|. -++.+-|-.+=+-|-+||---|+ T Consensus 47 ~gfv~~---~~~---ng~---i~~v~~~~l~~~f~----npd~~~~~i~~l~~y~--yi~~~~v~ql~~li~~lp~l~y~ 111 (525) T protein:vir:10 47 DGFVMD---LCN---NGK---IKTVNLDTLQLWFN----NPDKYINNIVNLLTYY--YIIDGNVFQLYDLIFSLPPLDYQ 111 (525) T ss_pred HHHHHH---hhc---CCc---eeeeeHHHHHhhhc----ChHHHHHHHHHHHHHh--hhhcchHHHHHHHHHhcCCccee Confidence 333222 111 110 01111112223331 1111111111111110 01122233444556666644454 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeeccCCC----------c Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDINTARV----------G 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~~~~~----------~ 150 (447) .+.=.... ..+..+ .+||..-..-.-..++-+.+..++...|--. -.|- +....-|+-..... + T Consensus 112 i~~~~~~k-~~~~~~-s~~n~~l~k~i~hk~ltrdll~q~a~~gtli--g~wl--g~~~~py~~vf~~~kyvfp~~r~~g 185 (525) T protein:vir:10 112 IKVLKRDK-DYKEDL-STINLYLEKKIQHKQLTRDLLVQLAHSGTLI--GTWL--GSKREPYFNVFNNLKYVFPYGRAKG 185 (525) T ss_pred ehhhhhcc-chhhHH-HHHHHHHHHhHHHHHHHHHHHHHhhccCcee--Eeee--cCCCCcchhhhhhhhhhccccccCC Confidence 33211111 111222 2222221121223344445555555566421 1111 11111111110000 1 Q ss_pred cee-----eecCC-----------ce-------EEEEee----ecccccceeeecccccccccccc----cc-cccchhH Q lcl|NC_010576. 151 KIM-----QFFPR-----------QV-------MVRVWN----DNTGLEQDLLVSKENCIIIESPF----YA-ILNDTNQ 198 (447) Q Consensus 151 ~~~-----~~~~~-----------~~-------~~~~~~----~~~~~~~~~~~~~~~v~~~~~~~----~~-~~~~~~~ 198 (447) ..+ .|+.. +. .+.-+. .+...-+.+.+|.+.++|.|... .. +.+..-. T Consensus 186 ~~v~vid~~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp 265 (525) T protein:vir:10 186 KMVAVIDLQWFDEMSELERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQ 265 (525) T ss_pred ceEEEEehHHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhh Confidence 000 01100 00 011011 11112234567778888887421 11 1111112 Q ss_pred HHHHHHHHHHHHHHHHHH-hhcCcccceeeeCCc------CChHHHHHHHHHHHHHHHHHhccCCcceee--cCCCceee Q lcl|NC_010576. 199 TLRMLEQKIKLMNSQDNR-ASSGKLNGFIQFPYS------TKSTARAAQAARRKQEIENEMANNKYGVAT--LDTQEKFV 269 (447) Q Consensus 199 ~~~~~~~~~~~~~~~~~~-~n~~~~~gvl~~~~~------~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v--l~~g~~~~ 269 (447) .+..+..--.+...-..- ..-..+-.+|++.+. +.+...++..+..++.+++.... ..|+.+ ++.=.+++ T Consensus 266 ~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kd-K~Gi~vi~~Pdfa~~e 344 (525) T protein:vir:10 266 TLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKD-KNGIACIAMPDFATFE 344 (525) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhccccc-ccCeEEEeccceeecc Confidence 222221111111110000 111222235555442 22223344555555555443332 234433 44433333 Q ss_pred --ecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCcH----HHHH-HHHHHHHHhHHHHHHHHHHHhhcC----ChhH Q lcl|NC_010576. 270 --SAGMGLQNNLLSDVRQLQQDFYNQMGITEAILNGTAN----EQQT-LGYYNRCVDVLLQYVTDAISRIAL----TKTA 338 (447) Q Consensus 270 --~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~g~~~----e~~~-~~f~~~ti~P~~~~ie~~l~~kLl----~~~e 338 (447) .+.....-.+=.-.+.+.++|-.++|++..+++|+.- ..-+ .-|| .-|.-+++.||+.. ++|| +. T Consensus 345 fp~ik~~~~glDg~K~d~I~~DI~~A~GlS~sL~nGdggNyAtaslnld~fy-kkigVm~e~Iee~y-~kL~d~Vl~~-- 420 (525) T protein:vir:10 345 FPEIKNGDKTLDPKKYDSIDNDITNATGISQVLTNGTKGNYASAKLNLDVFY-KKIGVMLEIIEEIY-NQLIDIILGE-- 420 (525) T ss_pred cccccCcccCCCchhhhhhhhhhhhhhccceeeecCCCCceeeeeeeHHHHH-HHHHHHHHHHHHHH-HHHHhhhcCc-- Confidence 2221111111123345678999999999999987642 2223 3455 45777888888554 4554 33 Q ss_pred hcCCceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccc-------ccccccccchhhcccccC Q lcl|NC_010576. 339 VSQGQVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLAN-------ELFNRNIADGNQVGGINT 411 (447) Q Consensus 339 ~~~g~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~-------~~~~~~~~~~~~~~~~~~ 411 (447) +.+..+.|++|.-...+.+.+++.+-++...||... -+....|+.--+.-..- .+-.+-+.+.+..--++ T Consensus 421 -~k~~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~k-~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SG- 497 (525) T protein:vir:10 421 -EKGCNYIFQYNKDTPIEREKKLDTLIKLEAQGYSAK-YVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSG- 497 (525) T ss_pred -ccCcceEEecCCCchhhhhhhhhhhhhhhccchhhh-hhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeec- Confidence 345566777777777788888889989999887532 22223333321110000 00001111111110000 Q ss_pred CCCCCCCCCCcCCCCCCCcccccccCCccCcCCCC Q lcl|NC_010576. 412 PGQITSDQPATASTDPLNNVSTSAIENGSLTDGGS 446 (447) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (447) .+.+.-.-|.+.+++.+..--.+-+.| . T Consensus 498 -k~~n~iG~P~~dd~~~~dati~s~~~~------~ 525 (525) T protein:vir:10 498 -KDGNDIGSPKLDDSDSSDATIESKERG------V 525 (525) T ss_pred -cccccccCCccCCCcchhhhhhhhhcC------C Confidence 011111111111221111111111111 1 No 210 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=24.75 E-value=2.1 Score=18.77 Aligned_cols=402 Identities=8% Similarity=0.043 Sum_probs=138.9 Q ss_pred CchhHhhhhhcccccCCccc---------ccccccccccccccc-----------ccccccccCCc-----------ccc Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQ---------NQNTNDFLTPSNGMT-----------SFGGYYGRGQS-----------NYS 49 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~---------~~~~~~~~~~~~~~~-----------~~~~~~~~~~~-----------~~~ 49 (447) |.|- ++++++++.+.+.. ..+++.+.... +.. ++.|.+...-+ -+. T Consensus 1 m~~~--~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~D-ga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~ 77 (524) T protein:vir:10 1 MKFN--VLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDD-GAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELID 77 (524) T ss_pred CCCc--hhhHhhccccCcchhhhhhhccCCccccCccCCC-CceeeeecccccccccceeeeehhcccccccchHHHHHH Confidence 8872 23333333332211 11222221110 000 11111111000 001 Q ss_pred cchhhhhhHHHHHHHHHHHHhhccC-----ceEEEEEcCC--CceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHh Q lcl|NC_010576. 50 RSYSYNKADLIKSVITRIALDASMV-----DFKHLKIDPI--SGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLD 122 (447) Q Consensus 50 ~~~~~~~~~~v~~cv~~ia~~ia~l-----p~~~~r~~~~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll 122 (447) .=+..+.+|-|.+||+-|.+++.-. |+.+-=.+.+ ...+........++|+. -|-...+++ ++..+.. T Consensus 78 ~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~~~----~fR~WYV 152 (524) T protein:vir:10 78 TYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNH-LSFQRKGSD----HFRRWYV 152 (524) T ss_pred HHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHH-hccchhhhH----HHhhhee Confidence 1123466788999999998887533 2222111111 01111122223344432 122233333 3455667 Q ss_pred cCCeeEEEeeccCC---cccceeeeccCCCcceeeecC----------CceEEEEee----------ecccccceeeecc Q lcl|NC_010576. 123 EGQIAMVPIDTTVD---PDSGSFDINTARVGKIMQFFP----------RQVMVRVWN----------DNTGLEQDLLVSK 179 (447) Q Consensus 123 ~Gna~i~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~----------~~~~~~~~~~~~~ 179 (447) .|-.|..+.-+... .+.....+.|..+..+..... +-..+.+|. ..+.....+.++. T Consensus 153 DgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~ 232 (524) T protein:vir:10 153 DSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK 232 (524) T ss_pred eeEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecch Confidence 78777655443322 222333333332222111100 000011111 1112234455666 Q ss_pred ccccccccccccccc-chhHHHHHHHHHHHHHHHHHH----H--hhcCccc-ceeeeCCcCChHHHHHHHHHHHHHHHHH Q lcl|NC_010576. 180 ENCIIIESPFYAILN-DTNQTLRMLEQKIKLMNSQDN----R--ASSGKLN-GFIQFPYSTKSTARAAQAARRKQEIENE 251 (447) Q Consensus 180 ~~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~--~n~~~~~-gvl~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (447) +-|.|..+.+.+... ..-+-+..+...+..+.-... + .+.---+ +.|.++ .+. +..+++....+... T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvG-nlP----k~KAeqYl~~im~k 307 (524) T protein:vir:10 233 AAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTG-NMP----ARKAAEHMQHVMNT 307 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecC-CCC----chhHHHHHHHHHHh Confidence 666666654433221 111222222222222111111 1 1111111 223333 232 22334444444443 Q ss_pred hcc------CCcce------e-ec-------CC---CceeeecCCChhhhhHHHHHHHHHHHHHHhCCCHHHhc------ Q lcl|NC_010576. 252 MAN------NKYGV------A-TL-------DT---QEKFVSAGMGLQNNLLSDVRQLQQDFYNQMGITEAILN------ 302 (447) Q Consensus 252 ~~~------n~~~~------~-vl-------~~---g~~~~~l~~~~~~~~l~~~~~~~~~Ia~~fgVP~~~l~------ 302 (447) +++ +.|.| + .| -+ |.+++.|.-.-.--+++..++..+...++++||.+-|. T Consensus 308 ~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~ 387 (524) T protein:vir:10 308 MKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGG 387 (524) T ss_pred cCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCcc Confidence 322 22222 1 11 13 33444443333334688899999999999999999882 Q ss_pred ---CCcH-----HHHHHHHHHHHHhHHHHHHHHHHHhhc-----CChhHhc-CCceEEEec--chhh----hcC-HHHHH Q lcl|NC_010576. 303 ---GTAN-----EQQTLGYYNRCVDVLLQYVTDAISRIA-----LTKTAVS-QGQVLVYYR--NPFK----LVP-VEQLA 361 (447) Q Consensus 303 ---g~~~-----e~~~~~f~~~ti~P~~~~ie~~l~~kL-----l~~~e~~-~g~~i~f~~--~~l~----~~d-~~~~~ 361 (447) |.++ |-....|+..-=.-+...+.+.|-..| +++.|+. ...+|+|++ |.-. ... +..|+ T Consensus 388 f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~ 467 (524) T protein:vir:10 388 VMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRI 467 (524) T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 1222 222223332222223334444444443 4556663 233455443 3221 000 11222 Q ss_pred HHHHHHHh--CCCcCHHHHHHH-hCCCCCCCccccccccccccchhhcccccCCCCCCCCCCCcCCCCCCCccccc Q lcl|NC_010576. 362 TVADVLTR--NAIYTPNEIREL-TGKAPHPNPLANELFNRNIADGNQVGGINTPGQITSDQPATASTDPLNNVSTS 434 (447) Q Consensus 362 ~~~~~~~~--~G~~t~NE~R~~-~gl~p~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (447) .++..+-. +-.++.+=+|+. +.+.-.+ + ... ..+..+...++--.+..++ +.+= T Consensus 468 ~~l~~~dpyvGky~s~~yi~k~ILr~tDee------i-~~~---~k~I~~E~k~~~~~~~~~~---------~~~f 524 (524) T protein:vir:10 468 NMLTMAEPFIGKYISHRTAMKDILQMTDEE------I-EQE---AKQIEEESKEARFQDPDQE---------QEDF 524 (524) T ss_pred HHHHHhhhhhcccchhHHHHHHHhccCHHH------H-HHH---HHHHHHHhhcCCCCCCchh---------hhcC Confidence 22221110 012233333221 1221000 0 000 0000000111111110000 0000 No 211 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=23.34 E-value=2.3 Score=18.58 Aligned_cols=375 Identities=6% Similarity=-0.007 Sum_probs=135.3 Q ss_pred CchhHhhhhhcccccCCccccccccccccccccccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQNQNQNTNDFLTPSNGMTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKHLK 80 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~~r 80 (447) ..-..|+.+....+..+...... . ....+ .+...... ...-+......-+|+..+.-+-.-|+. |. T Consensus 40 ~~~~~~~~~~~~Yy~g~~~i~~r-~---~~~~~----~~~~~~~~-----~~~ki~~n~~~~Ivd~~~~~l~g~p~~-~~ 105 (474) T protein:vir:95 40 RKQLDKITVGQRYYDKDNDIVKQ-M---KKVDV----YGNIDYDK-----PDWRITTNFHQNLVDQKVSYVASKPVT-YS 105 (474) T ss_pred HHHHHHHHHHHHHhcccCchhcc-c---ccccc----cccccccc-----ccceeccchHHHHHHHHHhhhccCCce-ec Confidence 22222333333333222110000 0 00000 00000000 000112234555667777766666765 32 Q ss_pred EcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccceeeec-cCCC---------c Q lcl|NC_010576. 81 IDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGSFDIN-TARV---------G 150 (447) Q Consensus 81 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~~~~~-~~~~---------~ 150 (447) . ++ +.....+..++. |. .......+....+.+|.||+++..+..+... +.+. +..+ . T Consensus 106 ~--~d---~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~--i~~~~p~~~~~v~d~~~~~ 172 (474) T protein:vir:95 106 C--ED---ESVLKIIHDVLD---TR---WDNKLIDILTATSNKGIDWLQVYINENGEMK--LFRVPAEQAIPIWVDKERE 172 (474) T ss_pred c--Cc---hHHHHHHHHHHh---cc---HHHHHHHHHHHHhhcCcEEEEEEecCCCceE--EEEEcccceEEEEcCCCCC Confidence 1 11 112223334432 32 2234455667888999999887665543221 1111 1111 1 Q ss_pred cee---eec--CCceEEEEeeec--------ccccce-------------eee--cccccccccccccc--cccchhHHH Q lcl|NC_010576. 151 KIM---QFF--PRQVMVRVWNDN--------TGLEQD-------------LLV--SKENCIIIESPFYA--ILNDTNQTL 200 (447) Q Consensus 151 ~~~---~~~--~~~~~~~~~~~~--------~~~~~~-------------~~~--~~~~v~~~~~~~~~--~~~~~~~~~ 200 (447) .+. .++ .+...+.+|... .+.... ..+ ..=.++++++...+ ......... T Consensus 173 ~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~li 252 (474) T protein:vir:95 173 ELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLI 252 (474) T ss_pred ceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHH Confidence 110 000 000111111100 000000 000 00112333221111 111111222 Q ss_pred HHHHHHHHHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhh-hh Q lcl|NC_010576. 201 RMLEQKIKLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQN-NL 279 (447) Q Consensus 201 ~~~~~~~~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~-~~ 279 (447) ..+...++...+...+ ...+--++ .+...++ .++ +... ....+++.++++.+++.+..+... +. T Consensus 253 Da~d~~~S~~~~~~~~--~~~p~lv~--~g~~~~~-~~~----~~~~------~~~~~~i~~~~~~~~~~l~~~~~~~~~ 317 (474) T protein:vir:95 253 DAIDKRLSDAQNMFDE--SVELIYIL--KGYEGQD-LEE----FMRG------LKYYKAINVDGDGGVETIQVEVPVSST 317 (474) T ss_pred HHHHHHHHHHHHHHHH--hcCceeee--ecCCccc-chh----hhhh------hhccceeeccCCCceeEEeecCCHHHH Confidence 2222222222222222 22232222 2221111 111 1111 123456677777777777655433 34 Q ss_pred HHHHHHHHHHHHHHhCCCHHH---hcCCcHH--------------HHHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCC Q lcl|NC_010576. 280 LSDVRQLQQDFYNQMGITEAI---LNGTANE--------------QQTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQG 342 (447) Q Consensus 280 l~~~~~~~~~Ia~~fgVP~~~---l~g~~~e--------------~~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g 342 (447) ...++.+.+.|+..-++|..- ++|+.+. +.....+...|..+++.|.+.++.+. .. T Consensus 318 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~-------d~ 390 (474) T protein:vir:95 318 KEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKM-------DV 390 (474) T ss_pred HHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-------cc Confidence 456778888999998988532 2222221 11123455666666666655443221 12 Q ss_pred ceEEEecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccc--ccccchhhcccccCCCCCCCCCC Q lcl|NC_010576. 343 QVLVYYRNPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANELFN--RNIADGNQVGGINTPGQITSDQP 420 (447) Q Consensus 343 ~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 420 (447) ..+++.++.-...|.++.++. +.+.|+++.-.+.+++++ ++++. .++-. ..-....+.......++ T Consensus 391 ~~i~v~f~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~--v~d~~-~E~~ri~~E~~~~~~~~~~~~~~~------ 458 (474) T protein:vir:95 391 KDIEISFNFNRMMNDAEQSQI---IAQSQYLSRETLVKSSPL--VDDYK-AELERIEQEQMEYNKQLPNLDDGG------ 458 (474) T ss_pred ceeeEEeccCCCcCHHHHHHH---HHhcCCCchHHHHHhCCC--CCCHH-HHHHHHHHHHHHHHhccccccccc------ Confidence 344444445555566655554 566799998888877543 33321 11111 00000000000000000 Q ss_pred CcCCCCCCCcccccccCCc Q lcl|NC_010576. 421 ATASTDPLNNVSTSAIENG 439 (447) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~ 439 (447) .+.....+....-+++ T Consensus 459 ---~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 459 ---ADGAQQQERSNDKESE 474 (474) T ss_pred ---CCCCcCCCCCccCCCC Confidence 0100111110011111 No 212 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=22.78 E-value=2.4 Score=18.50 Aligned_cols=407 Identities=11% Similarity=0.024 Sum_probs=139.0 Q ss_pred CchhHhhhhhcccccCCcc-cccccccccccccc-ccccccccccCCcccccchhhhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_010576. 1 MASSDRLLHSWNAFQSNQN-QNQNTNDFLTPSNG-MTSFGGYYGRGQSNYSRSYSYNKADLIKSVITRIALDASMVDFKH 78 (447) Q Consensus 1 Mg~~~~l~~~~~~f~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cv~~ia~~ia~lp~~~ 78 (447) +-..+.+.++.+-++.+.. +-.....++..... ...-........ ...+ .......-+|+..+.-+-.-|+++ T Consensus 38 ~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~---~~~r--i~~n~~k~Ivd~~~~yl~g~p~~~ 112 (501) T protein:vir:96 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEM---ADKR--AVHNYGRMISKFKTGYLAGNPIRV 112 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCcccc---ccce--eecchHHHHHHHHhhhhcccCeeE Confidence 2222222222221111100 00000011110000 000000000000 0001 123345556666666666667653 Q ss_pred EEEcCCCceeccccchHHHHHhhhcCcccCHHHHHHHHHHHHHhcCCeeEEEeeccCCcccce-------eeeccCCC-c Q lcl|NC_010576. 79 LKIDPISGNQTPMPSGLINVLTRSANIDQTGRSFVFDLLYSLLDEGQIAMVPIDTTVDPDSGS-------FDINTARV-G 150 (447) Q Consensus 79 ~r~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~~~~~~-------~~~~~~~~-~ 150 (447) - ...+. ..+.....+..++.. | ........+...++.+|.||+++..+..+.+... +++..... . T Consensus 113 ~-~~~~~-~~~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v~d~~~~~ 185 (501) T protein:vir:96 113 E-YDDND-DNSQNDDAIKRIGRI--N---DLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLED 185 (501) T ss_pred e-eCCcc-chhHHHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEEEcCCCCC Confidence 2 21111 111112223334321 3 3345677788889999999998877654422111 11111000 1 Q ss_pred cee---eecC-----Cce-EEEEeeecc-------cccce---eeec--cccccccccccc--ccccchhHHHHHHHHHH Q lcl|NC_010576. 151 KIM---QFFP-----RQV-MVRVWNDNT-------GLEQD---LLVS--KENCIIIESPFY--AILNDTNQTLRMLEQKI 207 (447) Q Consensus 151 ~~~---~~~~-----~~~-~~~~~~~~~-------~~~~~---~~~~--~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~ 207 (447) ++. .++. +.. .+.+|.... +.... ..++ .=.++++++... +...........+...+ T Consensus 186 ~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~ 265 (501) T protein:vir:96 186 NSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAE 265 (501) T ss_pred ceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHH Confidence 110 0000 000 111111100 00000 0000 001333332111 11111112222222222 Q ss_pred HHHHHHHHHhhcCcccceeeeCCcCChHHHHHHHHHHHHHHHHHhccCCcceeecCCCceeeecCCChhhhhH-HHHHHH Q lcl|NC_010576. 208 KLMNSQDNRASSGKLNGFIQFPYSTKSTARAAQAARRKQEIENEMANNKYGVATLDTQEKFVSAGMGLQNNLL-SDVRQL 286 (447) Q Consensus 208 ~~~~~~~~~~n~~~~~gvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~l-~~~~~~ 286 (447) +-..+...+. +.+--++. +..... ..+....++... .......+.......+.+++-+........+ ...+.+ T Consensus 266 s~~~~~~~~~--~~~~l~i~--G~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 339 (501) T protein:vir:96 266 SDTANHMSDM--ADAILAIY--GDLALP-KGMQASDMKRTR-LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRL 339 (501) T ss_pred HHHHHHHHHh--cCceeeee--cccccC-cccchhhhhhcC-eeeecccccccccccCcceeeEeccCCHHHHHHHHHHH Confidence 2222222222 22222222 211111 011111111100 0000111111123344455555544444333 445677 Q ss_pred HHHHHHHhCCCHHHh---cCCcHHH--------------HHHHHHHHHHhHHHHHHHHHHHhhcCChhHhcCCceEEEec Q lcl|NC_010576. 287 QQDFYNQMGITEAIL---NGTANEQ--------------QTLGYYNRCVDVLLQYVTDAISRIALTKTAVSQGQVLVYYR 349 (447) Q Consensus 287 ~~~Ia~~fgVP~~~l---~g~~~e~--------------~~~~f~~~ti~P~~~~ie~~l~~kLl~~~e~~~g~~i~f~~ 349 (447) .+.|+..-++|..-. +|+.+.. .....+...|...++.+..-++.+--.. ..+ ...+++.+ T Consensus 340 ~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~-~~d-~~~i~i~f 417 (501) T protein:vir:96 340 NRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFK-DFD-ESLLKITF 417 (501) T ss_pred HHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccc-cccceEEe Confidence 888888888886443 2322211 1122445555555555544443321110 011 12355566 Q ss_pred chhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccc--ccccccchh--hccccc--CCCCCCCCCCCcC Q lcl|NC_010576. 350 NPFKLVPVEQLATVADVLTRNAIYTPNEIRELTGKAPHPNPLANEL--FNRNIADGN--QVGGIN--TPGQITSDQPATA 423 (447) Q Consensus 350 ~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g~~~~~~--~~~~~~~~~--~~~~~~--~~~~~~~~~~~~~ 423 (447) ..-+..|.++.++++.++. |+++..-+.+++++ ++++.. ++ +...-.... ...+.. ..+...+++...+ T Consensus 418 ~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~--v~D~~~-E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 492 (501) T protein:vir:96 418 TPNLPKSLNEQVSILTGLG--GQVSQETALSLSGL--VESPNE-ELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETH 492 (501) T ss_pred CCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCC--CCCHHH-HHHHHHHHHHHhhccccccchhhcccccCCcCCCCC Confidence 6777889999999999885 78988777777543 333211 11 111000000 000000 0011111111111 Q ss_pred CCCCCCcccccccC Q lcl|NC_010576. 424 STDPLNNVSTSAIE 437 (447) Q Consensus 424 ~~~~~~~~~~~~~~ 437 (447) ++. .+++. + T Consensus 493 ~d~-~e~~~----~ 501 (501) T protein:vir:96 493 TDD-FEREY----E 501 (501) T ss_pred CCc-ccccc----C Confidence 111 11111 1 Done!