Query lcl|NC_012530.1_cdsid_YP_002790785.1 [gene=lb338_phage_106] [protein=putative portal protein] [protein_id=YP_002790785.1] [location=60644..62323] Match_columns 559 No_of_seqs 254 out of 1015 Neff 8.8 Searched_HMMs 1612 Date Thu Nov 7 14:24:28 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_106 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_106_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80644 Length: 551 100.0 2E-124 1E-127 698.5 49.9 529 1-559 5-545 (551) 2 protein:vir:63755 Length: 547 100.0 5E-122 3E-125 685.5 51.1 531 1-559 1-541 (547) 3 protein:vir:96579 Length: 576 100.0 1E-118 6E-122 667.5 50.0 540 1-559 6-569 (576) 4 protein:vir:80796 Length: 574 100.0 2E-116 1E-119 655.3 50.1 549 1-559 2-572 (574) 5 protein:vir:95599 Length: 563 100.0 4E-112 3E-115 631.1 50.9 528 1-559 2-557 (563) 6 protein:vir:99312 Length: 563 100.0 4E-112 3E-115 631.1 50.9 528 1-559 2-557 (563) 7 protein:vir:100691 Length: 535 100.0 7.5E-98 5E-101 553.0 49.6 521 1-550 1-535 (535) 8 protein:vir:101648 Length: 518 100.0 5.5E-90 3.4E-93 509.9 42.3 473 33-559 1-487 (518) 9 protein:vir:7853 Length: 518 # 100.0 2E-89 1.2E-92 506.9 42.2 479 22-559 1-487 (518) 10 protein:vir:6240 Length: 457 # 100.0 3.3E-88 2.1E-91 500.2 41.4 448 1-532 1-457 (457) 11 protein:vir:107605 Length: 432 100.0 1.4E-87 8.7E-91 496.7 39.6 427 1-532 1-432 (432) 12 protein:vir:105002 Length: 432 100.0 1.4E-87 8.7E-91 496.7 39.6 427 1-532 1-432 (432) 13 protein:vir:102855 Length: 432 100.0 1.4E-87 8.7E-91 496.7 39.6 427 1-532 1-432 (432) 14 protein:vir:100249 Length: 431 100.0 3.5E-88 2.2E-91 500.0 35.4 420 1-495 1-431 (431) 15 protein:vir:1326 Length: 457 # 100.0 2.8E-87 1.7E-90 495.1 40.1 447 1-536 1-457 (457) 16 protein:vir:93610 Length: 454 100.0 2.5E-87 1.6E-90 495.3 39.8 447 19-538 1-454 (454) 17 protein:vir:1380 Length: 422 # 100.0 1.8E-87 1.1E-90 496.1 38.7 416 1-502 1-422 (422) 18 protein:vir:102080 Length: 429 100.0 7.2E-87 4.5E-90 492.8 38.8 424 1-511 1-429 (429) 19 protein:vir:105064 Length: 421 100.0 2.3E-86 1.4E-89 490.1 39.1 417 1-521 1-421 (421) 20 protein:vir:100150 Length: 437 100.0 4.3E-86 2.7E-89 488.6 39.3 430 1-518 1-437 (437) 21 protein:vir:4337 Length: 434 # 100.0 3.4E-86 2.1E-89 489.2 37.4 428 19-505 1-434 (434) 22 protein:vir:81152 Length: 411 100.0 9.5E-86 5.9E-89 486.7 38.9 404 1-498 1-411 (411) 23 protein:vir:5737 Length: 419 # 100.0 1.2E-85 7.6E-89 486.1 39.4 415 1-528 1-419 (419) 24 protein:vir:189 Length: 424 # 100.0 4.7E-86 2.9E-89 488.4 37.1 418 13-508 1-424 (424) 25 protein:vir:102727 Length: 945 100.0 3E-86 1.9E-89 489.4 36.0 509 1-559 5-560 (945) 26 protein:vir:1884 Length: 424 # 100.0 8.4E-86 5.2E-89 487.0 38.4 419 7-508 1-424 (424) 27 protein:vir:10362 Length: 432 100.0 3E-85 1.9E-88 483.9 38.7 421 1-516 7-432 (432) 28 protein:vir:4454 Length: 414 # 100.0 6.2E-85 3.9E-88 482.2 39.5 410 1-509 1-414 (414) 29 protein:vir:1431 Length: 419 # 100.0 5.1E-85 3.2E-88 482.7 38.6 412 35-519 1-419 (419) 30 protein:vir:4509 Length: 424 # 100.0 6.7E-85 4.2E-88 482.0 38.3 403 1-511 16-424 (424) 31 protein:vir:81072 Length: 432 100.0 1.9E-84 1.2E-87 479.6 39.5 421 1-511 7-432 (432) 32 protein:vir:102118 Length: 409 100.0 1.3E-84 7.9E-88 480.5 38.5 403 22-483 1-409 (409) 33 protein:vir:97060 Length: 432 100.0 1.2E-84 7.6E-88 480.6 38.0 421 1-516 7-432 (432) 34 protein:vir:81218 Length: 423 100.0 3E-84 1.9E-87 478.5 38.5 409 1-503 1-423 (423) 35 protein:vir:9408 Length: 441 # 100.0 4.3E-84 2.7E-87 477.6 37.8 409 1-500 26-441 (441) 36 protein:vir:79984 Length: 441 100.0 4.3E-84 2.7E-87 477.6 37.8 409 1-500 26-441 (441) 37 protein:vir:80333 Length: 419 100.0 4.3E-84 2.7E-87 477.6 37.4 415 26-522 1-419 (419) 38 protein:vir:483 Length: 413 # 100.0 9.8E-83 6.1E-86 470.2 40.5 409 22-510 1-413 (413) 39 protein:vir:98396 Length: 441 100.0 2.6E-83 1.6E-86 473.3 36.8 409 1-500 26-441 (441) 40 protein:vir:4598 Length: 416 # 100.0 3.8E-83 2.4E-86 472.4 37.6 409 1-500 1-416 (416) 41 protein:vir:81095 Length: 416 100.0 3.8E-83 2.4E-86 472.4 37.6 409 1-500 1-416 (416) 42 protein:vir:8418 Length: 409 # 100.0 1.5E-82 9.3E-86 469.2 38.7 406 1-511 1-409 (409) 43 protein:vir:101647 Length: 460 100.0 1.4E-82 9E-86 469.3 37.7 430 16-486 1-460 (460) 44 protein:vir:1266 Length: 416 # 100.0 2.5E-82 1.5E-85 468.0 37.9 411 22-507 1-416 (416) 45 protein:vir:8317 Length: 409 # 100.0 1.2E-81 7.5E-85 464.2 33.7 401 1-468 1-409 (409) 46 protein:vir:8100 Length: 466 # 100.0 8.4E-81 5.2E-84 459.6 38.1 443 1-530 1-466 (466) 47 protein:vir:96980 Length: 409 100.0 1.7E-80 1.1E-83 457.9 38.6 401 1-503 4-409 (409) 48 protein:vir:3868 Length: 417 # 100.0 1.9E-80 1.2E-83 457.6 38.8 410 25-536 1-417 (417) 49 protein:vir:94666 Length: 723 100.0 1.2E-80 7.2E-84 458.8 36.0 458 48-559 1-481 (723) 50 protein:vir:2683 Length: 412 # 100.0 5.9E-80 3.6E-83 455.0 39.7 405 1-503 1-412 (412) 51 protein:vir:93943 Length: 409 100.0 8.4E-80 5.2E-83 454.1 38.1 401 1-503 4-409 (409) 52 protein:vir:94426 Length: 409 100.0 5.3E-79 3.3E-82 449.7 38.1 401 1-503 4-409 (409) 53 protein:vir:95378 Length: 406 100.0 4.9E-79 3E-82 449.9 36.9 401 1-513 1-406 (406) 54 protein:vir:960 Length: 413 # 100.0 9E-79 5.6E-82 448.5 35.6 407 1-503 1-413 (413) 55 protein:vir:9702 Length: 406 # 100.0 2.8E-78 1.7E-81 445.8 37.6 400 25-512 1-406 (406) 56 protein:vir:3153 Length: 467 # 100.0 3.2E-77 2E-80 440.0 40.3 414 76-540 1-467 (467) 57 protein:vir:80134 Length: 403 100.0 5.4E-78 3.4E-81 444.2 35.3 398 1-504 1-403 (403) 58 protein:vir:4194 Length: 540 # 100.0 5.4E-76 3.4E-79 433.2 39.6 469 27-559 1-503 (540) 59 protein:vir:4156 Length: 542 # 100.0 2.5E-75 1.5E-78 429.6 37.2 477 3-559 1-508 (542) 60 protein:vir:6210 Length: 394 # 100.0 2.4E-74 1.5E-77 424.2 34.3 389 1-501 1-394 (394) 61 protein:vir:3843 Length: 397 # 100.0 5.1E-73 3.2E-76 416.9 38.0 392 1-511 1-397 (397) 62 protein:vir:104259 Length: 403 100.0 1.8E-73 1.1E-76 419.3 34.9 394 22-511 1-403 (403) 63 protein:vir:100882 Length: 383 100.0 2.8E-72 1.8E-75 412.8 36.7 380 1-481 1-383 (383) 64 protein:vir:9359 Length: 348 # 100.0 2E-72 1.2E-75 413.7 35.4 343 97-503 1-348 (348) 65 protein:vir:99452 Length: 651 100.0 3.8E-72 2.3E-75 412.2 34.7 479 22-559 1-588 (651) 66 protein:vir:100187 Length: 385 100.0 9E-72 5.6E-75 410.1 36.1 380 1-501 1-385 (385) 67 protein:vir:1082 Length: 359 # 100.0 4.1E-71 2.5E-74 406.5 36.0 356 1-456 1-359 (359) 68 protein:vir:7407 Length: 392 # 100.0 7.6E-71 4.7E-74 405.0 36.1 371 1-503 3-392 (392) 69 protein:vir:4854 Length: 386 # 100.0 5.4E-70 3.4E-73 400.3 35.9 381 1-502 1-386 (386) 70 protein:vir:3989 Length: 392 # 100.0 5.9E-70 3.7E-73 400.1 35.7 371 1-469 3-392 (392) 71 protein:vir:1023 Length: 392 # 100.0 5.9E-70 3.7E-73 400.1 35.7 371 1-469 3-392 (392) 72 protein:vir:4995 Length: 384 # 100.0 1.4E-69 8.9E-73 398.0 32.8 374 1-466 1-384 (384) 73 protein:vir:95965 Length: 385 100.0 9.3E-69 5.8E-72 393.5 33.3 373 1-481 1-385 (385) 74 protein:vir:9507 Length: 395 # 100.0 4.4E-68 2.7E-71 389.9 35.2 384 1-532 1-395 (395) 75 protein:vir:101289 Length: 395 100.0 4.4E-68 2.7E-71 389.9 35.2 384 1-532 1-395 (395) 76 protein:vir:100650 Length: 395 100.0 4.4E-68 2.7E-71 389.9 35.2 384 1-532 1-395 (395) 77 protein:vir:79772 Length: 648 100.0 8.3E-67 5.1E-70 382.9 38.7 483 1-559 8-539 (648) 78 protein:vir:4828 Length: 382 # 100.0 2E-66 1.3E-69 380.7 32.4 371 1-502 1-382 (382) 79 protein:vir:4952 Length: 386 # 100.0 1.3E-65 8.1E-69 376.3 35.6 381 1-502 1-386 (386) 80 protein:vir:94002 Length: 378 100.0 5.2E-66 3.3E-69 378.5 29.5 366 34-511 1-378 (378) 81 protein:vir:78310 Length: 376 100.0 1.9E-65 1.2E-68 375.4 31.6 367 25-477 1-376 (376) 82 protein:vir:93867 Length: 378 100.0 1.4E-65 8.4E-69 376.2 29.4 366 25-511 1-378 (378) 83 protein:vir:98643 Length: 395 100.0 5.7E-65 3.6E-68 372.8 32.2 381 1-503 1-395 (395) 84 protein:vir:9641 Length: 395 # 100.0 2.6E-65 1.6E-68 374.7 30.2 377 1-503 1-395 (395) 85 protein:vir:1661 Length: 378 # 100.0 3.9E-65 2.4E-68 373.7 30.2 366 34-511 1-378 (378) 86 protein:vir:4089 Length: 395 # 100.0 3.6E-64 2.3E-67 368.4 33.3 382 1-510 1-395 (395) 87 protein:vir:94869 Length: 378 100.0 3.6E-63 2.2E-66 362.9 29.8 366 1-511 1-378 (378) 88 protein:vir:858 Length: 378 # 100.0 1E-62 6.4E-66 360.4 29.7 363 25-511 1-378 (378) 89 protein:vir:267 Length: 348 # 100.0 6.2E-56 3.8E-59 323.3 30.3 331 23-432 1-348 (348) 90 protein:vir:79207 Length: 351 100.0 1.3E-55 8.3E-59 321.4 30.5 339 4-428 1-351 (351) 91 protein:vir:103971 Length: 376 100.0 2.6E-55 1.6E-58 319.8 31.5 341 10-428 1-376 (376) 92 protein:vir:79150 Length: 368 100.0 5.9E-56 3.7E-59 323.4 27.1 354 23-436 1-368 (368) 93 protein:vir:98567 Length: 340 100.0 6.2E-55 3.9E-58 317.8 31.8 328 23-425 1-340 (340) 94 protein:vir:78191 Length: 351 100.0 4.4E-55 2.7E-58 318.6 30.4 339 4-428 1-351 (351) 95 protein:vir:6058 Length: 344 # 100.0 1.4E-54 8.5E-58 315.9 31.5 329 4-426 1-344 (344) 96 protein:vir:78749 Length: 337 100.0 1.4E-54 8.6E-58 315.9 31.5 331 32-422 1-337 (337) 97 protein:vir:2013 Length: 344 # 100.0 1.1E-54 7.1E-58 316.3 28.9 331 4-426 1-344 (344) 98 protein:vir:100328 Length: 346 100.0 3E-54 1.8E-57 314.0 30.8 332 4-426 1-346 (346) 99 protein:vir:5691 Length: 344 # 100.0 3.9E-54 2.4E-57 313.4 30.9 331 4-426 1-344 (344) 100 protein:vir:78641 Length: 278 100.0 3.6E-54 2.3E-57 313.6 29.2 274 97-421 1-278 (278) 101 protein:vir:3743 Length: 345 # 100.0 4.7E-53 2.9E-56 307.5 31.6 330 23-423 1-345 (345) 102 protein:vir:1150 Length: 350 # 100.0 4.7E-53 2.9E-56 307.5 31.0 338 1-421 1-350 (350) 103 protein:vir:3780 Length: 345 # 100.0 5.4E-53 3.3E-56 307.1 28.9 332 23-425 1-345 (345) 104 protein:vir:98853 Length: 219 100.0 2.3E-45 1.4E-48 265.3 20.6 214 192-427 1-219 (219) 105 protein:vir:4698 Length: 251 # 100.0 1.8E-44 1.1E-47 260.5 23.0 248 1-320 1-251 (251) 106 protein:vir:5249 Length: 437 # 99.9 1.8E-24 1.1E-27 150.8 35.0 411 1-513 1-437 (437) 107 protein:vir:107742 Length: 537 99.9 8.9E-23 5.5E-26 141.5 34.3 460 1-546 25-537 (537) 108 protein:vir:94049 Length: 532 99.9 2.9E-21 1.8E-24 133.2 39.2 473 11-556 1-532 (532) 109 protein:vir:99563 Length: 862 99.8 5.4E-19 3.3E-22 120.7 34.5 491 1-559 36-608 (862) 110 protein:vir:79538 Length: 502 99.8 5.3E-18 3.3E-21 115.3 31.7 449 1-528 1-502 (502) 111 protein:vir:108215 Length: 469 99.8 1.8E-16 1.1E-19 106.9 38.4 425 36-547 1-469 (469) 112 protein:vir:104338 Length: 422 99.8 9.4E-18 5.9E-21 113.9 31.4 392 37-510 1-422 (422) 113 protein:vir:79647 Length: 435 99.8 2.1E-17 1.3E-20 112.0 32.3 403 1-513 1-435 (435) 114 protein:vir:107662 Length: 427 99.7 5.4E-17 3.3E-20 109.8 31.2 393 40-512 1-427 (427) 115 protein:vir:80040 Length: 461 99.7 3.3E-17 2E-20 111.0 29.8 411 25-519 1-461 (461) 116 protein:vir:96068 Length: 765 99.7 1.3E-16 7.8E-20 107.7 32.1 500 1-559 1-581 (765) 117 protein:vir:95254 Length: 488 99.7 4.3E-15 2.7E-18 99.3 35.9 434 29-533 1-488 (488) 118 protein:vir:103860 Length: 528 99.7 3.1E-15 1.9E-18 100.1 34.4 462 1-559 2-485 (528) 119 protein:vir:1986 Length: 512 # 99.7 1E-14 6.2E-18 97.3 35.6 440 1-559 2-472 (512) 120 protein:vir:96738 Length: 505 99.7 1.9E-16 1.2E-19 106.8 26.1 450 1-524 8-505 (505) 121 protein:vir:99232 Length: 526 99.7 9.3E-15 5.8E-18 97.5 34.7 448 1-559 2-483 (526) 122 protein:vir:79233 Length: 526 99.7 1.4E-14 8.5E-18 96.6 35.5 461 1-559 2-483 (526) 123 protein:vir:99853 Length: 488 99.7 6.3E-15 3.9E-18 98.4 33.7 424 23-559 1-441 (488) 124 protein:vir:95542 Length: 548 99.6 6.7E-16 4.2E-19 103.8 27.5 485 1-547 1-548 (548) 125 protein:vir:79511 Length: 448 99.6 3E-15 1.9E-18 100.2 31.0 416 23-519 1-448 (448) 126 protein:vir:77981 Length: 448 99.6 5E-14 3.1E-17 93.5 31.2 418 23-519 1-448 (448) 127 protein:vir:389 Length: 530 # 99.6 2.5E-14 1.5E-17 95.2 28.3 460 1-523 1-530 (530) 128 protein:vir:6382 Length: 553 # 99.6 1.5E-14 9E-18 96.5 27.0 457 1-527 2-553 (553) 129 protein:vir:3420 Length: 533 # 99.5 5.4E-14 3.3E-17 93.3 28.0 470 11-530 1-533 (533) 130 protein:vir:98816 Length: 446 99.5 1.3E-13 7.9E-17 91.3 30.0 386 23-460 1-446 (446) 131 protein:vir:107880 Length: 491 99.5 2.2E-12 1.4E-15 84.5 34.5 435 1-559 3-454 (491) 132 protein:vir:10321 Length: 495 99.5 9.1E-14 5.7E-17 92.1 25.6 437 1-523 1-495 (495) 133 protein:vir:105782 Length: 449 99.5 5.8E-13 3.6E-16 87.7 29.5 409 1-514 1-449 (449) 134 protein:vir:106716 Length: 698 99.5 1.5E-12 9E-16 85.5 30.3 497 1-559 52-590 (698) 135 protein:vir:3648 Length: 695 # 99.4 8.1E-12 5E-15 81.4 31.0 495 1-559 52-607 (695) 136 protein:vir:79063 Length: 491 99.4 2.8E-11 1.7E-14 78.4 37.3 434 1-559 3-454 (491) 137 protein:vir:78589 Length: 695 99.4 3.3E-11 2E-14 78.1 31.5 495 1-559 52-607 (695) 138 protein:vir:101541 Length: 694 99.3 7E-11 4.3E-14 76.3 31.8 498 1-559 41-606 (694) 139 protein:vir:78161 Length: 355 99.2 1.1E-09 6.7E-13 69.7 30.0 328 170-540 1-355 (355) 140 protein:vir:5839 Length: 533 # 99.1 1.1E-09 7.1E-13 69.6 26.1 482 1-559 1-525 (533) 141 protein:vir:98444 Length: 434 99.1 1E-09 6.5E-13 69.8 25.9 382 61-540 1-434 (434) 142 protein:vir:106639 Length: 481 99.1 1.5E-09 9.1E-13 69.0 25.7 430 1-529 6-481 (481) 143 protein:vir:95113 Length: 474 99.0 4E-09 2.5E-12 66.7 26.2 420 1-513 2-474 (474) 144 protein:vir:99916 Length: 504 99.0 8.6E-09 5.3E-12 64.8 31.9 437 1-540 1-504 (504) 145 protein:vir:1236 Length: 483 # 99.0 8.7E-09 5.4E-12 64.8 28.0 423 1-513 9-483 (483) 146 protein:vir:93747 Length: 472 99.0 9.8E-09 6.1E-12 64.5 28.0 423 1-515 5-472 (472) 147 protein:vir:104082 Length: 485 99.0 1.1E-08 6.6E-12 64.3 29.6 429 7-541 1-485 (485) 148 protein:vir:4223 Length: 486 # 98.9 1.2E-08 7.2E-12 64.1 30.5 433 1-557 1-486 (486) 149 protein:vir:7768 Length: 484 # 98.9 2.4E-08 1.5E-11 62.4 29.1 435 1-529 1-484 (484) 150 protein:vir:97336 Length: 492 98.9 2.5E-08 1.6E-11 62.2 27.2 423 1-516 18-492 (492) 151 protein:vir:7987 Length: 456 # 98.8 2.4E-08 1.5E-11 62.4 25.3 409 11-511 1-456 (456) 152 protein:vir:95806 Length: 440 98.8 4.1E-08 2.6E-11 61.1 29.9 416 3-511 1-440 (440) 153 protein:vir:94101 Length: 474 98.8 4.3E-08 2.6E-11 61.0 30.9 432 1-513 1-474 (474) 154 protein:vir:105889 Length: 474 98.8 4.3E-08 2.6E-11 61.0 30.9 432 1-513 1-474 (474) 155 protein:vir:97171 Length: 512 98.8 4.8E-08 3E-11 60.7 31.2 440 1-518 18-512 (512) 156 protein:vir:94805 Length: 492 98.8 6.1E-08 3.8E-11 60.1 30.0 423 1-515 18-492 (492) 157 protein:vir:5961 Length: 503 # 98.8 6.2E-08 3.8E-11 60.1 32.4 443 1-540 2-503 (503) 158 protein:vir:95899 Length: 474 98.8 6.5E-08 4.1E-11 60.0 25.8 423 1-513 2-474 (474) 159 protein:vir:96266 Length: 474 98.8 6.5E-08 4.1E-11 60.0 25.8 423 1-513 2-474 (474) 160 protein:vir:9815 Length: 500 # 98.7 4.8E-08 3E-11 60.7 23.9 413 1-515 1-500 (500) 161 protein:vir:3028 Length: 500 # 98.7 4.8E-08 3E-11 60.7 23.9 413 1-515 1-500 (500) 162 protein:vir:99781 Length: 511 98.7 6.9E-08 4.3E-11 59.8 29.4 443 1-528 15-511 (511) 163 protein:vir:9306 Length: 511 # 98.7 7.2E-08 4.4E-11 59.8 30.8 440 1-518 15-511 (511) 164 protein:vir:80680 Length: 441 98.7 8E-08 5E-11 59.5 28.8 382 68-506 1-441 (441) 165 protein:vir:4898 Length: 502 # 98.7 8.3E-08 5.1E-11 59.4 30.1 436 1-528 23-502 (502) 166 protein:vir:103951 Length: 511 98.7 8.3E-08 5.2E-11 59.4 31.4 440 1-539 15-511 (511) 167 protein:vir:96240 Length: 511 98.7 8.6E-08 5.3E-11 59.3 31.0 441 1-516 15-511 (511) 168 protein:vir:2427 Length: 485 # 98.7 1E-07 6.4E-11 58.9 30.3 433 4-541 1-485 (485) 169 protein:vir:96366 Length: 511 98.7 1.1E-07 6.7E-11 58.8 29.8 440 1-539 15-511 (511) 170 protein:vir:78805 Length: 511 98.7 1.1E-07 6.7E-11 58.8 29.8 440 1-539 15-511 (511) 171 protein:vir:78537 Length: 480 98.7 1.2E-07 7.4E-11 58.6 28.5 392 64-542 1-480 (480) 172 protein:vir:4782 Length: 522 # 98.7 1.3E-07 7.9E-11 58.4 25.7 424 1-529 1-522 (522) 173 protein:vir:99072 Length: 479 98.7 1.4E-07 8.7E-11 58.1 29.1 432 8-541 1-479 (479) 174 protein:vir:38 Length: 496 # N 98.6 1.5E-07 9.4E-11 58.0 28.4 421 1-504 3-496 (496) 175 protein:vir:94742 Length: 409 98.6 1.6E-07 9.8E-11 57.9 30.3 351 43-456 1-409 (409) 176 protein:vir:2732 Length: 501 # 98.6 1.6E-07 1E-10 57.8 32.2 440 1-521 22-501 (501) 177 protein:vir:8654 Length: 629 # 98.6 1.8E-08 1.1E-11 63.1 17.6 482 22-559 1-554 (629) 178 protein:vir:99088 Length: 629 98.6 2E-08 1.3E-11 62.7 17.4 482 22-559 1-554 (629) 179 protein:vir:2341 Length: 488 # 98.6 2.8E-07 1.7E-10 56.5 31.4 432 10-515 1-488 (488) 180 protein:vir:105819 Length: 456 98.6 3E-07 1.8E-10 56.4 26.5 410 11-511 1-456 (456) 181 protein:vir:102602 Length: 456 98.6 3E-07 1.8E-10 56.4 26.5 410 11-511 1-456 (456) 182 protein:vir:99522 Length: 470 98.5 3.3E-07 2E-10 56.1 31.1 424 1-517 1-470 (470) 183 protein:vir:96494 Length: 501 98.5 5.2E-07 3.2E-10 55.0 30.5 440 1-528 22-501 (501) 184 protein:vir:101806 Length: 516 98.5 3.1E-07 1.9E-10 56.3 20.7 449 1-513 1-516 (516) 185 protein:vir:101189 Length: 516 98.5 3.1E-07 1.9E-10 56.3 20.7 449 1-513 1-516 (516) 186 protein:vir:94498 Length: 474 98.5 5.9E-07 3.7E-10 54.7 32.1 412 52-513 1-474 (474) 187 protein:vir:97447 Length: 474 98.5 5.9E-07 3.7E-10 54.7 32.1 412 52-513 1-474 (474) 188 protein:vir:102426 Length: 631 98.4 1.3E-07 8E-11 58.4 18.5 474 22-559 1-554 (631) 189 protein:vir:3964 Length: 453 # 98.4 7.6E-07 4.7E-10 54.1 29.3 421 1-511 1-453 (453) 190 protein:vir:78227 Length: 480 98.4 7.8E-07 4.8E-10 54.1 28.3 392 64-542 1-480 (480) 191 protein:vir:79703 Length: 505 98.4 8.8E-07 5.5E-10 53.8 31.3 411 1-497 1-505 (505) 192 protein:vir:98883 Length: 517 98.4 9.8E-07 6.1E-10 53.5 29.7 425 1-513 1-517 (517) 193 protein:vir:1634 Length: 409 # 98.4 9.8E-07 6.1E-10 53.5 29.6 350 43-456 1-409 (409) 194 protein:vir:2500 Length: 501 # 98.4 1.1E-06 6.5E-10 53.4 29.0 433 1-530 5-501 (501) 195 protein:vir:80959 Length: 499 98.4 1.1E-06 6.8E-10 53.3 29.4 414 1-513 3-499 (499) 196 protein:vir:100598 Length: 516 98.4 1.1E-06 6.9E-10 53.2 23.4 449 1-513 1-516 (516) 197 protein:vir:98265 Length: 524 98.3 1.1E-06 7.1E-10 53.2 22.2 448 1-513 1-524 (524) 198 protein:vir:8184 Length: 474 # 98.3 1.2E-06 7.7E-10 53.0 31.0 414 1-499 1-474 (474) 199 protein:vir:5665 Length: 511 # 98.3 1.3E-06 8.1E-10 52.9 23.0 442 1-513 1-511 (511) 200 protein:vir:78083 Length: 537 98.3 1.4E-06 8.6E-10 52.7 32.3 461 1-554 8-537 (537) 201 protein:vir:9871 Length: 429 # 98.3 1.7E-06 1.1E-09 52.2 29.4 373 81-508 1-429 (429) 202 protein:vir:79043 Length: 479 98.3 2E-06 1.2E-09 51.9 32.3 422 1-512 9-479 (479) 203 protein:vir:97900 Length: 639 98.3 3.9E-07 2.4E-10 55.7 17.0 490 22-559 1-561 (639) 204 protein:vir:107517 Length: 639 98.3 3.9E-07 2.4E-10 55.7 17.0 490 22-559 1-561 (639) 205 protein:vir:103458 Length: 524 98.2 2.1E-06 1.3E-09 51.8 21.4 453 1-513 1-524 (524) 206 protein:vir:7208 Length: 524 # 98.2 2.1E-06 1.3E-09 51.7 21.4 453 1-513 1-524 (524) 207 protein:vir:1587 Length: 508 # 98.2 2.4E-06 1.5E-09 51.4 30.3 427 1-511 1-508 (508) 208 protein:vir:108049 Length: 524 98.2 1.5E-06 9.4E-10 52.5 19.4 452 1-513 1-524 (524) 209 protein:vir:96839 Length: 474 98.2 2.7E-06 1.7E-09 51.1 31.1 420 1-533 1-474 (474) 210 protein:vir:106571 Length: 499 98.2 3E-06 1.9E-09 50.8 30.3 440 1-542 1-499 (499) 211 protein:vir:81017 Length: 521 98.2 3.2E-06 2E-09 50.7 23.1 450 1-513 5-521 (521) 212 protein:vir:9751 Length: 422 # 98.2 3.3E-06 2E-09 50.7 28.6 364 43-480 1-422 (422) 213 protein:vir:106027 Length: 629 98.2 2.4E-06 1.5E-09 51.4 19.5 482 22-559 1-548 (629) 214 protein:vir:6596 Length: 521 # 98.1 4.3E-06 2.6E-09 50.0 23.7 450 1-513 5-521 (521) 215 protein:vir:105292 Length: 478 98.1 4.4E-06 2.7E-09 50.0 32.8 397 41-513 1-478 (478) 216 protein:vir:9568 Length: 410 # 98.1 4.5E-06 2.8E-09 49.9 28.2 361 36-487 1-410 (410) 217 protein:vir:733 Length: 453 # 98.1 4.5E-06 2.8E-09 49.9 29.1 423 1-519 1-453 (453) 218 protein:vir:106491 Length: 646 98.1 4.6E-06 2.8E-09 49.9 23.0 480 1-559 1-550 (646) 219 protein:vir:3609 Length: 452 # 98.1 4.8E-06 3E-09 49.7 30.7 416 1-511 1-452 (452) 220 protein:vir:105461 Length: 470 98.1 5.8E-06 3.6E-09 49.3 25.9 413 11-512 1-470 (470) 221 protein:vir:94546 Length: 506 98.0 6.5E-06 4.1E-09 49.0 25.2 441 1-537 3-506 (506) 222 protein:vir:9922 Length: 489 # 98.0 8.8E-06 5.5E-09 48.3 27.5 430 1-512 1-489 (489) 223 protein:vir:103219 Length: 201 97.9 2.1E-06 1.3E-09 51.8 15.2 192 277-510 1-201 (201) 224 protein:vir:107112 Length: 478 97.8 1.6E-05 9.8E-09 46.9 32.8 421 1-513 1-478 (478) 225 protein:vir:106282 Length: 521 97.8 2.2E-05 1.4E-08 46.1 22.7 446 1-513 1-521 (521) 226 protein:vir:78907 Length: 518 97.7 2.3E-05 1.4E-08 46.0 33.7 432 1-529 1-518 (518) 227 protein:vir:6896 Length: 523 # 97.6 3.6E-05 2.2E-08 44.9 18.9 452 1-513 1-523 (523) 228 protein:vir:104892 Length: 558 97.4 7.1E-05 4.4E-08 43.3 22.8 479 1-541 1-558 (558) 229 protein:vir:104500 Length: 537 97.4 8.2E-05 5.1E-08 43.0 25.9 456 23-546 1-537 (537) 230 protein:vir:102950 Length: 471 97.3 0.0001 6.5E-08 42.4 30.4 415 1-504 1-471 (471) 231 protein:vir:96179 Length: 468 97.1 0.00019 1.2E-07 41.0 30.5 411 1-504 1-468 (468) 232 protein:vir:106999 Length: 564 96.9 0.00027 1.7E-07 40.2 22.8 476 1-538 1-564 (564) 233 protein:vir:101418 Length: 569 96.8 0.0003 1.9E-07 39.9 17.9 460 22-529 1-569 (569) 234 protein:vir:103177 Length: 533 96.7 0.00038 2.4E-07 39.3 22.1 466 1-546 1-533 (533) 235 protein:vir:94709 Length: 522 96.1 0.00097 6E-07 37.1 27.4 433 1-535 1-522 (522) 236 protein:vir:3361 Length: 535 # 96.0 0.0011 6.8E-07 36.8 24.4 449 1-541 1-535 (535) 237 protein:vir:4073 Length: 279 # 95.3 0.00036 2.2E-07 39.5 8.3 274 99-459 1-279 (279) 238 protein:vir:1538 Length: 535 # 95.2 0.0025 1.5E-06 34.9 23.9 449 1-541 1-535 (535) 239 protein:vir:102330 Length: 451 94.1 0.0053 3.3E-06 33.0 29.6 383 43-500 1-451 (451) 240 protein:vir:102239 Length: 527 91.2 0.017 1.1E-05 30.3 26.1 420 41-536 1-527 (527) 241 protein:vir:101494 Length: 527 91.2 0.017 1.1E-05 30.3 26.1 420 41-536 1-527 (527) 242 protein:vir:100039 Length: 522 90.9 0.019 1.1E-05 30.1 20.8 430 1-539 1-522 (522) 243 protein:vir:2198 Length: 536 # 88.2 0.034 2.1E-05 28.7 22.9 448 4-556 1-536 (536) 244 protein:vir:78696 Length: 542 87.7 0.037 2.3E-05 28.4 23.3 428 1-525 1-542 (542) 245 protein:vir:10447 Length: 536 82.8 0.074 4.6E-05 26.8 23.0 447 4-520 1-536 (536) 246 protein:vir:102668 Length: 547 82.6 0.075 4.7E-05 26.7 25.3 433 1-545 4-547 (547) 247 protein:vir:103765 Length: 549 81.5 0.085 5.3E-05 26.5 22.7 443 1-541 1-549 (549) 248 protein:vir:8883 Length: 543 # 72.7 0.18 0.00011 24.7 22.0 449 1-535 1-543 (543) 249 protein:vir:1785 Length: 555 # 68.4 0.24 0.00015 24.0 26.6 439 1-559 1-550 (555) 250 protein:vir:97376 Length: 320 53.3 0.53 0.00033 22.1 12.9 301 34-463 1-320 (320) 251 protein:vir:94572 Length: 535 39.8 1 0.00062 20.6 21.5 448 1-528 1-535 (535) 252 protein:vir:6322 Length: 510 # 36.0 1.2 0.00074 20.2 24.4 414 1-530 1-510 (510) 253 protein:vir:96988 Length: 516 35.4 1.2 0.00076 20.1 21.1 427 1-543 1-516 (516) 254 protein:vir:95315 Length: 559 33.5 1.3 0.00084 19.9 28.4 458 1-545 1-559 (559) 255 protein:vir:7321 Length: 556 # 31.8 1.5 0.00091 19.7 28.0 448 1-538 1-556 (556) 256 protein:vir:7017 Length: 515 # 28.1 1.8 0.0011 19.2 25.4 425 1-544 1-515 (515) 257 protein:vir:78942 Length: 510 21.7 2.6 0.0016 18.3 26.7 415 20-520 1-510 (510) No 1 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=2.2e-124 Score=698.48 Aligned_cols=529 Identities=46% Similarity=0.742 Sum_probs=472.8 Q ss_pred CcchhhhccccccCCcchHHHHHHH---------HHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDS---------KIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFG 71 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 71 (559) |+||+||| +...+....+.++++ .+...+++|++.++.+|+.+|.++++.+..++. ++|++.+.. T Consensus 5 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~----~r~~~~~~~ 78 (551) T protein:vir:80 5 LGLFESIR--LVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFK----TKPSIRNNQ 78 (551) T ss_pred hhhHHHhh--hccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccc----cCccccChh Confidence 99999999 555566666666666 677788999999999999999998887776644 566666677 Q ss_pred cHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc-ccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 72 RITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD-KPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 72 ~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~-~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) .+..+++.++++|+|++||++||++||++++++....+|++|.|++++.. +..++..++++.+.+||++|+++++++++ T Consensus 79 ~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~ 158 (551) T protein:vir:80 79 DLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRD 158 (551) T ss_pred HHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccc Confidence 88899999999999999999999999999999999999999999999875 67788889999999999999999999988 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEe Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFI 230 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~ 230 (559) +|++|+++++.|+|++||+|++++||..|+|++||||+|.+|++..+.+|+......+|+++..+.....|+++||||++ T Consensus 159 s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~ 238 (551) T protein:vir:80 159 SFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAV 238 (551) T ss_pred hHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEec Confidence 99999999999999999999999999999999999999999999999999877777788888888888899999999999 Q ss_pred cccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_012530. 231 RNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGING 310 (559) Q Consensus 231 ~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~n 310 (559) +||.++..+++||+|||.+++.+|..+.++++|+.+||+||++|+|||++++. ..++++++++|+++|++.++|.+| T Consensus 239 ~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~---~~lt~e~~~~lk~~~~~~~~G~~n 315 (551) T protein:vir:80 239 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAA---QQQSQHALEIFKREWKNSLSGING 315 (551) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCC---CCCCHHHHHHHHHHHHHHhcCccc Confidence 99988888889999999999999999999999999999999999999998653 458999999999999999999999 Q ss_pred ccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_012530. 311 AYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKG 389 (559) Q Consensus 311 ag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~ 389 (559) +|++|||.+++++|++++. +.|+||+|++++++++||++|||||++||+.+.+++++..+++++++|++++...|+++| T Consensus 316 ag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~t 395 (551) T protein:vir:80 316 SWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKG 395 (551) T ss_pred cCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHH Confidence 9999999888899999995 799999999999999999999999999999999999989999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCC-CCCCCEeeccceec Q lcl|NC_012530. 390 LMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPK-IAGGDIILSAVYIQ 468 (559) Q Consensus 390 l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~p-i~gGD~~~~~~~~~ 468 (559) |+||+.+||++||++|++.+ +..++|+|+.+++.+..+++++++...+||||+||+|+++|||| +||||++++|++++ T Consensus 396 L~P~~~~ie~~ln~~L~~~~-~~~~~f~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~ 474 (551) T protein:vir:80 396 LQPLLGFIEDFINKHIVAEF-GDKYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQ 474 (551) T ss_pred HHHHHHHHHHHHHhhhcccc-CCceEEEeeccChhhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccc Confidence 99999999999999999865 45789999999999999999988877788999999999999998 79999999999999 Q ss_pred ccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchh Q lcl|NC_012530. 469 RLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNT 548 (559) Q Consensus 469 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~ 548 (559) +++...+..+.+.+..+...+...+..+.++.++++.. +++++++...++|++.|++++. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------p~~~~~~~~~~~~~~~~~~~~~ 534 (551) T protein:vir:80 475 RIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDI--------------------PDGKDTTGDIGKDGQRKDKDNA 534 (551) T ss_pred cccccccccCcchhhhhhccccccCcCCCCCCCCCCCC--------------------CCccccCCCccccccccCcccc Confidence 99887777776666666666665555544444444322 3344456678899999999999 Q ss_pred hhhhccCCCCC Q lcl|NC_012530. 549 NSYKQGGSSKK 559 (559) Q Consensus 549 ~~~~~~~~~~~ 559 (559) |++||+++|.| T Consensus 535 ~~~~~~~~~~~ 545 (551) T protein:vir:80 535 NAGKQGMKGDK 545 (551) T ss_pred chhhhhcCCCC Confidence 99999999999 No 2 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=5.3e-122 Score=685.47 Aligned_cols=531 Identities=45% Similarity=0.727 Sum_probs=466.7 Q ss_pred Ccchhhhccccc-------cCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFY-------TDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) |+||+|||..+. ..++++++.---..+....++|++.+++++|.+|.+++..++.++. ++|++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~----~~~~~~~~~~l 76 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFK----TKPSIRNNQDL 76 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccc----cCCccCChhHH Confidence 999999996553 3444455442233455668899999999999999999888877654 66777777788 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc-ccChhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD-KPTKEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~-~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .++++.++.+|+|++||++||++||++|.+...+..+.+|++++++.. +.+++...+++.+.+||++|++++++++++| T Consensus 77 ~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~ 156 (547) T protein:vir:63 77 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSF 156 (547) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchH Confidence 999999999999999999999999999999999999999999998865 5678888899999999999999999988899 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecc Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRN 232 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n 232 (559) ++|+++++.++|++||+|++++||.+|+|++||||+|.+|++..+.+|+......+|+++.++.....|+++||||+++| T Consensus 157 ~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n 236 (547) T protein:vir:63 157 SSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRN 236 (547) T ss_pred HHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEeccc Confidence 99999999999999999999999999999999999999999999999988777788999888888889999999999999 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAY 312 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag 312 (559) |..+...++||+|||.+++.+|..+.++++|+.+||+||++|+|||++++. ..++++++++|+++|++.++|.+|+| T Consensus 237 ~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~---~~ls~e~~~~lk~~~~~~~~G~~nag 313 (547) T protein:vir:63 237 PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAA---QQQSQHALEIFKREWKNSLSGINGSW 313 (547) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCC---CCCCHHHHHHHHHHHHHHhcCccccc Confidence 998888889999999999999999999999999999999999999998753 45899999999999999999999999 Q ss_pred ccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 313 RIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 313 ~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) ++|||.+++++|++++. +.|+||+|++++++++||++|||||++||+.+.+++++..+++.+++|++++...|+++||+ T Consensus 314 k~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~ 393 (547) T protein:vir:63 314 QIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQ 393 (547) T ss_pred ccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHH Confidence 99999888899999995 79999999999999999999999999999999999888889999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCC-CCCCCEeeccceeccc Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPK-IAGGDIILSAVYIQRL 470 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~p-i~gGD~~~~~~~~~~l 470 (559) ||+++||++||++|++.. +..++|+|+.+++.+..+++++++...+|+||+||+|+++|||| +||||++++++++.++ T Consensus 394 P~~~~ie~~ln~~L~~~~-~~~~~~~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~ 472 (547) T protein:vir:63 394 PLLGFIEDFINKHIVAEF-GDKYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRI 472 (547) T ss_pred HHHHHHHHHHHhhccccc-CCceEEEeeccccccHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccc Confidence 999999999999999765 45789999999999999998888777788999999999999998 7999999999999998 Q ss_pred ccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhh Q lcl|NC_012530. 471 GQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNS 550 (559) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~ 550 (559) +...+..+.+.+..+.....+.+..+.++.++++ .++.+++++..+++|+..|++++.|+ T Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~d~~~~~~~~~~~ 532 (547) T protein:vir:63 473 GQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVE--------------------DIPDGKDTTGDIGKDGQRKDKDNANA 532 (547) T ss_pred cccccccCCccccchhhccccccccCCCCCCCCC--------------------CCCCCcccCCCcCccccccCccccch Confidence 8776666555555555555544444433333332 22334455667899999999999999 Q ss_pred hhccCCCCC Q lcl|NC_012530. 551 YKQGGSSKK 559 (559) Q Consensus 551 ~~~~~~~~~ 559 (559) +||+++|+| T Consensus 533 ~~~~~~~~~ 541 (547) T protein:vir:63 533 GKQGMKGDK 541 (547) T ss_pred hhhhcCCCC Confidence 999999999 No 3 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=9.9e-119 Score=667.52 Aligned_cols=540 Identities=43% Similarity=0.717 Sum_probs=453.5 Q ss_pred Ccchhhhc----------cccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCC-C Q lcl|NC_012530. 1 MGIFDRFR----------TKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPI-A 69 (559) Q Consensus 1 ~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~-~ 69 (559) -+||+||| +-.++|+++++|+++++. +..++|+++|+++|+.+|++++++.+.+++ .+|++. . T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~----~~p~~~~~ 79 (576) T protein:vir:96 6 ADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEK--SKELNKSLYGKQQAYAEPFLEVMDTNPEFR----TKRSYMKN 79 (576) T ss_pred HHHHHHHhccCccccchhhhhcccChhHHHHHhhhh--hhhhccccCCccchhhcceeeeeecCCCcc----ccCcchhh Confidence 79999999 777799999999999985 777999999999999999998876666554 445543 3 Q ss_pred cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc-ccChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_012530. 70 FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD-KPTKEQQKKIDYAERYIERMGVDYSPI 148 (559) Q Consensus 70 ~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~-~~~~~~~~~~~~~~~~L~~~~p~~~~~ 148 (559) ...+..+++.+..+|+|++||++||++||++|.+...+.++.+|.|++++.+ ..+.+...+++.+.++|++++++++++ T Consensus 80 ~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 159 (576) T protein:vir:96 80 SDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDID 159 (576) T ss_pred hhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCc Confidence 4567788889999999999999999999999999999999999999998886 456777888899999999999988888 Q ss_pred hhhHHHHHHHHHHHHHHcCCcceEEEEC--CCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccce Q lcl|NC_012530. 149 RDDFTSFLRKLVRDTYTYDQVNYENTYD--SNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEM 226 (559) Q Consensus 149 ~~~~~~f~~~~v~d~ll~Gna~~~i~rd--~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~ev 226 (559) +++|++|+++++.|++++||+|++++++ ..|+|++||||+|.+|++..+.+|+.+....+|+++.++.....|+++|| T Consensus 160 ~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~di 239 (576) T protein:vir:96 160 RDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREM 239 (576) T ss_pred cccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccce Confidence 8899999999999999999999999865 45789999999999999999999988877788999999999899999999 Q ss_pred EEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhc Q lcl|NC_012530. 227 GMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSS 306 (559) Q Consensus 227 i~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~ 306 (559) ||+++++..+...++||+|||.+++.+|.+++++++|+.+||+||++|+|||++++. ..++++++++|++.|++.++ T Consensus 240 i~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~---~~ls~e~~~~lr~~~~~~~~ 316 (576) T protein:vir:96 240 AMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSE---QQQSQRALENFKREWKSSFS 316 (576) T ss_pred EEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC---CCCCHHHHHHHHHHHHHHhc Confidence 999999988888889999999999999999999999999999999999999998753 35799999999999999999 Q ss_pred CcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccc-cccchhhhhHHHHHHH Q lcl|NC_012530. 307 GINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGN-KSNSLNESNNQNKIDA 384 (559) Q Consensus 307 G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~-~~~~~~~an~~~~~~~ 384 (559) |..|+|++|++.+++++|+++++ +.|+||+|++++++++||++|||||++||+.+.++++++ .+++.+++|++++.+. T Consensus 317 G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~ 396 (576) T protein:vir:96 317 GINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQ 396 (576) T ss_pred cccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHH Confidence 99999998777777899999995 799999999999999999999999999999998876653 4567789999999999 Q ss_pred HHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeecc Q lcl|NC_012530. 385 SKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSA 464 (559) Q Consensus 385 ~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~ 464 (559) |+++||+||+.+||++||++|++.. +..++|+|...+..+..+.+++.....+|+||+||+|+++||||+||||++++| T Consensus 397 f~~~tL~P~~~~ie~~ln~~Ll~~~-~~~~~~~f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~ 475 (576) T protein:vir:96 397 SQNKGLQPLLRFIEDLINTHIISEY-SDKYVFQFVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDG 475 (576) T ss_pred HHHHHHHHHHHHHHHHHHhhhchhc-cCceEEEeccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccc Confidence 9999999999999999999999865 456788887655555444444444444688999999999999999999999999 Q ss_pred ceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccc Q lcl|NC_012530. 465 VYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKN 544 (559) Q Consensus 465 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~ 544 (559) +++++++......+.+.+..+...+...+........++. .++.+++ ..+++.+++++ .+..++|||++|+ T Consensus 476 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~-----~~s~~~~---~~g~~~~~~~~-~~~~~~~~~~~~~ 546 (576) T protein:vir:96 476 SFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQ-----QESTEDK---VDGRESNDPTK-IDSPVGTDGQLKD 546 (576) T ss_pred cccccccccccCCCCCCccccccccccccccCCCCCCCCC-----CCCCCCc---ccccccccCCC-CCCccccccccCC Confidence 9999998877777777666666555543322221111111 1111222 23333333332 3444999999999 Q ss_pred cchhhhhh-----ccCCC---CC Q lcl|NC_012530. 545 KKNTNSYK-----QGGSS---KK 559 (559) Q Consensus 545 ~~~~~~~~-----~~~~~---~~ 559 (559) ++++||+| ||.|| +| T Consensus 547 ~~~~~~~~~~~~~~~~~~~~~~~ 569 (576) T protein:vir:96 547 QDNVKSQEGSNKGQGTKGKGNEK 569 (576) T ss_pred CCcccccccccccccccccCCCC Confidence 99999999 77777 55 No 4 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=1.7e-116 Score=655.27 Aligned_cols=549 Identities=43% Similarity=0.700 Sum_probs=463.8 Q ss_pred Ccchhhhcc---------------ccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccC Q lcl|NC_012530. 1 MGIFDRFRT---------------KFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKP 65 (559) Q Consensus 1 ~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p 65 (559) -+.||+-++ .+..-+.+.++.+.+. .-.+.+++++.++++++.+++.+.+++++++. .+| T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 76 (574) T protein:vir:80 2 PKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEP-YSMESIEKGMNGKTTAYMQPIIGEMSVNPGYK----TKP 76 (574) T ss_pred cchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccC-CCHHHHHHhHhhhcccccchhhhhcccccccc----CcC Confidence 234444321 0112222233332222 11155789999999999999999888887764 456 Q ss_pred CCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 66 SPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~ 144 (559) .+.+..++..+++.+..+++|++||++++++|++++..+..+.++++|+|+.++.+. .+.+.....+.+..||.++.++ T Consensus 77 ~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~ 156 (574) T protein:vir:80 77 SIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQF 156 (574) T ss_pred ccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCC Confidence 666677888999999999999999999999999999999999999999999988764 4566677888999999998888 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeeccc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTAD 224 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~ 224 (559) ++|++++|++|+++++.+++++||+|++++|+..|+|++||||+|.+|++..+.+|+....+.+|+++.++.....|+++ T Consensus 157 ~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~ 236 (574) T protein:vir:80 157 RDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNER 236 (574) T ss_pred CCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccc Confidence 88888899999999999999999999999999999999999999999999999999888888899999999999999999 Q ss_pred ceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHH Q lcl|NC_012530. 225 EMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTAT 304 (559) Q Consensus 225 evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~ 304 (559) ||||++++|.++..+++||+|||.+++.+|..++++++|+.+||+||++|+|||+++++ ..++++++++|++.|++. T Consensus 237 eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~---~~ls~e~~~~lk~~~~~~ 313 (574) T protein:vir:80 237 ELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTG---QQQSQQALDIFRREWRSS 313 (574) T ss_pred cEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC---CCCCHHHHHHHHHHHHHH Confidence 99999999998888899999999999999999999999999999999999999998653 458999999999999999 Q ss_pred hcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHH Q lcl|NC_012530. 305 SSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKID 383 (559) Q Consensus 305 ~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~ 383 (559) ++|..|+|++|||++++++|++++. +.|+||+|++++++++||++|||||++||+.+.++++++++.+.+++|++++.. T Consensus 314 ~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~ 393 (574) T protein:vir:80 314 LAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQ 393 (574) T ss_pred hccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHH Confidence 9999999999999888899999995 799999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeec Q lcl|NC_012530. 384 ASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILS 463 (559) Q Consensus 384 ~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~ 463 (559) .|+++||+||+.+||++||++|++..+ ..++|+|+..+..+..+++.+.....+|+||+||+|+++||||+||||++++ T Consensus 394 ~f~~~tL~P~~~~ie~~ln~~Ll~~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~ 472 (574) T protein:vir:80 394 ASQNKGLQPLLRFIEDTVNTYIVAEFG-EKYQFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILN 472 (574) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhcC-CceEEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeee Confidence 999999999999999999999998654 5688899877666655555554444468899999999999999999999999 Q ss_pred cceecccccccccccccccccccccccccccCCCCCCCCC-----CCCccccccchhccccccccccccccccccccccc Q lcl|NC_012530. 464 AVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP-----TLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGK 538 (559) Q Consensus 464 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 538 (559) ++++++++...+..+.+.+..+...+.+....+..++.++ +...++.+.+.......++++.+..|+- .+..++ T Consensus 473 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 551 (574) T protein:vir:80 473 GVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKV-DDNVGK 551 (574) T ss_pred ccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCc-cccccc Confidence 9999999988777777766666666654443332222221 1223334445556666788888777664 556999 Q ss_pred cccccccchhhhhhccCCCCC Q lcl|NC_012530. 539 DGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 539 ~~~~k~~~~~~~~~~~~~~~~ 559 (559) ||++|+++|+|+++||++++| T Consensus 552 ~~~~~~~~~~~~~~~~~~~~~ 572 (574) T protein:vir:80 552 DGQLKSEENTNSTKHGTDGIK 572 (574) T ss_pred ccccccccccccccccCcccc Confidence 999999999999999999999 No 5 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=4.4e-112 Score=631.06 Aligned_cols=528 Identities=41% Similarity=0.669 Sum_probs=433.3 Q ss_pred Ccchhhhc-------------cccccCCcchHHHHHHHH-HHHHHHhhhhccccccccccccccccccccccccccccCC Q lcl|NC_012530. 1 MGIFDRFR-------------TKFYTDDPNAFFKHIDSK-IANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPS 66 (559) Q Consensus 1 ~~~~~~~~-------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~ 66 (559) -.||.+|| +++ ||+++++|..+++. .....++|++.++++||.+|+...++.+.++ ..+|+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~----~~~~~ 76 (563) T protein:vir:95 2 ADLFKQFRLGKDYGNNSTIAQVPI-DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEF----RDKRS 76 (563) T ss_pred hhhhhhhhcccccccccccceeec-cCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccc----ccccc Confidence 67888998 455 99999999999998 7778889999999999999998877766544 34444 Q ss_pred CC-CcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 67 PI-AFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 67 ~~-~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~ 144 (559) .. +..++.++++.+.++++|++||++++++||++|+.++...++.+|.|++++.+. ...+..+.++.+.++|.+++++ T Consensus 77 ~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~ 156 (563) T protein:vir:95 77 YMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKD 156 (563) T ss_pred CCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCC Confidence 33 345788899999999999999999999999999999999999999999887653 4566677888999999999999 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEE--ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeec Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENT--YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFT 222 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~--rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ 222 (559) +++++++|++|+++++.++|++||+|++++ ||..|+|++||||+|++|++..+.+|..+.....|+++.++.....|. T Consensus 157 ~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~ 236 (563) T protein:vir:95 157 KDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFT 236 (563) T ss_pred CCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEec Confidence 999888999999999999999999999876 778899999999999999999999998888888899999999888999 Q ss_pred ccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHH Q lcl|NC_012530. 223 ADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWT 302 (559) Q Consensus 223 ~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~ 302 (559) ++|+||+++++..+...++||+|||.+++.+|.+++++++|+.+||+||++|+|||+++++ ..+++++++++++.|+ T Consensus 237 ~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~---~~ls~e~~~~~~~~~~ 313 (563) T protein:vir:95 237 SRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD---QQQSQHALENFKREWK 313 (563) T ss_pred CcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC---CCCCHHHHHHHHHHHH Confidence 9999999999988888889999999999999999999999999999999999999998753 3589999999999999 Q ss_pred HHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccc-ccchhhhhHHH Q lcl|NC_012530. 303 ATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNK-SNSLNESNNQN 380 (559) Q Consensus 303 ~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~-~~~~~~an~~~ 380 (559) +.++|.+|+|++|++.+++++|++++. +.|+||+|++++++++||++|||||++||+.+.++++++. +++.+++|+++ T Consensus 314 ~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~ 393 (563) T protein:vir:95 314 SSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGK 393 (563) T ss_pred HHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHH Confidence 999999999999877788899999995 6999999999999999999999999999999998876544 46678899999 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH--HH-cCCCCHHHHHHHhCCCCCCC Q lcl|NC_012530. 381 KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL--EL-QTATTVNDYREKQGLPKIAG 457 (559) Q Consensus 381 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~--~~-~~~~T~NE~R~~~gl~pi~g 457 (559) +.+.|++.||+||+++||++||++|++.. ...++|+|... |.++++++++. ++ +|+||+||+|+++||||+|| T Consensus 394 ~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-~~~~~~~f~r~---D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~g 469 (563) T protein:vir:95 394 KQQQSQNKGLQPLLRFIEDLVNRHIISEY-GDKYTFQFVGG---DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEG 469 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhchhc-ccccEEEeccC---CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999865 45678887654 44555554432 23 57899999999999999999 Q ss_pred CCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccc Q lcl|NC_012530. 458 GDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVG 537 (559) Q Consensus 458 GD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 537 (559) ||++++|++++++++..+....+.+..+...+...+...++.+.++...+. ..+++++.++ T Consensus 470 GD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--------------- 530 (563) T protein:vir:95 470 GDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQST----DSSNDDKEIG--------------- 530 (563) T ss_pred cceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCC----CCCCCccccc--------------- Confidence 999999999999988776666666655555554443333333222211111 1122222222 Q ss_pred ccccccccchhhhh-----hccCCCCC Q lcl|NC_012530. 538 KDGQLKNKKNTNSY-----KQGGSSKK 559 (559) Q Consensus 538 ~~~~~k~~~~~~~~-----~~~~~~~~ 559 (559) +|++.|+..+.++. +++.|++| T Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 557 (563) T protein:vir:95 531 TDAQIKGDDNVYRTQTSNKGQGRKGEK 557 (563) T ss_pred cccccccccccccccCccccccccCcC Confidence 23333333222222 34444444 No 6 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=4.4e-112 Score=631.06 Aligned_cols=528 Identities=41% Similarity=0.669 Sum_probs=433.3 Q ss_pred Ccchhhhc-------------cccccCCcchHHHHHHHH-HHHHHHhhhhccccccccccccccccccccccccccccCC Q lcl|NC_012530. 1 MGIFDRFR-------------TKFYTDDPNAFFKHIDSK-IANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPS 66 (559) Q Consensus 1 ~~~~~~~~-------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~ 66 (559) -.||.+|| +++ ||+++++|..+++. .....++|++.++++||.+|+...++.+.++ ..+|+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~----~~~~~ 76 (563) T protein:vir:99 2 ADLFKQFRLGKDYGNNSTIAQVPI-DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEF----RDKRS 76 (563) T ss_pred hhhhhhhhcccccccccccceeec-cCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccc----ccccc Confidence 67888998 455 99999999999998 7778889999999999999998877766544 34444 Q ss_pred CC-CcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 67 PI-AFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 67 ~~-~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~ 144 (559) .. +..++.++++.+.++++|++||++++++||++|+.++...++.+|.|++++.+. ...+..+.++.+.++|.+++++ T Consensus 77 ~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~ 156 (563) T protein:vir:99 77 YMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKD 156 (563) T ss_pred CCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCC Confidence 33 345788899999999999999999999999999999999999999999887653 4566677888999999999999 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEE--ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeec Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENT--YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFT 222 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~--rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ 222 (559) +++++++|++|+++++.++|++||+|++++ ||..|+|++||||+|++|++..+.+|..+.....|+++.++.....|. T Consensus 157 ~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~ 236 (563) T protein:vir:99 157 KDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFT 236 (563) T ss_pred CCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEec Confidence 999888999999999999999999999876 778899999999999999999999998888888899999999888999 Q ss_pred ccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHH Q lcl|NC_012530. 223 ADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWT 302 (559) Q Consensus 223 ~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~ 302 (559) ++|+||+++++..+...++||+|||.+++.+|.+++++++|+.+||+||++|+|||+++++ ..+++++++++++.|+ T Consensus 237 ~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~---~~ls~e~~~~~~~~~~ 313 (563) T protein:vir:99 237 SRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD---QQQSQHALENFKREWK 313 (563) T ss_pred CcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC---CCCCHHHHHHHHHHHH Confidence 9999999999988888889999999999999999999999999999999999999998753 3589999999999999 Q ss_pred HHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccc-ccchhhhhHHH Q lcl|NC_012530. 303 ATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNK-SNSLNESNNQN 380 (559) Q Consensus 303 ~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~-~~~~~~an~~~ 380 (559) +.++|.+|+|++|++.+++++|++++. +.|+||+|++++++++||++|||||++||+.+.++++++. +++.+++|+++ T Consensus 314 ~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~ 393 (563) T protein:vir:99 314 SSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGK 393 (563) T ss_pred HHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHH Confidence 999999999999877788899999995 6999999999999999999999999999999998876544 46678899999 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH--HH-cCCCCHHHHHHHhCCCCCCC Q lcl|NC_012530. 381 KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL--EL-QTATTVNDYREKQGLPKIAG 457 (559) Q Consensus 381 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~--~~-~~~~T~NE~R~~~gl~pi~g 457 (559) +.+.|++.||+||+++||++||++|++.. ...++|+|... |.++++++++. ++ +|+||+||+|+++||||+|| T Consensus 394 ~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-~~~~~~~f~r~---D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~g 469 (563) T protein:vir:99 394 KQQQSQNKGLQPLLRFIEDLVNRHIISEY-GDKYTFQFVGG---DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEG 469 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhchhc-ccccEEEeccC---CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999865 45678887654 44555554432 23 57899999999999999999 Q ss_pred CCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccc Q lcl|NC_012530. 458 GDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVG 537 (559) Q Consensus 458 GD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 537 (559) ||++++|++++++++..+....+.+..+...+...+...++.+.++...+. ..+++++.++ T Consensus 470 GD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--------------- 530 (563) T protein:vir:99 470 GDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQST----DSSNDDKEIG--------------- 530 (563) T ss_pred cceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCC----CCCCCccccc--------------- Confidence 999999999999988776666666655555554443333333222211111 1122222222 Q ss_pred ccccccccchhhhh-----hccCCCCC Q lcl|NC_012530. 538 KDGQLKNKKNTNSY-----KQGGSSKK 559 (559) Q Consensus 538 ~~~~~k~~~~~~~~-----~~~~~~~~ 559 (559) +|++.|+..+.++. +++.|++| T Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 557 (563) T protein:vir:99 531 TDAQIKGDDNVYRTQTSNKGQGRKGEK 557 (563) T ss_pred cccccccccccccccCccccccccCcC Confidence 23333333222222 34444444 No 7 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=7.5e-98 Score=553.05 Aligned_cols=521 Identities=26% Similarity=0.416 Sum_probs=385.8 Q ss_pred Ccchhhhccccc----cCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFY----TDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 1 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) |-||..+|.-|- -..-..++.+-++..+++.+.-...+ ..+...+-. ...+.-.++..+|++.+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~l 75 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRAS----ARDTVDGID-IADGNVAGQYSVASISDVLSTKKL 75 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhh----hhccccccc-cccCCcccccccCccccccCHHHH Confidence 999888874432 11222444555555555444333222 222222211 111111246677787777788899 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCCCCCChh-hHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVDYSPIRD-DFTS 154 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~~~~~~~-~~~~ 154 (559) ++.+..+++|++||+++++.||.++.+.+.+..+.++.+++++.+. .+.+..+..+.+.++|.. .||++++.. +|.+ T Consensus 76 ~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~-~PN~~~~~~~~~~~ 154 (535) T protein:vir:10 76 LKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYN-TGSEYYEWRDTFPR 154 (535) T ss_pred HHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHh-CCCCCCChhHHHHH Confidence 9999999999999999999999998888888888889998887753 455666677777777754 355556554 5668 Q ss_pred HHHHHHHHHHHcC-CcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEeccc Q lcl|NC_012530. 155 FLRKLVRDTYTYD-QVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNP 233 (559) Q Consensus 155 f~~~~v~d~ll~G-na~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~ 233 (559) |+++++.|+|++| ++|++|+|+..|+|++||||+|.+|++..+.+++ .....|+++.++.....|+++||||++++| T Consensus 155 ~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~--~~~~~~~~~~~~~~~~~~~~~eiih~~~~~ 232 (535) T protein:vir:10 155 LLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSK--DQPRKFEQFVSETKSVKFSERNLTFINYWN 232 (535) T ss_pred HHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccc--cCceEEEEEecCceeEEECcccEEEEeccC Confidence 9999999988776 6899999999999999999999999999887764 335678888888888899999999999999 Q ss_pred CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccc Q lcl|NC_012530. 234 RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYR 313 (559) Q Consensus 234 ~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~ 313 (559) ..+..+++||+|||.+++.+|..+.++++|+.++|+||++|+|||+++... ...++++++++|+++|++.++|.+|+|+ T Consensus 233 ~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~-~~~ls~e~~e~lk~~~~~~~~G~~nag~ 311 (535) T protein:vir:10 233 LSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDG-DAQANQMMLAGIRRQWTSQGSGLGGAWK 311 (535) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhcCcccccc Confidence 888888999999999999999999999999999999999999999997642 3468999999999999999999999999 Q ss_pred cccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccc--cchhhhhHHHHHHHHHHHHh Q lcl|NC_012530. 314 IPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKS--NSLNESNNQNKIDASKSKGL 390 (559) Q Consensus 314 ~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~--~~~~~an~~~~~~~~~~~~l 390 (559) +|||++++++|++++. +.|+||+|++++++++||++|||||++||+.+++++++... .+.+.++++++...|+++|| T Consensus 312 ~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L 391 (535) T protein:vir:10 312 IPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGL 391 (535) T ss_pred cccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHH Confidence 9999888899999995 79999999999999999999999999999999999876543 34567889999999999999 Q ss_pred hHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccc Q lcl|NC_012530. 391 MPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRL 470 (559) Q Consensus 391 ~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l 470 (559) .||+++||++||++|++..+ ..++|+|+.+++.|.++++++++...+||||+||+|+++||||+||||++++......+ T Consensus 392 ~P~l~~ie~~ln~~Ll~~~~-~~~~f~f~~l~~~d~~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~ 470 (535) T protein:vir:10 392 TPLLSFIEQVINDKIMRYVD-TDYRFSFTLGDAQDKLQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENF 470 (535) T ss_pred HHHHHHHHHHHhhhcccccC-CeEEEEeccccccCHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccccccchhhc Confidence 99999999999999998654 46999999999999999999999888999999999999999999999987754433322 Q ss_pred ccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccc--ccc--cccccccccccccccccc Q lcl|NC_012530. 471 GQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDA--KPS--GKDNQQGVGKDGQLKNKK 546 (559) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--g~~~~~~~~~~~~~k~~~ 546 (559) .......+ ...+.... +...+..+....+......+...++++ ++. ...++ +.+.++ T Consensus 471 ~~~~~~~~---------~~~p~~~~---~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~-------~~~~~~ 531 (535) T protein:vir:10 471 INATGFGQ---------PNVPDSSD---DSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESD-------DVSNNE 531 (535) T ss_pred cccccccc---------ccCCCCCC---CccccCCccccCcccccccccccCCCCCCCCCCcCCCCC-------cccccc Confidence 21100000 00000000 000000000000111111111111111 010 11111 122222 Q ss_pred hhhh Q lcl|NC_012530. 547 NTNS 550 (559) Q Consensus 547 ~~~~ 550 (559) ++-+ T Consensus 532 ~~~~ 535 (535) T protein:vir:10 532 DADT 535 (535) T ss_pred ccCC Confidence 2222 No 8 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=5.5e-90 Score=509.93 Aligned_cols=473 Identities=14% Similarity=0.117 Sum_probs=335.6 Q ss_pred hhhhccccc-----ccccccccc-ccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhh Q lcl|NC_012530. 33 SKALNGVDR-----AYTEPVDGN-LMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRAST 106 (559) Q Consensus 33 ~~~~~gr~~-----a~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~ 106 (559) --=..|+.. +...|.... ..+.+... .... . ........|+.+++|++||++||++||++|+.+++ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~---~~~~---~--~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~ 72 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVG---MQLE---R--QFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccc---eecc---c--ccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEE Confidence 000112211 111111111 01111000 0000 0 11122345788999999999999999999987655 Q ss_pred hcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEE Q lcl|NC_012530. 107 DDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRM 186 (559) Q Consensus 107 ~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~ 186 (559) ..++... ....+.+..++.+ ||++++ +++||+.++.+++++||+|++++|+.+|+|++||| T Consensus 73 ~~~~~~~--------------~~~~~~~~~Ll~~--PN~~~t---~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~ 133 (518) T protein:vir:10 73 TSGDTET--------------EESDTGYAKLLAD--PCEYLD---PFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMP 133 (518) T ss_pred EcCCCce--------------eccchHHHHHHcC--CCCCCC---HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 4332111 1122334445554 555554 46899999999999999999999999999999999 Q ss_pred ecCceEEEEecCcccccccceEEEEEe---cCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 187 VDPTTIYFANDEHGHRRTRGKIYRQYI---DNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 187 l~p~~V~~~~~~~g~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) |+|++|++..+.++... .|+... .+.....|+++||||++.+.. + ...+|+|||.+++.+|..+.+++++ T Consensus 134 l~p~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~eViHir~~s~-d--g~~~G~spi~~a~~~i~~~~a~~~~ 206 (518) T protein:vir:10 134 MHPSRVAIKRNSRTGRY----EYYFQAGAGVGTQLVSFADDEVVPIRFFNP-D--GLERGLSLMESLKSTIFSEDSSRNA 206 (518) T ss_pred ECCCceEEEEcCCCCEE----EEEEEecCCccceEEEecCCcEEEecCCCC-C--cccccccHHHHHHHHHHHHHHHHHH Confidence 99999999887654321 222221 123446789999999975432 1 2247999999999999999999999 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYL 342 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~ 342 (559) +.++|+||++|+|||+++ +.+++++++++++.|++.++|..|+|+++||++ +++|+++++ ++|+||+|+++++ T Consensus 207 ~~~~f~ng~~p~gil~~~-----~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~-G~~~~~l~~s~~D~q~le~r~~~ 280 (518) T protein:vir:10 207 TAAMWKNAGRPNLVLRHE-----KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE-GMEPIPLQLTAVEMQFIEARQLN 280 (518) T ss_pred HHHHHhcCCCccEEEecC-----CCCCHHHHHHHHHHHHHHhcCccccCcceEcCC-CceEEEccCChhHHHHHHHHHHH Confidence 999999999999999875 358899999999999999999999999988855 599999995 7999999999999 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC-ccceeeecch Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG-DNYMLEFVGG 421 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~~~f~~l 421 (559) +++||++|||||++||+.+.++ ++|++++...|+++||+||+.+||++||++|++..+. ..++|+++.+ T Consensus 281 ~~eIa~afgVPp~~lg~~~~~t----------~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~l 350 (518) T protein:vir:10 281 REEVCGVYDIAPPIVHILDRAT----------FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDV 350 (518) T ss_pred HHHHHHHhCCCHHHhccCCCCC----------chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEechhh Confidence 9999999999999999877653 5789999999999999999999999999999986543 3455566689 Q ss_pred hhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCC--CCCEeeccceecccccccccccccccccccccccccccCCCC Q lcl|NC_012530. 422 DTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIA--GGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNP 498 (559) Q Consensus 422 ~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~--gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (559) ++.|.+++++++..+++ |+||+||+|+++||||++ |||+++++.|+++++......... +..+........+.++. T Consensus 351 lr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g-~~~~~~~~~~~~~~~~~ 429 (518) T protein:vir:10 351 IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEG-EEAPAPKRPASTPVASL 429 (518) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCC-CCCCCCCCCCccccccc Confidence 99999999999999986 568999999999999996 899999999999887543322111 11111111111111111 Q ss_pred CCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 499 SGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) ++.++...+.... ....+....+...+... ....+++|...|+.+.+.+|...||++| T Consensus 430 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (518) T protein:vir:10 430 DQSPPTSVPGLSP--TNSDRSTDSGKTEPRRL-MQKPPPKESSPKHLRAVKGAMGRGKDIK 487 (518) T ss_pred cccccccCCCCCc--ccccccccccccchhcc-ccCCCcccccchHHHHHHHHhhcCccch Confidence 1111111111110 11112222222223332 5788999999999999999999999999 No 9 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=2e-89 Score=506.92 Aligned_cols=479 Identities=14% Similarity=0.120 Sum_probs=335.6 Q ss_pred HHHHHHHHHHHhhhhcccccccccccc-ccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhh Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVD-GNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEY 100 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~-~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~ 100 (559) +| =.+-+.+..-..+...+.+ .+..+.+.... .. .+ ........|+.+++|++||++||++||++ T Consensus 1 ~~------~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~---~~---~~--~~~~~~~~~~~~~~V~acV~~IA~~iA~l 66 (518) T protein:vir:78 1 ML------LANGQTLSAPAMAELSPQMQDSYYYAPAVGM---QL---ER--QFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) T ss_pred Cc------ccCceeeccchhhhhhhhhhhcccccceece---ec---cc--ccchhhHHhhhhHHHHHHHHHHHHhhccC Confidence 00 0000000000111111111 00011010000 00 00 11222356789999999999999999999 Q ss_pred hhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCc Q lcl|NC_012530. 101 AHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGR 180 (559) Q Consensus 101 ~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~ 180 (559) |+.+++..++... ....+.+..++.+ ||++++ .++||+.++.+++++||+|++|+|+..|+ T Consensus 67 p~~l~~~~~~~~~--------------~~~~~~~~~Ll~~--PN~~~t---~~~F~~~lv~~lll~Gnay~~i~r~~~G~ 127 (518) T protein:vir:78 67 PVKCMFTSGDTET--------------EEHDTGYAKLLAD--PCEYLD---PFAFWEWVASTLDIYGETYLAIQKNKSGT 127 (518) T ss_pred ceEEEEEcCCccc--------------cccchHHHHHHhC--CCCCCC---HHHHHHHHHHHHhhcCCeEEEEEEcCCCc Confidence 9876554332110 1112233444554 555554 46899999999999999999999999999 Q ss_pred EEEEEEecCceEEEEecCcccccccceEEEEEec--CceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHH Q lcl|NC_012530. 181 LSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID--NKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHE 258 (559) Q Consensus 181 ~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~--~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~ 258 (559) |++||||+|++|++..+.++.... ++++... +.....|+++||||++.+.. + ...||+|||.+++.+|..+. T Consensus 128 ~~~L~~l~p~~Vtv~~~~~~~~~~---y~~~~~~~~~~~~~~~~~~eIiHir~~~~-d--g~~~G~Spi~~~~~~i~~~~ 201 (518) T protein:vir:78 128 PEKLMPMHPSRVAIKRNSRTGRYE---YYFQAGAGVGTQLVSFADDEVVPIRFFNP-D--GLERGLSLMESLKSTIFSED 201 (518) T ss_pred EEEEEEECCCceEEEEcCCCCEEE---EEEEecCCccceeEEecCCcEEEecCCCC-C--cccccccHHHHHHHHHHHHH Confidence 999999999999998876543321 1122222 23456799999999985332 1 12479999999999999999 Q ss_pred HHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHH Q lcl|NC_012530. 259 NTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQS 337 (559) Q Consensus 259 ~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e 337 (559) ++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|..|+|+++||++ +++|+++++ ++|+||+| T Consensus 202 aa~~~~~~~f~Ng~~p~gvl~~~-----~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~-G~~~~~l~~~~~d~q~le 275 (518) T protein:vir:78 202 SSRNATAAMWKNAGRPNLVLRHE-----KRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEE-GMEPIPLQLTAVEMQFIE 275 (518) T ss_pred HHHHHHHHHHhcCCCccEEEecC-----CCCCHHHHHHHHHHHHHHhcCcccCCceeEcCC-CceEEeccCChhHHHHHH Confidence 99999999999999999999875 458899999999999999999999999988855 599999996 79999999 Q ss_pred HHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC-cccee Q lcl|NC_012530. 338 WLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG-DNYML 416 (559) Q Consensus 338 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~~ 416 (559) ++++++++||++|||||++||+.+.+ +++|++++...|+++||+||+.+||++||++|++..+. ..++| T Consensus 276 ~r~~~~~eIa~afgVPp~~lg~~~~s----------t~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~~~~~~~~f 345 (518) T protein:vir:78 276 ARQLNREEVCGVYDIAPPIVHILDRA----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKF 345 (518) T ss_pred HHHHHHHHHHHHhCCCHHHhccCCCC----------CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcceEEe Confidence 99999999999999999999987654 35789999999999999999999999999999976543 34555 Q ss_pred eecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCC--CCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 417 EFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIA--GGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 417 ~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~--gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) +++.+++.|.+++++++..+++ |+||+||+|+++||||++ |||+++++.++++++......... ++.......... T Consensus 346 d~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g-~~~~~~~~~~~~ 424 (518) T protein:vir:78 346 DIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEG-EEAPAPKRPAST 424 (518) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCC-CCCCCCCCCCcc Confidence 5568999999999999999986 568999999999999996 899999999999987654322211 111111111111 Q ss_pred cCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) +.++.++.++...+.. ++..+......++.+ +.. -....+++|...|+.+.+.+|...||++| T Consensus 425 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (518) T protein:vir:78 425 PVASLDQSPPASVPGL-SPTNSDRSTDSGKTE-PRR-LMQKPPPKESSPKHLRAVKGAMGRGKDIK 487 (518) T ss_pred cccccccCccccCCCC-Ccccccccccccccc-hhc-ccCCCCcccccchHHHHHHHHhhcCCcch Confidence 1111111111111111 111111222222322 222 25778899999999999999999999999 No 10 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=3.3e-88 Score=500.18 Aligned_cols=448 Identities=14% Similarity=0.142 Sum_probs=313.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||++|+....-. ....+....+ .+.. +.... ...++.+ +....-+.| T Consensus 1 Mg~~~~l~~~~~~~-------------------~~~~~~~~~~-~~~~------~~~~~--~~~~~~~---g~~v~~~~a 49 (457) T protein:vir:62 1 MGFWSALFGRGHSP-------------------ALDAAEGRAW-EPYD------PSIYN--LGATASS---GERVTPHDA 49 (457) T ss_pred Cchhhhhhcccccc-------------------cccccccccc-ccch------hhhhh--ccccccC---CceechHHh Confidence 99999984311110 0000001111 0000 00000 0011111 122233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++.... ...+.+..++.+|++ ++ ++++||+.++ T Consensus 50 l~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~--------------~~~~~~~~ll~~pn~--~~---t~~~f~~~~~ 110 (457) T protein:vir:62 50 LQVSAVFASVRLLSETIATLPLSTYSKRGGTRKE--------------IDTPEWLDFPNAEPG--GM---GRIDILSQTV 110 (457) T ss_pred hccHHHHHHHHHHHHhHhhCceEEEEecCCcccc--------------ccchHHHHhccccCC--CC---CHHHHHHHHH Confidence 8899999999999999999998876554332111 112234445544433 33 5678999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEe-cCc--eeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI-DNK--VRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~-~~~--~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|.++ .|++++||||+|.+|++..+..+.......+.+.+. .+. ....|+++||||++.+.. T Consensus 111 ~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~~--- 186 (457) T protein:vir:62 111 LSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMML--- 186 (457) T ss_pred HHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEEEEccCCceeEEEeeCccceEEecCCCC--- Confidence 9999999999998765 689999999999999987765543322222222222 222 234689999999985432 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ....+|+||+.+++.+|..+.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|.+|+|+++|| T Consensus 187 ~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl 261 (457) T protein:vir:62 187 PGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-----GTMSEEGLARAREAWRAANSGVDNAHRVALL 261 (457) T ss_pred CCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-----CCCCHHHHHHHHHHHHHHhcCccccCcceec Confidence 23468999999999999999999999999999999999999886 4689999999999999999999999999888 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ +++|++++. ++|+||+|++++++++||++|||||++||+.+.++++ .+|++++.+.|+++||+||+++ T Consensus 262 ~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~--------~sn~eq~~~~f~~~~l~P~~~~ 332 (457) T protein:vir:62 262 TE-GAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSW--------GSGLAEQNIAFTMFSLRPWLER 332 (457) T ss_pred CC-CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccc--------cchHHHHHHHHHHHHHHHHHHH Confidence 55 599999985 7999999999999999999999999999998876653 4689999999999999999999 Q ss_pred HHHHHHhhccccccCccc--eeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNY--MLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQRLG 471 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~--~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~l~ 471 (559) ||++||++|+++.+...+ +|+++.++++|.++|++++..+++ |+||+||+|+++||||++|| |++++|+|+.+++ T Consensus 333 ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~ 412 (457) T protein:vir:62 333 IEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIG 412 (457) T ss_pred HHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccc Confidence 999999999987765544 455568899999999999999886 56899999999999999988 9999999998776 Q ss_pred cccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 472 QQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) ...+..... ...... ++..++...+ ..+..++...+++.+.+.+. T Consensus 413 ~~~~~~~~~-----~~~~~~----------~~~~~~~~~~-~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 413 EEPEPEPAP-----APPAID----------PPAEEPADDE-EPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred ccccccccC-----CCccCC----------CCccCCCCCC-CCCCCCCCCccccccccccC Confidence 543221110 000000 0000000000 00111112212222222111 No 11 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.4e-87 Score=496.75 Aligned_cols=427 Identities=15% Similarity=0.157 Sum_probs=320.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|+||++.-|.-. +. + ..+.....+ .......+.+. .++ .+......+ T Consensus 1 M~~~~r~~~~~~~~----------~r-------~---~~~~~~~~~----~~~~~~~~~g~--~~~-----~~~v~~~~a 49 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KR-------Q---TSQVIELNK----DDEKLLEWLGI--SPS-----TISVKGKNA 49 (432) T ss_pred CChHHHHHHhcCcc----------cc-------C---cccccccCC----chHHHHHHhCC--CcC-----ccccchhhh Confidence 99999984322100 00 0 001000000 00000111110 111 122234567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++...+ .. ++...+|++..||++++ +++|++.++ T Consensus 50 l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~--------------~~-~~~l~~lL~~~PN~~~t---~~~f~~~~~ 111 (432) T protein:vir:10 50 LKVATVFACIKILSESVSKLPLKIYQEDEYGIQR--------------GT-KHYLNNLLRLRPNPYMS---SMNFFGSLE 111 (432) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCceee--------------cc-ccHHHHHHHhhccCCCC---HHHHHHHHH Confidence 8899999999999999999998765443221100 01 12223444445666655 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+..|+|++||||+|++|++..++.+........|+.+..++....|+++||||+++++ ..++ T Consensus 112 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~---~~~~ 188 (432) T protein:vir:10 112 AQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI---TLDG 188 (432) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC---CCCC Confidence 9999999999999999999999999999999999999888766666667777777777889999999998643 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||+..++.+|..+.++++++.++|+||++|+|||+++ +.+++++.+++++.|++.++|..|+|+++||++ T Consensus 189 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 262 (432) T protein:vir:10 189 LVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-----GDLNEDAKKVFRENFESMSSGLQNSHRIALMPV- 262 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-----CCCCHHHHHHHHHHHHHHhcccccCCcceecCC- Confidence 78999999999999999999999999999999999999875 458899999999999999999999999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.++ ++|++++.+.|++.||+||+++||+ T Consensus 263 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----------~s~~e~~~~~~~~~~l~P~~~~ie~ 332 (432) T protein:vir:10 263 GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----------LNNIEQQQQQFYTDTLQATLTMYEQ 332 (432) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----------cccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 79999999999999999999999999999866543 5789999999999999999999999 Q ss_pred HHHhhccccccC-c--cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG-D--NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~-~--~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++||+..+. . .++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+ T Consensus 333 ~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~ 412 (432) T protein:vir:10 333 EMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQ 412 (432) T ss_pred HHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccc Confidence 999999986543 2 34555568999999999999999986 5689999999999999999999999999988865432 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) .... .++ +..+...+..+. | T Consensus 413 ~~~k---------------~~~-~~~~~~~~~~~~---------------------~ 432 (432) T protein:vir:10 413 AYLK---------------GGD-TNGEVSKEGNEG---------------------N 432 (432) T ss_pred cccC---------------CCC-CCCCCCCCCCCC---------------------C Confidence 1100 000 000000000000 0 No 12 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.4e-87 Score=496.75 Aligned_cols=427 Identities=15% Similarity=0.157 Sum_probs=320.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|+||++.-|.-. +. + ..+.....+ .......+.+. .++ .+......+ T Consensus 1 M~~~~r~~~~~~~~----------~r-------~---~~~~~~~~~----~~~~~~~~~g~--~~~-----~~~v~~~~a 49 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KR-------Q---TSQVIELNK----DDEKLLEWLGI--SPS-----TISVKGKNA 49 (432) T ss_pred CChHHHHHHhcCcc----------cc-------C---cccccccCC----chHHHHHHhCC--CcC-----ccccchhhh Confidence 99999984322100 00 0 001000000 00000111110 111 122234567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++...+ .. ++...+|++..||++++ +++|++.++ T Consensus 50 l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~--------------~~-~~~l~~lL~~~PN~~~t---~~~f~~~~~ 111 (432) T protein:vir:10 50 LKVATVFACIKILSESVSKLPLKIYQEDEYGIQR--------------GT-KHYLNNLLRLRPNPYMS---SMNFFGSLE 111 (432) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCceee--------------cc-ccHHHHHHHhhccCCCC---HHHHHHHHH Confidence 8899999999999999999998765443221100 01 12223444445666655 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+..|+|++||||+|++|++..++.+........|+.+..++....|+++||||+++++ ..++ T Consensus 112 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~---~~~~ 188 (432) T protein:vir:10 112 AQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI---TLDG 188 (432) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC---CCCC Confidence 9999999999999999999999999999999999999888766666667777777777889999999998643 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||+..++.+|..+.++++++.++|+||++|+|||+++ +.+++++.+++++.|++.++|..|+|+++||++ T Consensus 189 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 262 (432) T protein:vir:10 189 LVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-----GDLNEDAKKVFRENFESMSSGLQNSHRIALMPV- 262 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-----CCCCHHHHHHHHHHHHHHhcccccCCcceecCC- Confidence 78999999999999999999999999999999999999875 458899999999999999999999999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.++ ++|++++.+.|++.||+||+++||+ T Consensus 263 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----------~s~~e~~~~~~~~~~l~P~~~~ie~ 332 (432) T protein:vir:10 263 GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----------LNNIEQQQQQFYTDTLQATLTMYEQ 332 (432) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----------cccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 79999999999999999999999999999866543 5789999999999999999999999 Q ss_pred HHHhhccccccC-c--cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG-D--NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~-~--~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++||+..+. . .++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+ T Consensus 333 ~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~ 412 (432) T protein:vir:10 333 EMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQ 412 (432) T ss_pred HHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccc Confidence 999999986543 2 34555568999999999999999986 5689999999999999999999999999988865432 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) .... .++ +..+...+..+. | T Consensus 413 ~~~k---------------~~~-~~~~~~~~~~~~---------------------~ 432 (432) T protein:vir:10 413 AYLK---------------GGD-TNGEVSKEGNEG---------------------N 432 (432) T ss_pred cccC---------------CCC-CCCCCCCCCCCC---------------------C Confidence 1100 000 000000000000 0 No 13 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.4e-87 Score=496.75 Aligned_cols=427 Identities=15% Similarity=0.157 Sum_probs=320.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|+||++.-|.-. +. + ..+.....+ .......+.+. .++ .+......+ T Consensus 1 M~~~~r~~~~~~~~----------~r-------~---~~~~~~~~~----~~~~~~~~~g~--~~~-----~~~v~~~~a 49 (432) T protein:vir:10 1 MKIVDSVKKFFNFE----------KR-------Q---TSQVIELNK----DDEKLLEWLGI--SPS-----TISVKGKNA 49 (432) T ss_pred CChHHHHHHhcCcc----------cc-------C---cccccccCC----chHHHHHHhCC--CcC-----ccccchhhh Confidence 99999984322100 00 0 001000000 00000111110 111 122234567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++...+ .. ++...+|++..||++++ +++|++.++ T Consensus 50 l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~--------------~~-~~~l~~lL~~~PN~~~t---~~~f~~~~~ 111 (432) T protein:vir:10 50 LKVATVFACIKILSESVSKLPLKIYQEDEYGIQR--------------GT-KHYLNNLLRLRPNPYMS---SMNFFGSLE 111 (432) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCceee--------------cc-ccHHHHHHHhhccCCCC---HHHHHHHHH Confidence 8899999999999999999998765443221100 01 12223444445666655 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+..|+|++||||+|++|++..++.+........|+.+..++....|+++||||+++++ ..++ T Consensus 112 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~---~~~~ 188 (432) T protein:vir:10 112 AQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI---TLDG 188 (432) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC---CCCC Confidence 9999999999999999999999999999999999999888766666667777777777889999999998643 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||+..++.+|..+.++++++.++|+||++|+|||+++ +.+++++.+++++.|++.++|..|+|+++||++ T Consensus 189 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 262 (432) T protein:vir:10 189 LVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-----GDLNEDAKKVFRENFESMSSGLQNSHRIALMPV- 262 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-----CCCCHHHHHHHHHHHHHHhcccccCCcceecCC- Confidence 78999999999999999999999999999999999999875 458899999999999999999999999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.++ ++|++++.+.|++.||+||+++||+ T Consensus 263 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----------~s~~e~~~~~~~~~~l~P~~~~ie~ 332 (432) T protein:vir:10 263 GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----------LNNIEQQQQQFYTDTLQATLTMYEQ 332 (432) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----------cccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 79999999999999999999999999999866543 5789999999999999999999999 Q ss_pred HHHhhccccccC-c--cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG-D--NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~-~--~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++||+..+. . .++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+ T Consensus 333 ~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~ 412 (432) T protein:vir:10 333 EMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQ 412 (432) T ss_pred HHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccc Confidence 999999986543 2 34555568999999999999999986 5689999999999999999999999999988865432 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) .... .++ +..+...+..+. | T Consensus 413 ~~~k---------------~~~-~~~~~~~~~~~~---------------------~ 432 (432) T protein:vir:10 413 AYLK---------------GGD-TNGEVSKEGNEG---------------------N 432 (432) T ss_pred cccC---------------CCC-CCCCCCCCCCCC---------------------C Confidence 1100 000 000000000000 0 No 14 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=3.5e-88 Score=500.03 Aligned_cols=420 Identities=13% Similarity=0.081 Sum_probs=307.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccc-cccccccccccccccccccccccCCCCCcccHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDR-AYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQ 79 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~-a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 79 (559) |+|||+||..-... .... ..++.+........+..|+.- +...|.+.. +....+ .. ........ T Consensus 1 Mgl~d~~r~~~~~~-~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~----------~~~~~~-~~--g~~v~~~~ 65 (431) T protein:vir:10 1 MGLFDFIRREKQPE-AQAR-PHVEPSFQASTPTTSIPGETFEGLDDPRLKE----------YIRRGE-LN--GGTGRETR 65 (431) T ss_pred CcchhhhhcCcccc-cccc-cccccccccccccccccccccccccchHHHH----------hhccCc-cC--cceechhh Confidence 99999997532111 0111 011111000000111111100 000000000 000000 01 11223467 Q ss_pred HhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 80 YSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 80 ~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) |+++++|++||++||+.||++|+.+++..++. +. ..++....|++..||+++++ ++|++.+ T Consensus 66 al~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~--~~--------------~~~~~~~~lL~~~PN~~~t~---~~f~~~l 126 (431) T protein:vir:10 66 ALRNMAVLRCVTLISGTIGMLPMNLISSDDSK--QV--------------LTDDPAHRLLKYKPNDWQTP---MEFKSLM 126 (431) T ss_pred hhccHHHHHHHHHHHHhhccCceEEEEecCce--ee--------------eccchHHHHHhhccCCCCCH---HHHHHHH Confidence 78999999999999999999998776543221 11 11122234444456666654 6799999 Q ss_pred HHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccC Q lcl|NC_012530. 160 VRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILS 239 (559) Q Consensus 160 v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~ 239 (559) +.+++++||+|++|+|+. |.+++||||+|.+|++..+.+|.. .|+....++....++++||||++... .+ T Consensus 127 ~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~~~~~~~~~~-----~y~~~~~~g~~~~~~~~dViHir~~~----~d 196 (431) T protein:vir:10 127 QLRALLDGESMARIVWSG-NRPIRLIPMDRGSAKGRLTSTWQI-----VYDYTTPTGDKIELPAREVFHLRDLS----ID 196 (431) T ss_pred HHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeEEEEcCCCeE-----EEEEEeCCceEEEEchhhEEEecCcC----CC Confidence 999999999999999985 899999999999999988877643 34444445566789999999997532 24 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA 319 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~ 319 (559) +.+|+||+..+..+|.++.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|.+|+|+++||++ T Consensus 197 g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 271 (431) T protein:vir:10 197 GVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP-----KELSDNAYGRMKASVQENHTGSENAGSWMLLEE 271 (431) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-----CCCCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 678999999999999999999999999999999999999886 458999999999999999999999999988855 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.+ +++|++++...|+++||.||+++|| T Consensus 272 -g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~eq~~~~f~~~tL~P~~~~ie 340 (431) T protein:vir:10 272 -GATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS----------WGSGIEQLAIFFIQYGLSHWFVSWE 340 (431) T ss_pred -CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC----------ccccHHHHHHHHHHHHHHHHHHHHH Confidence 599999995 7999999999999999999999999999986543 3578999999999999999999999 Q ss_pred HHHHhhccccccCcc--ceeeecchhhhhHHHHHHHHHHHHc-----CCCCHHHHHHHhCCCCCCC--CCEeeccceecc Q lcl|NC_012530. 399 KNLTNGIIRQILGDN--YMLEFVGGDTRSQQDKLKSVQLELQ-----TATTVNDYREKQGLPKIAG--GDIILSAVYIQR 469 (559) Q Consensus 399 ~~ln~~L~~~~~~~~--~~~~f~~l~~~d~~~~~~~~~~~~~-----~~~T~NE~R~~~gl~pi~g--GD~~~~~~~~~~ 469 (559) ++||++||++.+... ++|+++.+++.|.+++++++..++. ++||+||+|+++||||++| ||++++|.|++. T Consensus 341 ~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~~ 420 (431) T protein:vir:10 341 QAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQKQ 420 (431) T ss_pred HHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccccceeccccccc Confidence 999999998655444 4555567899999999999988763 3589999999999999965 999999887754 Q ss_pred cccccccccccccccccccccccccC Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESAL 495 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (559) .+...+. .+. + T Consensus 421 ~~~~~~~---------p~~------~ 431 (431) T protein:vir:10 421 KGSGDEP---------PAT------T 431 (431) T ss_pred CCCCCCC---------CCC------C Confidence 3221110 000 0 No 15 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=2.8e-87 Score=495.10 Aligned_cols=447 Identities=13% Similarity=0.165 Sum_probs=310.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+||+..+..- ......+|. . .|..... +.. ..++.+ +....-+.| T Consensus 1 Mg~~~~l~~r~~~~-----------------~~~~~~~~~--~-~~~~~~~-~~~-------~~~~~~---g~~V~~~~a 49 (457) T protein:vir:13 1 MGFWSALFGRGHSP-----------------ALDGIEARA--W-EPYDPSI-YNL-------GAVAAS---GETVTPHDA 49 (457) T ss_pred Cchhhhhhcccccc-----------------ccccccccc--c-cccchHH-Hhh-------cccccC---CceechHHh Confidence 99999983321110 001111111 1 1111000 000 001111 222334678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++...++ ..+.+ +..++.. ++.+++++|++.++ T Consensus 50 l~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~--------------~~~~l---~~~ln~~--~n~~t~~~f~~~~~ 110 (457) T protein:vir:13 50 LQVSAVFASVRLLSETIATLPLSTYSKRGGSRKEI--------------VTPEW---LDYPNAE--PGGMGRIDILSQTV 110 (457) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCccccc--------------ccchH---HHhcccc--CCCCCHHHHHHHHH Confidence 89999999999999999999988766543321111 11122 2222221 22245689999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEe-cCce--eeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI-DNKV--RGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~-~~~~--~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|+++ .|+|++||||+|.+|++..+..+.........+++. .+.. ...|+++||||++.+.. T Consensus 111 ~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~~--- 186 (457) T protein:vir:13 111 LSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMML--- 186 (457) T ss_pred HHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEEEEEecCCceeeEEeeCccceEEecCCCC--- Confidence 9999999999999876 599999999999999998766553322222222222 2222 24688999999986432 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ....+|+||+..++.+|.++.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|.+|+|+++|| T Consensus 187 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl 261 (457) T protein:vir:13 187 PGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-----GTMSEEGLARAREAWRAANSGVDNAHRVALL 261 (457) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-----CCCCHHHHHHHHHHHHHHhcCccccCcceec Confidence 23468999999999999999999999999999999999999886 4589999999999999999999999999888 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ +++|++++. +.|+||+|++++++++||++|||||++||+.+.+++ ..+|++++...|+.+||.||+++ T Consensus 262 ~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~sn~eq~~~~f~~~tl~P~~~~ 332 (457) T protein:vir:13 262 TE-GAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTS--------WGSGLAEQNIAFTMFSLRPWLER 332 (457) T ss_pred CC-CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccc--------ccchHHHHHHHHHHHHHHHHHHH Confidence 55 599999985 799999999999999999999999999998887654 34689999999999999999999 Q ss_pred HHHHHHhhccccccCccc--eeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNY--MLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQRLG 471 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~--~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~l~ 471 (559) ||++||++|+++.+...+ +|+++.+++.|.+++++++..+++ |+||+||+|+++||||++|| |++++|+|+.+++ T Consensus 333 ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~ 412 (457) T protein:vir:13 333 IEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVG 412 (457) T ss_pred HHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeecccccccc Confidence 999999999987765544 455568899999999999998886 56899999999999999987 9999999998876 Q ss_pred cccccccccccccccccccccccCCCCC-CCCCCCCccccccchhccccccccccccccccccccc Q lcl|NC_012530. 472 QQEQIKQNEFQRQQTRLTQLESALQNPS-GTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGV 536 (559) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 536 (559) .......... +.+ ...+...+. ++++.+.++ .+...+++ .++.+ T Consensus 413 ~~~~~~~~~~---~~~---~~~~~~~~~~~~~~~g~~d-~~~~~~~~--------------~~~~~ 457 (457) T protein:vir:13 413 EEPEPEPAPA---PPA---IEPPAEEPDEEPEPEGKPD-DEGATEED--------------DEDDA 457 (457) T ss_pred ccccccccCC---CCC---CCCCccccCCCCCCCCCCc-cccCCCCc--------------ccccC Confidence 5432211110 000 000000000 001100000 00000000 00000 No 16 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=2.5e-87 Score=495.32 Aligned_cols=447 Identities=14% Similarity=0.141 Sum_probs=308.0 Q ss_pred HHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHH Q lcl|NC_012530. 19 FFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVT 98 (559) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia 98 (559) +|..+.+ ...+...++. . ....+...+.. +...-.+.+..+.....+.|+++++|++||++||++|| T Consensus 1 ~~~~~~~-----~~~~~~~~~~------~-~~~~~~~~~~~-~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA 67 (454) T protein:vir:93 1 MWNLLRR-----TRKNQKSGRD------V-REAGWTSLFQA-VAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIA 67 (454) T ss_pred CCCcccc-----Cccccccccc------c-cchhhhhhhhh-hhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhc Confidence 1111111 0011111111 0 01111111000 00000112223344556778999999999999999999 Q ss_pred hhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC Q lcl|NC_012530. 99 EYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN 178 (559) Q Consensus 99 ~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~ 178 (559) ++|+.+++.... +. ......+.+..++. .||++++ +++||+.++.+++++||+|++++|+.+ T Consensus 68 ~lp~~~~~~~~~-g~------------~~~~~~~~~~~L~~--~PN~~~t---~~~f~~~l~~~lll~Gna~~~i~r~~~ 129 (454) T protein:vir:93 68 KMRLRLMQTDAQ-GI------------RRETRRGDIARLCR--RPNAQQN---RIQFFELWLNAKLRHGNTVVLKIRNAR 129 (454) T ss_pred cCceEEEEeccC-Cc------------cchhhhHHHHHHHh--cCCCCCC---HHHHHHHHHHHHhhcCceEEEEEECCC Confidence 999876543221 10 01112223334444 4555554 468999999999999999999999999 Q ss_pred CcEEEEEEecCceEEEEecCcccccccceEEEEEecC----ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHH Q lcl|NC_012530. 179 GRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN----KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREF 254 (559) Q Consensus 179 G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~----~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i 254 (559) |+|++||||+|++|++..+.+|... |...... .....++++||||+++++ ..++.||+||+..+..+| T Consensus 130 G~~~~L~~i~~~~v~v~~~~~g~~~-----y~~~~~~~~~~~~~~~~~~~eViH~k~~~---~~~~~~G~sp~~~~~~~i 201 (454) T protein:vir:93 130 GQIKELRILDWNRVEPLVADDGEVF-----YRITPDRNCGITEAVTVPAREVIHDRFNC---FFHPLIGLPPVYAAGLAA 201 (454) T ss_pred CcEEEEEEEcCcceEEEEcCCCcEE-----EEEEeccccccceeEEecCcceEEeccCC---CCCCceeccHHHHHHHHH Confidence 9999999999999999998877542 2222111 234579999999998643 335678999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchh Q lcl|NC_012530. 255 ISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDM 333 (559) Q Consensus 255 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~ 333 (559) .++.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++| .|+|+++||++ +++|++++. +.|+ T Consensus 202 ~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~-g~~~~~l~~~~~d~ 274 (454) T protein:vir:93 202 TQGHHIQENSTSFFRNGGRPSGVIEIP-----GSITEENAKKLKSNWDSGYTG-ENAGKTAILSN-GAKYNPTTFSPVDS 274 (454) T ss_pred HHHHHHHHHHHHHHhccCCccEEEecC-----CCCCHHHHHHHHHHHHHHhcc-cccCCceeccC-CceEEEcccChhHH Confidence 999999999999999999999999885 358899999999999999988 68999988855 599999985 7999 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCcc Q lcl|NC_012530. 334 QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDN 413 (559) Q Consensus 334 qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~ 413 (559) ||+|++++++++||++|||||++||+.+.+ +++|++++.+.|+++||.||+.+||++||++|++.. +.. T Consensus 275 q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~~~-~~~ 343 (454) T protein:vir:93 275 QTVEQLKMTAEIVCSVFRVPAYKIGVGQPP----------SSDNVEALEQQYYSQCLQTLIESIELLLDEALETGE-NES 343 (454) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCC----------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-CcE Confidence 999999999999999999999999986654 357899999999999999999999999999998754 446 Q ss_pred ceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc Q lcl|NC_012530. 414 YMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE 492 (559) Q Consensus 414 ~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 492 (559) ++|+++.+++.|.+++++++..+++ |+||+||+|+++||||+||||+++++.+..+++.+.+....+. .....+.+ T Consensus 344 ~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~- 420 (454) T protein:vir:93 344 TEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDARED--PFASSGKT- 420 (454) T ss_pred EEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccC--CCCCCccC- Confidence 7777788999999999999998886 5689999999999999999999999988877765433211110 00000000 Q ss_pred ccCCCCCC-CCCCCCccccccchhccccccccccccccccccccccc Q lcl|NC_012530. 493 SALQNPSG-TPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGK 538 (559) Q Consensus 493 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 538 (559) ...++. ++.+......+.+.+. ..+.-..-.|| T Consensus 421 --~~~~~~~~~~d~~~~~~e~~~d~-----------~~~~~~~~~~~ 454 (454) T protein:vir:93 421 --ASVPQAVAASDGNKAITETEHDA-----------VKAMFRGILKK 454 (454) T ss_pred --CCCCCCCCCCCCCCCccCCccch-----------hhhhhhhhhcC Confidence 000000 0000000000000000 00000111122 No 17 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=1.8e-87 Score=496.12 Aligned_cols=416 Identities=15% Similarity=0.137 Sum_probs=316.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|||||....... + ..+. ......++. .....++..+...+. ....-+.+ T Consensus 1 MG~f~~lf~~~~~~---~---------~~~~--------~~~~~~~~~---~~~~~~~~~~g~~~~------~~v~~~~a 51 (422) T protein:vir:13 1 MGFLRGLFNKKNNN---D---------EKRS--------NYDEDIGID---ISDSNFWEKFGIKLN------FSVRGKRA 51 (422) T ss_pred CchhhhhhhccCCc---c---------chhh--------hhhhccccc---cCcchhhhhccccCC------cccchhhh Confidence 99999982211100 0 0000 000000000 001111112211211 12334567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++.... . +.+.+..+ ++..||++++ +++|++.++ T Consensus 52 l~~~~v~~ci~~ia~~iA~lp~~~~~~~~~------------~------~~~~~~~l-L~~~PN~~~t---~~~f~~~~~ 109 (422) T protein:vir:13 52 LKENTVYVCTKIRAESIGKLSLKIYKDKEE------------Y------KEHELYYL-LRYKPNPLMS---SINFWKCLE 109 (422) T ss_pred hccHHHHHHHHHHHHhhhhCceEEEecCcc------------c------ccchHHHH-HhhhcccCCC---HHHHHHHHH Confidence 889999999999999999999876543211 0 11223333 3445565654 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceE-EEEEecCceeeeecccceEEEecccCCCccC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKI-YRQYIDNKVRGSFTADEMGMFIRNPRSDILS 239 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~-y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~ 239 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+++|.....+.. |+....++....+.++||||++.++ ..+ T Consensus 110 ~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~~~~~---~~~ 186 (422) T protein:vir:13 110 TQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLPDEMLHFIGDI---TLD 186 (422) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceEEEEEEeCCCeEEEEcccceEEEcCCC---CCC Confidence 999999999999999999999999999999999999998866544444 4444455666789999999998653 335 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA 319 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~ 319 (559) +.||+||+..+..+|.++.++++++.++|+||++|+|||+++ +.+++++.+++++.|++.++|.+|+|+++||++ T Consensus 187 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 261 (422) T protein:vir:13 187 GLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV-----GDLDEKAKKIFKKEFESMSNGLENAHSISLLPF 261 (422) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-----CCCCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 678999999999999999999999999999999999999886 358899999999999999999999999988855 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++|++++. +.|+||+|++++++++||++|||||++||..+.+ +++|++++...|++.||.||+++|| T Consensus 262 -g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~ie 330 (422) T protein:vir:13 262 -GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERA----------TFNNLTEQQKDFYVTTLQSSLTVYE 330 (422) T ss_pred -CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 7999999999999999999999999999987654 3568999999999999999999999 Q ss_pred HHHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 399 ~~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) ++||++|+++.+. ..++|+++.+++.|.++++++++.++. |+||+||+|+++||||+||||+++++.|++++.... T Consensus 331 ~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~l~~~~ 410 (422) T protein:vir:13 331 QEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRLLVNGNMIPIEMAG 410 (422) T ss_pred HHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhcc Confidence 9999999987653 235555668999999999999999986 568999999999999999999999999999886543 Q ss_pred ccccccccccccccccccccCCCCCCCC Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTP 502 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (559) +.... .+ ....+ T Consensus 411 ~~~~~---------------~g-~~~g~ 422 (422) T protein:vir:13 411 EQYKK---------------GG-EKGGK 422 (422) T ss_pred ccccc---------------CC-CcCCC Confidence 21110 00 00000 No 18 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=7.2e-87 Score=492.83 Aligned_cols=424 Identities=16% Similarity=0.167 Sum_probs=318.9 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|.|| |.- ..|..+..........+ ...+.+. .++ .+....+.+ T Consensus 1 M~~~~~~---f~~-----------------------~~r~~~~~~~~~~~~~~-~~~~~g~--~~~-----~~~v~~~~a 46 (429) T protein:vir:10 1 MDSVKKF---FNF-----------------------EKRQTSQVIELNKDDEK-LLEWLGI--SPS-----TISVKGKNA 46 (429) T ss_pred Cchhhhh---hcc-----------------------cccCcccccccCCChHH-HHHHhcC--CCC-----cceechhhh Confidence 9999999 310 00111011111111110 0111110 111 111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++. .+. ..++...+|++..||++++ +++|++.++ T Consensus 47 l~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~-~~~--------------~~~~~l~~lL~~~PN~~~t---~~~f~~~~~ 108 (429) T protein:vir:10 47 LKVATVFACIKILSESVSKLPLKIYQEDEYG-IQR--------------GTKHYLNNLLRLRPNPYMS---SMNFFGSLE 108 (429) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCc-eee--------------ccccHHHHHHHhhccCCCC---HHHHHHHHH Confidence 8899999999999999999998765543221 110 1112223444445565554 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++||..|+|++||||+|++|++..++.+........|+.+..++....|+++||||++++. ..++ T Consensus 109 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~---~~~~ 185 (429) T protein:vir:10 109 AQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI---TLDG 185 (429) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEccccEEEecCCC---CCCC Confidence 9999999999999999999999999999999999999888766656666666677777889999999998643 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||+..++.+|..+.++++++.++|+||++|+|+|+++ +.+++++.+++++.|++.++|..|+|+++||++ T Consensus 186 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 259 (429) T protein:vir:10 186 LVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-----GDLNEDAKKVFRENFESMSSGLQNSHRIALMPV- 259 (429) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-----CCCCHHHHHHHHHHHHHHhccccccCceeecCC- Confidence 78999999999999999999999999999999999999875 358899999999999999999999999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|++.||+||++.||+ T Consensus 260 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~ie~ 329 (429) T protein:vir:10 260 GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA----------TLNNIEQQQQQFYTDTLQATLTMYEQ 329 (429) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 7999999999999999999999999999986654 35789999999999999999999999 Q ss_pred HHHhhccccccC-cc--ceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG-DN--YMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~-~~--~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++||++.+. .. ++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+ T Consensus 330 ~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~ 409 (429) T protein:vir:10 330 EMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQ 409 (429) T ss_pred HHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccc Confidence 999999986543 23 4555568899999999999999986 5699999999999999999999999999988865432 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) ... . .++ +..+.+.+..+.. T Consensus 410 ~~~-k--------------~g~-~~~~~~~~~~e~~ 429 (429) T protein:vir:10 410 AYL-K--------------GGD-TNGEVSKEGNEGN 429 (429) T ss_pred ccc-C--------------CCC-CCCCCCCCCCCCC Confidence 110 0 000 0000000000000 No 19 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=2.3e-86 Score=490.11 Aligned_cols=417 Identities=16% Similarity=0.149 Sum_probs=307.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |-+...|+ +++++. .+...|.... .+....++.+ +....-+.| T Consensus 1 m~~~~~~~-----------------------------~~~~~~----s~~~~w~~~~-~~~~~~~~~~---g~~vt~~~a 43 (421) T protein:vir:10 1 MFIPQMFE-----------------------------GKKRSV----SGGGFWEAML-GGVRSSHSKA---GVMITPETA 43 (421) T ss_pred CCCcchhc-----------------------------cccccc----CcchhhHHHh-hhhccCcccC---CceechHHh Confidence 33333331 111111 1111111110 1111122221 233345678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..+..+.+. . ..+.+ .+|++..||++++ +++||+.++ T Consensus 44 l~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~-------~------~~~~l-~~lL~~~PN~~~t---~~~f~~~~~ 106 (421) T protein:vir:10 44 LALSAVRACVTLLAESVAQLPVELYRRDKNGGRQR-------A------TDHPI-YDLIHSQPNKKDT---SFEYFEQQQ 106 (421) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEEcCCCceee-------c------ccchH-HHHHhhcccCCCC---HHHHHHHHH Confidence 99999999999999999999987654332211111 0 11122 3344444555554 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.+|.. |+++...+ ..++++||||++.++ .++ T Consensus 107 ~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~------~y~~~~~g--~~~~~~eiih~~~~~----~d~ 174 (421) T protein:vir:10 107 GLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMP------YYEIPEIG--ETLPMRMMHHVKVFS----LDG 174 (421) T ss_pred HHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceE------EEEEcCCC--cEEchhhEEEecCcC----CCC Confidence 999999999999999999999999999999999998877643 33333332 368899999998654 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..++.+|..+.++++|+.++|+||++|+|||+++... ++.+++++++++++.|++.++|.+|+|+++||++ T Consensus 175 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 252 (421) T protein:vir:10 175 YIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEA-PAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQE- 252 (421) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCcc-CccCCHHHHHHHHHHHHHHhcCccccCcceecCC- Confidence 78999999999999999999999999999999999999987643 3457999999999999999999999999888855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.++ ++|++++...|+++||.||+.+||+ T Consensus 253 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t----------~sn~e~~~~~f~~~tl~P~~~~ie~ 322 (421) T protein:vir:10 253 GMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT----------NNNIEHQGLQFVMYTLLAWLKRHEG 322 (421) T ss_pred CceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc----------cccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 69999999999999999999999999999877553 5789999999999999999999999 Q ss_pred HHHhhccccccCccc--eeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNY--MLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQI 476 (559) Q Consensus 400 ~ln~~L~~~~~~~~~--~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~ 476 (559) +||++|+++.+...+ +|+++.++++|.+++++++..+++ |+||+||+|+++|+||+||||++++|+++..++..... T Consensus 323 ~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~ 402 (421) T protein:vir:10 323 ALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPG 402 (421) T ss_pred HHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccC Confidence 999999987665554 455567899999999999999886 56899999999999999999999999888765443211 Q ss_pred ccccccccccccccccccCCCCCCCCCCCCccccccchhcccccc Q lcl|NC_012530. 477 KQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYT 521 (559) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (559) ...+. +.+ ..+. ++...+| T Consensus 403 ~~~~~----------------~~~-~~e~---------d~~~~~~ 421 (421) T protein:vir:10 403 DKKPT----------------AQQ-MAEI---------DTILSRT 421 (421) T ss_pred CCCcc----------------ccc-Cccc---------ccccccC Confidence 10000 000 0000 1111111 No 20 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=4.3e-86 Score=488.60 Aligned_cols=430 Identities=12% Similarity=0.069 Sum_probs=303.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHH--hhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTA--SKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) ||. ..+.++.+.. ..+-.|. |+..+. +.++..+...++. .......+ T Consensus 1 ~~~-------------------~~~~~~~~~~~~~~~~~g~------~~s~~~---~~~~~~~~~~~~~---~g~~v~~~ 49 (437) T protein:vir:10 1 MKQ-------------------GKQRALGRIKSSFLKWLGV------PISLTD---GSFWSAWGGMGSS---SGETVTAD 49 (437) T ss_pred CCc-------------------chhhhhhhhHHhhhhhcCC------cccCCc---hhHHHhhcccccC---CCceechH Confidence 321 1111111110 0111122 211111 0011111111111 12223346 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) .|+.+++|++||++||++||++|+.+++...... +.. ...+.+. .|++..||++++ +++|++. T Consensus 50 ~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~~~g~-~~~------------~~~~~l~-~lL~~~PN~~~t---~~~f~~~ 112 (437) T protein:vir:10 50 SALQLSAVWSCVRLIAETIATLPLNLYQTKPDGT-RVL------------AKQHRLY-TVIHSQPNAENT---AAEFWEV 112 (437) T ss_pred hhhccHHHHHHHHHHHHHHhhCceeEEEEcCCCc-eee------------ccccHHH-HHhhccCCcCCC---HHHHHHH Confidence 7889999999999999999999987654332111 111 0112233 344444665655 4689999 Q ss_pred HHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCcc Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~ 238 (559) ++.+++++||+|++|+|+ .|+|++||||+|.+|++..+.+|.. .|+....++....++++||||++.++ . T Consensus 113 ~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g~~-----~y~~~~~~g~~~~~~~~dIih~r~~~----~ 182 (437) T protein:vir:10 113 IVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSGAL-----QYTYRNVDGTVSTLAEDDVFHVRGFS----L 182 (437) T ss_pred HHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCCeE-----EEEEEecCceEEEEccccEEEecCcC----C Confidence 999999999999999998 5999999999999999998877643 34433445566789999999997532 3 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT 318 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~ 318 (559) ++.+|+||+..++.+|.++.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|..|+|+++||+ T Consensus 183 d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~ 257 (437) T protein:vir:10 183 DGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-----QILQKEKRAEIRTDLAEQFGGAMQAGKTMVLE 257 (437) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-----CCCCHHHHHHHHHHHHHHhcCccccCcceecc Confidence 4678999999999999999999999999999999999999875 35889999999999999999999999988885 Q ss_pred CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_012530. 319 AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMI 397 (559) Q Consensus 319 ~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~i 397 (559) + +++|+++++ +.|+||+|++++++++||++|||||++||+.+.+++ +++|++++.+.|+++||+||+.+| T Consensus 258 ~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--------~~sn~e~~~~~f~~~tl~P~~~~i 328 (437) T protein:vir:10 258 A-GMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTS--------WGTGIEQQTLGFLTFTLRPWLTRI 328 (437) T ss_pred C-CceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--------ccchHHHHHHHHHHHHHHHHHHHH Confidence 4 599999985 799999999999999999999999999999876654 457899999999999999999999 Q ss_pred HHHHHhhccccccCccce--eeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEee-ccceecccccc Q lcl|NC_012530. 398 AKNLTNGIIRQILGDNYM--LEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIIL-SAVYIQRLGQQ 473 (559) Q Consensus 398 e~~ln~~L~~~~~~~~~~--~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~-~~~~~~~l~~~ 473 (559) |++|+++||++.++..++ |+++.+++.|.+++++++..++. |+||+||+|+++||||++|||.++ ++.++.++... T Consensus 329 e~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~ 408 (437) T protein:vir:10 329 EQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKL 408 (437) T ss_pred HHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhc Confidence 999999999876655544 45567899999999999998886 568999999999999999888754 56666555432 Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCccccccchhccc Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQE 518 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (559) .+... ........+. .+.+++ .. ..++++ T Consensus 409 ~~~~~-------~~~~~~~~~~--~~~~~~-----~~--~~~~e~ 437 (437) T protein:vir:10 409 GEHTT-------ATAAQDALKA--WLYQEE-----KT--RATQER 437 (437) T ss_pred cCcCC-------Ccchhccccc--cCCCCC-----CC--CccccC Confidence 11100 0000000000 000000 00 000000 No 21 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=3.4e-86 Score=489.17 Aligned_cols=428 Identities=16% Similarity=0.139 Sum_probs=309.6 Q ss_pred HHHHHHHHHHH--HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_012530. 19 FFKHIDSKIAN--DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQ 96 (559) Q Consensus 19 ~~~~~~~~~~~--~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ 96 (559) +.+.|.+-... .....+..++. .+.+.. ....++..+.-.++. .......+.|+.+++|++||++||++ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~----~~~~~~--~~~~~~~~~~g~~~~---~g~~v~~~~al~~~~V~~~i~~ia~~ 71 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWG----GKTIRL--TDGAFWSQFLGRESS---SGKKVTVDKAMKLSAVWACVRLISTS 71 (434) T ss_pred Cccchhhhhhhcccccchhhhccc----cccccc--CchHHHHHHhcCCcc---CCceechhhhhccHHHHHHHHHHHHh Confidence 22222221100 00011111111 111000 001111111111111 12233456789999999999999999 Q ss_pred HHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEEC Q lcl|NC_012530. 97 VTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYD 176 (559) Q Consensus 97 ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd 176 (559) ||++|+.+++.... +-++ ... ++....|++..||++++. ++|++.++.+++++||+|++|.++ T Consensus 72 ia~lp~~~~~~~~~-g~~~------------~~~-~~~l~~lL~~~PN~~~t~---~~f~~~~~~~lll~Gnay~~i~~~ 134 (434) T protein:vir:43 72 VAGLPLGVYERKAD-GSRV------------DAR-SFPLYDVVHNSPNDDMTA---FQFWQAMVASMLLWGNAYAEIRRA 134 (434) T ss_pred hhhCceEEEEEcCC-Cccc------------ccc-ccHHHHHHhccCCCCCCH---HHHHHHHHHHHhhcCCeEEEEEeC Confidence 99999876543221 1000 011 122334444456666654 689999999999999999999887 Q ss_pred CCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHH Q lcl|NC_012530. 177 SNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFIS 256 (559) Q Consensus 177 ~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~ 256 (559) +|+|++||||+|++|++..+.+|.. .|+++..++....++++||||++.++ .++.+|+||+..++.+|.. T Consensus 135 -~G~~~~L~~l~p~~v~~~~~~~g~~-----~y~~~~~~g~~~~~~~~eVih~~~~~----~dg~~G~spi~~~~~~i~~ 204 (434) T protein:vir:43 135 -AGRPAALDFLLPSRVDLECDENGRL-----KYFYTTKKGARREIERTNMLHIPAFT----LDGRIGLSAIRYGVDVFGS 204 (434) T ss_pred -CCcEEEEEEEcCcceEEEEcCCCeE-----EEEEEecCceEEEEccccEEEecCcC----CCCccccCHHHHHHHHHHH Confidence 6999999999999999999887753 45555566667889999999997543 3467899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHH Q lcl|NC_012530. 257 HENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQF 335 (559) Q Consensus 257 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf 335 (559) +.++++|+.++|+||++|+|||+++ +.+++++.+++|+.|++ +.|..|+|+++||++ +++|++++. +.|+|| T Consensus 205 ~~~~~~~~~~~f~ng~~~~gil~~~-----~~l~~e~~~~~r~~~~~-~~g~~nag~~~vl~~-g~~~~~l~~~~~d~q~ 277 (434) T protein:vir:43 205 VMSAEDAANGTFKNGLLPTVAFKVD-----RILQPAQREEFREYVKS-VSGAMNSGRSPVLEQ-GITPETIGINPVDAQL 277 (434) T ss_pred HHHHHHHHHHHHhccCCcceEEecC-----CCCCHHHHHHHHHHHHH-hcCccccCCccccCC-CceEEEccCChhHHHH Confidence 9999999999999999999999885 45789999999999975 677889999988865 599999985 799999 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccce Q lcl|NC_012530. 336 QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYM 415 (559) Q Consensus 336 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~ 415 (559) +|++++++++||++|||||++||+.+.+++ .++|++++...|+++||.||+.+||++||++|+++.+...++ T Consensus 278 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~ 349 (434) T protein:vir:43 278 LETREHGVIEICRWFGVPPWMIGQTDKGSN--------WGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIRYY 349 (434) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCcCCcc--------ccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCce Confidence 999999999999999999999998776543 357899999999999999999999999999999876654555 Q ss_pred eee--cchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc Q lcl|NC_012530. 416 LEF--VGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE 492 (559) Q Consensus 416 ~~f--~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 492 (559) |+| +.++++|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+..... ...... T Consensus 350 ~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~------~~~~~~ 423 (434) T protein:vir:43 350 AEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNKSQ------AVRAAL 423 (434) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCCCc------chhhhh Confidence 555 57889999999999999886 568999999999999999999999999998886543221111 000000 Q ss_pred ccCCCCCCCCCCC Q lcl|NC_012530. 493 SALQNPSGTPPTL 505 (559) Q Consensus 493 ~~~~~~~~~~~~~ 505 (559) .+..++++|+. T Consensus 424 --~~~~~~~~~~~ 434 (434) T protein:vir:43 424 --MNWFSQPEPQE 434 (434) T ss_pred --hccCCCCCCCC Confidence 00011111211 No 22 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=9.5e-86 Score=486.69 Aligned_cols=404 Identities=15% Similarity=0.144 Sum_probs=314.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||+|++.-|.... .......|... .+.+. +. ...+.+ T Consensus 1 MG~~~~~~~~~~~~~-----------------------~~~~~~~~~~~-------~~~g~---~~--------~~~~~a 39 (411) T protein:vir:81 1 MGWWSRLTRFFRPRN-----------------------ETVDMTNPLLL-------QWLGV---DP--------DTPRNQ 39 (411) T ss_pred CchHHHHHhhccCcc-----------------------cccccchHHHH-------HHhcC---cc--------cChhhh Confidence 999999853322100 00011111110 01010 00 112456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++...++ ..+.+ ..|++..||++++ +++|++.++ T Consensus 40 l~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~--------------~~~~l-~~lL~~~PN~~~t---~~~f~~~l~ 101 (411) T protein:vir:81 40 LSEATYFACLKILSESLGKLPLKMYQKTERGIVKS--------------DREEL-YNLLKLRPNPYMT---SSVFWSTVE 101 (411) T ss_pred hccHHHHHHHHHHHHhHhhCceeEEEecCCceeee--------------cccHH-HHHHhhccCCCCC---HHHHHHHHH Confidence 78999999999999999999988766443321111 11223 3344445666665 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec--CceeeeecccceEEEecccCCCcc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID--NKVRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~--~~~~~~~~~~evi~~~~n~~~~~~ 238 (559) .+++++||+|++++|+ .|++++||||+|++|++..++.+........++.+.. ++....|+++||||++.++. . T Consensus 102 ~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~~---~ 177 (411) T protein:vir:81 102 MNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSVT---F 177 (411) T ss_pred HHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEEEecCCceEEEEccccEEEEcCCCC---C Confidence 9999999999999998 5999999999999999999988866555555555443 44566799999999986543 3 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT 318 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~ 318 (559) ++.+|+||+.+++.+|..+.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|.+|+|+++|++ T Consensus 178 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~ 252 (411) T protein:vir:81 178 DGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT-----GDLNQEARDRLVKGFEQFANGSKNAGKIIPVP 252 (411) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-----CCCCHHHHHHHHHHHHHHhcCccccCCceecC Confidence 4678999999999999999999999999999999999999875 45889999999999999999999999987774 Q ss_pred CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_012530. 319 AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMI 397 (559) Q Consensus 319 ~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~i 397 (559) ++++|++++. +.|+||+|++++++++||++|||||++||+.+.+ +++|++++...|+++||.||+++| T Consensus 253 -~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~~n~e~~~~~f~~~~l~P~~~~i 321 (411) T protein:vir:81 253 -LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKS----------SYASAEAQNLAFYVDTLLYVLKQY 321 (411) T ss_pred -CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CchhHHHHHHHHHHHHHHHHHHHH Confidence 5599999985 7999999999999999999999999999987654 357899999999999999999999 Q ss_pred HHHHHhhcccccc---CccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccc Q lcl|NC_012530. 398 AKNLTNGIIRQIL---GDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQ 473 (559) Q Consensus 398 e~~ln~~L~~~~~---~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~ 473 (559) |++|+++||++.+ +..++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.+++++..+ T Consensus 322 e~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~~~~~n~~pl~~~ 401 (411) T protein:vir:81 322 EEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNLMANGNYIPLSML 401 (411) T ss_pred HHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchhhh Confidence 9999999998654 2345566668899999999999999886 56899999999999999999999999999988654 Q ss_pred cccccccccccccccccccccCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNP 498 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (559) .+... + +++. T Consensus 402 ~~~~~------k---------gGd~ 411 (411) T protein:vir:81 402 GANYG------K---------GGDS 411 (411) T ss_pred hhhhc------c---------CCCC Confidence 32111 0 0000 No 23 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=1.2e-85 Score=486.09 Aligned_cols=415 Identities=14% Similarity=0.133 Sum_probs=310.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |.|++.|+..-.- .......+.+ ++...+ +..+....-+.+ T Consensus 1 m~~~~~~~~~~~~--------------------------~~~~~~~~~~----------~~~~~~---~~~g~~v~~~~a 41 (419) T protein:vir:57 1 MFIPQFWKGRPSE--------------------------NRVNWQVVPG----------GMRSSS---SQAGVIITPETA 41 (419) T ss_pred CcchhhhccCCcc--------------------------cccccccccc----------cccccc---ccCCceechHHh Confidence 9998888432110 0000000000 000111 111222344567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++||+||++||++||++|+.+++...+.+.++. ..+.+..+ ++..||++++ +++|++.++ T Consensus 42 l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~-------------~~~~l~~l-L~~~PN~~~t---~~~f~~~~~ 104 (419) T protein:vir:57 42 LALSAVRACVTLLAESVAQLPCVLYRRTENGGREIA-------------FDHPLHDL-IRYQPNRKDT---AFEYHEQTQ 104 (419) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEEcCCCceecc-------------ccchHHHH-HhhccccCCC---HHHHHHHHH Confidence 889999999999999999999876554333221111 11223333 3334555554 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.+|.. |+++... ...++.++|||++.++ .++ T Consensus 105 ~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~------~y~~~~~--~~~~~~~~vih~r~~~----~d~ 172 (419) T protein:vir:57 105 GVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMP------YYDIPSI--GEILPMRMVHHIKSFS----LDG 172 (419) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceE------EEEEcCC--ceEEchhhEEEecCcC----CCC Confidence 999999999999999999999999999999999998877643 3333322 2468999999998643 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..++.+|..+.++++|+.++|.||++|+|||+++... ...+++++++++++.|.+.++|..|+|+++||++ T Consensus 173 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~- 250 (419) T protein:vir:57 173 YIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEA-KAIASQAAVDAILAKWTERYGGVRNAFSVGMLQE- 250 (419) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcC-CcccCHHHHHHHHHHHHHHhccccccccceecCC- Confidence 78999999999999999999999999999999999999987643 3468999999999999999999999999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|++.||+||+++||+ T Consensus 251 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~P~~~~ie~ 320 (419) T protein:vir:57 251 GMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKS----------TNNNIEHQGLQYVIYTMLAILKRHES 320 (419) T ss_pred CceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------ccccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999995 7999999999999999999999999999986654 35789999999999999999999999 Q ss_pred HHHhhccccccCccce--eeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNYM--LEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQI 476 (559) Q Consensus 400 ~ln~~L~~~~~~~~~~--~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~ 476 (559) +|+++||++.+...++ |+++.+++.|.++++++++.++. |+||+||+|+++||||+||||++++|+|+.+++.+... T Consensus 321 ~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~ 400 (419) T protein:vir:57 321 AMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGI 400 (419) T ss_pred HHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccccccc Confidence 9999999876555554 55558899999999999999886 56899999999999999999999999988766443221 Q ss_pred ccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccc Q lcl|NC_012530. 477 KQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPS 528 (559) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (559) .+.. +++. ++.+... .+ . | T Consensus 401 ~~~~-----------------~~~~-~~~~~~~----~~---------~--~ 419 (419) T protein:vir:57 401 GKAT-----------------PQQL-KDIEAIL----CT---------R--N 419 (419) T ss_pred cCCC-----------------cccC-cchhhhh----hc---------c--C Confidence 1000 0000 0000000 00 0 0 No 24 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=4.7e-86 Score=488.36 Aligned_cols=418 Identities=11% Similarity=0.095 Sum_probs=305.5 Q ss_pred cCCcchHHHHHHHH-HHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHH Q lcl|NC_012530. 13 TDDPNAFFKHIDSK-IANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIIN 91 (559) Q Consensus 13 ~~~~~~~~~~~~~~-~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~ 91 (559) -++++.-+..+.+. +.++ ...-..|+.... |..+.. +.+..+.++ ........+.|+++++|++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~-~~~~f~~~~~~~--~~~~~~-~~~~~~~~~--------~~~~~v~~~~al~~~~v~~cv~ 68 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWAR-LKSWFVGGRLVT--PNQGSQ-TGPVSAHGY--------LGDSSINDERILQISTVWRCVS 68 (424) T ss_pred CCCCccccccCCCCchHHH-HHhhcccccccc--ccchhh-ccccccccc--------cccccccHHHhhccHHHHHHHH Confidence 22222222211111 1111 111111221111 000000 001111111 1112234467899999999999 Q ss_pred HHHHHHHhhhhHhhhhcCCcce-eeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcc Q lcl|NC_012530. 92 TRANQVTEYAHRASTDDNGMGY-QVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVN 170 (559) Q Consensus 92 ~ia~~ia~~~~~~~~~~~g~~~-~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~ 170 (559) +||++||++|+.+++.....+. ++. ..+....|++..||+++++ ++||+.++.+++++||+| T Consensus 69 ~Ia~~iA~lp~~vy~~~~~~~~~~~~--------------~~~~l~~lL~~~PN~~~t~---~~f~~~~~~~lll~Gnay 131 (424) T protein:vir:18 69 LISTLTACLPLDVFETDQNDNRKKVD--------------LSNPLARLLRYSPNQYMTA---QEFREAMTMQLCFYGNAY 131 (424) T ss_pred HHHHhhccCceEEEEeccCCceeeec--------------cccHHHHHHhhccCCCCCH---HHHHHHHHHHHhhcCCeE Confidence 9999999999877554332111 110 1112233444456655554 689999999999999999 Q ss_pred eEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHH Q lcl|NC_012530. 171 YENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMG 250 (559) Q Consensus 171 ~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~ 250 (559) ++|+|+..|+|++||||+|.+|++..+.. ..++++..++....|+++||||++... .++.+|+|||..+ T Consensus 132 ~~i~r~~~G~~~~L~~l~~~~v~v~~~~~-------~~~y~~~~~g~~~~~~~~eVihir~~~----~dg~~G~spi~~~ 200 (424) T protein:vir:18 132 ALVDRNSAGDVISLLPLQSANMDVKLVGK-------KVVYRYQRDSEYADFSQKEIFHLKGFG----FTGLVGLSPIAFA 200 (424) T ss_pred EEEEECCCCcEEEEEEecCcceEEEEcCC-------eEEEEEEeCCeEEEeccccEEEecCcC----CCCcccccHHHHH Confidence 99999999999999999999999876532 233444555666789999999997532 3467899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc- Q lcl|NC_012530. 251 LREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ- 329 (559) Q Consensus 251 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~- 329 (559) +.+|..+.++++|+.++|+||++|+|||+++.. .+++++++++++.|++.++| .|+|+++||++ +++|++++. T Consensus 201 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~----~l~~e~~~~~~~~~~~~~~~-~nag~~~vl~~-g~~~~~l~~~ 274 (424) T protein:vir:18 201 CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQQRSQVEENFKEIAGG-PVKKRLWILEA-GFSTSAIGVT 274 (424) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc----CCCHHHHHHHHHHHHHHhCC-cccCCceeccC-CceEEecCCC Confidence 999999999999999999999999999998642 36899999999999987665 68899988855 599999985 Q ss_pred cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc Q lcl|NC_012530. 330 AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI 409 (559) Q Consensus 330 ~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 409 (559) +.|+||+|++++++++||++|||||++||+.+.+++ .++|++++...|+++||.||+++||++||++|+++. T Consensus 275 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--------~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~ 346 (424) T protein:vir:18 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS--------WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPSK 346 (424) T ss_pred hhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCccc--------ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Confidence 799999999999999999999999999999887654 346899999999999999999999999999999876 Q ss_pred cCccc--eeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccc Q lcl|NC_012530. 410 LGDNY--MLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQT 486 (559) Q Consensus 410 ~~~~~--~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~ 486 (559) +...+ +|+++.+++.|.++|++++..+++ |+||+||+|+++||||+||||+++++.+++++..+..... T Consensus 347 ~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~~~~~-------- 418 (424) T protein:vir:18 347 DVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAMRQAQYVPITDLGTNKE-------- 418 (424) T ss_pred ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhhhccCC-------- Confidence 65444 455568899999999999999986 5689999999999999999999999999988865432110 Q ss_pred ccccccccCCCCCCCCCCCCcc Q lcl|NC_012530. 487 RLTQLESALQNPSGTPPTLPPS 508 (559) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~ 508 (559) +.+... T Consensus 419 ----------------~~~n~a 424 (424) T protein:vir:18 419 ----------------PRNNGA 424 (424) T ss_pred ----------------ccccCC Confidence 000000 No 25 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=3e-86 Score=489.42 Aligned_cols=509 Identities=19% Similarity=0.199 Sum_probs=330.5 Q ss_pred Ccchhhhcc-------ccccCCc----------------chH--HHHHHHH----------HHHHHHhhhhcccc-cccc Q lcl|NC_012530. 1 MGIFDRFRT-------KFYTDDP----------------NAF--FKHIDSK----------IANDTASKALNGVD-RAYT 44 (559) Q Consensus 1 ~~~~~~~~~-------~~~~~~~----------------~~~--~~~~~~~----------~~~~~~~~~~~gr~-~a~~ 44 (559) -||+.-|.+ +.....| ..- ++-|+=. ..++...|...-.. +... T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~ 84 (945) T protein:vir:10 5 ENIIKGFIVNANEQKRPSFSSNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQE 84 (945) T ss_pred hhHhhhheeccccccCccccccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccc Confidence 222222210 0000000 000 0111000 00111111110000 0011 Q ss_pred ccccccccccccccccccccCCC-CCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccccc Q lcl|NC_012530. 45 EPVDGNLMFSTLEDTSIVPKPSP-IAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKP 123 (559) Q Consensus 45 ~~~~~~~~~~~~~~~~~~~~p~~-~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~ 123 (559) .|+.+......+.+..+.+.++. ..........+.++.+++|++||++||++||++|+.+++....... .. T Consensus 85 ~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~-------~~- 156 (945) T protein:vir:10 85 PPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHV-------NY- 156 (945) T ss_pred cchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcc-------cc- Confidence 22222111111111111111110 1111123456788999999999999999999999876544222110 00 Q ss_pred ChhHHHHHHHHHHHHHhcCCCCCCChhh-HHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccc Q lcl|NC_012530. 124 TKEQQKKIDYAERYIERMGVDYSPIRDD-FTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHR 202 (559) Q Consensus 124 ~~~~~~~~~~~~~~L~~~~p~~~~~~~~-~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~ 202 (559) ........+.+..+|.+| |+++++.. |.+|+++++.+++++||+|++++|+.+|+|++||||+|++|++..+++|.. T Consensus 157 ~~kk~~~~hpL~~LL~rP--Np~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~ 234 (945) T protein:vir:10 157 YLKRIRDARNILEFLERP--DPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGI 234 (945) T ss_pred cccccccchHHHHHHhCC--CcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcE Confidence 001122345667777655 55566654 446999999999999999999999999999999999999999999888754 Q ss_pred cccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHH-hcCCCceEEEec Q lcl|NC_012530. 203 RTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFT-HGGTTKGILLVK 281 (559) Q Consensus 203 ~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~-ng~~p~gil~~~ 281 (559) . .+|++..++.....+.++|+||+++++..+....+||+|||.+++++|..+.++++|+.++|. ||++|+|||+++ T Consensus 235 ~---y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvk 311 (945) T protein:vir:10 235 V---VGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIE 311 (945) T ss_pred E---EEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEec Confidence 3 246666777777789999999988888777666778999999999999999999999999995 789999999987 Q ss_pred Ccc-----CCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_012530. 282 PSP-----SVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPA 355 (559) Q Consensus 282 ~~~-----~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~ 355 (559) ++. .++.+++++++++++.|++.++|. ++|+++| .++|++|++++. +.|+||+|++++++++||++|||||+ T Consensus 312 g~~~~d~k~~~~LseEq~erlKe~wee~~sG~-NnG~piV-LdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~ 389 (945) T protein:vir:10 312 PPSYKEGDIYPQLSREQLESIQRQLQAIMMGD-YTQVPIL-SGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQ 389 (945) T ss_pred CccccccccccccCHHHHHHHHHHHHHHhCCc-cccccee-cCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 653 346789999999999999999984 5566444 466799999985 78999999999999999999999999 Q ss_pred HhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH Q lcl|NC_012530. 356 EIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL 435 (559) Q Consensus 356 ~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~ 435 (559) +||+.+.+ +++|++++...|+++||+||+.+||++||++|++...+..++|+|+.+++.|.+++++++.. T Consensus 390 lLG~~e~s----------t~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~k 459 (945) T protein:vir:10 390 DVGILEGS----------NKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQG 459 (945) T ss_pred HcccCCCC----------CcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHHH Confidence 99986643 45789999999999999999999999999999877777789999999999999999999999 Q ss_pred HHc-CCCCHHHHHHHhCCCCCCCCCEeeccce-ecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 436 ELQ-TATTVNDYREKQGLPKIAGGDIILSAVY-IQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 436 ~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) +++ |+||+||+|+++||||+||||+++++.+ +++..+....... ..+. .......+++.+.+++ T Consensus 460 li~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~g---a~p~-----q~aq~~~dqp~~kGGe------ 525 (945) T protein:vir:10 460 QLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQG---AMPP-----QLAQAMADQPSQQGGG------ 525 (945) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccC---CCCc-----ccccCCCCCCCCCCCC------ Confidence 886 5689999999999999999999998753 3333221111000 0000 0000000000000000 Q ss_pred hhccccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 514 QQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 514 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) . ++ +.....+..+++-+-.++..+...+.++. .++ T Consensus 526 -------~-dE-ns~~psE~kda~~e~~~~l~~~~~~~a~e--~i~ 560 (945) T protein:vir:10 526 -------V-DE-NSSVPSEQKNAGLEVLRNLFKSLDANASE--NLK 560 (945) T ss_pred -------C-CC-CCCCCCcccchHHHHHHHHHHHHHHHHHH--HHH Confidence 0 00 00001112222222222233333332210 111 No 26 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=8.4e-86 Score=486.99 Aligned_cols=419 Identities=11% Similarity=0.070 Sum_probs=304.7 Q ss_pred hccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHH Q lcl|NC_012530. 7 FRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVL 86 (559) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v 86 (559) .-.+-++=++.-+ .-+.++. ..-..|+.+....+.....++... . +........+.|+++++| T Consensus 1 ~~~~~~~~~~~~~-----~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~--------~~~~~~v~~~~al~~~~v 63 (424) T protein:vir:18 1 MEEPKYTIDLRTN-----NGWWARL-QSWFVGGRLVTPNQGSQTGPVSAH---G--------HLGDSSINDERILQISTV 63 (424) T ss_pred CCCCcceEeecCC-----CchHHHH-Hhhhcccccccccccccccccccc---c--------ccccccccHHHhhccHHH Confidence 0011111111111 1111111 111112121111100000000000 0 011122345678999999 Q ss_pred HHHHHHHHHHHHhhhhHhhhhcCCcce-eeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHH Q lcl|NC_012530. 87 NAIINTRANQVTEYAHRASTDDNGMGY-QVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYT 165 (559) Q Consensus 87 ~acv~~ia~~ia~~~~~~~~~~~g~~~-~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll 165 (559) ++||++||++||++|+.+++..+..+. ++. ..+.+. .|++..||+++++ ++||+.++.++++ T Consensus 64 ~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~-------------~~~~l~-~lL~~~PN~~~t~---~~f~~~~~~~lll 126 (424) T protein:vir:18 64 WRCVSLISTLTACLPLDVFETDQNDNRKKVD-------------LSNPLA-RLLRYSPNQYMTA---QEFREAMTMQLCF 126 (424) T ss_pred HHHHHHHHHhhccCceEEEEeecCCceeeec-------------cccHHH-HHHhhccCCCCCH---HHHHHHHHHHHhh Confidence 999999999999999876554332111 111 112233 3444445655554 6899999999999 Q ss_pred cCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCccccc Q lcl|NC_012530. 166 YDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLS 245 (559) Q Consensus 166 ~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~S 245 (559) +||+|++|+|+..|+|++||||+|.+|++..+.. ..++.+..++....|+++||||++... .++.+|+| T Consensus 127 ~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~-------~~~y~~~~~g~~~~~~~~eIih~r~~~----~dg~~G~s 195 (424) T protein:vir:18 127 YGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------KVVYRYQRDSEYADFSQKEIFHLKGFG----FTGLVGLS 195 (424) T ss_pred cCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC-------eEEEEEEeCCeEEEeccccEEEecCcC----CCCccccc Confidence 9999999999999999999999999999876532 223344455666789999999997532 35678999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeee Q lcl|NC_012530. 246 ELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFV 325 (559) Q Consensus 246 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~ 325 (559) |+.+++++|+++.++++|+.++|+||++|+|||+++. ..+++++++++++.|++.++| .|+|+++||++ +++|+ T Consensus 196 pi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~----~~l~~e~~~~~~~~~~~~~~g-~nag~~~vl~~-g~~~~ 269 (424) T protein:vir:18 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE----KVLTEQQRSQVEENFKEIAGG-PVKKRLWILEA-GFSTS 269 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCC----cCCCHHHHHHHHHHHHHHhCC-cccCCceeccC-CceEE Confidence 9999999999999999999999999999999998864 247899999999999988766 68999988855 59999 Q ss_pred eccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhh Q lcl|NC_012530. 326 SMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNG 404 (559) Q Consensus 326 ~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~ 404 (559) ++++ +.|+||+|++++++++||++|||||++||+.+.+++ .++|++++...|+++||.||+++||++||++ T Consensus 270 ~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--------~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~ 341 (424) T protein:vir:18 270 AIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS--------WGSGIEQQNLGFLQYTLQPYISRWENSIQRW 341 (424) T ss_pred ecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--------ccccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9985 799999999999999999999999999999887654 3468999999999999999999999999999 Q ss_pred ccccccCccce--eeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccc Q lcl|NC_012530. 405 IIRQILGDNYM--LEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEF 481 (559) Q Consensus 405 L~~~~~~~~~~--~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~ 481 (559) |+++.+...++ |+++.+++.|.++|++++..+++ |+||+||+|+++||||+||||+++++.++.++..+.... T Consensus 342 L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~~~~~---- 417 (424) T protein:vir:18 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK---- 417 (424) T ss_pred cCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchHhhhccC---- Confidence 99887655544 45567899999999999999986 568999999999999999999999999998886532211 Q ss_pred cccccccccccccCCCCCCCCCCCCcc Q lcl|NC_012530. 482 QRQQTRLTQLESALQNPSGTPPTLPPS 508 (559) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (559) ++.++.. T Consensus 418 --------------------~p~~~ga 424 (424) T protein:vir:18 418 --------------------EPRNNGA 424 (424) T ss_pred --------------------CCccCCC Confidence 0000000 No 27 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=3e-85 Score=483.93 Aligned_cols=421 Identities=15% Similarity=0.174 Sum_probs=303.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|+|.+.-|.--+ -.+..+++. ..|+. ...+. ....++ .......-+.| T Consensus 7 ~~~~~~~~~~~~~~~-----------------~~~~~~~~~--~~~~~------~~~~~-~~~~~s---~~g~~v~~~~a 57 (432) T protein:vir:10 7 LGLLGQLKAMFVPPD-----------------PVDIGGGQT--FTPVN------ATARD-LGIIIS---DTGAAVNADAI 57 (432) T ss_pred cchhhhhHhhcCCcc-----------------ccccccccc--cccCc------chhhh-hccccc---ccCcccchhhh Confidence 999998733221111 111111111 11110 00000 000111 11223344678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++|+|++||++||++||++|+.+++.......+ ..++....|++..||++++ +++||+.++ T Consensus 58 l~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~---------------~~~~~l~~lL~~~PN~~~t---~~~f~~~l~ 119 (432) T protein:vir:10 58 MRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKE---------------AVNHPLYTLLLDGPNSTQT---AFDFWQVVV 119 (432) T ss_pred hcchHHHHHHHHHHHhhhhCceeEEEecCCCccc---------------ccccHHHHHHHhcccccCC---HHHHHHHHH Confidence 8999999999999999999998765432211100 1112223344445565655 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+ +|++++||||+|++|++..+.+|.. .|+....++....++++||||++.++ .++ T Consensus 120 ~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-----~y~~~~~~g~~~~~~~~~iih~~~~~----~dg 189 (432) T protein:vir:10 120 TRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-----AYRYRRTDGQMIDIPKQQIWKIMGYS----LDG 189 (432) T ss_pred HHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCcE-----EEEEEecCceEEEEcCccEEEecCCC----CCC Confidence 9999999999999997 5999999999999999999887743 34444455666789999999997543 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+|||..++++|..+.++++|+.++|+||++|+|||+++ +.+++++++++++.| +|..|+|+++||++ T Consensus 190 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-----~~l~~e~~~~~~~~~----~~~~nag~~~vl~~- 259 (432) T protein:vir:10 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-----RFLTDDQYDSFAKKV----SGSVEAGRAPLLEG- 259 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-----CCCCHHHHHHHHHHH----hhhhhCCCceecCC- Confidence 78999999999999999999999999999999999999876 357899988887766 45678999888855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.++++ +++|++++.+.|+++||.||+++||+ T Consensus 260 g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~-------~~sn~e~~~~~f~~~tl~P~~~~ie~ 332 (432) T protein:vir:10 260 GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS-------WGSGIESQQLGFLSMTLSPWLRRIEQ 332 (432) T ss_pred CceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCccc-------ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 599999996 7999999999999999999999999999998876543 35789999999999999999999999 Q ss_pred HHHhhccccccCccceeee--cchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEe-eccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNYMLEF--VGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDII-LSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~~~~~~~f--~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~-~~~~~~~~l~~~~~ 475 (559) +||++|+++.+...++|+| +.++++|.++|++++..+++ |+||+||+|+++||||++|||.+ .++.+.+++..+.+ T Consensus 333 ~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~~~~~~~~~pl~~~~~ 412 (432) T protein:vir:10 333 SIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL 412 (432) T ss_pred HHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhcc Confidence 9999999877665565555 57889999999999999986 56899999999999999987654 46666666654321 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQN 516 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (559) .... .+....++.++.+ .+ + T Consensus 413 ~~~~----------~~~~~~~~~~~~~------~~-----~ 432 (432) T protein:vir:10 413 QASP----------EPASGLGNQQQDK------VS-----K 432 (432) T ss_pred cCCC----------CCCCCCCCccccc------cc-----C Confidence 1100 0000000000000 00 0 No 28 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=6.2e-85 Score=482.22 Aligned_cols=410 Identities=14% Similarity=0.145 Sum_probs=304.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|+..+ .... ....+. .+...... ..++.. .....-..+ T Consensus 1 Mg~f~~lf~r---~~~~------------------------~~~~~~----~~~~~~~~---~~~~~~---g~~v~~~~a 43 (414) T protein:vir:44 1 MVFFSGLFQR---KSDA------------------------PVTTPA----ELADAIGL---SYDTYT---GKQISSQRA 43 (414) T ss_pred Cchhhhhhcc---CccC------------------------cccchh----hHhHhhcc---CccccC---Cceechhhh Confidence 9999997211 1000 000000 00000000 011111 111122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++... ....+.+. .|++..||++++ +++|++.++ T Consensus 44 l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~--------------~~~~~~~~-~lL~~~PN~~~t---~~~f~~~~~ 105 (414) T protein:vir:44 44 MRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQ--------------RATGERLH-KLISTHPNGYMT---PQEFWELVV 105 (414) T ss_pred hccHHHHHHHHHHHHHhccCceEEEEecCCcee--------------ecccchHH-HHHHhhcccCCC---HHHHHHHHH Confidence 789999999999999999999876554332110 01112233 344444555554 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++++ .|+|++||||+|.+|++..+.+|.. .|.....++....|+++||||++.++ .++ T Consensus 106 ~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~-----~y~~~~~~g~~~~~~~~evih~~~~~----~d~ 175 (414) T protein:vir:44 106 TCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEP-----VYQVTFPDGSTDVLSQEDIWHVRTLT----LDG 175 (414) T ss_pred HHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcE-----EEEEEecCceEEEEccccEEEecCCC----CCC Confidence 9999999999999987 6999999999999999988876642 44444455566789999999998543 246 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..+..+|+++.++++|+.++|+||++|+|||++++ .+++++++++++.|++.++|.+|+|+++|+++ T Consensus 176 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-----~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 249 (414) T protein:vir:44 176 LVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQ-----TLSDQAYERLKKDFEERHTGLGNAHRPMILEM- 249 (414) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-----CCCHHHHHHHHHHHHHHhcCccccCcceecCC- Confidence 789999999999999999999999999999999999998864 58899999999999999999999999888754 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|+++||+||+++||+ T Consensus 250 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~----------t~~n~e~~~~~~~~~~l~P~~~~ie~ 319 (414) T protein:vir:44 250 GLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA----------TFNNIEELGLGFINYSLVPYLTRIEQ 319 (414) T ss_pred CceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 6999999999999999999999999999976544 36789999999999999999999999 Q ss_pred HHHhhccccccCccc--eeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNY--MLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQI 476 (559) Q Consensus 400 ~ln~~L~~~~~~~~~--~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~ 476 (559) +||++|+++.+...+ +|+++.+++.|.++++++++.+++ |+||+||+|+++||||+||||++++|.++......... T Consensus 320 ~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~ 399 (414) T protein:vir:44 320 RINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKPSDGSK 399 (414) T ss_pred HHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceecccccccccCCcccc Confidence 999999987765554 455568899999999999999986 56899999999999999999999998877543221100 Q ss_pred ccccccccccccccccccCCCCCCCCCCCCccc Q lcl|NC_012530. 477 KQNEFQRQQTRLTQLESALQNPSGTPPTLPPSS 509 (559) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (559) . +...++...+++++ T Consensus 400 ~------------------~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 400 A------------------GKQKDNANADETTS 414 (414) T ss_pred C------------------CCCCCCCCCCCCCC Confidence 0 00000000000111 No 29 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=5.1e-85 Score=482.70 Aligned_cols=412 Identities=12% Similarity=0.089 Sum_probs=302.1 Q ss_pred hhccccccc-c-ccccccccccccccccc-cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCc Q lcl|NC_012530. 35 ALNGVDRAY-T-EPVDGNLMFSTLEDTSI-VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGM 111 (559) Q Consensus 35 ~~~gr~~a~-~-~~~~~~~~~~~~~~~~~-~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~ 111 (559) =.++|.... . +...... +++..+ ...++.+ .....-+.|+.+++|++||++||++||++|+.+++..++. T Consensus 1 ~~~~r~~~~~~~~~~~~~~----~~~~~~~g~~~s~~---~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~ 73 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAG----GWVSALLGSSRSDS---GQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGED 73 (419) T ss_pred CcccccccccccccccCcc----hhhHHhhcCCCccC---CcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 111111100 0 0000011 111111 1111111 2223446789999999999999999999998765433221 Q ss_pred ceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCce Q lcl|NC_012530. 112 GYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTT 191 (559) Q Consensus 112 ~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~ 191 (559) . + ....+.+.. |++..||++++ +++|++.++.+++++||+|++|+|+.+|+|++||||+|++ T Consensus 74 ~-~-------------~~~~~~l~~-lL~~~PN~~~t---~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~ 135 (419) T protein:vir:14 74 R-K-------------PATDHPLYS-ILKYEPNSWQT---PFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEA 135 (419) T ss_pred c-c-------------cccccHHHH-HHHhhcccCCC---HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCce Confidence 1 0 111123333 33334555554 4789999999999999999999999999999999999999 Q ss_pred EEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_012530. 192 IYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHG 271 (559) Q Consensus 192 V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng 271 (559) |++..+.+|.. .| ++... ..+++++|+|+++++ .++.||+||+..++.+|..+.++++++.++|+|| T Consensus 136 v~v~~~~~~~~-----~y-~~~~~---~~~~~~~i~h~~~~~----~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng 202 (419) T protein:vir:14 136 VTVMRGSDLKP-----VY-RVRGS---DPMPQRLVHHVRWMS----INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNG 202 (419) T ss_pred EEEEECCCceE-----EE-EEccC---cccchhheeEecCcC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99998877642 22 22221 236789999997654 3467899999999999999999999999999999 Q ss_pred CCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHh Q lcl|NC_012530. 272 GTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALV 350 (559) Q Consensus 272 ~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~f 350 (559) ++|+|||++++... ..+++++++++++.|++.++|.+|+|+++||+. +++|++++. +.|+||+|++++++++||++| T Consensus 203 ~~p~gil~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 280 (419) T protein:vir:14 203 TALSGVIERPKDAP-ALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMTFRPLSMTNVDAALIDALRLSALDIARIY 280 (419) T ss_pred CCccEEEEecCCCC-cccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEEccCChhhHHHHHHHHHHHHHHHHHh Confidence 99999999876543 456799999999999999999999999988855 599999995 799999999999999999999 Q ss_pred CCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeee--cchhhhhHHH Q lcl|NC_012530. 351 AMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEF--VGGDTRSQQD 428 (559) Q Consensus 351 gVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f--~~l~~~d~~~ 428 (559) ||||++||+.+.+ +++|+|++.+.|+++||.||+++||++||++||++.+...++++| +.+++.|.++ T Consensus 281 gVpp~~lg~~~~~----------t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~ 350 (419) T protein:vir:14 281 KIPAHMVNELERA----------TFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSS 350 (419) T ss_pred CCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHH Confidence 9999999976654 357899999999999999999999999999999876655555555 5788999999 Q ss_pred HHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 429 KLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 429 ~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) +++++..+++ |+||+||+|+++|+||+||||++++|+++++++...+.... .+++ . T Consensus 351 ~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~--------------------~~~~---~ 407 (419) T protein:vir:14 351 RYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQQLPVG--------------------KSEP---T 407 (419) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCC--------------------CCCC---c Confidence 9999999886 56899999999999999999999999988765533211000 0000 0 Q ss_pred cccccchhcccc Q lcl|NC_012530. 508 SSSNSFQQNQEG 519 (559) Q Consensus 508 ~~~~~~~~~~~~ 519 (559) ....++..+.-+ T Consensus 408 ~~~~~e~~~~l~ 419 (419) T protein:vir:14 408 KAAIDEIGRILS 419 (419) T ss_pred cccccchhcccC Confidence 000011111111 No 30 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=6.7e-85 Score=482.04 Aligned_cols=403 Identities=11% Similarity=0.071 Sum_probs=300.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) +.+|++| | .+. ....|... +. ..+.... .....++....+.| T Consensus 16 ~~~~~~l---f-----------------~~~----------~~~~~~~~-~~--~~~~~~~-----~~~~~~~~vs~~~a 57 (424) T protein:vir:45 16 RVLLDAL---F-----------------RSK----------SLENPSTP-IT--GDAVDTD-----GLFRADVYVSPETA 57 (424) T ss_pred hHHHHhh---c-----------------ccc----------CCCCCccc-cc--hhhhhhh-----ccccCCceechHHh Confidence 5556665 1 110 01111100 00 0000000 00111223345678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..+|...++ ..+.+.. |++..||++++ .++|++.++ T Consensus 58 l~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~--------------~~~~l~~-lL~~~PN~~~t---~~~f~~~~v 119 (424) T protein:vir:45 58 MKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPA--------------RDHPAFY-LVHDEPNTWQT---SYKWRELKQ 119 (424) T ss_pred hccHHHHHHHHHHHHHHhhCceEEEEecCCceeec--------------ccchHHH-HHHhhcccCCC---HHHHHHHHH Confidence 89999999999999999999987765443321111 1122333 33334555554 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|+|+..|+|++||||+|++|++..+... .++.+........++++||||++... .++ T Consensus 120 ~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~~-------~~y~~~~~~~~~~~~~~eVih~r~~~----~d~ 188 (424) T protein:vir:45 120 RHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGGR-------YTYGLYNEYGAFAISPDDMIHIRALG----NNQ 188 (424) T ss_pred HHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCCe-------EEEEEEecCceEEECcccEEEecCcC----CCC Confidence 9999999999999999999999999999999998765321 22333444456679999999998532 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCc-ccccccccccC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGI-NGAYRIPMITA 319 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~-~nag~~~vl~~ 319 (559) .+|+||+..++++|..+.++++|+.++|+||++|+|||+++. .+++++++++++.|++.++|. +|+|+++||++ T Consensus 189 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-----~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~ 263 (424) T protein:vir:45 189 KMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-----GLNKESWGWLKDQWQKASQALRRQENKTMLLPA 263 (424) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-----CCCHHHHHHHHHHHHHHhccccccCCceeEcCC Confidence 789999999999999999999999999999999999998864 478999999999999999985 68999888855 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.++ ++|++++.+.|++.||.||+++|| T Consensus 264 -g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t----------~sn~eq~~~~f~~~tL~P~~~~ie 332 (424) T protein:vir:45 264 -DLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKAT----------FSNISAQAIQFVRYTMMPWVTNWE 332 (424) T ss_pred -CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----------cccHHHHHHHHHHHHHHHHHHHHH Confidence 599999995 79999999999999999999999999999876543 578999999999999999999999 Q ss_pred HHHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 399 ~~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) ++||++|+++.+. ..++|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||++++|.|+.+.... T Consensus 333 ~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~~~~~- 411 (424) T protein:vir:45 333 QELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVSVNAANPAGD- 411 (424) T ss_pred HHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccc- Confidence 9999999987643 235555668899999999999999986 56899999999999999999999998876532100 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) ..+++.+++...+ T Consensus 412 ------------------------~~~~~~~~~~~~~ 424 (424) T protein:vir:45 412 ------------------------FKPPKNDEGKTNE 424 (424) T ss_pred ------------------------cCCCCCCCCCCCC Confidence 0000111111110 No 31 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=1.9e-84 Score=479.63 Aligned_cols=421 Identities=15% Similarity=0.172 Sum_probs=302.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||+|+|.+-.|.- ....+..+.. . ..|... .. ..+...++. .+....-..| T Consensus 7 mg~f~r~~~~~~~-----------------~~~~~~~~~~-~-~~~~~~------~~-~~~~~~~~~---~g~~v~~~~a 57 (432) T protein:vir:81 7 LGLFGQLKAMFVP-----------------PDPVDIGGGQ-T-FTPVNA------TA-RDLGIIISD---TGAAVNADAI 57 (432) T ss_pred cchhhhhhhhccc-----------------cccccccccc-c-cccCcc------ch-hhhcccccc---cCcccchHhh Confidence 9999996211111 0000111111 0 011100 00 001111111 1222344678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++|+|++||++||++||++|+.+++..+....+ ...+.+. .|++..||+++++ ++||+.++ T Consensus 58 l~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~--------------~~~~~l~-~lL~~~PN~~~t~---~~f~~~l~ 119 (432) T protein:vir:81 58 MRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKE--------------AVNHPLY-TLLLDGPNSTQTA---FDFWQVVV 119 (432) T ss_pred hccHHHHHHHHHHHHhhhhCceeeEEecCCccee--------------cccchHH-HHHHhcccccCCH---HHHHHHHH Confidence 8999999999999999999998765433211111 0112233 3444456666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+ +|+|++||||+|+.|++..+.+|.. .|.....++....++++||||++.++ .++ T Consensus 120 ~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-----~y~~~~~~g~~~~~~~~~iih~r~~~----~dg 189 (432) T protein:vir:81 120 TRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGNT-----AYRYRRTDGQMIDIPKQQIWKIMGYS----LDG 189 (432) T ss_pred HHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCcE-----EEEEEecCceEEEEccccEEEecCCC----CCC Confidence 9999999999999997 5999999999999999999877743 34444445666789999999998653 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+|||..++.+|..+.++++|+.++|+||++|+|||+++ +.+++++++++++.| +|..|+|+++||++ T Consensus 190 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-----~~l~~e~~~~~~~~~----~~~~nag~~~vl~~- 259 (432) T protein:vir:81 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-----RFLTDDQYDSFAKKV----SGSVEAGRAPLLEG- 259 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-----CCCCHHHHHHHHHHH----hhhhcCCCceecCC- Confidence 78999999999999999999999999999999999999886 357899988888766 45678999888855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||+.+.++++ +.+|++++.+.|++.||.||+++||+ T Consensus 260 g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-------~~sn~eq~~~~f~~~tl~P~~~~ie~ 332 (432) T protein:vir:81 260 GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS-------WGSGIESQQLGFLTMTLSPWLRRIEQ 332 (432) T ss_pred CceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccc-------ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 599999996 7999999999999999999999999999998877643 35789999999999999999999999 Q ss_pred HHHhhccccccCccceeee--cchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHhCCCCCCCCCEe-eccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNYMLEF--VGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQGLPKIAGGDII-LSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~~~~~~~f--~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~gl~pi~gGD~~-~~~~~~~~l~~~~~ 475 (559) +|+++|+++.+...++|+| +.++++|.++|++++..++++ +||+||+|+++||||++|||.+ .++.++.++....+ T Consensus 333 ~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~ 412 (432) T protein:vir:81 333 SIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL 412 (432) T ss_pred HHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhcc Confidence 9999999877655555555 578999999999999999864 6899999999999999987654 46677766644322 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) .... .+....++.++.+. ++ T Consensus 413 ~~~~----------~~~~~~~n~~~~~~------~~ 432 (432) T protein:vir:81 413 QASP----------EPASGLGNQQQDKV------SK 432 (432) T ss_pred CCCC----------CCCCCCCCcccccc------cC Confidence 1100 00000000000000 00 No 32 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.3e-84 Score=480.51 Aligned_cols=403 Identities=17% Similarity=0.165 Sum_probs=308.6 Q ss_pred HHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhh Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYA 101 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~ 101 (559) |+=++.+ .++ .++ +...+.....+.+. .++ ......+.|+++++|++||++||++||++| T Consensus 1 m~f~~~~---------~~~---~~~-~~~~~~~~~~~~g~--~~~-----~~~v~~~~al~~~~v~~~i~~ia~~ia~lp 60 (409) T protein:vir:10 1 MLFRKGF---------KNQ---SQE-ISIDDKKILEWLGI--NPS-----ETYVNGKSCLKQATVFGCIRILSDNISKLP 60 (409) T ss_pred Ccccccc---------cCc---CCC-CCCChHHHHHHhcC--CcC-----cceechhhhhccHHHHHHHHHHHHhhhhCc Confidence 1111111 110 011 00111111111111 111 112234578899999999999999999999 Q ss_pred hHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcE Q lcl|NC_012530. 102 HRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRL 181 (559) Q Consensus 102 ~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~ 181 (559) +.+++..+|. .++ ..+.+ .+|++..||++++ +++|++.++.+++++||+|++++|+..|++ T Consensus 61 ~~~~~~~~~~-~~~--------------~~~~l-~~lL~~~PN~~~t---~~~f~~~~~~~lll~Gna~~~i~r~~~G~~ 121 (409) T protein:vir:10 61 IKIYQKKDGI-KRV--------------PDHYL-EYLLKLRPNPYMS---SSDFWKCIEVQRNIYGNAYVALDFKKNGEI 121 (409) T ss_pred eEEEEecCCe-eec--------------cCchH-HHHHhhccCCCCC---HHHHHHHHHHHHhhcCCeEEEEEEcCCCcE Confidence 8776544331 111 11122 3444445665554 468999999999999999999999999999 Q ss_pred EEEEEecCceEEEEecCcccccccc-eEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_012530. 182 SHTRMVDPTTIYFANDEHGHRRTRG-KIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENT 260 (559) Q Consensus 182 ~~L~~l~p~~V~~~~~~~g~~~~~~-~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~ 260 (559) ++||||+|++|++..+.+|...... ..|+.....+....++++||||++.+. .++.||+|||..+..+|..+.++ T Consensus 122 ~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~----~d~~~G~s~i~~~~~~i~~~~~~ 197 (409) T protein:vir:10 122 KGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMSDEILHFKGLT----ADGLAGLSVIELLNHLIENGKSS 197 (409) T ss_pred EEEEEEcCCceEEEEcCCccccccceEEEEEEeCCceeEEeccccEEEecCcC----CCCcccccHHHHHHHHHHHHHHH Confidence 9999999999999998887654433 344444455566789999999997542 24678999999999999999999 Q ss_pred HHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHH Q lcl|NC_012530. 261 ELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWL 339 (559) Q Consensus 261 ~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~ 339 (559) ++++.++|+||++|+|||+++ +.+++++.+++++.|++.++|..|+|+++|+++ +++|++++. +.|+||+|++ T Consensus 198 ~~~~~~~f~ng~~~~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~l~~~~~d~q~~e~~ 271 (409) T protein:vir:10 198 ETYLNNFFKNGLQVKGLVQYA-----GDLNPEAEEVFKENFERMSSGLKNAHRIAMLPI-GYKFEPISQKLVDAQFLENS 271 (409) T ss_pred HHHHHHHHhccCCCcEEEEcC-----CCCCHHHHHHHHHHHHHHhccccccCCceecCC-CceEEEccCChhhHHHHHHH Confidence 999999999999999999875 358899999999999999999999999988855 599999985 7999999999 Q ss_pred HHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC-c--ccee Q lcl|NC_012530. 340 NYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG-D--NYML 416 (559) Q Consensus 340 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~-~--~~~~ 416 (559) ++++++||++|||||++||+.+.+ +++|++++.+.|++.||+||+++||++||++|++..+. . .++| T Consensus 272 ~~~~~~Ia~~fgVPp~~lg~~~~~----------~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~f 341 (409) T protein:vir:10 272 QLTIRQIASVFGVKMHQLNDLDRA----------THSNITEQNREFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKF 341 (409) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCC----------ccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEE Confidence 999999999999999999987654 35789999999999999999999999999999976542 3 4555 Q ss_pred eecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccc Q lcl|NC_012530. 417 EFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQR 483 (559) Q Consensus 417 ~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~ 483 (559) +++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|++++....+......++ T Consensus 342 d~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 342 NVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhccccccccCCC Confidence 5668899999999999999886 568999999999999999999999999998876543211100000 No 33 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.2e-84 Score=480.61 Aligned_cols=421 Identities=16% Similarity=0.170 Sum_probs=303.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|+|.+.-|..-+ -.+..+++. ..|+. ...+ .+...++ ........+.| T Consensus 7 ~g~~~~~~~~~~~~~-----------------~~~~~~~~~--~~~~~------~~~~-~~~~~~~---~~g~~v~~~~a 57 (432) T protein:vir:97 7 LGLLGQLKAMFVPPD-----------------PVDIGGGQT--FTPVN------ATAR-DLGIIIS---DTGAAVNADAI 57 (432) T ss_pred CchhhhhHhhcCCcc-----------------ccccccccc--cccCc------hhhh-hhccccc---ccCcccchHhh Confidence 999998743331111 111111111 01110 0000 0000111 11233345678 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++.......+ ...+.+ ..|++..||++++ +++||+.++ T Consensus 58 ~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~g~~~--------------~~~~pl-~~lL~~~PN~~~t---~~~f~~~l~ 119 (432) T protein:vir:97 58 MRLDAVAACVKLVSQAVAAMPLMMYMRTPDGRKE--------------AVNHPL-YTLLLDGPNSTQT---AFDFWQVVV 119 (432) T ss_pred hcchHHHHHHHHHHHhhccCceEEEEecCCCccc--------------ccccHH-HHHHHhcccccCC---HHHHHHHHH Confidence 8999999999999999999998765433211100 011222 3344444555554 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+ +|++++||||+|++|++..+.+|.. .|+....++....++++||||++.++ .++ T Consensus 120 ~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g~~-----~y~~~~~~g~~~~~~~~~iih~r~~~----~dg 189 (432) T protein:vir:97 120 TRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-----AYRYRRTDGQMIDIPRQQIWKIMGYS----LDG 189 (432) T ss_pred HHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCCcE-----EEEEEecCceEEEEccccEEEecCcC----CCC Confidence 9999999999999997 5999999999999999999887743 34444455666789999999997543 346 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+|||..++++|.++.++++|+.++|+||++|+|||++++ .+++++++++++.| .|..|+|+++||++ T Consensus 190 ~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~-----~l~~e~~~~~~~~~----~~~~nag~~~vl~~- 259 (432) T protein:vir:97 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-----FLTDDQYDSFSKKV----SGSVEAGRAPLLEG- 259 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC-----CCCHHHHHHHHHHH----hhhhcCCCceecCC- Confidence 789999999999999999999999999999999999998864 57889888877665 56678999988855 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||+.+.++++ +.+|++++...|+++||.||+++||+ T Consensus 260 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-------~~s~~e~~~~~f~~~tl~P~~~~ie~ 332 (432) T protein:vir:97 260 GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS-------WGSGIESQQLGFLTMTLSPWLRRIEQ 332 (432) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccc-------cchhHHHHHHHHHHHHHHHHHHHHHH Confidence 599999995 7999999999999999999999999999998776542 35789999999999999999999999 Q ss_pred HHHhhccccccCccceeee--cchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEee-ccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILGDNYMLEF--VGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIIL-SAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~~~~~~~f--~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~-~~~~~~~l~~~~~ 475 (559) +||++|+++.+...++|+| +.++++|.++|++++..++. |+||+||+|+++||||++|||.++ ++.++.++..+.+ T Consensus 333 ~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~ 412 (432) T protein:vir:97 333 SIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL 412 (432) T ss_pred HHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecccccchhhhcc Confidence 9999999877655555555 57899999999999999986 568999999999999999887654 6677666644322 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQN 516 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (559) ... + .+....++.++.+ .+ + T Consensus 413 ~~~------~----~~~~~~~~~~~~~------~~-----~ 432 (432) T protein:vir:97 413 QAS------P----EPASGLGNQQQDK------VS-----K 432 (432) T ss_pred cCC------C----CCCCCCCCccccc------cc-----C Confidence 110 0 0000000000000 00 0 No 34 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=3e-84 Score=478.47 Aligned_cols=409 Identities=12% Similarity=0.087 Sum_probs=303.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|+. .|......+.. ..+ .+ .+...++.++. . ..+.+.+ T Consensus 1 Mg~~~~~~-----------------------------~~~~~~~~~~~--~~~-~~---~~~~~~~~~~~-~-~~~~~~~ 43 (423) T protein:vir:81 1 MGFLQKLG-----------------------------LAPSVVATPEP--IEL-VG---PIFESLKLSTK-N-MTVEQIW 43 (423) T ss_pred CchhHhhc-----------------------------cccccccCccc--ccc-cc---ccccccccccc-h-hhHHHHH Confidence 99999982 01111111110 000 00 00011111111 1 1234456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhc-CCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDD-NGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~-~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) ..+|+|++||++||++||++|+.+++.. +|..- ....+.+..+|.+ ||++++ +++|++.+ T Consensus 44 ~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~--------------~~~~~~~~~ll~~--PN~~~t---~~~f~~~~ 104 (423) T protein:vir:81 44 EDQPHLRTVTTFIARNVASLQLQAFERVEDGGRE--------------RVREGHLARVCKL--ANSDMT---MYDLLERT 104 (423) T ss_pred HhhhHHHHHHHHHHHhHhhCceEEEEEecCCcee--------------eeccchHHHHhhc--CCCCCC---HHHHHHHH Confidence 7899999999999999999998765432 22110 1122345555654 555554 47899999 Q ss_pred HHHHHHcCCcceEEEECCC--CcEEEEEEecCceEEEEecCcccccccceEEEEE---ecCceeeeecccceEEEecccC Q lcl|NC_012530. 160 VRDTYTYDQVNYENTYDSN--GRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQY---IDNKVRGSFTADEMGMFIRNPR 234 (559) Q Consensus 160 v~d~ll~Gna~~~i~rd~~--G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~---~~~~~~~~~~~~evi~~~~n~~ 234 (559) +.+++++||+|++|.||.. +.++.|+|+++..|++....++.. ...|... ..++....++++||||++.+.. T Consensus 105 ~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~---~~~Y~~~~~~~~~g~~~~~~~~evih~r~~~~ 181 (423) T protein:vir:81 105 MFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWG---SLDYIIIESGDNDGRSVKVPGERVIHRHGYNP 181 (423) T ss_pred HHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCc---ceEEEEEEecCCCceEEEEcccceEEecCCCC Confidence 9999999999999999853 467889999999888776655421 1122222 2344556789999999974322 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhc-Ccccccc Q lcl|NC_012530. 235 SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSS-GINGAYR 313 (559) Q Consensus 235 ~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~-G~~nag~ 313 (559) ....||+||+..++++|+.+.++++|+.++|+||++|+|||+++....++++++++++++++.|++.++ |..|+|+ T Consensus 182 ---~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~ 258 (423) T protein:vir:81 182 ---KTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGG 258 (423) T ss_pred ---CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCc Confidence 233479999999999999999999999999999999999999987777778999999999999999985 6788999 Q ss_pred cccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhH Q lcl|NC_012530. 314 IPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMP 392 (559) Q Consensus 314 ~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P 392 (559) ++||++ +++|+++++ ++|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|+++||.| T Consensus 259 ~~vl~~-g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~L~P 327 (423) T protein:vir:81 259 TLLLED-GMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNA----------NYSNVREFRKALYGDNLGS 327 (423) T ss_pred ceecCC-CceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHH Confidence 888855 599999985 7999999999999999999999999999987654 3578999999999999999 Q ss_pred HHHHHHHHHHhhccccccC--cc--ceeeecchhhhhHHHHHHHHHHHH-c-CCCCHHHHHHHhCCCCCCCCCEeeccce Q lcl|NC_012530. 393 LLDMIAKNLTNGIIRQILG--DN--YMLEFVGGDTRSQQDKLKSVQLEL-Q-TATTVNDYREKQGLPKIAGGDIILSAVY 466 (559) Q Consensus 393 ~~~~ie~~ln~~L~~~~~~--~~--~~~~f~~l~~~d~~~~~~~~~~~~-~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~ 466 (559) |++.||++|+++|+++.+. .. ++|+++.+++.|.++|++++..++ + |+||+||+|+++||||+||||++++|.| T Consensus 328 ~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD~~~~p~n 407 (423) T protein:vir:81 328 WIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGDDLARPLN 407 (423) T ss_pred HHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcceeecccc Confidence 9999999999999987542 33 455556889999999999998876 3 6799999999999999999999999988 Q ss_pred ecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) +.+.++..... ++.++ T Consensus 408 ~~~~~~~~~~~---------------------~~~~t 423 (423) T protein:vir:81 408 TEFGDSEDAPG---------------------EEVET 423 (423) T ss_pred cccCccCCCCC---------------------CCCCC Confidence 76543211100 00000 No 35 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=4.3e-84 Score=477.63 Aligned_cols=409 Identities=15% Similarity=0.152 Sum_probs=294.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||.++- +++...+..+ . ..+ +...+.+.........-..| T Consensus 26 ~~lf~~~e-------------------------------~R~~~~~~~~---~--~~~--~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:94 26 VGIFYKNE-------------------------------KRDLQYNEDD---L--QMM--VQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred cccccccc-------------------------------cccccCCCcc---h--HHH--HHHhcccCcccccccchhhh Confidence 33332220 0000001000 0 000 00001111111111223467 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.+ + ++ ...+.+. +|++..||+++++ ++||+.++ T Consensus 68 l~~~~V~~cv~~Ia~~iA~lp~~~~~~--~---~~-------------~~~~~~~-~lL~~~PN~~~t~---~~f~~~~~ 125 (441) T protein:vir:94 68 IRHSDIFTAVMMIASDLARMPIRVTVN--G---QI-------------NYSDRIV-NLLNTRPNPMYNG---YIFKLVVF 125 (441) T ss_pred hccHHHHHHHHHHHHhhccCceeeecC--c---cc-------------cccchHH-HHHhcccCcCCCH---HHHHHHHH Confidence 889999999999999999999765432 1 01 0112233 3444456666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||||++|+|+..|+|++||||+|++|++..+.+|...+ +++..+ ......++++||||+++++ T Consensus 126 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~----~~~~~~~~~~~~~~~~~~~dvih~k~~~---- 197 (441) T protein:vir:94 126 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 197 (441) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE----EEEEeccCCceeEEEEccccEEEeccCC---- Confidence 99999999999999999999999999999999999988875421 222222 2345679999999998654 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+|||..++++|+++.++++|+.++|+||++|+|||++++.. .++++++++|+.|++.++|..|+|+++|| T Consensus 198 ~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~----~~~e~~e~~r~~~~~~~~G~~nag~~~vl 273 (441) T protein:vir:94 198 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFHKSFSGTKQAGKVVVL 273 (441) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC----CCHHHHHHHHHHHHHHhcCccccCcceec Confidence 34678999999999999999999999999999999999999987532 35788999999999999999999998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ |++|++++. ++|+||+|++++++++||++|||||++||+...+ .+.+++...| .+||+||+++ T Consensus 274 ~~-G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------------~s~~q~~~~~-~~tl~P~~~~ 339 (441) T protein:vir:94 274 DE-SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN------------MSITDANLDY-LSTLKPYITC 339 (441) T ss_pred CC-CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC------------ccHHHHHHHH-HHHHHHHHHH Confidence 54 599999985 7999999999999999999999999999963321 1234444445 5699999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCC--Eeeccceecccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGD--IILSAVYIQRLGQQ 473 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD--~~~~~~~~~~l~~~ 473 (559) ||++||++|+++..+..++|+++.+++.|.++++++++.++. |+||+||+|+++||||+|||| +++++++++++... T Consensus 340 ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:94 340 VCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc Confidence 999999999987666667777788999999999999999985 578999999999999999998 46678888777654 Q ss_pred cccccccccccccccccccccCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) .+. +..+........++++.++ T Consensus 420 ~~~-----~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 420 DEY-----QMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccc-----ccccccccccccCCCCCCC Confidence 321 1111111111111111111 No 36 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=4.3e-84 Score=477.63 Aligned_cols=409 Identities=15% Similarity=0.152 Sum_probs=294.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||.++- +++...+..+ . ..+ +...+.+.........-..| T Consensus 26 ~~lf~~~e-------------------------------~R~~~~~~~~---~--~~~--~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:79 26 VGIFYKNE-------------------------------KRDLQYNEDD---L--QMM--VQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred cccccccc-------------------------------cccccCCCcc---h--HHH--HHHhcccCcccccccchhhh Confidence 33332220 0000001000 0 000 00001111111111223467 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.+ + ++ ...+.+. +|++..||+++++ ++||+.++ T Consensus 68 l~~~~V~~cv~~Ia~~iA~lp~~~~~~--~---~~-------------~~~~~~~-~lL~~~PN~~~t~---~~f~~~~~ 125 (441) T protein:vir:79 68 IRHSDIFTAVMMIASDLARMPIRVTVN--G---QI-------------NYSDRIV-NLLNTRPNPMYNG---YIFKLVVF 125 (441) T ss_pred hccHHHHHHHHHHHHhhccCceeeecC--c---cc-------------cccchHH-HHHhcccCcCCCH---HHHHHHHH Confidence 889999999999999999999765432 1 01 0112233 3444456666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||||++|+|+..|+|++||||+|++|++..+.+|...+ +++..+ ......++++||||+++++ T Consensus 126 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~----~~~~~~~~~~~~~~~~~~~dvih~k~~~---- 197 (441) T protein:vir:79 126 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 197 (441) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE----EEEEeccCCceeEEEEccccEEEeccCC---- Confidence 99999999999999999999999999999999999988875421 222222 2345679999999998654 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+|||..++++|+++.++++|+.++|+||++|+|||++++.. .++++++++|+.|++.++|..|+|+++|| T Consensus 198 ~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~----~~~e~~e~~r~~~~~~~~G~~nag~~~vl 273 (441) T protein:vir:79 198 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFHKSFSGTKQAGKVVVL 273 (441) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC----CCHHHHHHHHHHHHHHhcCccccCcceec Confidence 34678999999999999999999999999999999999999987532 35788999999999999999999998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ |++|++++. ++|+||+|++++++++||++|||||++||+...+ .+.+++...| .+||+||+++ T Consensus 274 ~~-G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------------~s~~q~~~~~-~~tl~P~~~~ 339 (441) T protein:vir:79 274 DE-SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN------------MSITDANLDY-LSTLKPYITC 339 (441) T ss_pred CC-CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC------------ccHHHHHHHH-HHHHHHHHHH Confidence 54 599999985 7999999999999999999999999999963321 1234444445 5699999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCC--Eeeccceecccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGD--IILSAVYIQRLGQQ 473 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD--~~~~~~~~~~l~~~ 473 (559) ||++||++|+++..+..++|+++.+++.|.++++++++.++. |+||+||+|+++||||+|||| +++++++++++... T Consensus 340 ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:79 340 VCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc Confidence 999999999987666667777788999999999999999985 578999999999999999998 46678888777654 Q ss_pred cccccccccccccccccccccCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) .+. +..+........++++.++ T Consensus 420 ~~~-----~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 420 DEY-----QMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccc-----ccccccccccccCCCCCCC Confidence 321 1111111111111111111 No 37 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=4.3e-84 Score=477.62 Aligned_cols=415 Identities=12% Similarity=0.106 Sum_probs=303.7 Q ss_pred HHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhh Q lcl|NC_012530. 26 KIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRAS 105 (559) Q Consensus 26 ~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~ 105 (559) =++++... +......|.. ..+-. ..+...++ ........+.|+++|+|++||++||++||++|+.++ T Consensus 1 m~~~~~~~-----~~~~~~~~~~--~~~~~---~~~g~~~s---~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~ 67 (419) T protein:vir:80 1 MFFSRQLL-----SNLGQTQPGS--GGWVS---ALLGSARS---EAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELY 67 (419) T ss_pred CCcccccc-----cccCcCCCCc--chhhH---Hhhccccc---ccCcccChHHhhccHHHHHHHHHHHHhhccCceEEE Confidence 01111100 0000111110 00100 00111121 122334557789999999999999999999998765 Q ss_pred hhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEE Q lcl|NC_012530. 106 TDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTR 185 (559) Q Consensus 106 ~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~ 185 (559) +..+.... +...+.+. .|++..||++++ +++|++.++.+++++||+|++|+|+.+|+|++|| T Consensus 68 ~~~~~~~~--------------~~~~~~l~-~lL~~~PN~~~t---~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~ 129 (419) T protein:vir:80 68 ERSGDDRK--------------PATDHPLY-SILKYEPNPWQT---PFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLY 129 (419) T ss_pred EecCCCcc--------------cccccHHH-HHHHhhcccCCC---HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE Confidence 54322110 01112233 344444555554 4689999999999999999999999999999999 Q ss_pred EecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 186 MVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFND 265 (559) Q Consensus 186 ~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~ 265 (559) ||+|++|++..+.+|.. .| ++. + ...+++++|+|+++++ .++.||+||+.+++.+|..+.++++|+. T Consensus 130 ~i~~~~v~i~~~~~~~~-----~y-~~~-~--~~~~~~~~i~h~~~~~----~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 196 (419) T protein:vir:80 130 PLDNEAVTVMKGPDLKP-----MY-RVA-G--ADPLPQRLVHHVRWMS----INGYTGLSPVLLHANAIGHAQAIQQYAG 196 (419) T ss_pred EecCceEEEEECCCceE-----EE-EEc-C--ccccchhheEEecCCC----CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 99999999998877642 22 222 2 2247889999998654 3467899999999999999999999999 Q ss_pred HHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHH Q lcl|NC_012530. 266 RFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLIN 344 (559) Q Consensus 266 ~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~ 344 (559) ++|+||++|+|||++++.. .+..++++++++++.|++.++|..|+|+++||++ +++|++++. +.|+||+|+++++++ T Consensus 197 ~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~l~~s~~d~q~~e~~~~~~~ 274 (419) T protein:vir:80 197 KSFMNGTALSGVIERPTDA-PALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMKFKPLSMTNVDAALIDALRLSAL 274 (419) T ss_pred HHHhcCCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEeccCChhhHHHHHHHHHHHH Confidence 9999999999999987643 3456899999999999999999999999988855 599999985 799999999999999 Q ss_pred HHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeee--cchh Q lcl|NC_012530. 345 IICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEF--VGGD 422 (559) Q Consensus 345 ~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f--~~l~ 422 (559) +||++|||||++||+.+.+ +++|++++.+.|++.||.||+++||++|+++||++.++..++|+| +.++ T Consensus 275 ~Ia~~fgVPp~llg~~~~~----------t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~ 344 (419) T protein:vir:80 275 DIARIYKIPAHMVNELERA----------TFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLL 344 (419) T ss_pred HHHHHhCCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhh Confidence 9999999999999976654 357899999999999999999999999999999876655555555 5788 Q ss_pred hhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 423 TRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 423 ~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) +.|.++++++++.+++ |+||+||+|+++|+||+||||++++|+|++.++........+ + T Consensus 345 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~--------------------~ 404 (419) T protein:vir:80 345 RGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQPIPMGK--------------------T 404 (419) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccCCC--------------------C Confidence 9999999999999886 568999999999999999999999998887654432110000 0 Q ss_pred CCCCCccccccchhccccccc Q lcl|NC_012530. 502 PPTLPPSSSNSFQQNQEGYTG 522 (559) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~ 522 (559) ++ .+...++.+.... T Consensus 405 ~~------~~~~~~~~~~~l~ 419 (419) T protein:vir:80 405 EP------TKAALDEIGRILS 419 (419) T ss_pred Cc------hhhhHHHHHhhcC Confidence 00 0111111122221 No 38 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=9.8e-83 Score=470.19 Aligned_cols=409 Identities=14% Similarity=0.126 Sum_probs=304.0 Q ss_pred HHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhh Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYA 101 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~ 101 (559) ++=..+ ++|... .+......+......+ .+++. +.....+.|+.+++|++||++||++||++| T Consensus 1 ~~f~~~---------f~r~~~--~~~~~~~~~~~~~~~~---~~~~~---g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p 63 (413) T protein:vir:48 1 MFFSGL---------FQRKSD--APVTTPAELAEAIGLS---YDTYT---GKRISSQRAMRLTAVYSCVRVLAESVGMLP 63 (413) T ss_pred Cccchh---------hccCcc--CCccchHHHHHhhhcC---ccccc---CceechhhhhccHHHHHHHHHHHHhhhhCc Confidence 222222 222111 1111111111111111 11111 111223567889999999999999999999 Q ss_pred hHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcE Q lcl|NC_012530. 102 HRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRL 181 (559) Q Consensus 102 ~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~ 181 (559) +.+++..++...++ ..+.+..+ ++..||++++ +++|++.++.+++++||+|++++|+ .|+| T Consensus 64 ~~~~~~~~~~~~~~--------------~~~~~~~l-L~~~PN~~~t---~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~ 124 (413) T protein:vir:48 64 CSLYKISGTLKTRV--------------VDERLHKL-VSAKPNGYMT---PQEFWELVIVCLCLRGNFYAYKVKA-LGEV 124 (413) T ss_pred eEEEEecCCcceee--------------cccHHHHH-HHhhccCCCC---HHHHHHHHHHHHhhcCceEEEEEeC-CCcE Confidence 87665433221111 11233333 3444555554 4689999999999999999999997 6899 Q ss_pred EEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_012530. 182 SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTE 261 (559) Q Consensus 182 ~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~ 261 (559) ++||||+|++|++..+.++.. .|+....++....++++||||++.++. ++.||+||+..+..+|..+.+++ T Consensus 125 ~~L~~l~~~~v~~~~~~~~~~-----~y~~~~~~g~~~~~~~~evih~~~~~~----d~~~G~s~i~~~~~~i~~~~~~~ 195 (413) T protein:vir:48 125 VELLPIDPGCVEPKLNSQWQP-----VYQVTFPDGSVDVLTQDEIWHVRTLTL----DGLVGLNPIAYAREAISLAAATE 195 (413) T ss_pred EEEEEEcCceEEEEEcCCceE-----EEEEEecCceEEEEccccEEEecCcCC----CCcccccHHHHHHHHHHHHHHHH Confidence 999999999999988876642 455455566667899999999986542 45789999999999999999999 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLN 340 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~ 340 (559) +++.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|+++|++ ++++|++++. ++|+||+|+++ T Consensus 196 ~~~~~~~~ng~~p~gil~~~~-----~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 269 (413) T protein:vir:48 196 EHGARLFGNGAVTSGVLRTEQ-----KLTPDAYERLKKDFEERHTGLGNAHRPMILE-MGLDWKSMALNAEDSQFLETRK 269 (413) T ss_pred HHHHHHHhccCCcceEEEeCC-----CCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEeccCChhHHHHHHHHH Confidence 999999999999999998863 5789999999999999999999999987775 4599999985 79999999999 Q ss_pred HHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceee--e Q lcl|NC_012530. 341 YLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLE--F 418 (559) Q Consensus 341 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~--f 418 (559) +++++||++|||||++||+.+.+ +++|++++...|++.||.||+++||++||++|+++.+...++|+ + T Consensus 270 ~~~~~Ia~~fgVPp~~lg~~~~~----------t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~ 339 (413) T protein:vir:48 270 FQLEEICRLFRVPLHMVQNTDRA----------TFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNA 339 (413) T ss_pred HHHHHHHHHhCCCHHHhCCCcCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEec Confidence 99999999999999999976543 45789999999999999999999999999999987665455554 5 Q ss_pred cchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCC Q lcl|NC_012530. 419 VGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQN 497 (559) Q Consensus 419 ~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (559) +.+++.|.++++++++.+++ |+||+||+|+++|+||+||||++++|.++.+.......... ...+ T Consensus 340 ~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~--------------~~~~ 405 (413) T protein:vir:48 340 GALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTSPSAGDDNGK--------------KKES 405 (413) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeccccccccccccccCCC--------------CCCC Confidence 57889999999999999986 46899999999999999999999999987765432111100 0000 Q ss_pred CCCCCCCCCcccc Q lcl|NC_012530. 498 PSGTPPTLPPSSS 510 (559) Q Consensus 498 ~~~~~~~~~~~~~ 510 (559) .+++++ .+ T Consensus 406 ~~~~~~-----~~ 413 (413) T protein:vir:48 406 GDADKT-----AS 413 (413) T ss_pred CCcccc-----CC Confidence 011111 00 No 39 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=2.6e-83 Score=473.31 Aligned_cols=409 Identities=15% Similarity=0.153 Sum_probs=295.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||.++- +++...+.. +.. .+ +...+.+.........-..| T Consensus 26 ~~~f~~~e-------------------------------~r~~~~~~~---~~~--~~--~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:98 26 VGIFYKNE-------------------------------KRDLQYNED---DLQ--MM--VQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred cccccccc-------------------------------cccccCCCc---chH--HH--HHHhhcccccCccccchhhh Confidence 33332220 000000000 000 00 00001111110111223457 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.+ | ++ ...+.+. .|++..||+++++ ++|++.++ T Consensus 68 l~~~~V~acv~~Ia~~iA~lpl~~~~~--~---~~-------------~~~~~~~-~lL~~~PN~~~t~---~~f~~~l~ 125 (441) T protein:vir:98 68 IRHSDIFTAVMMIASDLARMPIRVTVN--G---QI-------------NYSDRIV-NLLNTRPNPMYNG---YIFKLVVF 125 (441) T ss_pred hccHHHHHHHHHHHHhhccCceEEecC--C---cc-------------cccchHH-HHHhcccccCCCH---HHHHHHHH Confidence 889999999999999999999766532 1 00 0112233 3444456666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.+|...+ +++..+ ......++++||||+++++ T Consensus 126 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~----~~~~~~~~~~~~~~~~~~~dviHir~~~---- 197 (441) T protein:vir:98 126 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 197 (441) T ss_pred HHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEE----EEEEeccCcceeeEEEccccEEEeccCC---- Confidence 99999999999999999999999999999999999988875432 222222 2345679999999998653 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+||+..++++|.++.++++|+.++|+||++|+|||++++.. .++++++++++.|++.++|.+|+|+++|| T Consensus 198 ~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~----~~~e~~~~~~~~~~~~~~G~~nag~~~vl 273 (441) T protein:vir:98 198 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFHKSFSGTKQAGKVVVL 273 (441) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC----CCHHHHHHHHHHHHHHhcCccccCcceec Confidence 34678999999999999999999999999999999999999987532 25788999999999999999999998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ +++|++++. ++|+||+|++++++++||++|||||++||+...+ .+.+++...|+ +||+||+++ T Consensus 274 ~~-g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~------------~s~~q~~~~y~-~tl~P~~~~ 339 (441) T protein:vir:98 274 DE-SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN------------MSITDANLDYL-STLKPYITC 339 (441) T ss_pred CC-CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC------------ccHHHHHHHHH-HHHHHHHHH Confidence 54 599999985 7999999999999999999999999999863321 23455555565 599999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCC--Eeeccceecccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGD--IILSAVYIQRLGQQ 473 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD--~~~~~~~~~~l~~~ 473 (559) ||++||++|+++..+..++|+++.+++.|.++++++++.++. |+||+||+|+++||||+|||| ++++++|+.++... T Consensus 340 ie~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~ 419 (441) T protein:vir:98 340 VCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV 419 (441) T ss_pred HHHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc Confidence 999999999987766667777788999999999999999885 568999999999999999998 56677887777554 Q ss_pred cccccccccccccccccccccCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) .+ ++..+...+....++++.++ T Consensus 420 ~~-----~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 420 DE-----YQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cc-----cccccccccccccCCCCCCC Confidence 22 11111111111111111111 No 40 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=3.8e-83 Score=472.43 Aligned_cols=409 Identities=16% Similarity=0.172 Sum_probs=299.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||.||-. |.... +......+ +...+.+.........-..| T Consensus 1 Mg~f~~~~~-----------------------------r~~~~--~~~~~~~~-------~~~~~~~~~~~~~~~~~~~a 42 (416) T protein:vir:45 1 MGIFYKNEK-----------------------------RDLQY--NEDDLQMM-------VQTLPGFQGTKLRQYKDIEA 42 (416) T ss_pred CCccccccc-----------------------------ccccC--CCcchhHH-------HHHhccccccCccccchhhh Confidence 999988711 00000 00000000 00011111111111223456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.+ | ++ ...+.+ .+|++..||+++++ ++||+.++ T Consensus 43 l~~~~v~~cv~~Ia~~iA~~p~~~~~~--~---~~-------------~~~~~~-~~lL~~~PN~~~t~---~~f~~~~~ 100 (416) T protein:vir:45 43 IRHSDIFTAVMMIASDLARMPIRVTVN--G---QI-------------NYSDRI-VNLLNTRPNPMYNG---YIFKLVVF 100 (416) T ss_pred hcchHHHHHHHHHHHhhccCceEEecC--c---cc-------------cccchH-HHHHhcccccCCCH---HHHHHHHH Confidence 789999999999999999998765431 1 00 011222 34455556666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.+|...+ +++..+ ......++++||||+++++ T Consensus 101 ~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~----~~~~~~~~~~~~~~~~~~~evihir~~~---- 172 (416) T protein:vir:45 101 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 172 (416) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE----EEEEecCCCceeEEEEccccEEEeccCC---- Confidence 99999999999999999999999999999999999988875431 222222 2234679999999998654 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+||+..++++|.++.++++|+.++|+||++|+|||++++.. .++++++++++.|++.++|..|+|+++|| T Consensus 173 ~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~----~~~~~~~~~~~~~~~~~~g~~nag~~~vl 248 (416) T protein:vir:45 173 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFHKSFSGTKQAGKVVVL 248 (416) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC----CCHHHHHHHHHHHHHHhcCccccCceeec Confidence 24678999999999999999999999999999999999999987532 35788999999999999999999998877 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. ++|+||+|++++++++||++|||||++||+...+ .+.+++...| .+||.|++++ T Consensus 249 ~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------------~~~~~~~~~~-~~~l~P~~~~ 314 (416) T protein:vir:45 249 D-ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN------------MSITDANLDY-LSTLKPYITC 314 (416) T ss_pred C-CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC------------ccHHHHHHHH-HHHHHHHHHH Confidence 5 5599999985 7999999999999999999999999999963321 1234444444 5699999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCC--Eeeccceecccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGD--IILSAVYIQRLGQQ 473 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD--~~~~~~~~~~l~~~ 473 (559) ||++||++|+++..+..++|+++.+++.|.+++++++..++. |+||+||+|+++||||+|||| +++++.++.++... T Consensus 315 ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:45 315 VCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred HHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc Confidence 999999999987766677888889999999999999999886 568999999999999999998 57788888877654 Q ss_pred cccccccccccccccccccccCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) .+ ++..+........++++.++ T Consensus 395 ~~-----~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 395 DE-----YQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cc-----cCcccccccccccCCCCCCC Confidence 32 11111111111111111111 No 41 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=3.8e-83 Score=472.43 Aligned_cols=409 Identities=16% Similarity=0.172 Sum_probs=299.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||.||-. |.... +......+ +...+.+.........-..| T Consensus 1 Mg~f~~~~~-----------------------------r~~~~--~~~~~~~~-------~~~~~~~~~~~~~~~~~~~a 42 (416) T protein:vir:81 1 MGIFYKNEK-----------------------------RDLQY--NEDDLQMM-------VQTLPGFQGTKLRQYKDIEA 42 (416) T ss_pred CCccccccc-----------------------------ccccC--CCcchhHH-------HHHhccccccCccccchhhh Confidence 999988711 00000 00000000 00011111111111223456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.+ | ++ ...+.+ .+|++..||+++++ ++||+.++ T Consensus 43 l~~~~v~~cv~~Ia~~iA~~p~~~~~~--~---~~-------------~~~~~~-~~lL~~~PN~~~t~---~~f~~~~~ 100 (416) T protein:vir:81 43 IRHSDIFTAVMMIASDLARMPIRVTVN--G---QI-------------NYSDRI-VNLLNTRPNPMYNG---YIFKLVVF 100 (416) T ss_pred hcchHHHHHHHHHHHhhccCceEEecC--c---cc-------------cccchH-HHHHhcccccCCCH---HHHHHHHH Confidence 789999999999999999998765431 1 00 011222 34455556666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.+|...+ +++..+ ......++++||||+++++ T Consensus 101 ~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~----~~~~~~~~~~~~~~~~~~~evihir~~~---- 172 (416) T protein:vir:81 101 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 172 (416) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE----EEEEecCCCceeEEEEccccEEEeccCC---- Confidence 99999999999999999999999999999999999988875431 222222 2234679999999998654 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+||+..++++|.++.++++|+.++|+||++|+|||++++.. .++++++++++.|++.++|..|+|+++|| T Consensus 173 ~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~----~~~~~~~~~~~~~~~~~~g~~nag~~~vl 248 (416) T protein:vir:81 173 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFHKSFSGTKQAGKVVVL 248 (416) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC----CCHHHHHHHHHHHHHHhcCccccCceeec Confidence 24678999999999999999999999999999999999999987532 35788999999999999999999998877 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. ++|+||+|++++++++||++|||||++||+...+ .+.+++...| .+||.|++++ T Consensus 249 ~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------------~~~~~~~~~~-~~~l~P~~~~ 314 (416) T protein:vir:81 249 D-ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN------------MSITDANLDY-LSTLKPYITC 314 (416) T ss_pred C-CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC------------ccHHHHHHHH-HHHHHHHHHH Confidence 5 5599999985 7999999999999999999999999999963321 1234444444 5699999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCC--Eeeccceecccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGD--IILSAVYIQRLGQQ 473 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD--~~~~~~~~~~l~~~ 473 (559) ||++||++|+++..+..++|+++.+++.|.+++++++..++. |+||+||+|+++||||+|||| +++++.++.++... T Consensus 315 ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:81 315 VCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred HHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc Confidence 999999999987766677888889999999999999999886 568999999999999999998 57788888877654 Q ss_pred cccccccccccccccccccccCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) .+ ++..+........++++.++ T Consensus 395 ~~-----~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 395 DE-----YQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cc-----cCcccccccccccCCCCCCC Confidence 32 11111111111111111111 No 42 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1.5e-82 Score=469.17 Aligned_cols=406 Identities=15% Similarity=0.158 Sum_probs=298.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|||||..+-. + +++. .+..+.. .......+ .......+.| T Consensus 1 Mgl~~~~f~~~~--------------------~------~~~~-~~~~~~~--~~~~~~~~---------~g~~v~~~~a 42 (409) T protein:vir:84 1 MSLFTRIFSGPS--------------------E------ERTL-TKISGIP--SPAEDWAM---------HGDRPGANSA 42 (409) T ss_pred CchhhhhhcCCC--------------------c------cccc-ccccccc--cccchhhc---------cCcccchhhh Confidence 999999732210 0 0000 0000000 00000000 0111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..++. +. ..+.+..+ ++..||++++ +++|++.++ T Consensus 43 l~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~----------~~------~~~~l~~l-L~~~PN~~~t---~~~f~~~l~ 102 (409) T protein:vir:84 43 MTLGAFYACVTLLADTVASLSIDAYRKKDNV----------RI------PVSPAPKL-LESTPYPGLT---WFDWLWMLM 102 (409) T ss_pred hccHHHHHHHHHHHHhhhhCceEEEEecCCc----------cc------ccchHHHH-hhccCCCCCC---HHHHHHHHH Confidence 8899999999999999999998665433221 10 11233333 3445665554 478999999 Q ss_pred HHHHHcCCcceEEE-ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccC Q lcl|NC_012530. 161 RDTYTYDQVNYENT-YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILS 239 (559) Q Consensus 161 ~d~ll~Gna~~~i~-rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~ 239 (559) .+++++||+|++|. ++..|+|++||||+|++|++....++.. .++++........++++||||+++++.. + T Consensus 103 ~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~-----~~~~~~~~~~g~~~~~~dvih~~~~~~~---~ 174 (409) T protein:vir:84 103 ESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDG-----DWIEPVYRIDGKVVPNHRIMHIKRYPVA---G 174 (409) T ss_pred HHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcc-----eEEEEEecCCceEEchhhEEEecCCCCC---c Confidence 99999999999986 6888999999999999999876554322 2222222222246899999999876533 2 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA 319 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~ 319 (559) ..||+||+..+..+|..+.++++|+.++|+||++|+|||+++ +.+++++++++++.|.+.+ .|+|+++|| + T Consensus 175 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~---~n~g~~~vl-~ 245 (409) T protein:vir:84 175 CALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD-----ADLTPDQVKQTQKQWIQSH---HNRRLPAVM-S 245 (409) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-----CCCCHHHHHHHHHHHHHHh---ccCCCeeec-C Confidence 358999999999999999999999999999999999999875 3588999999999998875 467888777 4 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) ++++|++++. +.|+||+|++++++++||++|||||++||+.+.+++ ..+|++++...|+++||.||++.|| T Consensus 246 ~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~sn~e~~~~~f~~~~l~P~~~~ie 317 (409) T protein:vir:84 246 AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTS--------WGTGIEEQGINFVRHTLLPWLRCIE 317 (409) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--------ccchHHHHHHHHHHHHHHHHHHHHH Confidence 5699999985 699999999999999999999999999998776554 3468999999999999999999999 Q ss_pred HHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIK 477 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~ 477 (559) ++|+++|.. +..++|+++.+++.|.+++++++..+++ |+||+||+|+++||||+||||++++|.|+.+++...... T Consensus 318 ~~l~~~L~~---g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~ 394 (409) T protein:vir:84 318 QALDTFLPR---GQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEE 394 (409) T ss_pred HHHHHhccC---CCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccCCccc Confidence 999998733 4567888889999999999999999986 568999999999999999999999999998876542210 Q ss_pred cccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 478 QNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) . . +++++......++ T Consensus 395 ~-----~--------------~~~~~~~~~~gn~ 409 (409) T protein:vir:84 395 P-----A--------------QEPQPNSATEGNK 409 (409) T ss_pred c-----C--------------cCCCCCCccCCCC Confidence 0 0 0000000000000 No 43 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=1.4e-82 Score=469.25 Aligned_cols=430 Identities=13% Similarity=0.064 Sum_probs=306.7 Q ss_pred cchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_012530. 16 PNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRAN 95 (559) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~ 95 (559) +-..|.++.+. .+.......+ .+.. .+.+..++.+......+...|+.+|+|++||++||+ T Consensus 1 ~~~~~~~~~~~--------------~~~~~~~~~~-~~~~----~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~ 61 (460) T protein:vir:10 1 MANRIIRALRE--------------LTGLDNKFND-AFIK----YIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAA 61 (460) T ss_pred CchhHHHHHhh--------------hhccCCCchH-HHHH----hhccccCCCccchhhhhHHHHhcchHHHHHHHHHHH Confidence 22222222111 1110000000 0100 011111222222334556779999999999999999 Q ss_pred HHHhhhhHhhhhcCCcceeeeccccc----------ccC----hhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHH Q lcl|NC_012530. 96 QVTEYAHRASTDDNGMGYQVRLKNGD----------KPT----KEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVR 161 (559) Q Consensus 96 ~ia~~~~~~~~~~~g~~~~v~~~d~~----------~~~----~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~ 161 (559) +||++|+.+++.......+...+... +.. .......+....++.+| |++++ +++||+.++. T Consensus 62 ~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~P--N~~~t---~~~f~~~~~~ 136 (460) T protein:vir:10 62 KTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESP--NPTQT---WADIYSLYKT 136 (460) T ss_pred hhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCC--CCCCC---HHHHHHHHHH Confidence 99999998876554432221100000 000 00011112233344444 44444 4789999999 Q ss_pred HHHHcCCcceEEEECC----CCcEEEEEEecCceEEEEecCcccccc--cceEEEEEecCceeeeecccceEEEecccCC Q lcl|NC_012530. 162 DTYTYDQVNYENTYDS----NGRLSHTRMVDPTTIYFANDEHGHRRT--RGKIYRQYIDNKVRGSFTADEMGMFIRNPRS 235 (559) Q Consensus 162 d~ll~Gna~~~i~rd~----~G~~~~L~~l~p~~V~~~~~~~g~~~~--~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~ 235 (559) +++++||+|++|+|+. .|+|++||||+|++|++..+.++.... ....++.+..++....|+++||||++++... T Consensus 137 ~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~ 216 (460) T protein:vir:10 137 YMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPN 216 (460) T ss_pred HHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCC Confidence 9999999999999964 478999999999999999988875432 2345566667788889999999999875433 Q ss_pred Ccc--CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccc Q lcl|NC_012530. 236 DIL--SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYR 313 (559) Q Consensus 236 ~~~--~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~ 313 (559) ... .+.||+||+..++.+|..+.++++|+.++|+||+.|++|++.+ +.+++++++++++.|++.++|.+|+|+ T Consensus 217 ~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~g~ 291 (460) T protein:vir:10 217 FDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGS-----TGLTQPQADSLKQRLTEMDKSPDRLSQ 291 (460) T ss_pred cccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecC-----CCCCHHHHHHHHHHHHHHhcCccccCC Confidence 222 3468999999999999999999999999999999999998764 468999999999999999999999999 Q ss_pred cccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhH Q lcl|NC_012530. 314 IPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMP 392 (559) Q Consensus 314 ~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P 392 (559) +++| +++++|+++++ +.|+||+|++++++++||++|||||++||+.+.++ .+++|++++...|++.||.| T Consensus 292 ~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--------~~~sn~e~~~~~f~~~~l~P 362 (460) T protein:vir:10 292 IAGA-SGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGG--------LNTGNLEEERKRVVTDNIQP 362 (460) T ss_pred ceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--------CccccHHHHHHHHHHHHHHH Confidence 8777 45699999996 69999999999999999999999999999876554 35789999999999999999 Q ss_pred HHHHHHHHHHhhccccccC-ccc--eeeecch--hhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCC--CCCCEeeccc Q lcl|NC_012530. 393 LLDMIAKNLTNGIIRQILG-DNY--MLEFVGG--DTRSQQDKLKSVQLELQTATTVNDYREKQGLPKI--AGGDIILSAV 465 (559) Q Consensus 393 ~~~~ie~~ln~~L~~~~~~-~~~--~~~f~~l--~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi--~gGD~~~~~~ 465 (559) |+++||++||++|+++.+. ..+ +|+|+.+ +++|.+++++++ .+|+||+||+|+++||||+ +|||++++|+ T Consensus 363 ~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~---~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~ 439 (460) T protein:vir:10 363 DLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWL---NTIPVTPNEIRIAMKYETLNQDGMDIVFMPS 439 (460) T ss_pred HHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHH---hCCCCCHHHHHHHhCCCCCCCCCCCeeeecc Confidence 9999999999999987653 234 4555555 455666655543 2478999999999999999 6899999999 Q ss_pred eeccccccccccccccccccc Q lcl|NC_012530. 466 YIQRLGQQEQIKQNEFQRQQT 486 (559) Q Consensus 466 ~~~~l~~~~~~~~~~~~~~~~ 486 (559) |+++++..........+++.. T Consensus 440 n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 440 NKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred cccchhhcccccCCCcccCCC Confidence 998886543211111000000 No 44 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=2.5e-82 Score=467.98 Aligned_cols=411 Identities=10% Similarity=0.072 Sum_probs=300.4 Q ss_pred HHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhh Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYA 101 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~ 101 (559) |+-++++.+.... ......+.+ ... ..+...++.. ......+.++.+|+|++||++||++||++| T Consensus 1 m~~~~~f~~~~~~------~~~~~~~~~---~~~---~~~~~~~~~~---~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~ 65 (416) T protein:vir:12 1 MLLERMFEKRSGS------SDHEDGFNN---ILL---NMFGGRKTAS---GERVSESNSLVQPDIFACVNVLSDDIAKLP 65 (416) T ss_pred CccchhcccccCc------cccCccchh---HHH---HhhcCccccc---CceechhhhhccHHHHHHHHHHHHhhhhCc Confidence 3333333222110 001111000 000 0011111111 122234567889999999999999999999 Q ss_pred hHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcE Q lcl|NC_012530. 102 HRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRL 181 (559) Q Consensus 102 ~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~ 181 (559) +.+++..++...+ ...+.+...| +..||++++ +++|++.++.+++++||+|++|+|+..|+| T Consensus 66 ~~~~~~~~~~~~~--------------~~~~~l~~~l-~~~PN~~~t---~~~f~~~~v~~lll~Gna~~~i~r~~~G~~ 127 (416) T protein:vir:12 66 IHTYKRTDGGIER--------------KPEHKSAHAV-YARPNPYMT---AFTWKKLMMTHVLTWGNAYSYIQFGSHGYP 127 (416) T ss_pred eEEEEecCCcccc--------------ccccHHHHHH-HhhcccCCC---HHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 8765543321111 0112222222 223444454 468999999999999999999999999999 Q ss_pred EEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_012530. 182 SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTE 261 (559) Q Consensus 182 ~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~ 261 (559) .+||||+|.+|++..+.++. ..|+++..++....++++||||++.++ .++.+|+||+.+++.+|..+.+++ T Consensus 128 ~~L~~l~~~~v~v~~~~~~~-----~~~~~~~~~g~~~~~~~~eiih~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~ 198 (416) T protein:vir:12 128 EALFPLRPDYTNAYVHPTTG-----MLWYQTVLNGKAIELYDYEVLHFKGLS----TDGIHGKSPIGVVREHIGAQAAAT 198 (416) T ss_pred EEEEEECCcceEEEEeCCCc-----EEEEEEecCCeEEEecCccEEEecCcC----CCCcccccHHHHHHHHHHHHHHHH Confidence 99999999999998876653 345555566667789999999998543 246789999999999999999999 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLN 340 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~ 340 (559) +|+.++|+||++|+|||++++ .+++++++++++.|+... ++++++||++ +++|++++. +.|+||+|+++ T Consensus 199 ~~~~~~~~ng~~p~~il~~~~-----~~~~e~~~~~~~~~~~~~----~~~~~~vl~~-g~~~~~l~~~~~d~q~~e~~~ 268 (416) T protein:vir:12 199 KYNAKLYKNEATPRGILKVPA-----FLDEKPKENVRKEWKRVN----KVENIAIIDY-GLEYQSISMPLQEAQFVESMK 268 (416) T ss_pred HHHHHHHhcCCCCceEEecCC-----CCCHHHHHHHHHHHHHHh----cCCCeeecCC-CceEEEccCChhhHHHHHHHH Confidence 999999999999999998853 588999999999998654 5678877754 599999985 79999999999 Q ss_pred HHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---ccceee Q lcl|NC_012530. 341 YLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---DNYMLE 417 (559) Q Consensus 341 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---~~~~~~ 417 (559) +++++||++|||||++||....+ +++|++++.+.|++.||.||+.+||++||++|+++.+. ..++|+ T Consensus 269 ~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd 338 (416) T protein:vir:12 269 FNKAQISMIYKVPLHKLNELDKA----------TFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFN 338 (416) T ss_pred HHHHHHHHHhCCCHHHhCCccCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEee Confidence 99999999999999999976654 46789999999999999999999999999999986543 345666 Q ss_pred ecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCC Q lcl|NC_012530. 418 FVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQ 496 (559) Q Consensus 418 f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (559) ++.+++.|.+++++++..++. |+||+||+|+++||||+||||+++++.|+.+++....... .+... T Consensus 339 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~-----~~~~~-------- 405 (416) T protein:vir:12 339 IDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQR-----LKAGG-------- 405 (416) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccchhhc-----ccccc-------- Confidence 678899999999999998886 5689999999999999999999999999988865432111 00000 Q ss_pred CCCCCCCCCCc Q lcl|NC_012530. 497 NPSGTPPTLPP 507 (559) Q Consensus 497 ~~~~~~~~~~~ 507 (559) ...+.++.+++ T Consensus 406 ~~~gge~~~~g 416 (416) T protein:vir:12 406 AMKGGDNKNEG 416 (416) T ss_pred ccCCCCCcCCC Confidence 00011111111 No 45 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=1.2e-81 Score=464.20 Aligned_cols=401 Identities=14% Similarity=0.069 Sum_probs=294.1 Q ss_pred Ccchhhhccc-cccCCcc----hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHH Q lcl|NC_012530. 1 MGIFDRFRTK-FYTDDPN----AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD 75 (559) Q Consensus 1 ~~~~~~~~~~-~~~~~~~----~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 75 (559) |+..|+.++. .+.+=.+ .++..-+++.+. +.+..+..+.-.. .......+.++.. .+........ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~-~~~~~~~~~g~~~--~~~~~~~~~~ 70 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVE-------FRGPEEEPEARAL-PWIRPTAWSGYPE--SWATPSWGSA 70 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceee-------ccCCCcchhhhhc-ccccccccccccc--cccccCcccc Confidence 9999999642 1111111 112222222111 1111100000000 0011111211111 1111112233 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ..+.++.+++|++||++||+.||++|+.+++... +. +. ..+++++.||++++. ++| T Consensus 71 t~~~~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~------------~~--------~~-~~~ll~~~PN~~~t~---~~f 126 (409) T protein:vir:83 71 QDKLRTLIDVAWACIDLNASVLSSMPIYRMRNGR------------II--------DS-VAWMSNPDPEVYTSW---QEF 126 (409) T ss_pred chhhHhhhHHHHHHHHHHHHhhccCceEEeeCCc------------cc--------cc-hhhhcccCCCCCCCH---HHH Confidence 4567889999999999999999999976543211 00 11 134567778877764 678 Q ss_pred HHHHHHHHHHcCCcceEE-EECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccC Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYEN-TYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPR 234 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i-~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~ 234 (559) +++++.++++ ||+|+++ .|+.+|+|++|+||+|++|++..+.+|.. +|++.. .+.++||||+++++. T Consensus 127 ~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~p~~v~v~~~~~g~~------~y~~~~-----~~~~~eiiHir~~~~ 194 (409) T protein:vir:83 127 AKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVPPWLVNVELKKGARR------EYRIGG-----LNVTDEILHIRYQGN 194 (409) T ss_pred HHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEECCcceEEEEcCCceE------EEEEcc-----ccCccceEEeCCCCC Confidence 9999999887 9999975 58999999999999999999988877632 233321 234689999986533 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_012530. 235 SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRI 314 (559) Q Consensus 235 ~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~ 314 (559) .++.||+|||+.++.+|.++.++++|+.++|+||++|+|||+++ +.++++++++++++|++.++| |+|++ T Consensus 195 ---~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~-----~~ls~e~~~~~~~~~~~~~~~--nag~~ 264 (409) T protein:vir:83 195 ---TADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVE-----RRLSETEAVDLMDRWIESRSK--YAGHP 264 (409) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecC-----CCCCHHHHHHHHHHHHHhhCC--ccCcc Confidence 34568999999999999999999999999999999999999875 458999999999999998876 78998 Q ss_pred ccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 315 PMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 315 ~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) +||++|...+++++. ++|+||+|++++++++||++|||||++||+.+.++. .+++|++++...|+++||.|| T Consensus 265 ~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~-------~tysn~eq~~~~f~~~tL~P~ 337 (409) T protein:vir:83 265 ALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGS-------LTYSNIEQLFSFHDRSSLRPK 337 (409) T ss_pred ceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccc-------cccccHHHHHHHHHHHHHHHH Confidence 888666444467874 799999999999999999999999999998765432 347899999999999999999 Q ss_pred HHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceec Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQ 468 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~ 468 (559) +++||++||++|++.. ..++|+++.++++|.++|+++++.+++ |+||+||+|+++||||++|||.+-... + T Consensus 338 ~~~ie~~l~~~Ll~~~--~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~glpp~~ggd~l~~~g--v 409 (409) T protein:vir:83 338 ATAVMAALDRWALPSP--QHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERLHSEAAAVRLSGGG--V 409 (409) T ss_pred HHHHHHHHHHhhCCCC--cEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccCCCC--C Confidence 9999999999999753 457788889999999999999999986 578999999999999999999863211 1 No 46 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=8.4e-81 Score=459.60 Aligned_cols=443 Identities=13% Similarity=0.075 Sum_probs=303.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|||+|..+. ..+...+...... .......-.|..-....|-+... . .+. .+++.+........+.| T Consensus 1 M~~~~~l~~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~----~--~g~--~~~~~~~~g~~v~~~~a 69 (466) T protein:vir:81 1 MRLIDRLLSTRG-AAPRMSIDDYAQM--LNEFAFNGIGYGFGGGVPRIQQT----L--AGP--STELAPDTFVGLATQAY 69 (466) T ss_pred CchhHHHhhccC-cccccchhhhhhh--hhhhhccccccccccccHHHHHh----h--ccc--cccccCccccccchhhh Confidence 999999975443 2222111111111 00001011111111112111110 0 001 11222222333456778 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++...++ ..+.+..++.+ ||++++ +++|++.++ T Consensus 70 ~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~--------------~~~~~~~L~~~--PN~~~t---~~~f~~~l~ 130 (466) T protein:vir:81 70 QANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDT--------------FGSRDLQILET--PWKGGT---TQDMLSRMI 130 (466) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCceeec--------------cccHHHHHhhC--CCCCCC---HHHHHHHHH Confidence 99999999999999999999988766544321111 11223344444 555554 468999999 Q ss_pred HHHHHcCCcceEEEECCC--------CcEEEEEEecCceEEEEecCcccccccceEEEEEecC----ceeeeecccceEE Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSN--------GRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN----KVRGSFTADEMGM 228 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~--------G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~----~~~~~~~~~evi~ 228 (559) .+++++||+|++|+|+.. |.+++|+||+|.+|++..+.+++... .|.+...+ .....++++|||| T Consensus 131 ~~lll~Gnay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~---~y~~~~~~~~~~~~~~~~~~~dviH 207 (466) T protein:vir:81 131 QDADLAGNSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWRKV---GYLYTEGGRQSGNESVGFLAEDVVH 207 (466) T ss_pred HHHHhcCCeEEEEEecCccccccccCcceeEEEEecCcceEEEEcCCCceEE---EEEEEecCcccccceeeeccccEEE Confidence 999999999999999765 55899999999999999988875432 23322222 2345799999999 Q ss_pred EecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCc Q lcl|NC_012530. 229 FIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGI 308 (559) Q Consensus 229 ~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~ 308 (559) ++.++. ..++.||+||+.+++++|.++.++++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|. T Consensus 208 ir~~~~--~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~ 280 (466) T protein:vir:81 208 FAPIPD--PLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-----PMADPAAVKKWADEVNSKHAGV 280 (466) T ss_pred EcCCCC--cccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-----CCCCHHHHHHHHHHHHHHhcCc Confidence 986432 235678999999999999999999999999999999999999875 4588999999999999999999 Q ss_pred ccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_012530. 309 NGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKS 387 (559) Q Consensus 309 ~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~ 387 (559) +|+|+++||+ ++++|++++. ++|+||+|++++++++||++|||||++||+.+... +.+++|++++.+.|++ T Consensus 281 ~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~-------~st~sn~eq~~~~f~~ 352 (466) T protein:vir:81 281 DNAWKNLNLY-PGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLA-------AATYSNYGQARRRLAD 352 (466) T ss_pred cccccceEcC-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCC-------ccccccHHHHHHHHHH Confidence 9999987775 4599999995 79999999999999999999999999999865422 2357899999999999 Q ss_pred HHhhHHHHHHHHHHHhhccccccCccceeeec--chhhhhHHHHHHH-------HHHHHcCCCCHHHHHHHhCCCCCCCC Q lcl|NC_012530. 388 KGLMPLLDMIAKNLTNGIIRQILGDNYMLEFV--GGDTRSQQDKLKS-------VQLELQTATTVNDYREKQGLPKIAGG 458 (559) Q Consensus 388 ~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~--~l~~~d~~~~~~~-------~~~~~~~~~T~NE~R~~~gl~pi~gG 458 (559) +||.||+++||++|+++|++..+...++|+|+ .++++|.++++++ +..++.+++|+||+|+ ++++| T Consensus 353 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~t~nE~r~-----~~~~g 427 (466) T protein:vir:81 353 GTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGYEPESVVA-----AVNSG 427 (466) T ss_pred HHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCCChhhccc-----cccCC Confidence 99999999999999999998766666666665 7889999988876 4456667789999995 56788 Q ss_pred CEee-ccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccc Q lcl|NC_012530. 459 DIIL-SAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGK 530 (559) Q Consensus 459 D~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 530 (559) |..+ .+.++.++...... +.... ..+++... + ..+||. T Consensus 428 d~~~~~~~~~~~~~~~~~~-----~~~~~------------~~~~~~~~-------G----------g~~ngn 466 (466) T protein:vir:81 428 DLRLLKHTGLTSVQLLPPG-----VSASA------------SSDTPTSG-------G----------ADDNGN 466 (466) T ss_pred ccccccCCCcchhhhcccc-----ccccc------------CCCCcccC-------C----------CCcCCC Confidence 8654 33333322211100 00000 00000000 0 001111 No 47 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=1.7e-80 Score=457.86 Aligned_cols=401 Identities=10% Similarity=0.060 Sum_probs=293.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |||++|+...+.+.-+ ++ +..+..++.. |.+ ....++ ..+.+ T Consensus 4 ~~~~~~~k~~~~~~~~---------------------~~------~~~~~~~~~~--~~~-------~~~~~v--~~~~a 45 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWI---------------------DQ------SASKLYDFSP--WKN-------KSFWGV--INNTL 45 (409) T ss_pred ccchhhhhhHHhhhhh---------------------cc------cccccccccc--ccC-------cccccc--chhhH Confidence 7777777222211111 11 1111111111 100 011111 23457 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++.... .. +.+.. |++..||+++++ ++|++.++ T Consensus 46 ~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~-------------~~------~~l~~-lL~~~PN~~~t~---~~f~~~~~ 102 (409) T protein:vir:96 46 ETNETIFSAITKLSNSMASLPLKMYEDYKV-------------VN------TEVSD-LLTVSPNNSLSS---FDFINQIE 102 (409) T ss_pred hhhHHHHHHHHHHHHhhhhCceEEeecccc-------------cc------hhHHH-HHhhhcccCCCH---HHHHHHHH Confidence 889999999999999999999776543211 11 22333 344445656554 67999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|+|+..|+|++||||+|++|++..+.++.. ..|.....++....|+++||||++.++ ..++ T Consensus 103 ~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~----~~y~~~~~~g~~~~~~~~evih~r~~~---~~~~ 175 (409) T protein:vir:96 103 TIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIV---ASNM 175 (409) T ss_pred HHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE----EEEEEEcCCceEEEEccccEEEeCCCC---CCCc Confidence 999999999999999999999999999999999998776543 234444444556689999999998643 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..++.++..+.++++++ ++.++..++++++. ++.+++++++++++.|++.++ |+|+++++ ++ T Consensus 176 ~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~-----~~~l~~e~~~~~~~~~~~~~~---n~g~~~vl-~~ 244 (409) T protein:vir:96 176 VQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-----GSNVSTEKRQQVLEDFKQYYE---ENGGILFQ-EP 244 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEec-----CCCCCHHHHHHHHHHHHHHhh---cCCCeeec-CC Confidence 789999999999999999998874 45555555556554 356899999999999998875 56787777 45 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. +.|+||+|++++++++||++|||||++||....+ +++|++++.+.|+++||.||+++||+ T Consensus 245 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------~~s~~e~~~~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:96 245 GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEE 314 (409) T ss_pred CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 699999985 7999999999999999999999999999975543 46789999999999999999999999 Q ss_pred HHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++||++.+. ..++|+.+.+++.|.+++++++..++. |+||+||+|+++|+||+||||+++++.|+++++.... T Consensus 315 ~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:96 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecccccccccchh Confidence 999999987653 334555568899999999999999986 5689999999999999999999999999988754321 Q ss_pred cccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) ..+ ...+++.++.+. T Consensus 395 ~~~-------------~~~gG~~n~~e~ 409 (409) T protein:vir:96 395 LRK-------------SLKGGDKNVNES 409 (409) T ss_pred hcc-------------cccCCCCCcCCC Confidence 110 001111111111 No 48 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=1.9e-80 Score=457.58 Aligned_cols=410 Identities=11% Similarity=0.041 Sum_probs=286.9 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) ++.+ -+...... ..+ ..+.......|+.. ..+.. ..|+++++||+||++||+.||++|+.+ T Consensus 1 m~~~---------~~~~~~~~-----~~~-~~~~~~~~~~~~~~--g~~~~--~~Al~~~~V~~cv~~ia~~iA~lp~~~ 61 (417) T protein:vir:38 1 MKLF---------RGLATEVD-----PHW-ADHLLDSGVIPSFR--GGYLG--ISALRNSDVLTAVSIVSGDVSRFPLVI 61 (417) T ss_pred Cccc---------cccccCCC-----ccc-hhhhcccccccccC--Cceec--hhhcccHHHHHHHHHHHHhhccCeeEE Confidence 1111 00000000 000 00000111122222 11211 357889999999999999999998765 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC-CcEEE Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN-GRLSH 183 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~-G~~~~ 183 (559) ++.... +.. ..+.+. +|++..||++++ +++|++.++.+++++||+|++|+|+.. |.|.+ T Consensus 62 ~~~~~~-----------~~~-----~~~~~~-~lL~~~PN~~~t---~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~ 121 (417) T protein:vir:38 62 TDSSTD-----------EVI-----DLANIE-YLMNTKVNKRLS---AYQWKFPMMVNAILTGNAYSRIVRDPITNEPAM 121 (417) T ss_pred EEcCCc-----------cee-----ccchHH-HHHhcccCcCCC---HHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEE Confidence 442211 110 112233 344455666665 468999999999999999999999864 67999 Q ss_pred EEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) |+||+|++|++.....|... +++...++.....++++||||++.++ .++.+|+||+.+++.+|.++.++++| T Consensus 122 l~~l~p~~v~v~~~~~~~~~----y~~~~~~~~~~~~~~~~dviH~r~~~----~d~~~G~s~l~~~~~~i~~~~~~~~~ 193 (417) T protein:vir:38 122 FEFYAPSQTQVDTSDPDNII----YRFTPYNSSMQKVCGFEDVIHWKFFS----YDTIMGRSPLLSLGDEIGLQESGVST 193 (417) T ss_pred EEEeCCceEEEEEcCCCeEE----EEEEEcCCcEEEEecCcceEEecCCC----CCCccccCHHHHHHHHHHHHHHHHHH Confidence 99999999999887665331 12333445566779999999998643 34678999999999999999999999 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYL 342 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~ 342 (559) +.+||+||++|+|||+++ +.+++++++++++.|++.++|. |+|+++||+ ++++|++++. +.|+||+|+++++ T Consensus 194 ~~~~f~ng~~p~~il~~~-----~~l~~e~~~~~~~~~~~~~~g~-n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~ 266 (417) T protein:vir:38 194 LQKFFKSGLKGSIIKAKE-----SRLSAEARQKIREDFERAQAGA-DAGSPIIVD-ATMDYQPLEVDTNVLNLINSNNYS 266 (417) T ss_pred HHHHHhccCCCcEEEEeC-----CCCCHHHHHHHHHHHHHHhccc-ccCCceecc-CCceEEEccCCHHHHHHHHHHHhh Confidence 999999999999999875 4588999999999999999885 899988875 4599999985 7999999999999 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecc-- Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVG-- 420 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~-- 420 (559) +++||++|||||++||. . .+++|++++...|+++||.||+++||++|+++||++.+...++|+|+. T Consensus 267 ~~~Ia~~fgVPp~~lg~--~----------~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~ 334 (417) T protein:vir:38 267 TAQIAKALRVPAYRLAQ--N----------SPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQRHQYCIGFDTKS 334 (417) T ss_pred HHHHHHHhCCCHHHhCC--C----------CcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcccceEEechhh Confidence 99999999999999983 1 135789999999999999999999999999999987766667777763 Q ss_pred hhhhhHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCC--CEeeccceecccccccccccccccccccccccccccCCC Q lcl|NC_012530. 421 GDTRSQQDKLKSVQLEL-QTATTVNDYREKQGLPKIAGG--DIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQN 497 (559) Q Consensus 421 l~~~d~~~~~~~~~~~~-~~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (559) +++.+ ++ .++.++ .|+||+||+|+++||||+||| |.+++++|+.+++...+...... ...+++ T Consensus 335 l~~~~---~~-~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~---------~~~kgg- 400 (417) T protein:vir:38 335 VNGLP---IA-DVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHA---------AELKGG- 400 (417) T ss_pred hhHHH---HH-HHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccc---------cccCCC- Confidence 33222 22 244455 467899999999999999987 78999999888875433211000 000000 Q ss_pred CCCCCCCCCccccccchhccccccccccccccccccccc Q lcl|NC_012530. 498 PSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGV 536 (559) Q Consensus 498 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 536 (559) ++. .+ .+.++.++.+++ T Consensus 401 ----~~~-----~~-------------~~~~~~~~~~~~ 417 (417) T protein:vir:38 401 ----DTN-----AK-------------GNQNGSGTNANS 417 (417) T ss_pred ----CCC-----CC-------------CCCcCCCCcCCC Confidence 000 00 000000111111 No 49 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=1.2e-80 Score=458.82 Aligned_cols=458 Identities=12% Similarity=0.057 Sum_probs=297.3 Q ss_pred cccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhH Q lcl|NC_012530. 48 DGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQ 127 (559) Q Consensus 48 ~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~ 127 (559) ..+.+...+.+..+ ++... .....+.|+++++||+||++||++||++|+.++.. ++ . T Consensus 1 ~~~~~~~~g~~~~~----~~~~~--~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~-~~----------~------ 57 (723) T protein:vir:94 1 MTTFPSGAGGWNAW----SADSV--FGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGP-DG----------E------ 57 (723) T ss_pred CcccccCCCccccc----ccccc--ccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcC-CC----------c------ Confidence 11222222222211 11111 12234578899999999999999999999765432 11 0 Q ss_pred HHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCceEEEEecCcccccc Q lcl|NC_012530. 128 QKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPTTIYFANDEHGHRRT 204 (559) Q Consensus 128 ~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~~V~~~~~~~g~~~~ 204 (559) ....+.+..+|. ..||++++. ++|++.++.+++++||+|++|+|+. .|.|++||||+|..+.+.....+.... T Consensus 58 ~~~~~~l~~lL~-~~PN~~~t~---~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~ 133 (723) T protein:vir:94 58 LDELHPLSQLWN-VMPNRAMPA---QVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVP 133 (723) T ss_pred cchhhHHHHHHh-hCCCCCCCH---HHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccce Confidence 011233444443 345556554 6899999999999999999999764 589999999999888777665544332 Q ss_pred c--ceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC Q lcl|NC_012530. 205 R--GKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP 282 (559) Q Consensus 205 ~--~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 282 (559) . ...|.....++....++++||||++.+. ..++.||+|||..++.+|..+.++++|+.+||+||++|+|||+++ T Consensus 134 ~~~~~~y~~~~~~G~~~~~~~~dIiHir~~~---~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~- 209 (723) T protein:vir:94 134 QAQIIGYVIERTDGVRVPVLADEMLWLRFSD---PYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG- 209 (723) T ss_pred eeeeeEEEEEecCceeEEecccceEEecCCC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC- Confidence 2 1223333345556789999999998542 345678999999999999999999999999999999999999863 Q ss_pred ccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC---------Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCC Q lcl|NC_012530. 283 SPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA---------EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAM 352 (559) Q Consensus 283 ~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~---------g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgV 352 (559) .+++++++++++.|++.++|..|+|+++||++ .|++|++++. ++|+||+|++++++++||++||| T Consensus 210 -----~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgV 284 (723) T protein:vir:94 210 -----DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGI 284 (723) T ss_pred -----CCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCC Confidence 47899999999999999999999999998863 4689999985 79999999999999999999999 Q ss_pred CHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeec--chhhhhHHHHH Q lcl|NC_012530. 353 DPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFV--GGDTRSQQDKL 430 (559) Q Consensus 353 Pp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~--~l~~~d~~~~~ 430 (559) ||++||.. .+++|.+++...|+++||+||+++||++||++|++.. +..++|+|+ .++++|.++++ T Consensus 285 Pp~~i~~~------------st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-g~~~~~~f~~~~lLr~D~~~r~ 351 (723) T protein:vir:94 285 RKDALLGG------------STYENQAEAKAAVWTETLIPQMEVMASITDLQLLPDI-GWTVEWDFNSVPALQEDLEAQA 351 (723) T ss_pred ChhHcCCC------------CCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcccc-cCceEEeecchhhhhcCHHHHH Confidence 99999631 2357899999999999999999999999999999754 445777776 46899999999 Q ss_pred HHHHHHHc-CCCCHHHHHHHhCCCCCCCCCE--eeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 431 KSVQLELQ-TATTVNDYREKQGLPKIAGGDI--ILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 431 ~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~--~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) +++..+++ |+||+||+|+++||||+||||. .+.|.+.. +......... .++...+.-...... ..+.+. +..| T Consensus 352 ~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~-~a~~~~~~p~-~~e~~~~~~~~~~~~-~~~~p~-~~~~ 427 (723) T protein:vir:94 352 GRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQ-FAPAPAPAPA-VEEGAARMLALLERV-AADRPL-PELP 427 (723) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceecccccc-ccCCCCCCcc-chhhhHhhhhhcccc-ccccCc-CCCC Confidence 99999886 5689999999999999999983 34554332 1111111000 000000000000000 000110 0111 Q ss_pred cccccchhcccccccccccc---ccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 508 SSSNSFQQNQEGYTGKDAKP---SGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) ....+...++++....+.-. ++-. ..-..+=+++-..-.-.+.+--++++. T Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (723) T protein:vir:94 428 VRATTVLHHDPGPDPQQTLYERLEALL-QPLLVELGRRQAAVTLREFDLLMRGER 481 (723) T ss_pred CCCCCCCCCCcccCCchhHHHHHHHHH-hhhHHHHHHHHHHHHHHhhchhhcchH Confidence 11111111211111110000 0000 000000000000000001111122222 No 50 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=5.9e-80 Score=454.96 Aligned_cols=405 Identities=10% Similarity=0.050 Sum_probs=295.9 Q ss_pred Ccchhh--hccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDR--FRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |+++.| +. .++.+++..+... .+..+...+ ..+ .++.......+ T Consensus 1 m~~~~~~~~~------------~~~~~~~~~~~~~-----------~~~~~~~~~--~~~---------~~~~~~~v~~~ 46 (412) T protein:vir:26 1 MNVIAKENIV------------TRIKKKLIDNWID-----------QSTSKLYDF--SPW---------KNRSFWGVINN 46 (412) T ss_pred Cccchhhhhh------------hhhhhhHhhhhhc-----------ccccccccc--ccc---------CCccccccchh Confidence 777743 31 2233333322221 111100011 000 00111112345 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) .++.+|+|++||++||++||++|+++++..+. .. +.+ .+|++..||++++ +++||+. T Consensus 47 ~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~-------------~~------~~~-~~lL~~~PN~~~t---~~~f~~~ 103 (412) T protein:vir:26 47 TLETNETIFSAITKLSNSMASLPLKMYEDYKV-------------VN------TEV-SDLLTVSPNNSLS---SFDFINQ 103 (412) T ss_pred hhhccHHHHHHHHHHHHhHhhCceeEeecccc-------------cc------chH-HHHHHhhcccCCC---HHHHHHH Confidence 67899999999999999999999776543221 11 122 2344445665655 4689999 Q ss_pred HHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCcc Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~ 238 (559) ++.+++++||+|++|+|+..|++++||||+|++|++..+.+++. ..|.....++....|+++||||++.++ .. T Consensus 104 ~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~----~~y~~~~~~g~~~~~~~~evih~~~~~---~~ 176 (412) T protein:vir:26 104 IETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIV---AS 176 (412) T ss_pred HHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE----EEEEEEcCCceEEEEccccEEEeCCCC---CC Confidence 99999999999999999999999999999999999998876643 234444445556679999999998643 23 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT 318 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~ 318 (559) ++.||+||+.+++.+|.++.++++++ ++.++..++++++.+ +.+++++++++++.|++.++ ++|+++|++ T Consensus 177 ~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~ 246 (412) T protein:vir:26 177 NMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYG-----SNVGKEKRQQVLEDFKQYYE---ENGGILFQE 246 (412) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecC-----CCCCHHHHHHHHHHHHHHhh---cCCCeeecC Confidence 56789999999999999999999885 566666666676553 46899999999999998764 567877774 Q ss_pred CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_012530. 319 AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMI 397 (559) Q Consensus 319 ~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~i 397 (559) ++++|++++. +.|+||+|++++++++||++|||||++||....+ +++|++++.+.|++.||.||+++| T Consensus 247 -~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~i 315 (412) T protein:vir:26 247 -PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQY 315 (412) T ss_pred -CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHH Confidence 5699999985 7999999999999999999999999999964433 568899999999999999999999 Q ss_pred HHHHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHhCCCCCCCCCEeeccceecccccc Q lcl|NC_012530. 398 AKNLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQ 473 (559) Q Consensus 398 e~~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~ 473 (559) |++||++|++..+. ..++|+++.+++.|.++++++++.++.+ +||+||+|+++||||+||||+++++.|+++++.. T Consensus 316 e~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~ 395 (412) T protein:vir:26 316 EEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTP 395 (412) T ss_pred HHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccccc Confidence 99999999987653 2345555688999999999999999865 6899999999999999999999999998877543 Q ss_pred cccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) ....+. .+.++.+..+. T Consensus 396 ~~~~~~-------------~~gG~~n~~e~ 412 (412) T protein:vir:26 396 LELRKS-------------LKGGDKNVNES 412 (412) T ss_pred hhhccc-------------ccCCCCCcCCC Confidence 211100 01111111111 No 51 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=8.4e-80 Score=454.10 Aligned_cols=401 Identities=10% Similarity=0.041 Sum_probs=292.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) -||++|+ ..+...+-. ..++.+...+.+ +..+.......+.+ T Consensus 4 ~~~~~~~-------------------------~~~~~~~~~--~~~~~~~~~~~~-----------~~~~~~~~v~~~~~ 45 (409) T protein:vir:93 4 ENIVTRI-------------------------KKKLIDNWI--DQSTSKLYDFSP-----------WKNRSFWGVINNTL 45 (409) T ss_pred cchhhhh-------------------------hhhhhhhhh--cccccccccccc-----------ccCccccccchhhh Confidence 2333333 111111100 011111111100 00111111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++.... .+ +.+. .|++..||+++++ ++|++.++ T Consensus 46 ~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~-------------~~------~~~~-~lL~~~PN~~~t~---~~f~~~~~ 102 (409) T protein:vir:93 46 ETNETIFSAITKLSNSMASLPLKMYEDYKV-------------VN------TEVS-DLLTVSPNNSLSS---FDFINQIE 102 (409) T ss_pred hccHHHHHHHHHHHHhhhhCceeEeecccc-------------cc------chHH-HHHhhhcccCCCH---HHHHHHHH Confidence 899999999999999999999876543211 11 1222 3444455655554 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+..|++++||||+|++|++..+.++.. ..|.....++....++++||||++.++ ..++ T Consensus 103 ~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~~~g~~~~~~~~eVih~r~~~---~~~~ 175 (409) T protein:vir:93 103 TIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIV---ASNM 175 (409) T ss_pred HHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE----EEEEEEcCCceEEEEccccEEEeCCCC---CCCc Confidence 999999999999999999999999999999999988776542 234444445556679999999998543 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+|||.++..+|.++.++++++ ++.++..++++++.+ +.+++++++++++.|++.++ ++|+++|++ + T Consensus 176 ~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~-~ 244 (409) T protein:vir:93 176 VQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYG-----SNVGKEKRQQVLEDFKQYYE---ENGGILFQE-P 244 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecC-----CCCCHHHHHHHHHHHHHHhh---cCCCeeecC-C Confidence 789999999999999999998885 566666667777653 46899999999999998774 567877774 5 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|+++++ +.|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|++.||+||+++||+ T Consensus 245 g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:93 245 GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEE 314 (409) T ss_pred CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 699999985 7999999999999999999999999999975543 46789999999999999999999999 Q ss_pred HHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++|+++.+. ..++|+++.+++.|.++++++++.++++ +||+||+|+++|+||+||||+++++.|+++++.... T Consensus 315 ~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:93 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccccchh Confidence 999999987653 2345555688999999999999999864 689999999999999999999999999988765322 Q ss_pred cccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) .... ..+++.+..|. T Consensus 395 ~~~~-------------~~gG~~n~~e~ 409 (409) T protein:vir:93 395 LRKS-------------LKGGDKNVNES 409 (409) T ss_pred hccc-------------ccCCCCCcCCC Confidence 1110 01111111111 No 52 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=5.3e-79 Score=449.71 Aligned_cols=401 Identities=10% Similarity=0.051 Sum_probs=291.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) =||++|+ .+++.++. ...+......+.. | ......+ ...+.| T Consensus 4 ~~~~~~~-------------------------k~~~~~~~--~~~~~~~~~~~~~--~-------~~~~~~~--v~~~~a 45 (409) T protein:vir:94 4 ENIVTRI-------------------------KKKLIDNW--IDQSASKLYDFSP--W-------KNKSFWG--VINNTL 45 (409) T ss_pred cccchhh-------------------------hhHHhhhh--hcCCccccccccc--c-------cCccccc--cchhhh Confidence 1233333 11111111 1111111111111 0 0001111 234567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..+. .+ +.+.. |++..||+++++ ++|++.++ T Consensus 46 ~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~-------------~~------~~~~~-lL~~~PN~~~t~---~~f~~~~~ 102 (409) T protein:vir:94 46 ETNETIFSAITKLSNSMASLPLKMYEDYKV-------------VN------TEVSD-LLTVSPNNSLSS---FDFINQIE 102 (409) T ss_pred hccHHHHHHHHHHHHhhhhCceeEeecccc-------------cc------hhHHH-HHhhhcccCCCH---HHHHHHHH Confidence 889999999999999999999876543221 11 12333 344445655554 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|+|+..|+|++||||+|++|++..+.++.. ..|.....++....++++||||++..+ ..++ T Consensus 103 ~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~----~~y~~~~~~g~~~~~~~~dvih~r~~~---~~~~ 175 (409) T protein:vir:94 103 TIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIV---ASNM 175 (409) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE----EEEEEEcCCceEEEEccccEEEecCCC---CCCc Confidence 999999999999999999999999999999999988776543 223333344555679999999998532 2356 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..++.++..+.+++.++ ++.++..++++++.+ +.+++++++++++.|++.++ ++|+++|++ + T Consensus 176 ~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~-~ 244 (409) T protein:vir:94 176 VQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYG-----SNVGKEKRQQVLEDFKQYYE---ENGGILFQE-P 244 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecC-----CCCCHHHHHHHHHHHHHHhh---cCCCeeecC-C Confidence 789999999999999999998885 566666666676553 46899999999999998874 567877774 5 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++|++++. ++|+||+|++++++++||++|||||++||..+.+ +++|++++.+.|++.||.|++++||+ T Consensus 245 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:94 245 GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEE 314 (409) T ss_pred CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 599999995 7999999999999999999999999999975433 46789999999999999999999999 Q ss_pred HHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 400 NLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 400 ~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) +||++|+++.+. ..++|+.+.+++.|.+++++++..+++ |+||+||+|+++|+||+||||+++++.++.+++...+ T Consensus 315 ~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:94 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecccccccccchh Confidence 999999987653 234555568899999999999999886 5689999999999999999999999999988765322 Q ss_pred cccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) ..+ ..++++.+..|. T Consensus 395 ~~~-------------~~kGG~~n~~e~ 409 (409) T protein:vir:94 395 LRK-------------SLKGGDKNVNES 409 (409) T ss_pred hcc-------------cccCCCCCcCCC Confidence 110 011111111111 No 53 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=4.9e-79 Score=449.93 Aligned_cols=401 Identities=13% Similarity=0.069 Sum_probs=296.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||++|+....... ....+... ..+..+.+ .. ........+ T Consensus 1 Mg~f~~~~~~~~~~~--------------------------~~~~~~~~-~~~~~~~~--------~~---~~~~~~~~~ 42 (406) T protein:vir:95 1 MGLFDRWRRTKRKSK--------------------------IRADTGYV-GLFMSGED--------VS---FLVPGYVRL 42 (406) T ss_pred Ccchhhhcccccccc--------------------------ccccchhh-hhhccCcc--------cC---ccccCHHHH Confidence 999999943111100 00001000 00111100 00 111123456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++.. +.. ..+....++.+ ||++++ +++|+++++ T Consensus 43 ~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~---------~~~-----~~~~~~~l~~~--PN~~~t---~~~f~~~~~ 103 (406) T protein:vir:95 43 SDNPEVRMAVHKIADLISSMTIYLMQNTEDGD---------IRI-----RNELSRKIDIT--PYSLMT---RKSWMYNIV 103 (406) T ss_pred hhcHHHHHHHHHHHHhhccCceEEEEecCCcc---------eee-----cchHHHHHhhc--cCCCCC---HHHHHHHHH Confidence 88999999999999999999887655443211 000 11122233344 444444 578999999 Q ss_pred HHHHHcCCcce--EEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCcc Q lcl|NC_012530. 161 RDTYTYDQVNY--ENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 161 ~d~ll~Gna~~--~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~ 238 (559) .+++++|++|+ ++.|+..|+|++||||+|.+|++..+.+|+.. ..+. ..|+++||||+++++.+ . T Consensus 104 ~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~~---------~~~~--~~~~~~evih~~~~~~~--~ 170 (406) T protein:vir:95 104 YTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQV---------LYGG--QTFNYDEVLHFIYNPDP--E 170 (406) T ss_pred HHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEEE---------Eecc--EEEchhHEEEeeccCCC--C Confidence 99999977655 56788999999999999999999988775321 1122 36899999999976543 3 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT 318 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~ 318 (559) ++.+|+||+..+..+|.++.++++++.++|.||++|+|+|++++ .+++++.++++++|.+.++|..|+|+++|++ T Consensus 171 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-----~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~ 245 (406) T protein:vir:95 171 RPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-----ATAELSSEEGRNAVFKKYLQATEAGQPWIIP 245 (406) T ss_pred CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-----CCCHHHHHHHHHHHHHHhccccccCCceeec Confidence 56789999999999999999999999999999999999998864 5889999999999999999999999999998 Q ss_pred CCceeeeecc--ccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 319 AEDAKFVSMT--QAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 319 ~g~~~~~~ls--~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) .++.+++++. +++|+||+|++++++++||++|||||++||.. ++.+++...|++.||.||+++ T Consensus 246 ~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---------------~~~~~~~~~~~~~~l~P~~~~ 310 (406) T protein:vir:95 246 AELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---------------EFNRDEYNNFINSTILPIAKG 310 (406) T ss_pred CCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---------------CchHHHHHHHHHHHHHHHHHH Confidence 7777776654 36899999999999999999999999999842 234667788999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) ||++||++|+++. +..++|+++.+++.|.+++++.+..++. |+||+||+|+++||||+||||+++++.++++++.... T Consensus 311 ie~~l~~~l~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n~~~~~~~~~ 389 (406) T protein:vir:95 311 IEQELTRKLLISP-DLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGD 389 (406) T ss_pred HHHHHHHhcCCCC-CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccCccchhhccc Confidence 9999999999764 3457777788999999999999998886 5689999999999999999999999999988865432 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) .... ++++.++.+. +.+ T Consensus 390 ~~~~--------------k~g~~~~~~~-------~~~ 406 (406) T protein:vir:95 390 QSKL--------------KGGDNSGADG-------QTD 406 (406) T ss_pred cccc--------------CCCCCCCCCC-------CCC Confidence 1110 0000000000 000 No 54 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=9e-79 Score=448.46 Aligned_cols=407 Identities=13% Similarity=0.083 Sum_probs=292.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccc--ccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEP--VDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~--~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |-+++--| +.... ..+++.++..++ ........+.... ..+.+..... ..... T Consensus 1 ~~~~~~~~------------~~~~m---------~~F~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-~~~~~ 55 (413) T protein:vir:96 1 MPGVSEIR------------KDKNL---------KFFNNKRSPTEESKAKDEIPKAPQVVM---TLPNFFKELI-SDGYT 55 (413) T ss_pred CCccchhh------------hhhcC---------CccccCCCcchhhhhhccccccccccc---cchhhHhhhc-cchhH Confidence 33333221 11111 122221111100 0000000000000 0000000000 01123 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) .++.+++|++||++||++||++|+.+++...+. .+ ..++....|.+..||++++ +++|++. T Consensus 56 ~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~---------~~-------~~~~~~~~ll~~~PN~~~t---~~~f~~~ 116 (413) T protein:vir:96 56 KLSDSPEVRMAVDCIADLVSNMTIQLMQNGETG---------DK-------RIKNDLSRVVDIEPNKYLS---RKTFIQW 116 (413) T ss_pred HHhhchHHHHHHHHHHHhhccCceEEEEecCCC---------cc-------ccccHHHHHHHhccccCCC---HHHHHHH Confidence 467899999999999999999987765432211 00 0112223344445555554 4789999 Q ss_pred HHHHHHHcCCcceEEEECCCC-cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCc Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSNG-RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~G-~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~ 237 (559) ++.+++++||+|++++|+.+| .+++||||+|.+|++..+... ..|.....+ ..+.++||||++.++.+ T Consensus 117 ~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~------~~y~~~~~~---~~~~~~evih~k~~~~~-- 185 (413) T protein:vir:96 117 LVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDD------LDYSITFDN---KEYDPSTLLHFVLNPSI-- 185 (413) T ss_pred HHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCe------EEEEEeecC---cEEchhhEEEEeccCCC-- Confidence 999999999999999999877 578999999999999876432 234333333 35789999999876533 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+||+.++..+|.++.++++|+.++|+||++|+|+|+++ +.+++++.++++++|++.++|..|+|+++|| T Consensus 186 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl 260 (413) T protein:vir:96 186 ERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD-----SDSDELSDEEGRENFEEMYLKRKEAGKPWII 260 (413) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-----CCCCHHHHHHHHHHHHHHhcCccccCceeee Confidence 34568999999999999999999999999999999999999885 3588999999999999999999999999999 Q ss_pred cCCceeeeecc--ccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHH Q lcl|NC_012530. 318 TAEDAKFVSMT--QAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLD 395 (559) Q Consensus 318 ~~g~~~~~~ls--~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~ 395 (559) ++++.++..+. .++|+||+|++++++++||++|||||++||.. .+.+++...|+++||+||++ T Consensus 261 ~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~---------------~~~~~~~~~~~~~~l~P~~~ 325 (413) T protein:vir:96 261 PEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG---------------TYNKDEFNNFINTKIMSIAQ 325 (413) T ss_pred cCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---------------cchHHHHHHHHHHHHHHHHH Confidence 88877777653 36899999999999999999999999999842 12456677899999999999 Q ss_pred HHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 396 MIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 396 ~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) .||++||++|+++ +..++|+++.+++.|.+++++++..++. |+||+||+|+++|+||+||||+++++.|+++++... T Consensus 326 ~ie~~ln~~ll~~--~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd~~~~~~n~~~~~~~~ 403 (413) T protein:vir:96 326 VIQQTYNKLIVEE--DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMDDLLVLENYLQQKDLV 403 (413) T ss_pred HHHHHHHHhhCCC--CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccchhhcc Confidence 9999999999874 3456777778999999999999988885 568999999999999999999999999998886543 Q ss_pred ccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) ..... .+.++ T Consensus 404 ~~~~~-------------------~~~dt 413 (413) T protein:vir:96 404 NQKKL-------------------IQDET 413 (413) T ss_pred cccCC-------------------CCCCC Confidence 21110 00001 No 55 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=2.8e-78 Score=445.80 Aligned_cols=400 Identities=11% Similarity=0.035 Sum_probs=287.1 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) ++++... ... +......+ .....+ .++ . . .+...|+.+++|++||++||++||++|+.. T Consensus 1 m~~f~~~--------~~~---~~~~~~~~-~~~~~~---~~~---~-~--~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~ 59 (406) T protein:vir:97 1 MSFFQPL--------GTS---KVSYDDYI-SSVLAG---DVS---Q-K--YLGVSALKNSDILTATSIIAGDIARFPLVK 59 (406) T ss_pred Ccccccc--------CCC---CCCcchHH-HHHhcC---CCC---c-c--cccchhhccHHHHHHHHHHHHhhhhCeeEE Confidence 4444321 100 00000001 000000 010 1 1 112247889999999999999999998643 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC-CCcEEE Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS-NGRLSH 183 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~-~G~~~~ 183 (559) ++ .+| ... ..+.+ ..|++..||++++ +++||+.++.+++++||+|++|+|+. .|++.+ T Consensus 60 ~~-~~g----------~~~------~~~~~-~~lL~~~PN~~~t---~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~ 118 (406) T protein:vir:97 60 KD-VNG----------DII------HDEDI-NYLLNVKSTSNAS---ARTWKFAMAVNAILTGNSFSRILRDPKTNQALQ 118 (406) T ss_pred Ee-cCc----------ccc------ccchH-HHHhhccCCCCCC---HHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEE Confidence 22 111 111 11223 3445555666664 47899999999999999999999985 689999 Q ss_pred EEEecCceEEEEecCcccccccceEEE-EEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHGHRRTRGKIYR-QYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTEL 262 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g~~~~~~~~y~-~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~ 262 (559) ||||+|++|++..++.|.. .|. ....++....++++||||++.++ .++.+|+|||.+++.+|.++.++++ T Consensus 119 L~~i~p~~v~v~~~~~~~~-----~y~~~~~~~~~~~~~~~~evih~r~~~----~dg~~G~spi~~~~~~i~~~~a~~~ 189 (406) T protein:vir:97 119 FQFYRPSETTVEETDNHEI-----VYTFTDMLTAKQVKCFAHDVIHWKFFS----HDTILGRSPLLSLGDEIDLQTGGIN 189 (406) T ss_pred EEEECCCeeEEEEcCCceE-----EEEEEecCCceEEEEccccEEEecCCC----CCCcccccHHHHHHHHHHHHHHHHH Confidence 9999999999988776643 232 22345566789999999998653 3466799999999999999999999 Q ss_pred HHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHH Q lcl|NC_012530. 263 FNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNY 341 (559) Q Consensus 263 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~ 341 (559) |+.++|+||+.|++++..+ ..+++++++++++.|++.++| .|+|+++||++ +++|++++. +.|+||+|++++ T Consensus 190 ~~~~~f~ng~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~-g~~~~~l~~~~~d~q~le~~~~ 262 (406) T protein:vir:97 190 TLIKFFKDGFSSGILTMKG-----AQLSGDARQRARQEFEKMREG-SVGGSPLVFDS-TMEYTPLEIDTNVLQLITSNNF 262 (406) T ss_pred HHHHHHhccCCCceEEecC-----CCCCHHHHHHHHHHHHHHhcc-cccCceeecCC-CceEEEccCCHHHHHHHHHHHh Confidence 9999999999988776543 468999999999999999988 58899888754 599999984 799999999999 Q ss_pred HHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecch Q lcl|NC_012530. 342 LINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGG 421 (559) Q Consensus 342 ~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l 421 (559) ++++||++|||||++||... .++|++++...|++.||.||+++||++|+++|+++.+...++++|+. T Consensus 263 ~~~~Ia~afgVPp~~lg~~~------------~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~- 329 (406) T protein:vir:97 263 STAQIAKALRVPSYKLGVNS------------PNQSVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDRRLYHIEFDT- 329 (406) T ss_pred hHHHHHHHhCCCHHHcCCCC------------CcchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhccceeEEEec- Confidence 99999999999999998411 24688999999999999999999999999999987766667777752 Q ss_pred hhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCC--CCEeeccceecccccccccccccccccccccccccccCCCC Q lcl|NC_012530. 422 DTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAG--GDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNP 498 (559) Q Consensus 422 ~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~g--GD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (559) +.+.+.+++.+..++. |+||+||+|+++|+||+++ ||++++|.|+.+++...+. +...... .+ T Consensus 330 -~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~-----~~~~~~~----~~---- 395 (406) T protein:vir:97 330 -RSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEY-----QDKVGIK----GK---- 395 (406) T ss_pred -CccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccc-----ccccccc----cC---- Confidence 3455566666666665 5799999999999999966 9999999999887654211 0000000 00 Q ss_pred CCCCCCCCcccccc Q lcl|NC_012530. 499 SGTPPTLPPSSSNS 512 (559) Q Consensus 499 ~~~~~~~~~~~~~~ 512 (559) ..+...+ ..++ T Consensus 396 -gg~~~~~--~~~~ 406 (406) T protein:vir:97 396 -GGEVNAE--EDKS 406 (406) T ss_pred -CCCCCCC--CCCC Confidence 0000000 0000 No 56 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=3.2e-77 Score=439.96 Aligned_cols=414 Identities=12% Similarity=0.094 Sum_probs=292.3 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCCh-----h Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIR-----D 150 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~-----~ 150 (559) |++.+..||+|++||++||++||++|+.++. +...+.........+.+..+|+++.|+..++. + T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~-----------~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~ 69 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIP-----------HPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERA 69 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEE-----------ccCcccccchhhhhhhHHHHhhccCCCccccchhhHhh Confidence 5566677999999999999999988765432 22222223334566777888888888776543 3 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc--ceEEEE-E--------------- Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR--GKIYRQ-Y--------------- 212 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~--~~~y~~-~--------------- 212 (559) ++.+||++++.|++++||+|++++|+..|+|++||||+|++|++..+..+++... ...|+. + T Consensus 70 t~~~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (467) T protein:vir:31 70 TATNVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPV 149 (467) T ss_pred HHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeee Confidence 6778999999999999999999999999999999999999999988765433211 011111 0 Q ss_pred ------ecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC Q lcl|NC_012530. 213 ------IDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV 286 (559) Q Consensus 213 ------~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 286 (559) ...+....++++||||++.+ ...++.||+||+.+++.+|.++.+++.|+.++|+||++|+|||++++ T Consensus 150 ~~~~~~~~~~~~~~~~~~diih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---- 222 (467) T protein:vir:31 150 FVDADDGSTGTSVSNPANELIFKRNH---SPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKG---- 222 (467) T ss_pred eeeeccccccceeEeccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC---- Confidence 11233456899999999753 23456799999999999999999999999999999999999998753 Q ss_pred ccCCHHHHHHHHHHHHHHhc-----------CcccccccccccCC------ceeeeecc--ccchhHHHHHHHHHHHHHH Q lcl|NC_012530. 287 TNTSMRALEDFKRHWTATSS-----------GINGAYRIPMITAE------DAKFVSMT--QAEDMQFQSWLNYLINIIC 347 (559) Q Consensus 287 ~~~~~e~~~~l~~~~~~~~~-----------G~~nag~~~vl~~g------~~~~~~ls--~~~D~qf~e~~~~~~~~Ia 347 (559) +.+++++++++++.|++.++ |..|++++.++..+ ++++++++ .++|+||+|++++++++|| T Consensus 223 ~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia 302 (467) T protein:vir:31 223 AELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDIL 302 (467) T ss_pred cCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHH Confidence 35899999999999998776 55678887777554 35677776 3589999999999999999 Q ss_pred HHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---ccceeeecchhhh Q lcl|NC_012530. 348 ALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---DNYMLEFVGGDTR 424 (559) Q Consensus 348 ~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---~~~~~~f~~l~~~ 424 (559) ++|||||++||+.+.+++ .+|++++...|++.||+|++++||++||++|++.... ..++|+++.+++. T Consensus 303 ~~fgVpp~~lG~~~~~~~---------~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~ 373 (467) T protein:vir:31 303 KVHDVPPVIAGVVESGAF---------STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTK 373 (467) T ss_pred HHhCCCHHHcccCCCCCc---------ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhcc Confidence 999999999998765432 3578999999999999999999999999999986543 3467777899999 Q ss_pred hHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 425 d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) |.+++++++..+++ |+||+||+|+++||||+++++. .+........ ..+..+. . +..+++++ T Consensus 374 d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~--~~~~~~~~~~-------~~~~~~~------~--~~~~~~~~ 436 (467) T protein:vir:31 374 LQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHV--YGGETLVAEV-------TGGSGPG------G--GIGDQIEQ 436 (467) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccc--cCCccccccc-------ccccCCC------C--cccCcCCC Confidence 99999999999886 5689999999999999965432 2211110000 0000000 0 00000000 Q ss_pred CCCccccccchhcccccccc-ccccccccccccccccc Q lcl|NC_012530. 504 TLPPSSSNSFQQNQEGYTGK-DAKPSGKDNQQGVGKDG 540 (559) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~-~~~~~g~~~~~~~~~~~ 540 (559) . .....+...+...++..+ ...|.|. ..|. T Consensus 437 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 467 (467) T protein:vir:31 437 L-VEDRADEIIDSYQADLETEQLIEIGA------NADS 467 (467) T ss_pred C-CCCcccchHhhhhhccccchhhhhcc------ccCC Confidence 0 000000000000011100 0111121 1111 No 57 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=5.4e-78 Score=444.18 Aligned_cols=398 Identities=13% Similarity=0.088 Sum_probs=289.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+.|+.+.-.+-.. +. ..+....+.... ... .. ..+ T Consensus 1 Mg~~~~f~~k~~~~~~~----------------------------~~---~~~~~~~~~~~~---~~~---~~----~~~ 39 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTN----------------------------AI---SWFLTQEAYDTL---AIP---GY----TRL 39 (403) T ss_pred Ccccccccccccccccc----------------------------hh---hhhccccccccc---ccc---hh----hhh Confidence 99999885432110000 00 000001000000 000 11 124 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+|+|++||++||++||++|+.+++..++.. .+ ..+.+. .|++..||++++ +++||+.+| T Consensus 40 ~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~--~~-------------~~~~~~-~lL~~~PN~~~t---~~~f~~~~v 100 (403) T protein:vir:80 40 SDNPEVRMAVHKIAELISSMTIHLMQNTDNGD--IR-------------IKNELS-RKIDINPYSLMT---RKAWMYNIV 100 (403) T ss_pred hhhHHHHHHHHHHHHhhhhCceEEEEecCCce--ee-------------cCChHH-HHHhccCCcCCC---HHHHHHHHH Confidence 56899999999999999999987655433211 11 011232 334445666665 468999999 Q ss_pred HHHHHc--CCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCcc Q lcl|NC_012530. 161 RDTYTY--DQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 161 ~d~ll~--Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~ 238 (559) .++|+. ||+|+++.|+..|+|++||||+|++|++..+.+|+. .+++ ...+.++||||++.++.+ . T Consensus 101 ~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~-----~~y~------~~~~~~~eiih~~~~~~~--~ 167 (403) T protein:vir:80 101 YTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQ-----IWYQ------GKAYNYDEVLHFIVNPDP--E 167 (403) T ss_pred HHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceE-----EEEe------ecccchhhEEEEeccCCC--c Confidence 999985 778999999999999999999999999998887643 1221 135789999999976543 3 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT 318 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~ 318 (559) ++.+|+||+..++.++....++++++.++|+||++|+|||+++. .+++++.++++++|.+.+.|..++|++++++ T Consensus 168 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 242 (403) T protein:vir:80 168 KPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-----ATAELSSEEGRNAVFKKYLEASEAGQPWIIP 242 (403) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-----CCChHHHHHHHHHHHHHHhhhhhcCCeeeec Confidence 45679999999999999999999999999999999999998864 4678888999999999999999999998887 Q ss_pred CCceeeeecc--ccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 319 AEDAKFVSMT--QAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 319 ~g~~~~~~ls--~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) .++.++..+. .++|+||+|++++++++||++|||||++||+.+ +.++....|+..||.||+++ T Consensus 243 ~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~---------------~~~~~~~~f~~~~l~P~~~~ 307 (403) T protein:vir:80 243 AELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK---------------YDKDEYNNFINSTILPIAKG 307 (403) T ss_pred ccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC---------------ccHHHHHHHHHHHHHHHHHH Confidence 6655444332 368999999999999999999999999998522 11233456999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) ||++|+++||++.+ ..++|+.+.++++|.+++++++..+++ |+||+||+|+++||||+||||+++++.+++++....+ T Consensus 308 ie~~l~~kll~~~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n~~pl~~~~~ 386 (403) T protein:vir:80 308 IEQELTRKLLISPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGD 386 (403) T ss_pred HHHHHHHhccCCCC-cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccc Confidence 99999999998643 345556668899999999999998886 5689999999999999999999999999998865432 Q ss_pred cccccccccccccccccccCCCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPT 504 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (559) .......+ .+..+.+.+ T Consensus 387 ~~~~k~ge------------~~~~~~~~~ 403 (403) T protein:vir:80 387 QNKLKGGE------------KGGADGQTD 403 (403) T ss_pred hhhccCCC------------CCCCCCCCC Confidence 21100000 000000000 No 58 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=5.4e-76 Score=433.22 Aligned_cols=469 Identities=14% Similarity=0.106 Sum_probs=288.0 Q ss_pred HHHHHHhhhhccccccc----cccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhh Q lcl|NC_012530. 27 IANDTASKALNGVDRAY----TEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAH 102 (559) Q Consensus 27 ~~~~~~~~~~~gr~~a~----~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~ 102 (559) .|+..-.-..-+|-.+. ..+.... ...+.+. .| ..++..+.+.+..+++|++||++||++||++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~-~p----p~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~ 70 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKE-----DRFEEYV-EP----KVHPLVLLSLLQVNPYHASACSIKANDILRTGY 70 (540) T ss_pred CCCcccChhhccchhhhhcccccccccc-----CCCCccc-cC----CCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCc Confidence 22211111111111100 1111111 1111111 11 125566777888999999999999999998876 Q ss_pred HhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEE Q lcl|NC_012530. 103 RASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLS 182 (559) Q Consensus 103 ~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~ 182 (559) .+ ..++. .+.+++ ||.++ ++.+|+++++.|++++||+|++++|+..|+|+ T Consensus 71 ~i-----------~~~~~------------~~~~~l----pN~~~---t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~ 120 (540) T protein:vir:41 71 LI-----------DGDDG------------GVEELL----RACRP---SFEFILLQALEDLQVFNYCTLEVVRDDQGEPV 120 (540) T ss_pred eE-----------ecCcc------------chhhhc----cCCCC---CHHHHHHHHHHHHHhcCCeEEEEEECCCCcEE Confidence 43 22221 122232 44444 45789999999999999999999999999999 Q ss_pred EEEEecCceEEEEecCccccccc---ceEEEE---------EecCceeeeecccceEEEecccCCCccCCcccccHHHHH Q lcl|NC_012530. 183 HTRMVDPTTIYFANDEHGHRRTR---GKIYRQ---------YIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMG 250 (559) Q Consensus 183 ~L~~l~p~~V~~~~~~~g~~~~~---~~~y~~---------~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~ 250 (559) +||||+|.+|++..+..++.... ...|+. ...+.....++++||||++.+ ...++.||+||+.++ T Consensus 121 ~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~---~~~~~~~G~Spi~~~ 197 (540) T protein:vir:41 121 RLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLP---SPICSYYGVPRYLSA 197 (540) T ss_pred EEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCC---CCCCCcccccHHHHH Confidence 99999999999988766543211 111111 112333457899999999753 234577999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc-----CCHHHHHHHHHHHHHHhcCc-cccccccccc-----C Q lcl|NC_012530. 251 LREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN-----TSMRALEDFKRHWTATSSGI-NGAYRIPMIT-----A 319 (559) Q Consensus 251 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~-----~~~e~~~~l~~~~~~~~~G~-~nag~~~vl~-----~ 319 (559) +.+|..+.++++|+.+||+||++|+|||++++...... ..++.++++++.|++.++|. +|+|+++||+ . T Consensus 198 ~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~ 277 (540) T protein:vir:41 198 APSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDT 277 (540) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcc Confidence 99999999999999999999999999999986543321 22345577888888888874 5788887775 3 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) ++++|++++. ++|+||+|++++++++||++|||||++||+.+.++ .+++|++++...|+++||.|++++|| T Consensus 278 ~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~--------~n~sn~eq~~~~f~~~tL~P~~~~ie 349 (540) T protein:vir:41 278 VEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGP--------LGGNFAEVARRTYYESVVRPQQEIVS 349 (540) T ss_pred cceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCC--------CCcccHHHHHHHHHHHHHHHHHHHHH Confidence 5799999985 79999999999999999999999999999977554 35789999999999999999999999 Q ss_pred HHHHhhccccccCccceeeec--chhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC-CEeeccceeccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFV--GGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG-DIILSAVYIQRLGQQE 474 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~--~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG-D~~~~~~~~~~l~~~~ 474 (559) ++||++|++... ..++|+|+ .+++.|.++++ ..+++ |+||+||+|+.+ +|+++| |.++.|.++....... T Consensus 350 ~~ln~~L~~~~~-~~~~i~f~~~~ll~~D~~~~~---~~lv~~G~lT~NE~Re~L--~g~e~gdd~~l~p~n~~~~~~~~ 423 (540) T protein:vir:41 350 SVLTDFIQLKLD-PGARFVFNEEILMESEFVHNY---ALLVQCGVLTPSEVREKL--FGLDGGPDMFMVPSSIGKSAMKR 423 (540) T ss_pred HHHHHhhhhccC-CceEEEecchhhcchHHHHHH---HHHHhCCCCCHHHHHHHh--CcCcCCCcccccccccccccccc Confidence 999999987643 35666665 56666665554 34454 679999999753 444444 6667776664322221 Q ss_pred ccccccccccccccccccccC-CCCCCCCCCC-Cccccccchhccccccccccccccccccccccccccccccchhhhhh Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESAL-QNPSGTPPTL-PPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYK 552 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~ 552 (559) +.. ........+........ ....+...+. ..++..++.++..+...++..++||-..+=.+.-|+- .+-- T Consensus 424 ~~~-~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 496 (540) T protein:vir:41 424 QKR-NYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENGKKMLSIAGDMGTM------SAIN 496 (540) T ss_pred ccc-ccCCCCccccccccchhcccccCccccccccccccccccccccccCCccccchhHHHHHhhhhhhh------hhhh Confidence 110 00000000000000000 0000000000 0111112222222222233333333222111111110 0001 Q ss_pred ccCCCCC Q lcl|NC_012530. 553 QGGSSKK 559 (559) Q Consensus 553 ~~~~~~~ 559 (559) +|-+-+- T Consensus 497 ~~~~~~~ 503 (540) T protein:vir:41 497 RGVSMIP 503 (540) T ss_pred cCceecC Confidence 1111111 No 59 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=2.5e-75 Score=429.58 Aligned_cols=477 Identities=14% Similarity=0.132 Sum_probs=294.5 Q ss_pred chhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhh Q lcl|NC_012530. 3 IFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM 82 (559) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 82 (559) +|+.. |.--++ ...+++.... ..+..+.......+. .|+ .+...+.+.+.. T Consensus 1 ~~~~~---~~i~s~--------------~~~~~i~~~~-------~~s~~~~~~~~~~~~-~pp----~~~~~la~l~~~ 51 (542) T protein:vir:41 1 MFNYH---LSIRSL--------------EKYKAIKREE-------VESQALGETRFEEYV-EPK----VNPLVLLSLLQV 51 (542) T ss_pred Ccccc---cccccc--------------ccchhhhhcc-------ccccccccccCCccc-cCC----CCHHHHHHHHhh Confidence 22211 000000 0011111011 011111111111111 121 244556677788 Q ss_pred ChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHH Q lcl|NC_012530. 83 NVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRD 162 (559) Q Consensus 83 ~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d 162 (559) +++|++||++||++||++|+.+... + .+ .+.+..||+++ ++++|++.++.+ T Consensus 52 n~~v~scI~~ia~~IA~l~~~~~~~-----------~-----------~~----~l~~~lpN~~~---s~~~f~~~~v~~ 102 (542) T protein:vir:41 52 NPYHASACSIKANDIIRTGYILEGD-----------D-----------EG----VVDEFIRACKP---SFEYVLLRALED 102 (542) T ss_pred cHHHHHHHHHHHHHHhhCceeeecc-----------c-----------ch----hhhhhcCCCCC---CHHHHHHHHHHH Confidence 9999999999999999887654211 0 01 22333355554 457899999999 Q ss_pred HHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---cce-EEEEEec--------CceeeeecccceEEEe Q lcl|NC_012530. 163 TYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---RGK-IYRQYID--------NKVRGSFTADEMGMFI 230 (559) Q Consensus 163 ~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---~~~-~y~~~~~--------~~~~~~~~~~evi~~~ 230 (559) ++++||+|++++||..|+|++||||+|.+|++..+..+.... ... .|..+.+ +.....++++||||++ T Consensus 103 lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir 182 (542) T protein:vir:41 103 LQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIH 182 (542) T ss_pred HhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEec Confidence 999999999999999999999999999999998876653321 111 1222211 2223457889999997 Q ss_pred cccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccC-----CccCCHHHHHHHHHHHHHHh Q lcl|NC_012530. 231 RNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPS-----VTNTSMRALEDFKRHWTATS 305 (559) Q Consensus 231 ~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-----~~~~~~e~~~~l~~~~~~~~ 305 (559) .+. ..++.||+|||..++.+|..+.++++|+.++|+||++|+|||++++... ...+++++++++++.|++.+ T Consensus 183 ~~~---~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~ 259 (542) T protein:vir:41 183 IPS---PVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNF 259 (542) T ss_pred CCC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHH Confidence 543 3457799999999999999999999999999999999999999876532 34688999999999999999 Q ss_pred cCc-cccccccccc-----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH Q lcl|NC_012530. 306 SGI-NGAYRIPMIT-----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN 378 (559) Q Consensus 306 ~G~-~nag~~~vl~-----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~ 378 (559) +|. +|+|+++||. .++++|++++. +.|+||+|++++++++||++|||||++||+.+.+++ +++|+ T Consensus 260 ~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~--------n~sn~ 331 (542) T protein:vir:41 260 KHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPL--------GGNFA 331 (542) T ss_pred hhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCccc--------ccccH Confidence 886 5788887774 46799999985 799999999999999999999999999999776543 56789 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeec--chhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCC Q lcl|NC_012530. 379 QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFV--GGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKI 455 (559) Q Consensus 379 ~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~--~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi 455 (559) +++...|+++||+|++++||++||++|+++.+. .++|+|+ .+++.|..++ +..+++ |+||+||+|+. |+++ T Consensus 332 Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~~~f~~~~ll~~d~~~~---~~~~v~~GilT~NE~Re~--L~g~ 405 (542) T protein:vir:41 332 EVTRRTYYESVVRPQQNIISSILTDFFQVKFNP-KTRFKFNDETLLESDSVRN---CALLVQSGVLTPAEARER--LFGL 405 (542) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-ceEEEecchhhcchHHHHH---HHHHHhCCCCCHHHHHHh--hCCC Confidence 999999999999999999999999999886543 4666665 5566665544 344554 56899999974 3444 Q ss_pred CCCC-EeeccceecccccccccccccccccccccccccccCCCCCCCC---CCCCccccccchhcccccccccccccccc Q lcl|NC_012530. 456 AGGD-IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTP---PTLPPSSSNSFQQNQEGYTGKDAKPSGKD 531 (559) Q Consensus 456 ~gGD-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 531 (559) ++|| .++.|.++.......+ +..++..+............+...+ ......+.++..++..+...++..+.||- T Consensus 406 ~pgdd~~l~p~~~~~~~~~~~--~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (542) T protein:vir:41 406 DGGPDIFMVPSKGAAKSVKRQ--ERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGKK 483 (542) T ss_pred CCCCccccccccccccccccC--CcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhhhhHHhHHhcCce Confidence 4454 5555655532211111 1111111111000000111111110 11111122222223333333444444432 Q ss_pred ccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 532 NQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 532 ~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) ..- ++-|-. +-..-|.+-.-.-++- T Consensus 484 ~~~-~~~~~~--~~~~~~~~~~~~~~~~ 508 (542) T protein:vir:41 484 MLI-IGGDMG--SMSALNQGVSVIPSKP 508 (542) T ss_pred EEE-eecCch--hhhhhhccceeccCCC Confidence 210 111111 1111111111111111 No 60 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=2.4e-74 Score=424.19 Aligned_cols=389 Identities=11% Similarity=0.059 Sum_probs=284.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|||||+..+. +.+ ..+..+. ..+.. .++. . ........+ T Consensus 1 MGl~~~~~~~~~--------------------------~~~-~~~~~~~-~~~~~--------~~~~-~--~~~vt~~~a 41 (394) T protein:vir:62 1 MGLRDRFSNYLF--------------------------KKA-EKRGYLD-NVLGK--------SIRY-S--GVYVTDSNI 41 (394) T ss_pred Cchhhhhhhhcc--------------------------CCC-Cchhhhh-hhhhc--------cccc-C--ccccChhhh Confidence 999999964332 000 0000000 00000 0010 0 111234557 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++.. +| .+. +.+.+..++.+| |++++ +++|++.++ T Consensus 42 l~~~~v~~~i~~Ia~~iA~lp~~v~~~-~g----------~~~------~~~~~~~Ll~~P--N~~~t---~~~f~~~~~ 99 (394) T protein:vir:62 42 LQSSDVYELLQDISNQMVLADIVVEDE-FG----------NEI------KDDIALQILRNP--NNYLT---QSEFIKLMT 99 (394) T ss_pred hccHHHHHHHHHHHHhhcccceEEEcC-CC----------ccc------chhhHHHHhccC--CCCCC---HHHHHHHHH Confidence 889999999999999999998765432 11 111 123344455554 44444 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++|.++..+ + +..|.+..++++. +.+..+ ...|+++||||++.++ .++ T Consensus 100 ~~lll~Gn~~~~i~~~~~~----~----~~~~~~~~~~~~~--------~~~~~~--~~~~~~~eiih~r~~~----~d~ 157 (394) T protein:vir:62 100 NTYLLEGETFPILNGAQIH----L----ASNVFTELDDNLV--------EHFNIG--GHEIPPCMIRHVKNIG----ADH 157 (394) T ss_pred HHHHhcCCeEEEEecceee----c----cccceEEECCceE--------EEEeeC--CEEechhheEEecCcC----CCC Confidence 9999999999998765433 2 2345566655442 112222 2468999999998643 245 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||+..+..+|..+.++++|+.++|+||++|+|+|++++... .++++++++++.|++.++|..|+|+++|++.+ T Consensus 158 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~---~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g 234 (394) T protein:vir:62 158 LRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHIN---PQNGAQSKLINAILDQLESIDEARSVKMIPLG 234 (394) T ss_pred ccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCC---cCHHHHHHHHHHHHHHhccccccCceeEeeCC Confidence 789999999999999999999999999999999999999976532 35777899999999999999999999888654 Q ss_pred -ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 321 -DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 321 -~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) ++++.+++. +.|+||+|++++++++||++|||||++||.. .++|++++.+.|++.||+||+++|| T Consensus 235 ~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-------------~~sn~e~~~~~~~~~~l~P~~~~ie 301 (394) T protein:vir:62 235 KGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL-------------IKEDIEKAMMYIHNKAVRPIMKNFE 301 (394) T ss_pred CceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC-------------CCcCHHHHHHHHHHHHHHHHHHHHH Confidence 567778874 6899999999999999999999999999842 2468899999999999999999999 Q ss_pred HHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCC--CCCCEeeccceecccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKI--AGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi--~gGD~~~~~~~~~~l~~~~~ 475 (559) ++|+++|+++.++.+++|+|+.....+..++++++..+++ |+||+||+|+++||||+ ++||+++++.++++++.... T Consensus 302 ~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~~~ 381 (394) T protein:vir:62 302 DHLSLLFYAQNSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKKEA 381 (394) T ss_pred HHHhhhhcCccccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccccccccccc Confidence 9999999988777889999998877777888888888886 56899999999999999 78999999988876643211 Q ss_pred cccccccccccccccccccCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) .. ...++++.+++ T Consensus 382 ~~-------------~~~kgge~~en 394 (394) T protein:vir:62 382 TD-------------GSLGGGEENEN 394 (394) T ss_pred cc-------------ccCCCCCCCCC Confidence 00 00011111111 No 61 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=5.1e-73 Score=416.90 Aligned_cols=392 Identities=11% Similarity=0.044 Sum_probs=278.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|... + .+......+ .+......+ .. ......+.+ T Consensus 1 M~~f~~~~~-------------------------~--~~~~~~~~~-----~~~~~~~~~-------~~--~~~v~~~~a 39 (397) T protein:vir:38 1 MPLLKLNKS-------------------------H--SQGFSLNDP-----DWVNFLTGG-------EA--QKYVSADTA 39 (397) T ss_pred Ccchhhhhc-------------------------c--cCcccCCch-----hhhhhhcCC-------cC--CceechHHh Confidence 888876510 0 000000001 111110000 00 111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.. ++ +.+..++.+|+| ++ ++++|++.++ T Consensus 40 l~~~~V~~~v~~ia~~ia~~p~~~-------------~~------------~~~~~l~~~PN~--~~---s~~~f~~~~~ 89 (397) T protein:vir:38 40 LKNSDIFSLIMQLSGDLAMVRYTS-------------ES------------DRSQSIISNPSV--TA---NGYSFWQGMF 89 (397) T ss_pred hccHHHHHHHHHHHHHHhhCcccc-------------cc------------cHHHHHHhcCCC--CC---CHHHHHHHHH Confidence 889999999999999999988631 01 123345555544 44 4578999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEe---cCceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI---DNKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||||++++|+..|++++||||+|++|++..+.+|... .|.... .++....++++||||++++.. T Consensus 90 ~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~eiih~~~~~~--- 162 (397) T protein:vir:38 90 AQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGL----IYNINFDEPAIGYMENVPAADVIHIRLLSK--- 162 (397) T ss_pred HHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EEEEEeccccccceeEecCccEEEecCCCC--- Confidence 9999999999999999999999999999999999998877532 222221 133456799999999987542 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .+..||+|||.++..+|..+.++++++.++|+||++|+|+|+++. .+++++.+++++.|+..+++ .|+|+++|+ T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~-----~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl 236 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQK-----GGLLDAETRIARSKEISKQI-HNSDGPVVI 236 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-----CCCHHHHHHHHHHHHHHhcc-cccCCceec Confidence 334689999999999999999999999999999999999999864 47788999999999887665 688887776 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) +++++|++++. +.|+||+|++++++++||++|||||++||..+.+ .+|.++ ...|+.+||+||+.. T Consensus 237 -~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~-----------~~~~e~-~~~~~~~~l~P~~~~ 303 (397) T protein:vir:38 237 -DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQ-----------QSSITQ-ISGQYAKSLNRYVQA 303 (397) T ss_pred -CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-----------ccHHHH-HHHHHHHHHHHHHHH Confidence 45699999985 6999999999999999999999999999965432 235554 456788999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) ||++||++|+++. ++++..+.+.|.+++++.++.+++ |+||+||+|+++|+||++|||.+.......+...... T Consensus 304 ie~~ln~~l~~~~-----~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~ 378 (397) T protein:vir:38 304 IVGELNDKLHANI-----SANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQ 378 (397) T ss_pred HHHHHHHhccChh-----cccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccc Confidence 9999999998853 344455667899999999998886 5689999999999999999997643322221111100 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) . .+++.+..+......++| T Consensus 379 ~-----------------~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 379 Q-----------------EGGENDGNNSDERGSDPE 397 (397) T ss_pred c-----------------ccCCCCCCCCCCCCCCCC Confidence 0 000000000000000000 No 62 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=1.8e-73 Score=419.33 Aligned_cols=394 Identities=14% Similarity=0.075 Sum_probs=278.3 Q ss_pred HHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhh Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYA 101 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~ 101 (559) |--.+|+.+.... |+ ++... + .+.. ......|. ...+.++.+++|++||++||+.||++| T Consensus 1 mg~~~~~~~~~~~---~~-~~~~~-----~--~~~~-~~~~~~~~--------~t~~~~~~~~~v~~cv~~Ia~~ia~~p 60 (403) T protein:vir:10 1 MGFKSWITEKLNP---GQ-RIIRD-----M--EPVS-HRTNRKPF--------TTGQAYSKIEILNRTANMVIDSAAECS 60 (403) T ss_pred Ccchhhhhhccch---hh-hhhhc-----c--cccc-cccCCccc--------ccHHHHHHHHHHHHHHHHHHHHHhhCc Confidence 2223333222211 00 11000 0 0000 00000011 123566789999999999999999999 Q ss_pred hHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcE Q lcl|NC_012530. 102 HRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRL 181 (559) Q Consensus 102 ~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~ 181 (559) +.++....... + ......+.+. .|++..||++++ +++|++.++.+++++||+|+++.+ T Consensus 61 ~~v~~~~~~~~------~------~~~~~~~~l~-~lL~~~PN~~~t---~~~f~~~~~~~~ll~Gnayi~~~~------ 118 (403) T protein:vir:10 61 YTVGDKYNIVT------Y------ANGVKTKTLD-TLLNVRPNPFMD---ISTFRRLVVTDLLFEGCAYIYWDG------ 118 (403) T ss_pred eeEeecccccc------c------ccccccchHH-HHHhhCCCCCCC---HHHHHHHHHHHHhhcCCeEEEEeC------ Confidence 77643322110 0 0001112233 344555666665 468999999999999999987643 Q ss_pred EEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEeccc-CCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_012530. 182 SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNP-RSDILSGGYGLSELEMGLREFISHENT 260 (559) Q Consensus 182 ~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~-~~~~~~~~~G~Spl~~~~~~i~~~~~~ 260 (559) ..|++|+|..|++..+..+.. .++ . .+. ...+.++||+|++.+. .....++.+|+||+.+++.++..+.++ T Consensus 119 ~~l~~l~~~~~~v~~~~~~~~-----~~~-~-~~~-~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~ 190 (403) T protein:vir:10 119 TSLYHVPAALMQVEADANKFI-----KKF-I-FNN-QINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKM 190 (403) T ss_pred ceeEeecCcceEEEEcCCceE-----EEE-E-ecC-ceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHH Confidence 268999999999987765432 122 1 122 2457889999998543 233446789999999999999999999 Q ss_pred HHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc---cchhHHHH Q lcl|NC_012530. 261 ELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ---AEDMQFQS 337 (559) Q Consensus 261 ~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~---~~D~qf~e 337 (559) ++|+.++|+||++|+|||+++ +.+++++++++++.|++.++|..|+|+++||+ ++++|++++. +.|+||+| T Consensus 191 ~~~~~~~f~ng~~~~gil~~~-----~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~~~~~~~~~d~q~~e 264 (403) T protein:vir:10 191 LNFKEKFLDNGTVIGLILETD-----EILNKKLRERKQEELQLDYNPSTGQSSVLILD-GGMKAKPYSQISSFKDLDFKE 264 (403) T ss_pred HHHHHHHHhccCCcceEEEeC-----CCCCHHHHHHHHHHHHHHhCCcccCcceeecC-CCceeEEecccCCHHHHHHHH Confidence 999999999999999999875 46899999999999999999999999988875 4699999873 57999999 Q ss_pred HHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceee Q lcl|NC_012530. 338 WLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLE 417 (559) Q Consensus 338 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~ 417 (559) ++++++++||++|||||++||.. +++|++++.+.|+++||.||+++||++|+++|. ..+.|+ T Consensus 265 ~~~~~~~~Ia~~fgVPp~~lg~~-------------~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L~-----~~~~~d 326 (403) T protein:vir:10 265 DIEGFNKSICLAFGVPQVLLDGG-------------NNANIRPNIELFYYMTIIPMLNKLTSSLTFFFG-----YKITPN 326 (403) T ss_pred HHHHHHHHHHHHhCCCHHHcCCC-------------CCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----ceeeec Confidence 99999999999999999999731 346889999999999999999999999999873 346667 Q ss_pred ecch--hhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCC--CCCEeeccceeccccccccccccccccccccccccc Q lcl|NC_012530. 418 FVGG--DTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIA--GGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE 492 (559) Q Consensus 418 f~~l--~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~--gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 492 (559) ++.+ ++.|.+++++++..+++ |+||+||+|+++|+||++ +||+++.|.++....... . . T Consensus 327 ~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~~-~-~-------------- 390 (403) T protein:vir:10 327 TKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAGSATGV-S-G-------------- 390 (403) T ss_pred cchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccccccccccccccC-C-C-------------- Confidence 7644 78899999999998885 579999999999999994 799999988775321100 0 0 Q ss_pred ccCCCCCCCCCCCCccccc Q lcl|NC_012530. 493 SALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (559) ++..+ + +...+++ T Consensus 391 ---~e~~~--~-~~~~~g~ 403 (403) T protein:vir:10 391 ---QEGGR--P-KGSTEGD 403 (403) T ss_pred ---CcCCC--C-CCCcCCC Confidence 00000 0 0000000 No 63 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=2.8e-72 Score=412.83 Aligned_cols=380 Identities=11% Similarity=0.022 Sum_probs=281.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||+++... -.+ .+....+.. ...+.... ... .......+.+ T Consensus 1 Mg~~~~~~~~--k~~------------------------~~~~~~~~~-~~~~~~~~--------~~~--~~~~v~~~~~ 43 (383) T protein:vir:10 1 MGLLTPKNFS--KRN------------------------AKNMVYPSN-PAFFTTTV--------GGM--QLSYVSALSA 43 (383) T ss_pred CCcccccccc--ccc------------------------ccccccccc-hhhhhhhc--------cCc--cccccchhHh Confidence 9999975111 000 001111110 00010000 000 0111223567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+.. +....+|.+||| ++ ++++|++.++ T Consensus 44 l~~~~v~~~i~~ia~~ia~~~~~~~~-------------------------~~~~~ll~~PN~--~~---t~~~f~~~~~ 93 (383) T protein:vir:10 44 LQNTNVYSVINRIASDVSSAHFKTEN-------------------------TATLNRLESPSS--LI---GRFSFWQGAL 93 (383) T ss_pred hcchHHHHHHHHHHHhhccCceeecc-------------------------cchhhhhhCCCC--CC---CHHHHHHHHH Confidence 88999999999999999998764321 112235665544 44 4578999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+ +.+++|+++.+|++..+.++.. +++....++....|+++||||++... .+..++ T Consensus 94 ~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~-----~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~~ 163 (383) T protein:vir:10 94 MQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIV-----YTVLESNDRPKMVLRQDQMLHFRLMP-DPQYRY 163 (383) T ss_pred HHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceE-----EEEEEcCCceEEEEcccceEEeccCC-CCcccc Confidence 9999999999999875 4678999999998887665432 33444556677889999999997543 344567 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+|||.++..+|..+.++++++.++|+||++|+|+|++++. ..++++.+++++.|++.++| .|+|+++||++ T Consensus 164 ~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~----~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~- 237 (383) T protein:vir:10 164 LIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNY----LSDGKDLESAREEFEKANTG-DNSGRLMVLPD- 237 (383) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC----CCCHHHHHHHHHHHHHHhCc-cccCCccccCC- Confidence 8999999999999999999999999999999999999998753 24688999999999999887 68999888854 Q ss_pred ceeeeeccc-cchhHHH-HHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQ-SWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++|++++. +.|+||+ |++++++++||++|||||++||+.+.++ .+++|++++...|. +||.||++.|| T Consensus 238 g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~--------~~~sn~eq~~~~~~-~~l~P~~~~ie 308 (383) T protein:vir:10 238 GFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTE--------SQHSNIDQIKATYL-ANLNSYVNPIV 308 (383) T ss_pred CceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCC--------CccccHHHHHHHHH-HHHHHHHHHHH Confidence 599999985 6899975 9999999999999999999999755332 34677888776555 69999999999 Q ss_pred HHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIK 477 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~ 477 (559) ++|+++|+.+ .++|+++.+++.|.+++++++..+++ |+||+||+|+++|++|+++||.+....+..+.. + T Consensus 309 ~~l~~~l~~~----~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~-----g 379 (383) T protein:vir:10 309 DELRLKMNAP----DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTNETK-----G 379 (383) T ss_pred HHHHHhhCCc----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCCcccCC-----C Confidence 9999999753 48888889999999999999998886 568999999999999999999764332221110 0 Q ss_pred cccc Q lcl|NC_012530. 478 QNEF 481 (559) Q Consensus 478 ~~~~ 481 (559) .+++ T Consensus 380 Gd~e 383 (383) T protein:vir:10 380 GDDK 383 (383) T ss_pred CCCC Confidence 0000 No 64 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=2e-72 Score=413.68 Aligned_cols=343 Identities=10% Similarity=0.046 Sum_probs=265.6 Q ss_pred HHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEEC Q lcl|NC_012530. 97 VTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYD 176 (559) Q Consensus 97 ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd 176 (559) ||++|+.+++.... .. +.+. .|++..||++++ +++||+.++.+++++||+|++++|+ T Consensus 1 ia~lp~~~~~~~~~-------------~~------~~l~-~lL~~~PN~~~t---~~~f~~~~~~~l~l~Gna~~~i~r~ 57 (348) T protein:vir:93 1 MASLPLKMYEDYKV-------------VN------TEVS-DLLTVSPNNSLS---SFDFINQIETIRNEKGNAYVLIERD 57 (348) T ss_pred CcccceEeEecCcC-------------cc------cHHH-HHHHhCCCCCCC---HHHHHHHHHHHHhhcCCeEEEEEEC Confidence 88888765442211 11 2233 344445666654 4689999999999999999999999 Q ss_pred CCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHH Q lcl|NC_012530. 177 SNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFIS 256 (559) Q Consensus 177 ~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~ 256 (559) ..|+|++||||+|++|++..+.++.. ..|.....++....|+++||||++.++. .++.||+||+..++.++.. T Consensus 58 ~~G~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~~~g~~~~~~~~eiih~r~~~~---~~~~~G~s~~~~~~~~i~~ 130 (348) T protein:vir:93 58 IYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIVA---SNMVQGISPIDVLKNTTDF 130 (348) T ss_pred CCCcEEEEEEEcCCceEEEEeCCCcE----EEEEEEcCCCeEEEEccccEEEecCCCC---CCceeeccHHHHHHHHHHH Confidence 99999999999999999988876643 2344444445566799999999986432 3567899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHH Q lcl|NC_012530. 257 HENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQF 335 (559) Q Consensus 257 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf 335 (559) +.++++++ ++.++..++++++.+ +.+++++++++++.|++.+. |+|+++|++ ++++|++++. ++|+|| T Consensus 131 ~~~~~~~~--~~~~~~~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~---n~~~~~vl~-~g~~~~~l~~~~~d~q~ 199 (348) T protein:vir:93 131 DNAVRTFN--LTEMQKPDSFMLKYG-----SNVSTEKRQQVLEDFKQYYE---ENGGILFQE-PGVEIEPLPKKYVSEDI 199 (348) T ss_pred HHHHHHHH--HHhcCCCceeEEecC-----CCCCHHHHHHHHHHHHHHhh---cCCCeeecC-CCceEEEcCCChhHHHH Confidence 99999886 344444456666543 46899999999999999874 567877774 5699999985 799999 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---c Q lcl|NC_012530. 336 QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---D 412 (559) Q Consensus 336 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---~ 412 (559) +|++++++++||++|||||++||..+.+ +++|++++.+.|+++||+||++.||++||++||+..+. . T Consensus 200 ~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~ 269 (348) T protein:vir:93 200 VASENLTRERVANVFQLPSIFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNR 269 (348) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcc Confidence 9999999999999999999999965433 56789999999999999999999999999999987653 3 Q ss_pred cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccc Q lcl|NC_012530. 413 NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQL 491 (559) Q Consensus 413 ~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 491 (559) .++|+++.+++.|.+++++++..++. |+||+||+|+++|+||+||||+++++.++.+++....... T Consensus 270 ~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~------------- 336 (348) T protein:vir:93 270 YFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRK------------- 336 (348) T ss_pred eEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEeecccccccccchhhcc------------- Confidence 45566678899999999999999986 5689999999999999999999999999987755322110 Q ss_pred cccCCCCCCCCC Q lcl|NC_012530. 492 ESALQNPSGTPP 503 (559) Q Consensus 492 ~~~~~~~~~~~~ 503 (559) ..+.++.+..++ T Consensus 337 ~~~gg~~n~~~~ 348 (348) T protein:vir:93 337 SLKGGDKNVNES 348 (348) T ss_pred cccCCCCCcCCC Confidence 001111111111 No 65 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3.8e-72 Score=412.15 Aligned_cols=479 Identities=16% Similarity=0.132 Sum_probs=305.3 Q ss_pred HHHHH--HHHHHHhhhhccccccc-----cccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHH Q lcl|NC_012530. 22 HIDSK--IANDTASKALNGVDRAY-----TEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRA 94 (559) Q Consensus 22 ~~~~~--~~~~~~~~~~~gr~~a~-----~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia 94 (559) +-+++ +...+ -++....-+|. ....+.+..+ +.....-.|++ +...|...+..+++|++||++++ T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~p~~----~~~~L~~~~e~~~~~~~~i~~~~ 72 (651) T protein:vir:99 1 MTDTTGETQETK-VHVEGLGGEADLAKSPNSTQIPDHRI---QSHNVGVNPPY----NPDRLAAFLELNETLATGIRKKS 72 (651) T ss_pred CCCccceeeeeE-EEeecccccccccccccccccchhhh---cccCCCCCCCC----CHHHHHHHHhcChHHHHHHHHHh Confidence 11111 00000 00000000111 1111222222 11111222322 45666666777999999999999 Q ss_pred HHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC------CCCChhhHHHHHHHHHHHHHHcCC Q lcl|NC_012530. 95 NQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD------YSPIRDDFTSFLRKLVRDTYTYDQ 168 (559) Q Consensus 95 ~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~------~~~~~~~~~~f~~~~v~d~ll~Gn 168 (559) ++|| |.||+++.+.....++...++...+.+|++++.+. ..+...++.+|++.++.|++.+|| T Consensus 73 ~~ia-----------g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGn 141 (651) T protein:vir:99 73 RYEV-----------GFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGW 141 (651) T ss_pred hhhh-----------ccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhh Confidence 9998 45667776655555555666677788888765432 223335778999999999999999 Q ss_pred cceEEEECCCCcEEEEEEecCceEEEEecCcccc--------------------------------cc--cceE------ Q lcl|NC_012530. 169 VNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHR--------------------------------RT--RGKI------ 208 (559) Q Consensus 169 a~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~--------------------------------~~--~~~~------ 208 (559) +|++++++..|+|+.|+++++..+++..+..... +. .+.. T Consensus 142 a~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~ 221 (651) T protein:vir:99 142 LALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEV 221 (651) T ss_pred HhhhhhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceee Confidence 9999999999999999999999887644321100 00 0000 Q ss_pred -------------------------------EEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHH Q lcl|NC_012530. 209 -------------------------------YRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISH 257 (559) Q Consensus 209 -------------------------------y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~ 257 (559) .+...+......++++||||++.+. ..++.||+|||..++.+|.++ T Consensus 222 ~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~---~~~g~~G~spl~~a~~~i~~a 298 (651) T protein:vir:99 222 VIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPS---ILEDDYGVPDWVSAIRTISAD 298 (651) T ss_pred eeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCC---CCCCcccccHHHHHHHHHHHH Confidence 0111122334567899999997542 346779999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC----------ceeeeec Q lcl|NC_012530. 258 ENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE----------DAKFVSM 327 (559) Q Consensus 258 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g----------~~~~~~l 327 (559) .++++|+.++|+||++|+|||++++ +.++++++++|+++|++.++ |+|+++||+.+ +++|+++ T Consensus 299 ~~a~~~~~~~f~NG~~p~gil~~~~----~~ls~e~~~~lr~~~~~~~~---nagk~~vL~~~~~~~~~~~~~g~~~~pl 371 (651) T protein:vir:99 299 EAAKDYNRDFFDNDTIPRMVIKVTG----GELSEESKRDLRQMLNGLRE---ESHRAVVLEVEKFQSQLDEDVEIELEPM 371 (651) T ss_pred HHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHHHhc---cCCceEEeecccccccccccCCceEEEc Confidence 9999999999999999999999853 35899999999999998653 67898888642 7999999 Q ss_pred cc-c-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc Q lcl|NC_012530. 328 TQ-A-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI 405 (559) Q Consensus 328 s~-~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L 405 (559) +. + +|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|+++||+||+++||++||++| T Consensus 372 s~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~----------~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kL 441 (651) T protein:vir:99 372 GQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSA----------NRSNSDQQDKDFALEVIQPEQHTFAEWLYQII 441 (651) T ss_pred CcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 85 4 699999999999999999999999999987653 46789999999999999999999999999999 Q ss_pred cccccC---ccceeeec--chhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCC--CCCEeeccceecccccccccc Q lcl|NC_012530. 406 IRQILG---DNYMLEFV--GGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIA--GGDIILSAVYIQRLGQQEQIK 477 (559) Q Consensus 406 ~~~~~~---~~~~~~f~--~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~--gGD~~~~~~~~~~l~~~~~~~ 477 (559) +++.++ ..++|+|+ .+++.|.+++++++..+++ |+||+||+|+++||||++ +||.++.+.+....+...... T Consensus 442 l~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gg 521 (651) T protein:vir:99 442 HQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGG 521 (651) T ss_pred cCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCC Confidence 987653 24566665 5889999999999998886 579999999999999995 488877665443332211000 Q ss_pred cccccccccccccccccCCCCCCCCCCCCccccccchhcccc-ccccccc---cccccccccccccccccccchhhhhh- Q lcl|NC_012530. 478 QNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG-YTGKDAK---PSGKDNQQGVGKDGQLKNKKNTNSYK- 552 (559) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---~~g~~~~~~~~~~~~~k~~~~~~~~~- 552 (559) . +....+++.+....+.+.+...+ +..++.. +--+.+-..++=|...+-... .+. T Consensus 522 e------------------~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~gyd~~~~~l~~--~f~~ 581 (651) T protein:vir:99 522 E------------------TEAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEGLYDFGENELYL--SFLR 581 (651) T ss_pred C------------------CcccccCccccccccchhhhhhhhhcccchhhhhhHHHHHHHhhcCCCccceEEE--EEee Confidence 0 00000000000011111111100 0101000 000000111111111000000 000 Q ss_pred ccCCCCC Q lcl|NC_012530. 553 QGGSSKK 559 (559) Q Consensus 553 ~~~~~~~ 559 (559) .+++|-- T Consensus 582 ~~~~~~~ 588 (651) T protein:vir:99 582 DEGQSSL 588 (651) T ss_pred cCCCCce Confidence 0000000 No 66 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=9e-72 Score=410.08 Aligned_cols=380 Identities=11% Similarity=0.020 Sum_probs=278.9 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+||.-.. .+. . ..++.. .+..+..... +. . ......+.| T Consensus 1 Mg~~~~~~~~~-~~~-----------------~------~~~~~~---~~~~~~~~~~-~~-~--------~~~v~~~~a 43 (385) T protein:vir:10 1 MGLLTPRNFNK-RKA-----------------K------NMVYPS---NPAFFTTTVG-GM-Q--------LSYVSALSA 43 (385) T ss_pred Cccccchhccc-ccc-----------------c------cccccc---chhhhhhhcc-cc-C--------ccccCHHHh Confidence 99999873110 000 0 000000 0000001000 00 0 011223567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||+.||++|+.+.. +....+|.+||| ++ ++++|+++++ T Consensus 44 l~~~~v~~~i~~ia~~ia~~p~~v~~-------------------------~~~~~ll~~PN~--~~---t~~~f~~~~~ 93 (385) T protein:vir:10 44 LQNTNVYSVINRIASDVASAHFKTEN-------------------------TATLNRLESPSS--LI---GRFSFWQGAL 93 (385) T ss_pred hccHHHHHHHHHHHHHHhhCceeeec-------------------------cchhhhhhcCCC--CC---CHHHHHHHHH Confidence 88999999999999999998864321 112335555544 44 4478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+ +.+++|+++.+|++..+..+.. +++....++....|+++||||+++.. .+..++ T Consensus 94 ~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~-----~~~~~~~~~~~~~~~~~eiihik~~~-~~~~~~ 163 (385) T protein:vir:10 94 MQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIV-----YTVLESNDRPQMVLRQDQMLHFRLMP-DPQYRY 163 (385) T ss_pred HHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceE-----EEEEEcCCceEEEEccccEEEeccCC-CCcccc Confidence 9999999999999986 4679999999999887765532 23333455567789999999998643 344567 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .||+||+..++.+|..+.++++++.++|+||++|+|+|++++. ..++++++++++.|++.++| .|+|++++|++ T Consensus 164 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~----~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~- 237 (385) T protein:vir:10 164 LIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNY----LSDGKDLESAREEFEKANTG-DNSGRLMVLPD- 237 (385) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC----CCCHHHHHHHHHHHHHHhCc-cccCCccccCC- Confidence 7899999999999999999999999999999999999998743 24678999999999999887 68999888855 Q ss_pred ceeeeeccc-cchhHHH-HHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQ-SWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) |++|++++. +.|+||+ |++++++++||++|||||++||..+.+. .+++|++++... +.+||.||++.|| T Consensus 238 g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~--------~~~sn~eq~~~~-~~~~l~P~~~~ie 308 (385) T protein:vir:10 238 GFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTE--------SQHSNIDQIKAT-YLANLNSYVNPIV 308 (385) T ss_pred CceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCC--------cccccHHHHHHH-HHHHHHHHHHHHH Confidence 599999985 6899975 9999999999999999999999755332 245778766554 4569999999999 Q ss_pred HHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecccccccc Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQRLGQQEQ 475 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~l~~~~~ 475 (559) ++|+++|+++ .++|+++.+++.|.+++++++..++. |+||+||+|+++|++|+|+| |.+..+.+.... T Consensus 309 ~~l~~~l~~~----~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~~~~~~----- 379 (385) T protein:vir:10 309 DELRLKMNAP----DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTTQVKG----- 379 (385) T ss_pred HHHHHhhCCc----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCcccccCC----- Confidence 9999999864 47888889999999999999998886 56899999999999999754 444433322100 Q ss_pred cccccccccccccccccccCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) +. .+++ T Consensus 380 --g~------------------~~dn 385 (385) T protein:vir:10 380 --GD------------------EGDN 385 (385) T ss_pred --CC------------------CCCC Confidence 00 0000 No 67 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=4.1e-71 Score=406.49 Aligned_cols=356 Identities=11% Similarity=0.069 Sum_probs=264.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|.+|.... ...+.. +......+ . .+..+.....+.| T Consensus 1 M~~~~~f~~r~-------------------------------~~~~~~----~~~~~~~~---~---~~~~~~~v~~~~a 39 (359) T protein:vir:10 1 MSILNPFERRS-------------------------------SITPNN----YYPFMVQN---G---SIVPNSLVDATEA 39 (359) T ss_pred Ccccchhhccc-------------------------------cCCCCc----chhhhhcc---c---cccCCcccCHHHh Confidence 99999882110 000000 00000000 0 1111222344567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+. ..+....++.+|| ++++ .++||+.++ T Consensus 40 l~~~av~~cv~~ia~~ia~~p~~--------------------------~~~~~~~L~~~PN--~~~t---~~~f~~~~~ 88 (359) T protein:vir:10 40 LKNSDLYAVTSLISSDIAGTRFI--------------------------GNQVFTSVLNNPS--HLTN---AFSFWQTAI 88 (359) T ss_pred hcchHHHHHHHHHHHhhhcCccc--------------------------cchHHHHHhhccc--ccCC---HHHHHHHHH Confidence 88999999999999999988752 0112334455544 4444 468999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccC-CCccC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPR-SDILS 239 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~-~~~~~ 239 (559) .+++++||+|++|+|+.+|+|++||||+|++|++..+.++.. +.+....++....++++||||++.+.. .+..+ T Consensus 89 ~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~-----y~~~~~~~~~~~~~~~~evih~~~~~~~~~~~d 163 (359) T protein:vir:10 89 LNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDTLT-----YEVNQFDDYPSAKYNASEMIHVKIMAYGVDTLH 163 (359) T ss_pred HhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCeEE-----EEEEecCCceEEEEcccceEEeccCCCCCCccC Confidence 999999999999999999999999999999999977654311 112233456677899999999986532 33457 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA 319 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~ 319 (559) +.+|+||+.+++.+|..+.++++|+.++|+||++|+|||+++. +.+++++++++++.|++.++| .|+|+++||++ T Consensus 164 g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~----~~l~~e~~~~~~~~~~~~~~~-~n~g~~~vl~~ 238 (359) T protein:vir:10 164 NLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQ----GTLSSEAKDSIRKEFEKANGG-NNSGRVMVLDQ 238 (359) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC----CCCCHHHHHHHHHHHHHHhCc-cccCCceecCC Confidence 7899999999999999999999999999999999999999864 257899999999999887655 79999988854 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++|++++. +.|+||+|++++++++||++|||||++||..+... .+++|++++...|+..+|.||+..|+ T Consensus 239 -g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~--------~~~~~~e~~~~~~l~~~l~p~~~~l~ 309 (359) T protein:vir:10 239 -SADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQ--------SSLDQIKDLYVNALNRFIEPLISELR 309 (359) T ss_pred -CcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCccc--------ccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 599999985 79999999999999999999999999998654322 25788899999999999999999998 Q ss_pred HHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCC Q lcl|NC_012530. 399 KNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLEL-QTATTVNDYREKQGLPKIA 456 (559) Q Consensus 399 ~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~-~~~~T~NE~R~~~gl~pi~ 456 (559) ..|++++.... ...++|+ .......+..++ .|+||+||+|+++|+|||= T Consensus 310 ~~l~~~~~~~~---~~~~~~d------~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 310 IKCDSSIGVDM---SPITDYS------NSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHhhhhhcccc---hhhhhcC------HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 88887653321 2223332 122223344455 4679999999999999985 No 68 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=7.6e-71 Score=405.01 Aligned_cols=371 Identities=12% Similarity=0.032 Sum_probs=267.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+.|+..-.. . + ........ ..+.+..+. +......+....-+.| T Consensus 3 m~~~~~~~~~~~~---~---------------~------~~~~~~~~------~~~~~~~~~--~~~~~~~g~~v~~~~a 50 (392) T protein:vir:74 3 LPILNFINQTNDP---P---------------E------AGSVQSYF------PDGNDAQIM--ESLLGDNNEWVSARAA 50 (392) T ss_pred chhhhhhhcccCc---c---------------c------cccccccc------ccCchhhhh--hhccCCCCcccchhhh Confidence 9999888421000 0 0 00000000 000000000 0001111222334667 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.++... ...++.+||+ ++ +.++|++.++ T Consensus 51 l~~~~v~~~v~~ia~~ia~lp~~~~~~~-------------------------~~~l~~~PN~--~~---t~~~f~~~~~ 100 (392) T protein:vir:74 51 LRNSDLFSIILQLSSDLAIVKINAEKKK-------------------------NQGIIDNPST--NA---NKHGFWQSMF 100 (392) T ss_pred hcchHHHHHHHHHHHhhccCceeeccch-------------------------hhhhhhhcCC--CC---CHHHHHHHHH Confidence 8999999999999999999887543211 1124555554 44 4478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++++|+.+|++++||||+|++|++..+.+|... .|..... ......++++||||++.++.. T Consensus 101 ~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~evih~~~~~~~-- 174 (392) T protein:vir:74 101 AQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM----YYNITFDDPKIEPILQAPQSDLIHMKLLSID-- 174 (392) T ss_pred HHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EEEEEecCCccceeEEEcCccEEEecCCCCC-- Confidence 9999999999999999999999999999999999988776432 2222222 223567999999999865422 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ...||+|||.+++.+|.++.++++|+.++|+||++|+|+|++++... .+++ .++.|.+.+.|..|+|+++|| T Consensus 175 -~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~---~~~~----~~~~~~~~~~~~~n~g~~~vl 246 (392) T protein:vir:74 175 -GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL---LSDK----DKASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC---chHH----HHHHHHHHHhccccCCCeeec Confidence 34689999999999999999999999999999999999999875421 2333 346666778888999998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. ++|+||+|++++++++||++|||||++||+...++ +..++.++|+++||.||++. T Consensus 247 ~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~------------~~~e~~~~~~~~~l~p~~~~ 313 (392) T protein:vir:74 247 D-DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ------------SSIQQISGMYASALNRYLRP 313 (392) T ss_pred C-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc------------cHHHHHHHHHHHHHHHHHHH Confidence 5 5599999985 79999999999999999999999999999754321 23456778999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHh--------------CCCCCCCCCEe Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQ--------------GLPKIAGGDII 461 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~--------------gl~pi~gGD~~ 461 (559) ||++|+++|++. ++|++..+.+.|..++++.+..++++ ++|+||+|+++ |+||++|||. T Consensus 314 ie~~l~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~- 387 (392) T protein:vir:74 314 AISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS- 387 (392) T ss_pred HHHHHHHhccch-----hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCC- Confidence 999999999864 45666677788888888888888764 57999998876 4555555442 Q ss_pred eccceecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 462 LSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 462 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) +++-| T Consensus 388 -------------------------------------~~p~p 392 (392) T protein:vir:74 388 -------------------------------------NEPVP 392 (392) T ss_pred -------------------------------------CCCCC Confidence 11111 No 69 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=5.4e-70 Score=400.33 Aligned_cols=381 Identities=12% Similarity=0.058 Sum_probs=274.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+++...-. ..+ +. ...+.......+. .. +........+.+ T Consensus 1 M~~f~~~~~~~~-------------------------~~~-~~------~~~~~~~~~~~~~--~~--~~~~~~v~~~~~ 44 (386) T protein:vir:48 1 MPIFNITNLATE-------------------------SPP-IS------QGGFFDITDPDFL--ST--LNGSEWVSAESA 44 (386) T ss_pred Cccccccccccc-------------------------ccc-cc------cccccccccchhc--cc--ccCCceechhhh Confidence 999987621100 000 00 0000000000000 01 111222334567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+|+|++||++||++||++|+.+++. ....++.+||| ++ ++++|++.++ T Consensus 45 ~~~~~v~~~i~~ia~~ia~~p~~~~~~-------------------------~~~~l~~~pN~--~~---t~~~f~~~~~ 94 (386) T protein:vir:48 45 LRNSDLFSIINQLSNDLATVKLTASRK-------------------------QLQGIIDNPSN--NA---NRFNFYQSIF 94 (386) T ss_pred hcchHHHHHHHHHHHhhccCceeeccc-------------------------hhHHHhhcCCC--CC---CHHHHHHHHH Confidence 889999999999999999998754321 12235555555 34 4578999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecC---ceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN---KVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~---~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++|+||..|+|++||||+|++|++..+.+|... .|.....+ .....|+++||||++.+.. T Consensus 95 ~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~evih~~~~~~--- 167 (386) T protein:vir:48 95 AQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGI----YYNITFDDPRIPPKQHVPQGDVLHFKLLSV--- 167 (386) T ss_pred HHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceE----EEEEEecCccccceeEecCccEEEecCCCC--- Confidence 9999999999999999999999999999999999988776432 23332222 2345789999999986432 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ....||+||+..+..+|..+.++++++.++|+||++|+|+|++++ .+++++.+++++.|... ..|+|+++|| T Consensus 168 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-----~~~~e~~~~~~~~~~~~---~~n~g~~~vl 239 (386) T protein:vir:48 168 DGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG-----GGLLDFKTKLSRSRQAM---KQMQGGPLVL 239 (386) T ss_pred CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-----CCCHHHHHHHHHHHHHh---hcCCCCceec Confidence 234689999999999999999999999999999999999998864 47888999999988764 4578998777 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. ++|+||+|++++++++||++|||||++||+.. +++|++++.+.|++.||.||++. T Consensus 240 ~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~------------~~~~~e~~~~~~~~~~l~P~~~~ 306 (386) T protein:vir:48 240 D-DLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQG------------DQQSSLEMSLDLYNKAVSRYLRP 306 (386) T ss_pred C-CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC------------CcccHHHHHHHHHHHHHHHHHHH Confidence 5 5599999995 79999999999999999999999999998521 34678999999999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) ||++|+++|+++ +.+++......|...++..+..+++ +++|+||+|+.+|++|+++||.+... . T Consensus 307 ie~~l~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~----~------ 371 (386) T protein:vir:48 307 FLSELSQKLSCD-----VDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGE----N------ 371 (386) T ss_pred HHHHHHHhhcch-----hhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhc----C------ Confidence 999999999874 3344444455666666666666665 46899999999999999888754210 0 Q ss_pred cccccccccccccccccccCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTP 502 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (559) .+.+...+++.+..+ T Consensus 372 ------------~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 372 ------------PNKTTLKGGEINGED 386 (386) T ss_pred ------------CCCCccCCCCCCCCC Confidence 000011111111111 No 70 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=5.9e-70 Score=400.12 Aligned_cols=371 Identities=12% Similarity=0.026 Sum_probs=264.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|++|+.. .... ......... ..+....+.......+ +....-+.+ T Consensus 3 m~~f~~~~~~---~~~~---------------------~~~~~~~~~------~~~~~~~~~~~~~~~~--~~~v~~~~a 50 (392) T protein:vir:39 3 LPILNFINQT---NDPP---------------------EVGSVQSYF------PDGNDAQIMESLLGDN--NEWVSARAA 50 (392) T ss_pred chhhhhhhcc---cccc---------------------ccccccccc------ccCchhhhhhhhcCCC--CceechHHh Confidence 9999887320 0000 000000000 0000000000000011 111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.++... ...++.+| |++++ .++|++.++ T Consensus 51 l~~~~v~~~i~~ia~~ia~lp~~~~~~~-------------------------~~~l~~~P--N~~~t---~~~f~~~~~ 100 (392) T protein:vir:39 51 LRNSDLFSIILQLSSDLAIVKINAEKKK-------------------------NQGIIDNP--STNAN---KHGFWQSMF 100 (392) T ss_pred hccHHHHHHHHHHHHhhccCceeeccch-------------------------hhhHhhcC--CCCCC---HHHHHHHHH Confidence 8899999999999999999887543211 01244454 44444 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++++|+.+|++++||||+|++|++..+.+|... .|..... +.....++++||||+++++.. T Consensus 101 ~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~-- 174 (392) T protein:vir:39 101 AQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM----YYNITFDDPKIEPILQAPQSDLIHMKLLSID-- 174 (392) T ss_pred HHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EEEEEecCcccceeEEEccccEEEecCCCCC-- Confidence 9999999999999999999999999999999999988776432 2222222 223467999999999865422 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ...||+|||.++..+|..+.++++++.++|+||++|+|+|++++.. ..++++ ++.|.+.+.|..|+|+++|| T Consensus 175 -~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~---~~~~~~----~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:39 175 -GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGG---LLSDKD----KASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC---CchHHH----HHHHHHHHhccccCCCeeec Confidence 3468999999999999999999999999999999999999997542 233333 45666777888899998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. +.|+||+|++++++++||++|||||++||+.... .+..++.+.|++.||.|+++. T Consensus 247 ~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~------------~~~~~~~~~f~~~~l~P~~~~ 313 (392) T protein:vir:39 247 D-DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ------------QSSIQQISGMYASALNRYLRP 313 (392) T ss_pred C-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHHHHHHHHHH Confidence 5 5599999985 7999999999999999999999999999964322 133456788999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHh--------------CCCCCCCCCEe Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQ--------------GLPKIAGGDII 461 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~--------------gl~pi~gGD~~ 461 (559) ||++|+++|++. ++|++..+.+.|..++++.+..++++ ++|+||+|+++ |+||++|||.- T Consensus 314 ie~~l~~~L~~~-----~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~~ 388 (392) T protein:vir:39 314 AISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSN 388 (392) T ss_pred HHHHHHHhcccc-----ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCCC Confidence 999999999864 45555666777888888887777754 57888888876 66666666531 Q ss_pred eccceecc Q lcl|NC_012530. 462 LSAVYIQR 469 (559) Q Consensus 462 ~~~~~~~~ 469 (559) .| .| T Consensus 389 -~p---~p 392 (392) T protein:vir:39 389 -EP---VP 392 (392) T ss_pred -CC---CC Confidence 00 00 No 71 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=5.9e-70 Score=400.12 Aligned_cols=371 Identities=12% Similarity=0.026 Sum_probs=264.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++|++|+.. .... ......... ..+....+.......+ +....-+.+ T Consensus 3 m~~f~~~~~~---~~~~---------------------~~~~~~~~~------~~~~~~~~~~~~~~~~--~~~v~~~~a 50 (392) T protein:vir:10 3 LPILNFINQT---NDPP---------------------EVGSVQSYF------PDGNDAQIMESLLGDN--NEWVSARAA 50 (392) T ss_pred chhhhhhhcc---cccc---------------------ccccccccc------ccCchhhhhhhhcCCC--CceechHHh Confidence 9999887320 0000 000000000 0000000000000011 111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.++... ...++.+| |++++ .++|++.++ T Consensus 51 l~~~~v~~~i~~ia~~ia~lp~~~~~~~-------------------------~~~l~~~P--N~~~t---~~~f~~~~~ 100 (392) T protein:vir:10 51 LRNSDLFSIILQLSSDLAIVKINAEKKK-------------------------NQGIIDNP--STNAN---KHGFWQSMF 100 (392) T ss_pred hccHHHHHHHHHHHHhhccCceeeccch-------------------------hhhHhhcC--CCCCC---HHHHHHHHH Confidence 8899999999999999999887543211 01244454 44444 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++++|+.+|++++||||+|++|++..+.+|... .|..... +.....++++||||+++++.. T Consensus 101 ~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~-- 174 (392) T protein:vir:10 101 AQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM----YYNITFDDPKIEPILQAPQSDLIHMKLLSID-- 174 (392) T ss_pred HHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EEEEEecCcccceeEEEccccEEEecCCCCC-- Confidence 9999999999999999999999999999999999988776432 2222222 223467999999999865422 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ...||+|||.++..+|..+.++++++.++|+||++|+|+|++++.. ..++++ ++.|.+.+.|..|+|+++|| T Consensus 175 -~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~---~~~~~~----~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:10 175 -GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGG---LLSDKD----KASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC---CchHHH----HHHHHHHHhccccCCCeeec Confidence 3468999999999999999999999999999999999999997542 233333 45666777888899998887 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. +.|+||+|++++++++||++|||||++||+.... .+..++.+.|++.||.|+++. T Consensus 247 ~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~------------~~~~~~~~~f~~~~l~P~~~~ 313 (392) T protein:vir:10 247 D-DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ------------QSSIQQISGMYASALNRYLRP 313 (392) T ss_pred C-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHHHHHHHHHH Confidence 5 5599999985 7999999999999999999999999999964322 133456788999999999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHh--------------CCCCCCCCCEe Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQ--------------GLPKIAGGDII 461 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~--------------gl~pi~gGD~~ 461 (559) ||++|+++|++. ++|++..+.+.|..++++.+..++++ ++|+||+|+++ |+||++|||.- T Consensus 314 ie~~l~~~L~~~-----~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~~ 388 (392) T protein:vir:10 314 AISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSN 388 (392) T ss_pred HHHHHHHhcccc-----ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCCC Confidence 999999999864 45555666777888888887777754 57888888876 66666666531 Q ss_pred eccceecc Q lcl|NC_012530. 462 LSAVYIQR 469 (559) Q Consensus 462 ~~~~~~~~ 469 (559) .| .| T Consensus 389 -~p---~p 392 (392) T protein:vir:10 389 -EP---VP 392 (392) T ss_pred -CC---CC Confidence 00 00 No 72 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=1.4e-69 Score=398.01 Aligned_cols=374 Identities=13% Similarity=0.090 Sum_probs=274.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|-...- ..+. ....... .. ....+. .. +..+.....+.| T Consensus 1 Mglf~~~~~~~--~~~~------------------------~~~~~~~-----~~-~~~~~~--~~--~~~~~~v~~~~a 44 (384) T protein:vir:49 1 MPIFNITNLAT--ESPP------------------------SNQDSFF-----DI-TDPEFL--DA--LNGSEWVSAETA 44 (384) T ss_pred CccccccccCc--cccc------------------------ccchhhc-----cc-cchhhc--cc--ccCCceechhhh Confidence 99998731100 0000 0000000 00 000000 00 111122234567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++.. ...++.+||| ++ ++++|+++++ T Consensus 45 l~~~~V~~~i~~Ia~~ia~l~~~~~~~~-------------------------~~~l~~~PN~--~~---t~~~f~~~l~ 94 (384) T protein:vir:49 45 LKNSDLFSIISQLSNDLATAKITTSRKQ-------------------------LQGIVDNPSN--NA---NRFNFYQSIF 94 (384) T ss_pred hccHHHHHHHHHHHHHHhhCceeeecch-------------------------hhhhhhccCC--CC---CHHHHHHHHH Confidence 8899999999999999999987653211 1124455544 44 4478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++++|+..|+|++||||+|++|++..+.++... .|..... .+....++++||||++.+.. T Consensus 95 ~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~eVih~~~~~~--- 167 (384) T protein:vir:49 95 AQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGL----YYNITFDDPRIPPKQHVPQGDILHFRLLSV--- 167 (384) T ss_pred HHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EEEEEecCccccceeEecCccEEEecCCCC--- Confidence 9999999999999999999999999999999999887665321 2222222 23446799999999986432 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ....+|+||+.+++.+|..+.++++++.++|+||++|+|||++++. ++.++. ++.+.+.+.|..|+|++++| T Consensus 168 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-----~~~~~~---~~~~~~~~~~~~n~~~~~vl 239 (384) T protein:vir:49 168 DGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGG-----GLLDFK---TKQSRSRQAMKQMQGGPLVL 239 (384) T ss_pred CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-----CChHHH---HHHHHHHHhcccCCccceec Confidence 2346899999999999999999999999999999999999998754 333333 23455667788899998777 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) ++ +++|++++. +.|+||+|++++++++||++|||||++||....++ .++++++++...|++.+|.||+.. T Consensus 240 ~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~--------~~~~~~~~~~~~~i~~~l~pi~~~ 310 (384) T protein:vir:49 240 DD-LEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQ--------SSLEMIYNIYFKAVSRFLRPFVSE 310 (384) T ss_pred CC-CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--------ccHHHHHHHHHHHHHHHHHHHHHH Confidence 54 599999985 79999999999999999999999999999754433 246788999999999999999999 Q ss_pred HHHHHHhhcccc---c---cCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccce Q lcl|NC_012530. 397 IAKNLTNGIIRQ---I---LGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVY 466 (559) Q Consensus 397 ie~~ln~~L~~~---~---~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~ 466 (559) |+++|+++|... . .+..++|.++.+.+.+..++.+++..+...++++||+|+.+|++|++|||.= ..+ T Consensus 311 i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~~~~~p~~gGd~~--~~~ 384 (384) T protein:vir:49 311 LSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEGETDSTLKGGETN--EQY 384 (384) T ss_pred HHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHHcCCCCCCCCCCC--CCC Confidence 999999987432 1 1233555667888999999999999888776666999999999999999741 111 No 73 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=9.3e-69 Score=393.54 Aligned_cols=373 Identities=13% Similarity=0.126 Sum_probs=270.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|||+|. .... +.... ... .+.. . ..-+.| T Consensus 1 Mg~f~~~f---~~~~--------------------------~~~~~----~~~------~~~~--~--------~~~~~a 31 (385) T protein:vir:95 1 MGLFDSVF---KRHS--------------------------ELSWM----YDL------EFLQ--D--------KSKKAY 31 (385) T ss_pred Cchhhhhh---ccCc--------------------------ccccc----cch------hhhh--c--------cchhhh Confidence 99999983 2110 00000 000 0000 0 012456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++... . ..+.+. .|++..||++++ +++|++.++ T Consensus 32 ~~~~~v~~~i~~ia~~ia~~p~~~~~~~~-----------~--------~~~~l~-~lL~~~PN~~~t---~~~f~~~~~ 88 (385) T protein:vir:95 32 LKQIALNTVVEMVARTISQSEFRVMKNNT-----------K--------EKGTLY-YLLNVRPNRNQN---AVDFWQKFI 88 (385) T ss_pred hhhHHHHHHHHHHHHHHcccceeeeecCc-----------c--------ccchHH-HHHhcccCcCCC---HHHHHHHHH Confidence 78999999999999999999876653211 0 112233 344445666554 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|+++.++. +.++.++++.+..+.+..+.. ..+.+........++++||||+++++.. .. T Consensus 89 ~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~eiih~~~~~~~---~~ 156 (385) T protein:vir:95 89 FKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHRF--------TNVLVNDFEFKRVFTMDDVIYLKYNNQK---LD 156 (385) T ss_pred HHHhhcCceEEEEecCC-Ceeeccccccccccccccccc--------eeeeecccceeeeeccccEEEecCCCCC---cc Confidence 99999999999877653 456666666666554432211 1222233445567999999999875432 34 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) .+|+||++.+..++..+.++.. | ++.|+|+|++++. ..+++++.+++++.|++.++|..++++.+++.++ T Consensus 157 ~~G~s~~~~~~~~i~~~~~~~~-----~--~~~~~g~l~~~~~---~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~ 226 (385) T protein:vir:95 157 AFSLGLFEDYGEIFGRMIDLQM-----L--NNQIRGILKVDAT---KFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTE 226 (385) T ss_pred cccchHHHHHHHHHHHHHHHHH-----h--cCCCceEEEeCCc---cCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCC Confidence 6799999999999987665532 3 3458899988653 4578999999999999999998766665555577 Q ss_pred ceeeeeccc-------cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 321 DAKFVSMTQ-------AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 321 ~~~~~~ls~-------~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) +++|+++++ +.|+||+|++++++++||++|||||++|+ + +++|.+++...|++.||+|| T Consensus 227 g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~----~----------~~sn~e~~~~~~~~~~l~P~ 292 (385) T protein:vir:95 227 GLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL----G----------EMADLEKTIESYLQFCINPL 292 (385) T ss_pred CceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc----C----------CCcCHHHHHHHHHHHHHHHH Confidence 799999862 36999999999999999999999999995 1 35789999999999999999 Q ss_pred HHHHHHHHHhhccccccCcc--ceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCC--CCCCEeeccceec Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILGDN--YMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKI--AGGDIILSAVYIQ 468 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~~~--~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi--~gGD~~~~~~~~~ 468 (559) +.+||++||++|+++.+... ++|+++.+++.|.+++++++..+++ |+||+||+|+++|+||+ ||||+++++.|++ T Consensus 293 ~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 372 (385) T protein:vir:95 293 LRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIITKNLQ 372 (385) T ss_pred HHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccce Confidence 99999999999999765433 4555568899999999999998885 56899999999999999 7899999999988 Q ss_pred ccccccccccccc Q lcl|NC_012530. 469 RLGQQEQIKQNEF 481 (559) Q Consensus 469 ~l~~~~~~~~~~~ 481 (559) +++........++ T Consensus 373 ~~~~~kgge~~~e 385 (385) T protein:vir:95 373 SADAFKGGESNEE 385 (385) T ss_pred ecccccCCCCCCC Confidence 7754211100000 No 74 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=4.4e-68 Score=389.87 Aligned_cols=384 Identities=14% Similarity=0.104 Sum_probs=268.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||||++.. ... ..+ . ..+.. ......+.+ T Consensus 1 Mg~f~~lf~~---~~~------------------------~~~---~---~~~~~----------------~~~v~~~~~ 31 (395) T protein:vir:95 1 MSILEKIFKT---RKD------------------------ITY---M---LDLDM----------------IEDLSQQAY 31 (395) T ss_pred Cchhhhhhcc---Ccc------------------------ccc---c---ccchh----------------ccccchhhh Confidence 9999998321 100 000 0 00000 001122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++... .. .+.+. +|++..||+++++ ++|++.++ T Consensus 32 ~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-------------~~------~~~~~-~ll~~~PN~~~t~---~~f~~~~~ 88 (395) T protein:vir:95 32 VKRLAIDSCIEFVARAVAQSHFKVLEGNR-------------IQ------KNDVY-YKLNIKPNTDLSS---DSFWQQVI 88 (395) T ss_pred hhhHHHHHHHHHHHHhhccceeEeccCCc-------------cc------cchHH-HHHHhccCcCCCH---HHHHHHHH Confidence 78999999999999999999876553210 01 11222 3333456655554 67899999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .++++.|++|+++.++. .++++++..+++...... ...++..........++++||||+++++.. .. T Consensus 89 ~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~---~~ 155 (395) T protein:vir:95 89 YKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNK---VT 155 (395) T ss_pred HHHhhCCceEEEEecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCC---cc Confidence 99999999887665443 256666666555433221 112333334444568999999999876543 34 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TA 319 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~ 319 (559) .||+||+..+..++..+.+ .|.+|+.|+|+|+++.. .+++++.+++++.|++.++|. ++++.+|+ .+ T Consensus 156 ~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~----~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:95 156 HFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASS----AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred cccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCC----CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 5899999999988876553 46778889999988643 468999999999999998886 45555443 56 Q ss_pred Cceeeeeccc-cch-----hHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AED-----MQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 320 g~~~~~~ls~-~~D-----~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) ++++|++++. +.| +||+|++++++++||++|||||++||- +++|++++.+.|+++||.|| T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--------------~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:95 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--------------ETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------cccCHHHHHHHHHHHHHHHH Confidence 7799999984 555 499999999999999999999999961 35789999999999999999 Q ss_pred HHHHHHHHHhhccccccC-ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecc Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILG-DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQR 469 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~-~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~ 469 (559) +.+||++||++|+++.+. ..++|+++.+++.|.+++++++..+++ |+||+||+|+++||||++|| |+++++.++.+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~ 369 (395) T protein:vir:95 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEK 369 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccc Confidence 999999999999986543 457888899999999999999998886 56899999999999999876 99999998887 Q ss_pred cccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) ++......... ... ...+.+.+++| + T Consensus 370 ~~~~~~~~~~~--------~~~---------------------------~~kgg~~~~~g--~ 395 (395) T protein:vir:95 370 ANSGENDEKEK--------DEN---------------------------TLKGGDEDESG--D 395 (395) T ss_pred ccccccccCcc--------ccc---------------------------ccCCCCCCCCC--C Confidence 75432111000 000 00001111111 1 No 75 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=4.4e-68 Score=389.87 Aligned_cols=384 Identities=14% Similarity=0.104 Sum_probs=268.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||||++.. ... ..+ . ..+.. ......+.+ T Consensus 1 Mg~f~~lf~~---~~~------------------------~~~---~---~~~~~----------------~~~v~~~~~ 31 (395) T protein:vir:10 1 MSILEKIFKT---RKD------------------------ITY---M---LDLDM----------------IEDLSQQAY 31 (395) T ss_pred Cchhhhhhcc---Ccc------------------------ccc---c---ccchh----------------ccccchhhh Confidence 9999998321 100 000 0 00000 001122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++... .. .+.+. +|++..||+++++ ++|++.++ T Consensus 32 ~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-------------~~------~~~~~-~ll~~~PN~~~t~---~~f~~~~~ 88 (395) T protein:vir:10 32 VKRLAIDSCIEFVARAVAQSHFKVLEGNR-------------IQ------KNDVY-YKLNIKPNTDLSS---DSFWQQVI 88 (395) T ss_pred hhhHHHHHHHHHHHHhhccceeEeccCCc-------------cc------cchHH-HHHHhccCcCCCH---HHHHHHHH Confidence 78999999999999999999876553210 01 11222 3333456655554 67899999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .++++.|++|+++.++. .++++++..+++...... ...++..........++++||||+++++.. .. T Consensus 89 ~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~---~~ 155 (395) T protein:vir:10 89 YKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNK---VT 155 (395) T ss_pred HHHhhCCceEEEEecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCC---cc Confidence 99999999887665443 256666666555433221 112333334444568999999999876543 34 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TA 319 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~ 319 (559) .||+||+..+..++..+.+ .|.+|+.|+|+|+++.. .+++++.+++++.|++.++|. ++++.+|+ .+ T Consensus 156 ~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~----~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:10 156 HFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASS----AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred cccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCC----CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 5899999999988876553 46778889999988643 468999999999999998886 45555443 56 Q ss_pred Cceeeeeccc-cch-----hHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AED-----MQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 320 g~~~~~~ls~-~~D-----~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) ++++|++++. +.| +||+|++++++++||++|||||++||- +++|++++.+.|+++||.|| T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--------------~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--------------ETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------cccCHHHHHHHHHHHHHHHH Confidence 7799999984 555 499999999999999999999999961 35789999999999999999 Q ss_pred HHHHHHHHHhhccccccC-ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecc Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILG-DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQR 469 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~-~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~ 469 (559) +.+||++||++|+++.+. ..++|+++.+++.|.+++++++..+++ |+||+||+|+++||||++|| |+++++.++.+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~ 369 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEK 369 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccc Confidence 999999999999986543 457888899999999999999998886 56899999999999999876 99999998887 Q ss_pred cccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) ++......... ... ...+.+.+++| + T Consensus 370 ~~~~~~~~~~~--------~~~---------------------------~~kgg~~~~~g--~ 395 (395) T protein:vir:10 370 ANSGENDEKEK--------DEN---------------------------TLKGGDEDESG--D 395 (395) T ss_pred ccccccccCcc--------ccc---------------------------ccCCCCCCCCC--C Confidence 75432111000 000 00001111111 1 No 76 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=4.4e-68 Score=389.87 Aligned_cols=384 Identities=14% Similarity=0.104 Sum_probs=268.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||||++.. ... ..+ . ..+.. ......+.+ T Consensus 1 Mg~f~~lf~~---~~~------------------------~~~---~---~~~~~----------------~~~v~~~~~ 31 (395) T protein:vir:10 1 MSILEKIFKT---RKD------------------------ITY---M---LDLDM----------------IEDLSQQAY 31 (395) T ss_pred Cchhhhhhcc---Ccc------------------------ccc---c---ccchh----------------ccccchhhh Confidence 9999998321 100 000 0 00000 001122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++... .. .+.+. +|++..||+++++ ++|++.++ T Consensus 32 ~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-------------~~------~~~~~-~ll~~~PN~~~t~---~~f~~~~~ 88 (395) T protein:vir:10 32 VKRLAIDSCIEFVARAVAQSHFKVLEGNR-------------IQ------KNDVY-YKLNIKPNTDLSS---DSFWQQVI 88 (395) T ss_pred hhhHHHHHHHHHHHHhhccceeEeccCCc-------------cc------cchHH-HHHHhccCcCCCH---HHHHHHHH Confidence 78999999999999999999876553210 01 11222 3333456655554 67899999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .++++.|++|+++.++. .++++++..+++...... ...++..........++++||||+++++.. .. T Consensus 89 ~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~---~~ 155 (395) T protein:vir:10 89 YKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNK---VT 155 (395) T ss_pred HHHhhCCceEEEEecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCC---cc Confidence 99999999887665443 256666666555433221 112333334444568999999999876543 34 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TA 319 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~ 319 (559) .||+||+..+..++..+.+ .|.+|+.|+|+|+++.. .+++++.+++++.|++.++|. ++++.+|+ .+ T Consensus 156 ~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~----~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:10 156 HFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASS----AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred cccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCC----CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 5899999999988876553 46778889999988643 468999999999999998886 45555443 56 Q ss_pred Cceeeeeccc-cch-----hHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AED-----MQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 320 g~~~~~~ls~-~~D-----~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) ++++|++++. +.| +||+|++++++++||++|||||++||- +++|++++.+.|+++||.|| T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--------------~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--------------ETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------cccCHHHHHHHHHHHHHHHH Confidence 7799999984 555 499999999999999999999999961 35789999999999999999 Q ss_pred HHHHHHHHHhhccccccC-ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecc Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILG-DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQR 469 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~-~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~ 469 (559) +.+||++||++|+++.+. ..++|+++.+++.|.+++++++..+++ |+||+||+|+++||||++|| |+++++.++.+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~ 369 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEK 369 (395) T ss_pred HHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccc Confidence 999999999999986543 457888899999999999999998886 56899999999999999876 99999998887 Q ss_pred cccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) ++......... ... ...+.+.+++| + T Consensus 370 ~~~~~~~~~~~--------~~~---------------------------~~kgg~~~~~g--~ 395 (395) T protein:vir:10 370 ANSGENDEKEK--------DEN---------------------------TLKGGDEDESG--D 395 (395) T ss_pred ccccccccCcc--------ccc---------------------------ccCCCCCCCCC--C Confidence 75432111000 000 00001111111 1 No 77 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=8.3e-67 Score=382.87 Aligned_cols=483 Identities=10% Similarity=0.054 Sum_probs=277.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccc-------ccccccc-----cccccccccccccCCCC Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYT-------EPVDGNL-----MFSTLEDTSIVPKPSPI 68 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~-------~~~~~~~-----~~~~~~~~~~~~~p~~~ 68 (559) -+.+||-+-.+-+++=+..=.-|+.+ ++..+ +-.+.+++.+ .|..... ....+..++.... . T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~---e 81 (648) T protein:vir:79 8 RGFWSRISLMWRDEDDDKEPLVLEES--MQLGE-APGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFE---E 81 (648) T ss_pred chhhhhhhhhccCccccccccccccc--cccCC-CccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccc---c Confidence 57777776666544332211122222 11110 0000000000 0000000 0001111111111 1 Q ss_pred CcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_012530. 69 AFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPI 148 (559) Q Consensus 69 ~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~ 148 (559) ...++..+.+.+..+|+|++||++||++||++|+. ++.++....... .....+.+|++ +++ T Consensus 82 pp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~-----------i~~~~~~~~~~~------~~~~ll~rPn~--~~t 142 (648) T protein:vir:79 82 PEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWD-----------FVSKNPNAVEYI------RMRFTLMAEAT--QIP 142 (648) T ss_pred CCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcce-----------EEecCCccchhh------HHHHHhhccCC--CCC Confidence 23366777788888999999999999999987754 333332211111 11223444444 444 Q ss_pred hhhHHHHHHHHHHHHHHcCCcceEEEECCCCc---------------EEEEEEecCceEEEEecCcccccccceEEEEEe Q lcl|NC_012530. 149 RDDFTSFLRKLVRDTYTYDQVNYENTYDSNGR---------------LSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI 213 (559) Q Consensus 149 ~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~---------------~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~ 213 (559) + ++|++.++.|++++||||++++|+.+|. +.+||||+|.+|++..+++|... .|++.. T Consensus 143 ~---~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~----~Y~y~~ 215 (648) T protein:vir:79 143 T---NQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIK----GWQQEQ 215 (648) T ss_pred H---HHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCcee----eeEEEe Confidence 3 5789999999999999999999998883 57899999999999998877542 243333 Q ss_pred cC-ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHH Q lcl|NC_012530. 214 DN-KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMR 292 (559) Q Consensus 214 ~~-~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e 292 (559) .+ .....|.++||||+++++. .++.||+|||.+++.+|.++.++++|+.+||+||++|+|||+++. ++...+ T Consensus 216 ~g~~~~~~~~~~dIIHik~~~~---~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~----~~~~~e 288 (648) T protein:vir:79 216 EGQDKPQKFKPEDIVHIYYKRE---KGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGL----EQEGFG 288 (648) T ss_pred cCCceeEEecCccEEEEccCCC---CCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC----CccchH Confidence 33 3445689999999986542 346799999999999999999999999999999999999998753 233456 Q ss_pred HHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccc Q lcl|NC_012530. 293 ALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNS 372 (559) Q Consensus 293 ~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~ 372 (559) +.+++++.|.+.+.+..- +...+. ...+.+.+.++++|+||++++++++++||++|||||++||+.+.+ T Consensus 289 ~~k~~~e~~~~~~~~~~i-~gg~v~-~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~s--------- 357 (648) T protein:vir:79 289 AEEGEVDLVRGEVENMDV-EGGMVT-TERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTA--------- 357 (648) T ss_pred HHHHHHHHHHHhcccccc-cccccc-cceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCc--------- Confidence 667777778777765321 111121 112334444456899999999999999999999999999986543 Q ss_pred hhhhhHHHHHHHHHHHHhhHHHHHHHHHHHh----hccccc-------cCccceeeecchhhhhHHHHHHHHHHHHc-CC Q lcl|NC_012530. 373 LNESNNQNKIDASKSKGLMPLLDMIAKNLTN----GIIRQI-------LGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TA 440 (559) Q Consensus 373 ~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~----~L~~~~-------~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~ 440 (559) +++|.+++... +..++.|++..|+..++. .++.+. ....++|+|+.+++.|.+++++.+..++. |+ T Consensus 358 -s~stae~~~~~-~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~Gi 435 (648) T protein:vir:79 358 -SRSTGDNLSSD-FKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNA 435 (648) T ss_pred -cchHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCC Confidence 34555555554 456677776665544433 332211 12346888899999999999999887764 67 Q ss_pred CCHHHHHHHhCCCCCCCCCEe-eccceecccccccccccccccccccccccccccCCCCCCCCCCCCcc-ccccchhccc Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDII-LSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPS-SSNSFQQNQE 518 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 518 (559) ||+||+|+++||||+|+|+.. +...++.+..+. . ......+ .++.... +..++..+.. T Consensus 436 lT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~------~----~~~~~~~----------~~~~~~~~~a~~eg~~~e 495 (648) T protein:vir:79 436 ISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQA------T----ALAALAP----------TPAGGSSASASGDKKKKA 495 (648) T ss_pred cCHHHHHHHhCCCCCCCCCCccccccccccchhc------c----ccccCCC----------CCCCCCCCCccccccccc Confidence 999999999999999988642 222221111110 0 0000000 0000000 0000000101 Q ss_pred c-ccccccccccccccccccccccccccchhhh------hhccCCCCC Q lcl|NC_012530. 519 G-YTGKDAKPSGKDNQQGVGKDGQLKNKKNTNS------YKQGGSSKK 559 (559) Q Consensus 519 ~-~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~------~~~~~~~~~ 559 (559) + ...++++.+|+ +...+|....|+-+.-.+ ....+ .| T Consensus 496 ~~~~~~~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 539 (648) T protein:vir:79 496 TDNKTKPTNQHGT--KTSPKKQTNGRHVRYMQEMLLEYTTLNEA--IK 539 (648) T ss_pred cCCCCCCCCCCCc--CCCCccccchhhhhhhhhhhhcchhhhHH--Hh Confidence 0 00011111111 111222222222220000 00000 00 No 78 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2e-66 Score=380.75 Aligned_cols=371 Identities=12% Similarity=0.046 Sum_probs=254.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|..... .+ .+........+. +.. ++ ..+.......+ T Consensus 1 Mg~f~~~~~~~---------------------~~-----~~~~~~~~~~~~-----~~~------~~--~~~~~v~~~~~ 41 (382) T protein:vir:48 1 MPIFNLATESP---------------------PD-----NQGGFFDVVDSD-----FLA------SL--KGNEWVSAETA 41 (382) T ss_pred CccccccccCC---------------------cc-----cccccccchhhh-----ccc------cc--cCCcccchHhh Confidence 99999861110 00 000000001100 000 00 11122334567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+|+|++||++||++||++|+.+++... ..++.+|+| ++ ++++|++.++ T Consensus 42 l~~~~v~~~i~~ia~~ia~~~~~~~~~~~-------------------------~~L~~~PN~--~~---t~~~f~~~l~ 91 (382) T protein:vir:48 42 LRNSDLFSIINQLSNDLATVKLITSRKKL-------------------------QGIVDNPSN--NA---NRFNFYQSIF 91 (382) T ss_pred hccHHHHHHHHHHHHhhccCceeeecchh-------------------------hhhhhhcCC--CC---CHHHHHHHHH Confidence 88999999999999999999876543211 123445444 44 4578999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecC---ceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN---KVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~---~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||+|++++||.+|++++||||+|++|++..+.+|... .|....++ +....|+++||||++.+.. T Consensus 92 ~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~evih~~~~~~--- 164 (382) T protein:vir:48 92 AQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGI----YYNITFDDPRIPPKQHVPQNDVLHFRLLSV--- 164 (382) T ss_pred HHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeE----EEEEEecCccccceeEEcCccEEEecCCCC--- Confidence 9999999999999999999999999999999999988776432 23322222 2345799999999986432 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .+..||+|||.+++.+|..+.++++|+.++|+||++|+|||++++ .+++++.+++++.|.+. ..|+|+++|+ T Consensus 165 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-----~~~~e~~~~~~~~~~~~---~~n~g~~~vl 236 (382) T protein:vir:48 165 DGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKG-----GGLLDFKTKLSRSRQAM---KQMQGGPLVL 236 (382) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-----CCChHHHHHHHHHHHhh---ccCCCCeeEc Confidence 234689999999999999999999999999999999999999864 46788889998888764 4678998777 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. +.|+||+|++++++++||++|||||++||....+ ++.+++.+.|++.||.|+++. T Consensus 237 ~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~------------~~~~~~~~~~~~~~l~p~~~~ 303 (382) T protein:vir:48 237 D-DLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQ------------QSSLEMSSDLYSKAVSRYLRP 303 (382) T ss_pred C-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHHHHHHHHHH Confidence 5 4599999984 7999999999999999999999999999964322 356777889999999999999 Q ss_pred HHHHHHhhccccccCc-cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCC-----CCCCCEeeccceecc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGD-NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPK-----IAGGDIILSAVYIQR 469 (559) Q Consensus 397 ie~~ln~~L~~~~~~~-~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~p-----i~gGD~~~~~~~~~~ 469 (559) ||++|+++|+++.+.. ...++++ ..........+++ +++|+||+|+.++..+ +++|+.+..+ T Consensus 304 i~~~l~~~l~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~~~----- 372 (382) T protein:vir:48 304 FLSELSQKLSCDVDADIFPAVDPT------GSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPNST----- 372 (382) T ss_pred HHHHHHHHhcChhhhhhhhhhccc------hhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCCCC----- Confidence 9999999998765321 1111111 1111112223333 4578888887764322 2222221100 Q ss_pred cccccccccccccccccccccccccCCCCCCCC Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTP 502 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (559) ..+++.++.+ T Consensus 373 -----------------------~~GGd~~~~~ 382 (382) T protein:vir:48 373 -----------------------LKGGEEDGQD 382 (382) T ss_pred -----------------------CCCCCCCCCC Confidence 0111111111 No 79 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=1.3e-65 Score=376.30 Aligned_cols=381 Identities=11% Similarity=0.042 Sum_probs=268.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||+|....-. + +... ...+.......+. +.+.. ......+.+ T Consensus 1 M~~f~~~~~~~~--~-----------------------~~~~-------~~~~~~~~~~~~~--~~~~~--~~~v~~~~a 44 (386) T protein:vir:49 1 MPIFNITNLATE--S-----------------------PPIN-------QESFFDIADSDFL--ASLNS--SEWVSAENA 44 (386) T ss_pred CchhhhhccCCC--C-----------------------cccc-------hhhhhhhhhcccc--ccccC--Cceechhhh Confidence 999988722110 0 0000 0000000000000 01111 111233467 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+|+|++||++||++||++|+.+++.. ...++.+|+| ++ ++++|++.++ T Consensus 45 l~~~~v~~~i~~ia~~ia~~p~~~~~~~-------------------------~~~l~~~PN~--~~---t~~~f~~~~~ 94 (386) T protein:vir:49 45 LKNSDLFSIISQLSNDLATAKITTSRKQ-------------------------LQGIVDNPSN--NA---NRFNFYQSIF 94 (386) T ss_pred hccHHHHHHHHHHHHHhhhCceeeccch-------------------------hhhhhhccCC--CC---CHHHHHHHHH Confidence 8899999999999999999887654311 1124555544 44 4478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEe---cCceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI---DNKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||||++|+|+.+|++++||||+|++|++..+.++... .|.... .++....|+++||||++.+. . T Consensus 95 ~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~evih~~~~~---~ 167 (386) T protein:vir:49 95 AQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGL----YYNITFDDPHIAPKQHVPQNDILHFRLLS---V 167 (386) T ss_pred HHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceE----EEEEEEcCccccceeEEccccEEEecCCC---C Confidence 9999999999999999999999999999999999988776432 222221 23445689999999998643 2 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .+..||+|||.+++.+|..+.++++++.++|+||++|+|+|++++ .+++++.+++++.|+.. ..|+|+++|| T Consensus 168 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-----~~~~~~~~~~~~~~~~~---~~n~g~~~vl 239 (386) T protein:vir:49 168 DGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKG-----GGLLDFKTKVSRSRQAM---KQMQGGPLVL 239 (386) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCC-----CCChHHHHHHHHHHHHh---ccCCCCceec Confidence 234689999999999999999999999999999999999999864 46778888899888753 4688998777 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDM 396 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ 396 (559) + ++++|++++. +.|+||+|++++++++||++|||||++||....++ +|.+ +.+.|+..+|.|++.. T Consensus 240 ~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-----------~~~~-~~~~~~~~~i~~~l~~ 306 (386) T protein:vir:49 240 D-DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQ-----------SSLE-MIYNIYFKSVSRYLRP 306 (386) T ss_pred C-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-----------chHH-HHHHHHHHHHHHHHHH Confidence 5 4599999985 79999999999999999999999999999643322 2232 3456788899999999 Q ss_pred HHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcC-CCCHHHHHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 397 IAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQT-ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 397 ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~-~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) |+++|+++|++ .++|+...+++.|..+++..+..++++ ++|+||+|++++..++..+|.+.. ..... T Consensus 307 i~~~~~~~l~~-----~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~---~~~~~---- 374 (386) T protein:vir:49 307 FVSEMSKKLSC-----EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDG---KNPNR---- 374 (386) T ss_pred HHHHHHHHhcc-----hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcch---hccCC---- Confidence 99999999865 356777778888988998888888865 589999999998766543332210 00000 Q ss_pred cccccccccccccccccccCCCCCCCC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTP 502 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (559) +...+++.++.. T Consensus 375 ---------------~~~~gGd~~~~~ 386 (386) T protein:vir:49 375 ---------------TSLKGGEINEQD 386 (386) T ss_pred ---------------CCCCCCCCCCCC Confidence 000000000000 No 80 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=5.2e-66 Score=378.47 Aligned_cols=366 Identities=12% Similarity=0.107 Sum_probs=241.5 Q ss_pred hhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcce Q lcl|NC_012530. 34 KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGY 113 (559) Q Consensus 34 ~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~ 113 (559) =..+++-+.+... ..+....+.........++++++|++||++||++||++|+.+++...+.+. T Consensus 1 Mg~f~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:94 1 MNLFGKVVSFSRG----------------KLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred CCccccchhcccc----------------cccCCcceeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcc Confidence 1111111111000 000000000000112345788999999999999999999865443332111 Q ss_pred eeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEEC-CCCcEEEEEEecCceE Q lcl|NC_012530. 114 QVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYD-SNGRLSHTRMVDPTTI 192 (559) Q Consensus 114 ~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd-~~G~~~~L~~l~p~~V 192 (559) .. ......++....|++..||+++++ ++|++.++.+++++||+|++++++ ..|+++.|+|... T Consensus 65 ~~----------~~~~~~~~~l~~lL~~~PN~~~t~---~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~~--- 128 (378) T protein:vir:94 65 SD----------TLISMAGSDLDEVLNWSPKGERNS---MDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFADD--- 128 (378) T ss_pred cc----------cccccccchHHHHHhhcCCCCCCH---HHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecCC--- Confidence 00 001111122233444456666654 689999999999999999998765 4577777766321 Q ss_pred EEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 193 YFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 193 ~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ...|.++||||++. | .++..|+||++.+++++..+ +.+ + T Consensus 129 -------------------------~~~~~~~diiH~~~-~----~~~~~g~s~l~~~~~~i~~~----------~~~-~ 167 (378) T protein:vir:94 129 -------------------------KKEYKPEELVRLTS-P----FYINEDTSILDNALASIQTK----------LEQ-G 167 (378) T ss_pred -------------------------eeEeeeeeeEEecC-c----CCccchhHHHHHHHHHHHHH----------Hhc-c Confidence 12356789999973 3 23456999999998877532 344 4 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|+|+|++++...+ +..+++++++++.|+..+.| .++|++++|+ ++++|++++. +.|+|+ +.+++++++||++|| T Consensus 168 ~~~gil~~~~~l~~-~~~~~~~~~~~~~~~~~~~~-~~~g~~~vl~-~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fg 243 (378) T protein:vir:94 168 KLRGLLKINAFLDI-DNTQEYREKALTTIKNMQEG-SSYNGLTPVD-NKTEIVELKKDYSVLNK-DEIDLIKSELLTGYF 243 (378) T ss_pred cccceeeeCCcCCH-HHHHHHHHHHHHHHHHhhcc-cccccceecC-CCceEEEccCChhhhhH-HHHHHHHHHHHHHhC Confidence 68999998754321 22344455555666555555 5788887775 5699999985 789997 667899999999999 Q ss_pred CCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCc---------cceeeecchh Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGD---------NYMLEFVGGD 422 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~---------~~~~~f~~l~ 422 (559) |||++|+ + .+.+++...|+++||.||+++||++|+++||++.++. .++|+++.++ T Consensus 244 VP~~~l~----~------------~~se~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~ 307 (378) T protein:vir:94 244 MNENILL----G------------TASQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFK 307 (378) T ss_pred CCHHHhc----C------------ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhh Confidence 9999994 1 2346778899999999999999999999999864321 2567777899 Q ss_pred hhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 423 TRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 423 ~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) +.|.+++++++..+++ |+||+||+|+++||||+||||+++++.|+++++........ ... T Consensus 308 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~-------------------~~~ 368 (378) T protein:vir:94 308 FATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGS-------------------RKD 368 (378) T ss_pred hcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeecccccccccchhhcCC-------------------cCC Confidence 9999999999998886 57999999999999999999999999999887654322110 000 Q ss_pred CCCCCccccc Q lcl|NC_012530. 502 PPTLPPSSSN 511 (559) Q Consensus 502 ~~~~~~~~~~ 511 (559) .++.++.+.+ T Consensus 369 ~~~~~e~~n~ 378 (378) T protein:vir:94 369 VTSTDETNNQ 378 (378) T ss_pred CCCCCCCCCC Confidence 0000111111 No 81 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=1.9e-65 Score=375.42 Aligned_cols=367 Identities=9% Similarity=0.030 Sum_probs=254.0 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) +.++.+ +++|..+... . ..+ .... ...-..|+.+++|++||++||++||++|+.+ T Consensus 1 Mg~f~~-----l~~~~~~~~~-~---~~~------~~~~----------~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~ 55 (376) T protein:vir:78 1 MGFFSE-----LFKRNKEIEW-M---WDL------DFLE----------DKTTKVYLKKMALNTCVKHIARTIAKSDFRL 55 (376) T ss_pred Cchhhh-----hhccCCcccc-c---cch------hhcc----------ccchhhhhhhHHHHHHHHHHHHhhcccceee Confidence 444433 2223221100 0 000 0000 0123457789999999999999999999765 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEE Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHT 184 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L 184 (559) ++.. ...+ +.+ ..|++..||+++++ ++|++.++.+++++||+|+++.|+..|.+.++ T Consensus 56 ~~~~-------------~~~~------~~l-~~ll~~~PN~~~t~---~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~ 112 (376) T protein:vir:78 56 KNGE-------------TSVR------DKL-YYKLNIRPNTDMSS---SSFWEKVIYKLIYDNECLIVLSDTDDFLIADS 112 (376) T ss_pred cccc-------------cccc------chH-HHHHhhccccCCCH---HHHHHHHHHHHhHcCcEEEEEEeCCCeeeccc Confidence 4311 0011 122 23344456656554 68999999999999999999999999999999 Q ss_pred EEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 185 RMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFN 264 (559) Q Consensus 185 ~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~ 264 (559) ||+.+..+...... .+...+......|+++||||++++..+. ..++.+++..+...+.... . T Consensus 113 ~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~~~~~~~~~~~~~~~~~-----~ 174 (376) T protein:vir:78 113 YVRKEFAFFPDVFE----------GVTVKDYRYNRNFSMDDVIFLEYGNERL---SAFTDGMFEDYGELFGKMI-----R 174 (376) T ss_pred eeecccceeeeeee----------eeeeecceeeeeeccccEEEeccCCCCc---hhhhhHHHHHHHHHHHHHH-----H Confidence 99998876543211 1111222334578999999998754321 1223233333332222211 1 Q ss_pred HHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cch-----hHHHHH Q lcl|NC_012530. 265 DRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AED-----MQFQSW 338 (559) Q Consensus 265 ~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D-----~qf~e~ 338 (559) ..++.||.++.+++.. +..+++++.+++++.|++.++|..+.++.+++.++|++|++++. +.| +||+|+ T Consensus 175 ~~~~~~~~~~~~~~~~-----~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~ 249 (376) T protein:vir:78 175 AQMRNFQIRGAVNFKM-----AGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKL 249 (376) T ss_pred HHHhcCCCceeEEEcc-----CCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHH Confidence 2233444443333322 34688999999999999999997554443443466799999974 544 599999 Q ss_pred HHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeee Q lcl|NC_012530. 339 LNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEF 418 (559) Q Consensus 339 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f 418 (559) +++++++||++|||||++||. +++|++++...|+++||.||+.+||++||++|+++.+ ..+.|++ T Consensus 250 ~~~~~~~Ia~~fgVPp~~l~~--------------~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~-~~~~~~~ 314 (376) T protein:vir:78 250 RKEMIDYVASILGIPSSLLHG--------------DMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTFSE-FLAGEHI 314 (376) T ss_pred HHHHHHHHHHHhCCCHHHhCC--------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCccc-ceecccc Confidence 999999999999999999962 3568899999999999999999999999999998643 3456677 Q ss_pred cchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCC--CEeeccceecccccccccc Q lcl|NC_012530. 419 VGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGG--DIILSAVYIQRLGQQEQIK 477 (559) Q Consensus 419 ~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gG--D~~~~~~~~~~l~~~~~~~ 477 (559) +.+++.|.+++++++..++. |+||+||+|+++|+||+||| |+++++.|+++++...+.+ T Consensus 315 ~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 315 KIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred hhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccCceehhccccCC Confidence 78899999999999998886 56899999999999999987 9999999998876432222 No 82 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=1.4e-65 Score=376.21 Aligned_cols=366 Identities=11% Similarity=0.087 Sum_probs=242.4 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) +.++. +-+.... ...+........-.....++++++|++||++||++||++|+.+ T Consensus 1 Mg~f~---------~~~~f~~----------------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~ 55 (378) T protein:vir:93 1 MNLFG---------KVVSFSR----------------GKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNH 55 (378) T ss_pred Cccch---------hhhhhhc----------------cccCCCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceee Confidence 11111 1110000 0000000000000112245688999999999999999999866 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC-CCcEEE Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS-NGRLSH 183 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~-~G~~~~ 183 (559) ++..++.+...+ ......+....|++..||+++++ ++||+.++.+++++||+|++++++. .|+++. T Consensus 56 ~~~~~~~~~~~~----------~~~~~~~~l~~lL~~~PN~~~t~---~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~ 122 (378) T protein:vir:93 56 VKYKKSDVGSDT----------LISMAGSDLDEVLNWSPKGERNS---MDFWRKVIKKLLRAPYVDLYAVFDDNTGELLD 122 (378) T ss_pred EEEccccccccc----------ccccccchHHHHHhhcCCCCCCH---HHHHHHHHHHHhhcCceEEEEEeecCCceEEE Confidence 544333221111 01111122334444456666654 6899999999999999999998874 366666 Q ss_pred EEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) |+|... ...|+++||||++ +| .++..|.||+..++.++. T Consensus 123 l~~~~~----------------------------~~~~~~~diih~r-~~----~~~~~~~s~l~~~~~~i~-------- 161 (378) T protein:vir:93 123 LLFADD----------------------------KKEYKTEELVRLT-SP----FYINEDTSILDNALASIQ-------- 161 (378) T ss_pred EEecCC----------------------------eeEeccceeEEec-Cc----cccchhhHHHHHHHHHHH-------- Confidence 655321 1246789999996 23 234458999988776653 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYL 342 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~ 342 (559) .+|.+| .|+|+|++++... .+..+++++++++.|++.++| .++|++++|+ ++++|++++. +.|+|+ +.++++ T Consensus 162 --~~~~~~-~~~g~l~~~~~l~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~-~g~~~~~l~~~~~~~~~-~~~~~~ 234 (378) T protein:vir:93 162 --TKLEQG-KLRGLLKINAFLD-IDNTQEYREKALTTIKNMQEG-SSYNGLTPVD-NKTEIVELKKDYSVLNK-DEIDLI 234 (378) T ss_pred --HHHhcC-cccceeeeCCcCC-HHHHHHHHHHHHHHHHHhhcc-cccccceEcC-CCceEEEccCChhhhhH-HHHHHH Confidence 355665 5899999875432 223344555566666555555 5778887775 4699999985 789997 667899 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---------cc Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---------DN 413 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---------~~ 413 (559) +++||++|||||++|+ ..+.+++...|++.||.||+++||++||++||++.+. .. T Consensus 235 ~~~Ia~~fgVPp~~l~----------------g~~~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~ 298 (378) T protein:vir:93 235 KSELLTGYFMNENILL----------------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYER 298 (378) T ss_pred HHHHHHHhCCCHHHhc----------------CCcHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccc Confidence 9999999999999994 1234677889999999999999999999999986432 13 Q ss_pred ceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc Q lcl|NC_012530. 414 YMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE 492 (559) Q Consensus 414 ~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 492 (559) ++|+++.+++.|.++++++++.++. |+||+||+|+++||||+||||+++++.|.++++......... T Consensus 299 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~------------ 366 (378) T protein:vir:93 299 IIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSR------------ 366 (378) T ss_pred eeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccccccccchhhhcCcc------------ Confidence 5677788999999999999999986 568999999999999999999999999998886543321100 Q ss_pred ccCCCCCCCCCCCCccccc Q lcl|NC_012530. 493 SALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (559) ..+++.++.+.+ T Consensus 367 -------~~~~~~~e~~n~ 378 (378) T protein:vir:93 367 -------KDVTSTDETNNQ 378 (378) T ss_pred -------CCCCCCCCCCCC Confidence 000111111111 No 83 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=5.7e-65 Score=372.78 Aligned_cols=381 Identities=11% Similarity=0.035 Sum_probs=256.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||++|..+- + + . .+... +... ......+.+ T Consensus 1 MGlf~~~~~~~-~---------------------------~-~-~~~~~--------~~~~----------~~~~~~~~~ 32 (395) T protein:vir:98 1 MGILDFFSFKK-S---------------------------G-T-LSDDD--------SGST----------TSEKLTNVV 32 (395) T ss_pred CcchhhhcCCC-c---------------------------c-c-ccccc--------cchh----------hhhhcchhh Confidence 99999994321 0 0 0 00000 0000 001122446 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++..++. .. ++....|++..||++++. ++|++.++ T Consensus 33 ~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~-----------~~-------~~~~~~lL~~~PN~~~t~---~~f~~~~~ 91 (395) T protein:vir:98 33 LKEDALYKCVNYLARIISKSTFRLKTPEKLT-----------EN-------QKDWLYWINTKANPNQSA---SQFWVEVI 91 (395) T ss_pred hhhHHHHHHHHHHHHHHhhCceeEEecCCcc-----------cc-------cchHHHHHhhcCCCCCCH---HHHHHHHH Confidence 7899999999999999999998765432210 00 112344555567766654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||||++++++..+. + ++..+.... ..+.. ..++.........++.++||||++++.. +. . T Consensus 92 ~~lll~Gnayi~~~~~~~~~-----~-~~~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~evih~k~~~~-~~--~ 157 (395) T protein:vir:98 92 QKLLVDGETLIFVIPGKGIY-----V-ADSFTQDKK-ISGSQ----FKVSRVQGQTYEKTFTFDQVIYLKNDNS-DL--M 157 (395) T ss_pred HHHhhcCceEEEEEeCCcee-----c-CCccccccc-ccCcc----cceeeecCceeeeEecCccEEEecCCCC-Cc--c Confidence 99999999999999975432 2 222222111 01100 1122222222346789999999986543 22 2 Q ss_pred cccccHHHHHHHHHHHHHHH--HHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcc-cccccccc Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENT--ELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGIN-GAYRIPMI 317 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~--~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~-nag~~~vl 317 (559) .++.+++..+...+...+.. ..+..++|.++..+.+++..... ..++++.++.++.|+..+++.. +.+++ ++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~ 232 (395) T protein:vir:98 158 SKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQEN----SDGGRQSKSDKDFFKRTVEKIRTESVVG-IP 232 (395) T ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc----CCcHHHHHHHHHHHHHHHhhhhcCCcce-ee Confidence 23444455555555444333 44556788888888888765432 2345666777888888776643 33333 33 Q ss_pred cCCceeeeeccc-------cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHh Q lcl|NC_012530. 318 TAEDAKFVSMTQ-------AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGL 390 (559) Q Consensus 318 ~~g~~~~~~ls~-------~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l 390 (559) .++|++|++++. +.|+||++++++++++||++|||||++||. +++|.+++...|+++|| T Consensus 233 l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~--------------~~sn~e~~~~~f~~~tl 298 (395) T protein:vir:98 233 VTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG--------------DIADNQKNYELLLEGPI 298 (395) T ss_pred cCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------CcccHHHHHHHHHHHHH Confidence 466799999863 357899999999999999999999999961 35789999999999999 Q ss_pred hHHHHHHHHHHHhhcccccc-CccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCC--CCEeeccce Q lcl|NC_012530. 391 MPLLDMIAKNLTNGIIRQIL-GDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAG--GDIILSAVY 466 (559) Q Consensus 391 ~P~~~~ie~~ln~~L~~~~~-~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~g--GD~~~~~~~ 466 (559) .||+.+||++||++||++.+ ..+++|+|+.+++.|.+++++++..+++ |+||+||+|+++|+||++| ||++++++| T Consensus 299 ~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~~~~~n 378 (395) T protein:vir:98 299 ESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKN 378 (395) T ss_pred HHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccc Confidence 99999999999999998765 3567899999999999999999999986 5689999999999999977 999999999 Q ss_pred ecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) ++++........ ++.++ T Consensus 379 ~~~~~~~gge~~--------------------~~~~~ 395 (395) T protein:vir:98 379 YESVLERGGEVD--------------------EEVET 395 (395) T ss_pred ceecccccCCCC--------------------CCCCC Confidence 988753100000 00000 No 84 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=2.6e-65 Score=374.68 Aligned_cols=377 Identities=11% Similarity=0.028 Sum_probs=247.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+|||+|+-+- .+... . ..+... .+ ...-..+ T Consensus 1 Mgl~d~~~~~~----------------------------~~~~~--~--------~~~~~~---~~-------~~~~~~~ 32 (395) T protein:vir:96 1 MGILDFFSFKK----------------------------SGTLS--D--------DDSGST---TS-------EKLTNVV 32 (395) T ss_pred CcchhhhcCCC----------------------------Ccccc--c--------cccccc---hh-------hhcchhh Confidence 99999983310 00000 0 000000 00 0122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++..+. .. . ++....|++..||++++. ++||+.++ T Consensus 33 l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~-----------~~------~-~~~~~~lL~~~PN~~~t~---~~f~~~l~ 91 (395) T protein:vir:96 33 LKEDALYKCVNYLARIISKSTFRIKAPEKL-----------TE------N-QKDWLYWINTKANPNQSA---SQFWVEVV 91 (395) T ss_pred hhhHHHHHHHHHHHHhhccceeEEEeCCcc-----------cc------c-cchHHHHHhhcCCCCCCH---HHHHHHHH Confidence 789999999999999999998765432110 00 1 122234555567766654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) .+++++||+|++++|+..+.+...++.... . .+.. ...+..........++++||||+++++.... T Consensus 92 ~~lll~Gna~~~~~~~~~~~~~~~~~~~~~-----~--~~~~----~~~v~~~~~~~~~~~~~~dvih~k~~~~~~~--- 157 (395) T protein:vir:96 92 QKLLVDGETLIFVIPGKGIYVADAFTQDKK-----L--SGNK----FKVSRVQGQTYEKIFTFDQVIYLKNDNSDLM--- 157 (395) T ss_pred HHHhhcCceEEEEEcCCceecCCccccccc-----c--ccce----eeeeeeccceeeeEeccCceEEecccCCccc--- Confidence 999999999999999864322222221100 0 0000 0111112222346789999999987653221 Q ss_pred cccccHHHHHHH------HHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcc-cccc Q lcl|NC_012530. 241 GYGLSELEMGLR------EFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGIN-GAYR 313 (559) Q Consensus 241 ~~G~Spl~~~~~------~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~-nag~ 313 (559) .++.+++..+.. .+.....+.++..++|.+|+.|.+++..++. ..++ .+++.|++.+++.. +++. T Consensus 158 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~---~~~~~~~~~~~~~~~~~~~ 229 (395) T protein:vir:96 158 LKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGG-----RQPK---SDKDFFKRTIEKIRTESVV 229 (395) T ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCch-----hhHH---HHHHHHHHHHHHhhcCCcc Confidence 222222222222 2222334557888999999999999976532 2333 34444444443332 2333 Q ss_pred cccccCCceeeeeccc-cchhHHHHHHHHH------HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHH Q lcl|NC_012530. 314 IPMITAEDAKFVSMTQ-AEDMQFQSWLNYL------INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASK 386 (559) Q Consensus 314 ~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~------~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~ 386 (559) ++++ ++|++|++++. +.|+|++|.+++. +++||++|||||++||- +++|++++.+.|+ T Consensus 230 v~~l-~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~--------------~~sn~e~~~~~f~ 294 (395) T protein:vir:96 230 GIPV-TANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG--------------DIADNQKNYELLL 294 (395) T ss_pred eEEc-cCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------CCccHHHHHHHHH Confidence 3334 66799999985 7899999888765 58999999999999961 3578999999999 Q ss_pred HHHhhHHHHHHHHHHHhhccccccC-ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCC--CCEee Q lcl|NC_012530. 387 SKGLMPLLDMIAKNLTNGIIRQILG-DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAG--GDIIL 462 (559) Q Consensus 387 ~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~g--GD~~~ 462 (559) +.||.||+.+||++|+++|+++.+. .+++|+|+.+++.|.++++++++.++. |+||+||+|+++||||++| ||+++ T Consensus 295 ~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~~~ 374 (395) T protein:vir:96 295 EGPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKVLY 374 (395) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee Confidence 9999999999999999999986543 457899999999999999999999986 5689999999999999977 99999 Q ss_pred ccceecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 463 SAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 463 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) .+.|++++.+.. ... .++.++ T Consensus 375 ~~~N~~~~~~~g---ge~-----------------~~~~~~ 395 (395) T protein:vir:96 375 MTKNYESVLERG---GEV-----------------DEEVET 395 (395) T ss_pred ecccceechhcc---CCC-----------------CCCCCC Confidence 999988775310 000 000000 No 85 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=3.9e-65 Score=373.71 Aligned_cols=366 Identities=12% Similarity=0.104 Sum_probs=241.5 Q ss_pred hhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcce Q lcl|NC_012530. 34 KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGY 113 (559) Q Consensus 34 ~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~ 113 (559) =..+++-+.+.+ ...+......-.......++++++|++||++||++||++|+.+++...+.+. T Consensus 1 Mg~f~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~ 64 (378) T protein:vir:16 1 MNLFGKVVSFSR----------------GKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchhhhhhhc----------------ccccCCcceeeecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccc Confidence 111111110000 0000000000000112245688999999999999999999865544332211 Q ss_pred eeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC-CcEEEEEEecCceE Q lcl|NC_012530. 114 QVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN-GRLSHTRMVDPTTI 192 (559) Q Consensus 114 ~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~-G~~~~L~~l~p~~V 192 (559) .... .....+.+. .|++..||+++++ ++||+.++.+++++||+|++++|+.. |+++.|+|.+. T Consensus 65 ~~~~---------~~~~~~~l~-~lL~~~PN~~~t~---~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~~--- 128 (378) T protein:vir:16 65 SDTL---------ISMAGSDLD-EVLNWSPKGERNS---MDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFADD--- 128 (378) T ss_pred cccc---------cccccchHH-HHHhhcCCCCCCH---HHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecCC--- Confidence 1100 001112333 3444456666654 68999999999999999999999854 66666655321 Q ss_pred EEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 193 YFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 193 ~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ...|+++||||++. | .++..|.||++.++..+.. +|.+ + T Consensus 129 -------------------------~~~~~~~diih~r~-~----~~~~~~~s~l~~~~~~i~~----------~~~~-~ 167 (378) T protein:vir:16 129 -------------------------KKEYKPEELVRLTS-P----FYINEDTSILDNALASIQT----------KLEQ-G 167 (378) T ss_pred -------------------------eeEecccceEEecC-c----cCccchhHHHHHHHHHHHH----------HHhc-C Confidence 12356789999972 2 2344588999888876642 3444 4 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|+|+|++++...+ +..+++++++++.|++.+.| .|+|++++|+ ++++|++++. +.|+|+ +.+++++++||++|| T Consensus 168 ~~~g~l~~~~~l~~-~~~~~~~~~~~~~~~~~~~~-~~~g~~~vl~-~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fg 243 (378) T protein:vir:16 168 KLRGLLKINAFLDI-DNTQEYREKALTTIKNMQEG-SSYNGLTPVD-NKTEIVELKKDYSVLNK-DEIDLIKSELLTGYF 243 (378) T ss_pred ccceeeEeCCcCCH-HHHHHHHHHHHHHHHHhhcc-cccccceEcC-CCceEEEccCChhhhhH-HHHHHHHHHHHHHhC Confidence 68999988754322 23344555666666555554 5788887775 4599999985 789997 456899999999999 Q ss_pred CCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---------ccceeeecchh Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---------DNYMLEFVGGD 422 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---------~~~~~~f~~l~ 422 (559) |||.+|+ ..+.+++...|++.||.||+++||++|+++||++.+. ..++|+++.++ T Consensus 244 VPp~~l~----------------g~~~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~ 307 (378) T protein:vir:16 244 MNENILL----------------GTASQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFK 307 (378) T ss_pred CCHHHhc----------------CCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhh Confidence 9999994 1234678889999999999999999999999986432 13567778899 Q ss_pred hhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 423 TRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 423 ~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) +.|.+++++++..++. |+||+||+|+++||||+||||++++|.|+++++......... .. T Consensus 308 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~-------------------~~ 368 (378) T protein:vir:16 308 FATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSR-------------------KD 368 (378) T ss_pred hcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccccccccchhhhcCcc-------------------CC Confidence 9999999999999886 568999999999999999999999999999886543321100 00 Q ss_pred CCCCCccccc Q lcl|NC_012530. 502 PPTLPPSSSN 511 (559) Q Consensus 502 ~~~~~~~~~~ 511 (559) ++..++.+.| T Consensus 369 ~~~~~e~~ne 378 (378) T protein:vir:16 369 VTSTDETNNQ 378 (378) T ss_pred CCCCCCCCCC Confidence 0111111111 No 86 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=3.6e-64 Score=368.38 Aligned_cols=382 Identities=10% Similarity=0.015 Sum_probs=251.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |||+|||+..|... .+. ..... ...+.. ......+.+ T Consensus 1 Mg~~~~~~~~~~~~-------------------------~~~-----~~~~~--~~~~~~-----------~~~~~~~~~ 37 (395) T protein:vir:40 1 MGFKSWVSGFFNEE-------------------------QRT-----LNLTD--TVWCSI-----------PSEKLKELS 37 (395) T ss_pred CchHHHHHhhhccc-------------------------ccc-----ccccc--chhhcc-----------ccccchhhh Confidence 99999995443211 000 00000 000000 001122456 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+++|++||++||++||++|+.+++... + .++....|++..||+++++ ++|++.++ T Consensus 38 l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~------------~--------~~~~~~~lL~~~PN~~~t~---~~f~~~~~ 94 (395) T protein:vir:40 38 IKKWAIDSCANKIANTLSCAEVLTYEKGE------------E--------VRKKNWYMFNVEANQNQNA---TEFWKKAI 94 (395) T ss_pred hhhHHHHHHHHHHHHHHhhCceeeccCCc------------c--------ccchHHHHHHhcCCCCCCH---HHHHHHHH Confidence 78999999999999999999976543211 0 0111234555566666654 68999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEE-ecCc-eeeeecccceEEEecccCCCcc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQY-IDNK-VRGSFTADEMGMFIRNPRSDIL 238 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~-~~~~-~~~~~~~~evi~~~~n~~~~~~ 238 (559) .+++++||+|+++.++. +++.++..+ ....... ..|..+ .++. ....|+++||||+++++... T Consensus 95 ~~lll~Gnay~~~~~~~------~~~~~~~~~-~~~~~~~------~~~~~v~~~~~~~~~~~~~~evih~r~~~~~~-- 159 (395) T protein:vir:40 95 YKLVYDNEALIFMQDEY------IYVADSFTK-NDKSLYE------NTYTEVTLKDLTLKKEFKESEVLHLTLNNESI-- 159 (395) T ss_pred HHHhhcCceEEEEecCc------eeecCCccc-ccccccc------ceeeeeeecCceeeeeeccccEEEeecCCCCc-- Confidence 99999999999988764 233222211 1111111 112211 2222 23568999999998765322 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCc-ccccccccc Q lcl|NC_012530. 239 SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGI-NGAYRIPMI 317 (559) Q Consensus 239 ~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~-~nag~~~vl 317 (559) ..++.+.+..+...+.... +..++.|+. ++++.++. +..+++++.+++++.|++.++|. .++++++++ T Consensus 160 -~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~--~~~l~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl 228 (395) T protein:vir:40 160 -KSIIDGFYLLYGDLLTAAV-----NKYKKLNSR--KIIVKLKA---MFGQTPEAEEKLRLMLSERMKKFLAEGDSALPV 228 (395) T ss_pred -cccchhHHHHHHHHHHHHH-----HHHHhcCCC--CceEEEec---ccCCCHHHHHHHHHHHHHHHHHhhccCCceeec Confidence 2234344444444433322 223344444 44454432 24578999999999999999875 456776665 Q ss_pred cCCceeeeeccc-cchhHHHHHHHHHH---HHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHH Q lcl|NC_012530. 318 TAEDAKFVSMTQ-AEDMQFQSWLNYLI---NIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPL 393 (559) Q Consensus 318 ~~g~~~~~~ls~-~~D~qf~e~~~~~~---~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~ 393 (559) +++++|++++. +.|+||+|+++++. ++||++|||||++||. +++|.+++...|++.||.|| T Consensus 229 -~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~--------------~~sn~e~~~~~f~~~~L~P~ 293 (395) T protein:vir:40 229 -EDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG--------------DTVGLSEQVNSFLMFSINPI 293 (395) T ss_pred -CCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--------------CCcCHHHHHHHHHHHHHHHH Confidence 55699999995 68999999999874 7999999999999961 35688999999999999999 Q ss_pred HHHHHHHHHhhccccccC---ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCC--CCEeecccee Q lcl|NC_012530. 394 LDMIAKNLTNGIIRQILG---DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAG--GDIILSAVYI 467 (559) Q Consensus 394 ~~~ie~~ln~~L~~~~~~---~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~g--GD~~~~~~~~ 467 (559) +++||++|+++||++.+. ..++|+++.+++.|.+++++++..++. |+||+||+|+++|+||++| ||+++++.|+ T Consensus 294 ~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~ 373 (395) T protein:vir:40 294 AEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERFVTKNY 373 (395) T ss_pred HHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceeeecccc Confidence 999999999999987543 345666678999999999999998886 5689999999999999965 9999999998 Q ss_pred cccccccccccccccccccccccccccCCCCCCCCCCCCcccc Q lcl|NC_012530. 468 QRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSS 510 (559) Q Consensus 468 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (559) ++++..... .++++.++ +..++ T Consensus 374 ~~~~~~~~~----------------~kgge~~~-----~~~~~ 395 (395) T protein:vir:40 374 APLGENEED----------------LKGGDINE-----NKGDS 395 (395) T ss_pred ccccccccc----------------cCCCCCCC-----CcCCC Confidence 877643211 01110000 00000 No 87 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=3.6e-63 Score=362.93 Aligned_cols=366 Identities=12% Similarity=0.110 Sum_probs=242.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) ||||.||+. ++ + .+...+.. +...+.. ...+ T Consensus 1 M~if~~~~~-----------------~~-~--~~~~~~~~------------------------~~~~~~~-----~~~~ 31 (378) T protein:vir:94 1 MNLFGKVVS-----------------FS-R--GKLNNDTQ------------------------RVTAWQN-----EAVE 31 (378) T ss_pred CchhHHhHh-----------------hh-h--cccccCcc------------------------eeeeeec-----chhh Confidence 999999952 10 0 00000100 0000000 0234 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +++++|++||++||++||++|+.+++...+.+..- .......+...+|++..||+++++ ++||+.++ T Consensus 32 ~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~----------~~~~~~~~~l~~lLn~~PN~~~t~---~~f~~~~~ 98 (378) T protein:vir:94 32 YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSD----------TLISMAGSDLDEVLNWSSKGERNS---MEFWQKVI 98 (378) T ss_pred hhhHHHHHHHHHHHHhHhhCceeeeeecccccccc----------cccccccchHHHHHhhcCCCCCCH---HHHHHHHH Confidence 66789999999999999999986554433221100 001111122334555556666654 68999999 Q ss_pred HHHHHcCCcceEEE-ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccC Q lcl|NC_012530. 161 RDTYTYDQVNYENT-YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILS 239 (559) Q Consensus 161 ~d~ll~Gna~~~i~-rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~ 239 (559) .+++++||+|++++ ++..|++..+++.. + ...++++||||++. |... + T Consensus 99 ~~lll~Gnayi~~i~~~~~g~~~~~~~~~--------------------------~--~~~~~~~dvih~~~-~~~~--~ 147 (378) T protein:vir:94 99 KKLLTTRYIDLYPIFDSETGELLDLLFAN--------------------------D--KKEYKPEELVRLTS-PFYI--N 147 (378) T ss_pred HHHhhcCCeEEEEEeeCCCCcEEEEEEec--------------------------C--cEEechhceeeecC-cCCc--c Confidence 99999999999855 45667766554421 1 13467899999963 2111 1 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA 319 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~ 319 (559) -+.+++..+...+. ..+++| .++|+|+++...++ +..+++++++++.|++.++| .|+|++++| + T Consensus 148 --~~~~~~~~~~~~~~----------~~~~~~-~~~g~l~~~~~l~~-~~~~~~~e~~~~~~~~~~~~-~n~~~~~vl-~ 211 (378) T protein:vir:94 148 --EDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFLDI-DNTQEYREKALATIKNMQEG-SSYNGLTPV-D 211 (378) T ss_pred --cchhHHHHHHHHHH----------HHHhhC-CcccceeeCCcCCH-HHHHHHHHHHHHHHHHhhcc-cccccceec-c Confidence 14566666655443 223444 68899988764332 23355667777777766665 467787777 4 Q ss_pred Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) ++++|+++++ +.|+++ +.+++++++||++|||||++|+ + ...+++...|+++||.||+.+|| T Consensus 212 ~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~----g------------~~~e~~~~~f~~~tl~P~~~~ie 274 (378) T protein:vir:94 212 NKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----G------------TATQEQQIYFYNSTIIPLLIQLE 274 (378) T ss_pred CCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhc----C------------CchHHHHHHHHHHHHHHHHHHHH Confidence 5699999985 789996 7789999999999999999994 1 12356778899999999999999 Q ss_pred HHHHhhccccccC---------ccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceec Q lcl|NC_012530. 399 KNLTNGIIRQILG---------DNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQ 468 (559) Q Consensus 399 ~~ln~~L~~~~~~---------~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~ 468 (559) ++||++||++.+. ..++|+++.+++.|.+++++++..++. |+||+||+|+++|+||+||||+++++.|++ T Consensus 275 ~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~~~~~n~~ 354 (378) T protein:vir:94 275 KELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAV 354 (378) T ss_pred HHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccccc Confidence 9999999986432 125567788999999999999999986 568999999999999999999999999998 Q ss_pred ccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 469 RLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 469 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) +++.......... ..+..++.+.+ T Consensus 355 ~~~~~~~~~~~~~-------------------~~~~~~e~~n~ 378 (378) T protein:vir:94 355 AVKNLSDLQGNRK-------------------DVTSTDETNNQ 378 (378) T ss_pred chhcchhcccccC-------------------CCCCCCCCCCC Confidence 8876543311100 00000111111 No 88 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=1e-62 Score=360.40 Aligned_cols=363 Identities=11% Similarity=0.091 Sum_probs=231.5 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) +.++.+..+- .|.+... + ..+...+ + ..+.++++++|++||++||++||++|+.+ T Consensus 1 M~~f~k~~~~---~~~~~~~-----------~------~~~~~~~---~--~~~~~~~~~~v~~~v~~ia~~iA~lp~~~ 55 (378) T protein:vir:85 1 MNLFGKVVSF---SRGKLNN-----------D------TQRVTAW---Q--NEAVEYTSAFVTNIHNKIANEITKVEFNH 55 (378) T ss_pred Cchhhhhhhh---hhccccc-----------C------Ccceeee---e--ccchhhhhHHHHHHHHHHHHhHhhCceeE Confidence 2222211100 0000000 0 0001010 0 11345788999999999999999999876 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE-ECCCCcEEE Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT-YDSNGRLSH 183 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~-rd~~G~~~~ 183 (559) ++......- .. ..+....+....|++..||+++++ ++||+.++.+++++||+|++++ ++..|++.. T Consensus 56 ~~~~~~~~~---------~~-~~~~~~~~~l~~lL~~~PN~~~t~---~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~ 122 (378) T protein:vir:85 56 VKYKKSDVG---------SD-TLISMAGSDLDEVLNWSYKGEHNS---MEFWQKVIKKLLCTRYVDLYPIFDSETGELLD 122 (378) T ss_pred EEEeccccc---------cc-cccccccchHHHHHhccCCCCCCH---HHHHHHHHHHHhhcCCeEEEEeecCCCceEEE Confidence 544332110 00 001111222334555556666654 6899999999999999999865 455666554 Q ss_pred EEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) +++.. + ...+.++|+||++. |.+ ..+ +.+++..+...+ T Consensus 123 ~~~~~--------------------------~--~~~~~~~dvih~~~-~~~--~~~--~~~~~~~a~~~~--------- 160 (378) T protein:vir:85 123 LLFAN--------------------------D--KKEYKPEELVRLVS-PFY--INE--DTSILDNALASI--------- 160 (378) T ss_pred EEecC--------------------------C--CEEEcccceEEEec-CcC--ccc--hhhHHHHHHHHH--------- Confidence 44321 1 12456789999873 211 111 334454444433 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHh---cCcccccccccccCCceeeeeccc-cchhHHHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATS---SGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWL 339 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~---~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~ 339 (559) ..+|++ +.|+|+|++++. +++++.+++++.|++.+ .+..++|++++|+ ++++|++++. +.++++ +.+ T Consensus 161 -~~~~~~-~~~~g~l~~~~~-----l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~~~~~-~~~ 231 (378) T protein:vir:85 161 -QTKLEQ-GKLRGLLKINAF-----LDIDNTQEYREKALATIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNK-DEI 231 (378) T ss_pred -HHHHhc-CCcceEEEeCCc-----CCHHHHHHHHHHHHHHHHHhhcccccccceecC-CCceEEeccCChhhhhH-HHH Confidence 233444 478999988653 45555555555554432 2446788887775 4599999985 688886 678 Q ss_pred HHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCc------- Q lcl|NC_012530. 340 NYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGD------- 412 (559) Q Consensus 340 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~------- 412 (559) ++++++||++|||||++|+ .++.+++...|++.||.||+.+||++|+++||++.+.. T Consensus 232 ~~~~~~Ia~~fgVPp~~l~----------------~s~~e~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~ 295 (378) T protein:vir:85 232 ELIKSELLTGYFMNENILL----------------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLY 295 (378) T ss_pred HHHHHHHHHHhCCCHHHhc----------------CCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccc Confidence 9999999999999999994 12356778889999999999999999999999864321 Q ss_pred --cceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccc Q lcl|NC_012530. 413 --NYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT 489 (559) Q Consensus 413 --~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 489 (559) .+.|+++.+++.|.+++++++..++. |+||+||+|+++||||+||||++++|.|+++++......... T Consensus 296 ~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~~~~~~~~--------- 366 (378) T protein:vir:85 296 YERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKNLSDLQGSR--------- 366 (378) T ss_pred cceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccccccchhhcCcc--------- Confidence 24566678999999999999999986 568999999999999999999999999998886543221100 Q ss_pred cccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 490 QLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~ 511 (559) ...+..++.+.+ T Consensus 367 ----------~~~~~~~e~~n~ 378 (378) T protein:vir:85 367 ----------KDVASTDETNNQ 378 (378) T ss_pred ----------CCCCCCCCCCCC Confidence 000111111111 No 89 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=6.2e-56 Score=323.25 Aligned_cols=331 Identities=13% Similarity=0.123 Sum_probs=232.7 Q ss_pred HHHHHHHHHHhhhhccccccc-----cccccccc---ccccccc---ccccccCCCCCcccHHHHHHHHhhChHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDRAY-----TEPVDGNL---MFSTLED---TSIVPKPSPIAFGRITDVLRQYSMNVVLNAIIN 91 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~a~-----~~~~~~~~---~~~~~~~---~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~ 91 (559) +-++....... ....+..++ .+|++... ++-.... +.+.+ | ...+..+.+.+..++.+.+||. T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~e-p----p~~~~~La~l~~~n~~h~~~i~ 74 (348) T protein:vir:26 1 MTEQLIHSHTT-DGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWE-P----PISLKGLAEIANANGYHGSLLK 74 (348) T ss_pred CCccccchhhc-cccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCcccc-C----CCCHHHHHHHHhhhhhhhhhHh Confidence 11111111100 000011111 12222111 0000000 01111 1 1234445555556777777877 Q ss_pred HHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcce Q lcl|NC_012530. 92 TRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNY 171 (559) Q Consensus 92 ~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~ 171 (559) ++++.++. ...|++++++ .+|+ +++.|++++||+|+ T Consensus 75 ~k~N~l~~----------------------------------------~~~Pn~~~t~---~~f~-~~~~d~ll~Gnay~ 110 (348) T protein:vir:26 75 ARANYVAG----------------------------------------RFMNGGGLPM---YKMN-SACWDYFGLGMSAF 110 (348) T ss_pred hhhhHHhh----------------------------------------cccCCCCCCH---HHHH-HHHHHHHhcCCeEE Confidence 76655441 1135666665 3454 45679999999999 Q ss_pred EEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHH Q lcl|NC_012530. 172 ENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGL 251 (559) Q Consensus 172 ~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~ 251 (559) +++||..|+|++|+||+|.+|++..+. .|+++..++....|.++||||++. .+..++.||+||+..++ T Consensus 111 ~~~rn~~G~~~~L~~l~~~~v~~~~d~---------~~~~~~~~g~~~~f~~~dIiHir~---~~~~~~~~Gls~~~~a~ 178 (348) T protein:vir:26 111 VKIRSYLKNVIALEPLPMVHMRKRKNG---------DFVQLLRNNEQKVFKAKDVIFIPQ---YDPQQQIYGLPDYLGSI 178 (348) T ss_pred EEEEcCCCcEEEEEEecCceeEeeecC---------cEEEEEecCeEEEEcCccEEEEcC---CCCCCCcccccHHHHHH Confidence 999999999999999999999986542 134444556667899999999974 34456789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeec Q lcl|NC_012530. 252 REFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSM 327 (559) Q Consensus 252 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~l 327 (559) +++.++.+++.|+++||+||++|+|||.++. +.+++++++++|++|++. .|.+|++++.|+. .++++++|+ T Consensus 179 ~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~----~~ls~e~~~~lk~~~~~~-~G~~n~~~~~vl~~~g~~~Gi~~~pi 253 (348) T protein:vir:26 179 QSSLLNRDATLFRRRYYLNGAHMGFIFYATD----PNLSEADEKALKEKIASS-KGIGNFRSMFVNIPNGKEKGIQLIPV 253 (348) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHHh-cCcccccceeEEcCCCCccceeEEEc Confidence 9999999999999999999999999997643 468999999999999986 6888999977763 346899999 Q ss_pred cc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhcc Q lcl|NC_012530. 328 TQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGII 406 (559) Q Consensus 328 s~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~ 406 (559) +. ++|+||+|.+++++++||++|||||++||+.+.++. +++|++++.+.|+++||.||+++||++||++|. T Consensus 254 s~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~--------~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~ 325 (348) T protein:vir:26 254 GDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGA--------NVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPE 325 (348) T ss_pred cCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCC--------ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhC Confidence 85 789999999999999999999999999998765432 467999999999999999999999999999886 Q ss_pred ccccCccceeeecc-hhhhhHHHHHHH Q lcl|NC_012530. 407 RQILGDNYMLEFVG-GDTRSQQDKLKS 432 (559) Q Consensus 407 ~~~~~~~~~~~f~~-l~~~d~~~~~~~ 432 (559) .+. ...++|+|+. .++.+..+ + T Consensus 326 ~~~-~~~~~fdl~~~~e~~~~~a---~ 348 (348) T protein:vir:26 326 IPD-NLKLKFNLNPGVESANGSA---V 348 (348) T ss_pred CCC-ccEEEEecCcccccchhhc---C Confidence 432 3345666653 22323222 2 No 90 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=1.3e-55 Score=321.42 Aligned_cols=339 Identities=14% Similarity=0.151 Sum_probs=236.7 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc----ccccccccc---ccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY----TEPVDGNLM---FSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~----~~~~~~~~~---~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) .||=|.. ....... .-......+..++-.++ .+|++...+ +--....+-...| ...+..+ T Consensus 1 ~~~~~~~--~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p----p~~~~~l 67 (351) T protein:vir:79 1 MSKRRSR--APRTFAA-------APNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEP----PVSFAGL 67 (351) T ss_pred CCCCCCC--CCCCCCC-------CCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecC----CCCHHHH Confidence 1110000 0000000 00000000000111111 122222110 0000000000111 2234455 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .+.+..++.+.+||.+.++.++. ...||+++++ .+|+ T Consensus 68 a~~~~~~~~h~~~l~~k~n~l~~----------------------------------------~~~Pnp~~t~---~~f~ 104 (351) T protein:vir:79 68 AKSFRASTHHSSALFFKANVLAS----------------------------------------TFRPHRWLSR---HAFE 104 (351) T ss_pred HHHHhhhHhhhhhhhhhhhHHhh----------------------------------------cccCCCCCCH---HHHH Confidence 56666677777787766654431 1135666665 4564 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCC Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSD 236 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~ 236 (559) +++.|++++||+|++++||..|++++|+||+|.+|++..+.++ |+++..++....|.++||||++. .+ T Consensus 105 -~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~--------~~~~~~~g~~~~~~~~eIihir~---~~ 172 (351) T protein:vir:79 105 -RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG--------FVYVNGWQERHEFEPDSVFQLVR---PD 172 (351) T ss_pred -HHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCCe--------EEEEecCceEEEEcCccEEEeCC---CC Confidence 5678999999999999999999999999999999998776553 44455566677899999999974 24 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 237 ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 237 ~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) ..++.||+||+..++.+|.++.+++.|+++||+||++|+|||.++. +.+++++.++++++|++ ..|.+|++++.| T Consensus 173 ~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~----~~ls~e~~~~lk~~~~~-~~G~~N~~~~~v 247 (351) T protein:vir:79 173 INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD----AAQKQDDVDNMRDALKN-AKGPGNFRNVFM 247 (351) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHH-hcCccccCceeE Confidence 4567899999999999999999999999999999999999998753 46899999999999986 688999999877 Q ss_pred cc----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 317 IT----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 317 l~----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) +. .++++|+|++. ++|+||+|++++++++||++|||||.+||+.+.++. +++|++++.+.|++.||. T Consensus 248 ~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~--------~~~n~e~~~~~f~~~~l~ 319 (351) T protein:vir:79 248 YAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSG--------GFGTPDTAARVFGRNEIR 319 (351) T ss_pred ecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC--------CcccHHHHHHHHHHHHHH Confidence 64 34699999995 799999999999999999999999999999876543 467999999999999999 Q ss_pred HHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) ||+++||+ +|..|-. ..++|+...++++|.++ T Consensus 320 Pl~~~ie~-ln~~lg~----~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 320 PLQARFAE-LNDWLGD----EVVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHH-HHhhcCc----ceeeeChhhhccccccC Confidence 99999985 7776521 23456556788888877 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=2.6e-55 Score=319.83 Aligned_cols=341 Identities=13% Similarity=0.140 Sum_probs=240.3 Q ss_pred ccccCCcchHHHH----------HHHHHHH----------HHHhhhhc--cccccc--c--ccccccccc---c-ccccc Q lcl|NC_012530. 10 KFYTDDPNAFFKH----------IDSKIAN----------DTASKALN--GVDRAY--T--EPVDGNLMF---S-TLEDT 59 (559) Q Consensus 10 ~~~~~~~~~~~~~----------~~~~~~~----------~~~~~~~~--gr~~a~--~--~~~~~~~~~---~-~~~~~ 59 (559) --.-+.+.+..++ +.++..- .....+.. ++--++ + +|++...+. . ..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~ 80 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNG 80 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcC Confidence 0111111121111 0000000 00000011 111111 1 122211100 0 00000 Q ss_pred cccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHH Q lcl|NC_012530. 60 SIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIE 139 (559) Q Consensus 60 ~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~ 139 (559) .+. . ...++..|.+.+..++.+.+||.++++.++. T Consensus 81 ~~~-~----pp~~~~~La~~~~~~~~h~s~l~~k~n~l~~---------------------------------------- 115 (376) T protein:vir:10 81 EWF-E----PPVSFAGLAKSFRASTHHSSALFFKANVLAS---------------------------------------- 115 (376) T ss_pred cee-c----CCCCHHHHHHHHhhhHHhhhhHHHHhHHHHh---------------------------------------- Confidence 010 1 1234455556666777888888777665441 Q ss_pred hcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceee Q lcl|NC_012530. 140 RMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRG 219 (559) Q Consensus 140 ~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~ 219 (559) ...||+++++ .+|+ +++.|++++||+|++++|+..|+|++|+||+|.+|++..+.++ |+++..++... T Consensus 116 ~~~Pnp~lT~---~~f~-~~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~--------~~~~~~~~~~~ 183 (376) T protein:vir:10 116 TFRPHRWLSR---HAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFNG--------FVYVNGWQERH 183 (376) T ss_pred ccCCCCCCCH---HHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCCe--------EEEEEcCCeEE Confidence 1125666665 3454 4567999999999999999999999999999999998876543 44455666677 Q ss_pred eecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHH Q lcl|NC_012530. 220 SFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKR 299 (559) Q Consensus 220 ~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~ 299 (559) .|.++||||++. .+..++.||+||+.+++.++.++.+++.|+.+||+||++|+|||.+++ +.++++++++|++ T Consensus 184 ~~~~~eViHir~---~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d----~~l~~e~~~~lr~ 256 (376) T protein:vir:10 184 EFEPDSVFQLVR---PDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD----AAQKQDDVDNMRD 256 (376) T ss_pred EEccccEEEecC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHH Confidence 899999999974 244567899999999999999999999999999999999999998753 4689999999999 Q ss_pred HHHHHhcCccccccccccc----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchh Q lcl|NC_012530. 300 HWTATSSGINGAYRIPMIT----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLN 374 (559) Q Consensus 300 ~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~ 374 (559) +|++ +.|.+|+++++|+. .+|++|+|++. ++|+||+|++++++++||++|||||.+||+.+.++. + T Consensus 257 ~~~~-~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~--------~ 327 (376) T protein:vir:10 257 ALKN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSG--------G 327 (376) T ss_pred HHHH-hcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC--------C Confidence 9987 68999999987774 34799999995 799999999999999999999999999999886543 4 Q ss_pred hhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 375 ESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 375 ~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) ++|++++.+.|+++||.||+++|| ++|.+|..+ .++|+...++++|.++ T Consensus 328 ~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~----~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 328 FGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEE----VVRFDDYEIPPAPVAA 376 (376) T ss_pred cccHHHHHHHHHHHHHHHHHHHHH-HHHhhcccc----ccccChhHhhcccccC Confidence 689999999999999999999998 488776332 3556666788888888 No 92 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=5.9e-56 Score=323.36 Aligned_cols=354 Identities=12% Similarity=0.057 Sum_probs=230.4 Q ss_pred HHHHHHHHHHhhhhcccccccccccccc---ccccccccccccccCCCCCcccHHHHHHHHh-----hChHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDRAYTEPVDGN---LMFSTLEDTSIVPKPSPIAFGRITDVLRQYS-----MNVVLNAIINTRA 94 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~a~~~~~~~~---~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~-----~~~~v~acv~~ia 94 (559) +.|....+ ......++..+-....... -.........+....++.....+.+....+. +.|+-..|+.-+. T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~ 79 (368) T protein:vir:79 1 MSRNKTRR-AARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSF 79 (368) T ss_pred CCcccccc-chhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHH Confidence 11110000 0000000000000000000 0000111111111111222222222222222 2222222221111 Q ss_pred HHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE Q lcl|NC_012530. 95 NQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT 174 (559) Q Consensus 95 ~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~ 174 (559) + ... .+.. .....+.+..++.+ ||+++++ ++|+ +++.|++++||+|++++ T Consensus 80 ~-----------~~~--------~h~~-----~~~~~~n~l~l~~~--Pn~~~t~---~~f~-~l~~d~ll~Gnay~~~~ 129 (368) T protein:vir:79 80 R-----------AAA--------HHSS-----AVYVKRNILVSTFI--PHPLLSR---ATFE-RLVLDWQVFGNAYLERR 129 (368) T ss_pred h-----------hcc--------ccch-----hhhhhcchhhhhcC--CCcCCCH---HHHH-HHHHHHhhcCCeEEEEE Confidence 1 000 0000 01111122233444 4555555 4564 47889999999999999 Q ss_pred ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHH Q lcl|NC_012530. 175 YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREF 254 (559) Q Consensus 175 rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i 254 (559) ||..|+|++|+||+|.+|++..+.+ .|++...++....|+++||||++. .+..++.||+||+.+++.++ T Consensus 130 r~~~G~~~~L~~l~~~~v~~~~~~~--------~~~~~~~~~~~~~~~~~dIihir~---~~~~~~~yGlsp~~~a~~si 198 (368) T protein:vir:79 130 ENVLGGTIRLDTPLAKYVRRGLDLN--------TYFFVQNWQQPYTFAAGSVFHLQE---PDINQEVYGLPEYLSALNAT 198 (368) T ss_pred EcCCCCEEEEEEeCcccceeeccCC--------EEEEEecCCeEEEEccccEEEecC---CCCCCCcccccHHHHHHHHH Confidence 9999999999999999998765532 244445566667899999999974 23456789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc- Q lcl|NC_012530. 255 ISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ- 329 (559) Q Consensus 255 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~- 329 (559) .++.+++.|+.+||+||++|+|||.++. +.++++++++++++|++ +.|.+|+|+++|+. +++++|+|++. T Consensus 199 ~l~~aa~~~~~~~~~NGa~~~gil~~~~----~~l~~e~~~~lk~~~~~-~~G~~N~g~~~vl~~~g~~~g~~~~pls~~ 273 (368) T protein:vir:79 199 WLNESATLFRRRYYKNGSHAGFILYMTD----AAQKQEDVDTLREAMKS-AKGPGNFRNLFMYAPNGKKDGIQLLPVSEV 273 (368) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCC----CCCCHHHHHHHHHHHHH-hcCCcccCceeEecCCCCccceeEEEcCCC Confidence 9999999999999999999999998753 46899999999999987 68899999988874 35799999995 Q ss_pred cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc Q lcl|NC_012530. 330 AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI 409 (559) Q Consensus 330 ~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 409 (559) ++|+||+|++++++++||++|||||.+||+.+.++. +++|++++.+.|+++||.||+++|| ++|.+|.. T Consensus 274 ~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~--------~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~-- 342 (368) T protein:vir:79 274 AAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTG--------GFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD-- 342 (368) T ss_pred HHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCC--------ccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc-- Confidence 799999999999999999999999999999776542 4689999999999999999999998 68877633 Q ss_pred cCccceeeecchhhhhHHHHHHHH-HHH Q lcl|NC_012530. 410 LGDNYMLEFVGGDTRSQQDKLKSV-QLE 436 (559) Q Consensus 410 ~~~~~~~~f~~l~~~d~~~~~~~~-~~~ 436 (559) ..++|+-..+++.|.++++.-. +.+ T Consensus 343 --e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 343 --EVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred --ceeeechhHhhcccccccCCcccccC Confidence 1344555567888888877521 222 No 93 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=6.2e-55 Score=317.76 Aligned_cols=328 Identities=13% Similarity=0.109 Sum_probs=230.5 Q ss_pred HHHHHHHHHHh--hhhccccccc--c--ccccccccccccccccccccCC-CCCcccHHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTAS--KALNGVDRAY--T--EPVDGNLMFSTLEDTSIVPKPS-PIAFGRITDVLRQYSMNVVLNAIINTRAN 95 (559) Q Consensus 23 ~~~~~~~~~~~--~~~~gr~~a~--~--~~~~~~~~~~~~~~~~~~~~p~-~~~~~~~~~~~~~~~~~~~v~acv~~ia~ 95 (559) +-|..-.+... .+..++-.++ + +|++...+. ..+..-+.- -. +.....+..+.+.+..++.+.+||.+.++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~-~~~~~~~~~-~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n 78 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDI-LDYVECISN-GKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRN 78 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcchh-hhhhhhhhc-CceecCCCCHHHHHHHHHhccccchhhhhhhh Confidence 00000000000 0000011111 1 111111100 000000000 00 01122344455555667777777776665 Q ss_pred HHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE Q lcl|NC_012530. 96 QVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY 175 (559) Q Consensus 96 ~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r 175 (559) .++. ...|++++++. +| ++++.|++++||+|++++| T Consensus 79 ~l~~----------------------------------------~~~Pn~~lt~~---~f-~~~~~d~ll~Gnay~~~~r 114 (340) T protein:vir:98 79 VLAS----------------------------------------TYIPHPLLSRQ---DF-SRFALDYLVFGNAFLEQRH 114 (340) T ss_pred HHhh----------------------------------------ccCCCCCCCHH---HH-HHHHHHHHhcCCeEEEEEE Confidence 5442 11356666653 44 4667899999999999999 Q ss_pred CCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHH Q lcl|NC_012530. 176 DSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFI 255 (559) Q Consensus 176 d~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~ 255 (559) |..|++++|+|++|.+|++..+.+ .|+++..++....|.++||||++. .+..++.||+||+..++.++. T Consensus 115 n~~G~~~~L~pl~~~~vr~~~~~~--------~~~~~~~~~~~~~~~~~eViHir~---~~~~~~~~Gls~~~~a~~si~ 183 (340) T protein:vir:98 115 SVTGQLIKLLTSPAKYTRRGVDDS--------VFWFVENFTQPHEFAPDTVFHLLE---PDINQEIYGLPEYLSALNSAW 183 (340) T ss_pred CCCCcEEEEEEeCCceEEEcccCc--------EEEEEecCCeEEEEccccEEEEcC---CCCCCCcccccHHHHHHHHHH Confidence 999999999999999998755322 345555666677899999999974 244567899999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc-c Q lcl|NC_012530. 256 SHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ-A 330 (559) Q Consensus 256 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~-~ 330 (559) ++.+++.|+++||+||++|+|||.++. +.+++++++++|++|++ .+|.+|++++.|+. .++++++|++. + T Consensus 184 l~~aa~~~~~~~f~NGa~pg~il~~~~----~~ls~e~~~~lk~~~~~-~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~ 258 (340) T protein:vir:98 184 LNESATLFRRKYYQNGAHAGYIMYVTD----PAQSATDVESLRDAMRN-SKGLGNFKNLFFYSPNGKPDGIKIVPLSEVA 258 (340) T ss_pred HHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHH-hcCccccCceeEecCCCCccceEEEEcCCCh Confidence 999999999999999999999998753 46899999999999987 58999999987764 34799999995 7 Q ss_pred chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccc Q lcl|NC_012530. 331 EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQIL 410 (559) Q Consensus 331 ~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~ 410 (559) +|+||+|++++++++||++|||||++||+.+.++. +++|++++.+.|+++||.||+++||+ +|.+|..+ T Consensus 259 ~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~--------~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e-- 327 (340) T protein:vir:98 259 TKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIG--------SLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME-- 327 (340) T ss_pred hHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCC--------ccccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc-- Confidence 99999999999999999999999999999876543 46789999999999999999999995 88877432 Q ss_pred Cccceeeecchhhhh Q lcl|NC_012530. 411 GDNYMLEFVGGDTRS 425 (559) Q Consensus 411 ~~~~~~~f~~l~~~d 425 (559) .++|+-..+++.| T Consensus 328 --~~rF~~~~l~~~d 340 (340) T protein:vir:98 328 --VIRFKEYTLDNPE 340 (340) T ss_pred --ccccCccccccCC Confidence 2445445666777 No 94 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=4.4e-55 Score=318.60 Aligned_cols=339 Identities=14% Similarity=0.151 Sum_probs=236.4 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc----ccccccccc---ccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY----TEPVDGNLM---FSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~----~~~~~~~~~---~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) .||=|.. ....... .-......+..++-.++ .+|++...+ +--....+-...| ...+..+ T Consensus 1 ~~~~~~~--~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p----p~~~~~l 67 (351) T protein:vir:78 1 MSKRRSR--APRTFAA-------APNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEP----PVSFAGL 67 (351) T ss_pred CCCCCCC--CCCCCCC-------CCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecC----CCCHHHH Confidence 1110000 0000000 00000000000111111 122222111 0000000101111 2234455 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .+.+..++.+.+||.+.++.++. ...|++++++ ++|+ T Consensus 68 a~~~~~~~~h~~~l~~k~n~l~~----------------------------------------~~~Pn~~~t~---~~f~ 104 (351) T protein:vir:78 68 AKSFRASTHHSSALFFKANVLAS----------------------------------------TFRPHRWLSR---HAFE 104 (351) T ss_pred HHHHhhhHhhhhhhhhhhhHHhh----------------------------------------cccCCCCCCH---HHHH Confidence 55556677777787766655431 1125666665 3454 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCC Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSD 236 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~ 236 (559) .++.|+|++||+|++++||..|+|++|+||+|.+|++..+.++ |++...++....|.++||||++. .+ T Consensus 105 -~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~--------~~~~~~~~~~~~~~~~eVihir~---~~ 172 (351) T protein:vir:78 105 -RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG--------FVYVNGWQERHEFAPDSVFQLVR---PD 172 (351) T ss_pred -HHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCCe--------EEEEecCCeEEEEccccEEEEcC---CC Confidence 5567999999999999999999999999999999998876653 34444556667899999999974 24 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 237 ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 237 ~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) ..++.||+||+..++.++.++.++..|+++||+||++|+|||.++. +.++++++++++++|++ ..|.+|+|+++| T Consensus 173 ~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~----~~ls~e~~~~lr~~~~~-~~G~~N~~~~~v 247 (351) T protein:vir:78 173 INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD----AAQKQDDVDNMRDALKN-AKGPGNFRNVFM 247 (351) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHH-hcCcccccceee Confidence 4567899999999999999999999999999999999999998753 46899999999999986 689999999877 Q ss_pred cc----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 317 IT----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 317 l~----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) +. .++++++|++. ++|+||+|++++++++||++|||||.+||+.+.++. +++|++++.+.|+++||. T Consensus 248 ~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~--------~~sn~e~~~~~f~~~~l~ 319 (351) T protein:vir:78 248 YAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSG--------GFGTPDTAARVFGRNEIR 319 (351) T ss_pred ecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC--------CcccHHHHHHHHHHHHHH Confidence 74 34689999995 799999999999999999999999999999876542 468999999999999999 Q ss_pred HHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) ||+++||+ ++.+|.. ..++|+...+++.|.++ T Consensus 320 P~~~~iee-~n~~l~~----~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 320 PLQARFAE-LNDWLGD----EVVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHH-HHhhcCc----cceecChhhhccccccC Confidence 99999995 6666532 23555555788888888 No 95 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=1.4e-54 Score=315.89 Aligned_cols=329 Identities=14% Similarity=0.165 Sum_probs=227.3 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccc--c--c--cccccccc---ccccccccccccCCCCCcccHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRA--Y--T--EPVDGNLM---FSTLEDTSIVPKPSPIAFGRIT 74 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a--~--~--~~~~~~~~---~~~~~~~~~~~~p~~~~~~~~~ 74 (559) .+|=+.. . ......++.....++ + + +|++...+ +.-....+....|+ ..+. T Consensus 1 m~~~~~~---------------~-~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp----~~~~ 60 (344) T protein:vir:60 1 MSKKKGK---------------T-LQPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPP----ISFT 60 (344) T ss_pred CCcccCC---------------C-CCchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCC----CCHH Confidence 0000000 0 000001111111111 1 1 11111110 00000001001111 1233 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) .+.+.+..++.+.+||.+.++-|+. ...||+++++ .+ T Consensus 61 ~la~~~~a~~~h~~~i~~k~n~l~~----------------------------------------~~~Pn~~~t~---~~ 97 (344) T protein:vir:60 61 GLAKSLRAAVHHSSPIYVKRNILAS----------------------------------------TFIPHPWLSQ---QD 97 (344) T ss_pred HHHHHHHhhhhhccchhhhhhHHHh----------------------------------------hccCCCCCCH---HH Confidence 3333334455555555544433321 1135666665 34 Q ss_pred HHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccC Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPR 234 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~ 234 (559) | ++++.|++++||||++++||..|+|++|+||+|.+|++..+.+ .|+++..++....|.++||||++. T Consensus 98 f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~--------~~~~v~~~~~~~~~~~~eIiHir~--- 165 (344) T protein:vir:60 98 F-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPGSVFHLLE--- 165 (344) T ss_pred H-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC--------eEEEEccCCeEEEEcCccEEEEcC--- Confidence 5 5788999999999999999999999999999999999876543 355566677778899999999974 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_012530. 235 SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRI 314 (559) Q Consensus 235 ~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~ 314 (559) .+..++.||+||+..++.++.++.+++.|+.+||+||++|+|||.++. +.++++++++++++|++.. |. ++|+. T Consensus 166 ~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~----~~ls~e~~~~ik~~~~~~~-g~-~~~r~ 239 (344) T protein:vir:60 166 PDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD----AVQDRNDIEMLRENMVKSK-GR-NNFKN 239 (344) T ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----cCCCHHHHHHHHHHHHHhc-CC-CCCcc Confidence 234567899999999999999999999999999999999999998753 4689999999999999875 44 66887 Q ss_pred cccc-----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_012530. 315 PMIT-----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSK 388 (559) Q Consensus 315 ~vl~-----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~ 388 (559) ++|. .++++|+|++. ++|+||+|++++++++||++|||||++||+.+.++. +++|++++.+.|+++ T Consensus 240 ~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~--------~~~n~e~~~~~f~~~ 311 (344) T protein:vir:60 240 LFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVG--------SLGDIEKVAKVFVRN 311 (344) T ss_pred eEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCC--------ccccHHHHHHHHHHH Confidence 7774 35799999985 789999999999999999999999999999876543 468899999999999 Q ss_pred HhhHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 389 GLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 389 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) ||.||+++|| +||.+|-. ..++|++..++..|. T Consensus 312 ~L~Pl~~~~e-~ln~~lg~----~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 312 ELIPLQDRIR-EINGWLGQ----EVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHH-HHHHhcCC----cccccCccccCCCCC Confidence 9999999998 58887632 335666666666665 No 96 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=1.4e-54 Score=315.85 Aligned_cols=331 Identities=13% Similarity=0.136 Sum_probs=215.9 Q ss_pred HhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCc Q lcl|NC_012530. 32 ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGM 111 (559) Q Consensus 32 ~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~ 111 (559) -+|... + ++.... ..+.....+.. |.+ +. ....+..|+...-+....+ T Consensus 1 m~~~~~-~-~~~~~~------~~~~~~~~~~~-p~~-----~~-------~~~~~~~~~~~~~~~~~~~----------- 48 (337) T protein:vir:78 1 MTKRQQ-Q-PAQAAA------SSPRPSVVFSM-PEA-----ID-------PTAWMTDYTGVFYNPYGEY----------- 48 (337) T ss_pred CCCccc-C-cccccc------cCceeEEEecC-ccc-----cc-------CcchhHhhhhhhhccCcce----------- Confidence 000000 0 000000 00010111111 111 10 0111122222222211110 Q ss_pred ceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh-hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCc Q lcl|NC_012530. 112 GYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD-DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPT 190 (559) Q Consensus 112 ~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~-~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~ 190 (559) |+ -+-+..... +......+....|. ..||+.++.. ...+++++++.|++++||||++++||..|+|++|+||+|. T Consensus 49 -~~-pP~~~~~La-~l~~~~~~h~~~L~-~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~ 124 (337) T protein:vir:78 49 -YQ-PPIDRKGLA-KVARANAHHGAILM-ARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSV 124 (337) T ss_pred -ec-CCCCHHHHH-HHhhcchhhhhHHH-hhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCc Confidence 00 000000000 00000001111111 1223222211 1146788999999999999999999999999999999999 Q ss_pred eEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012530. 191 TIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTH 270 (559) Q Consensus 191 ~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~n 270 (559) +|++..+.. |++...++....|+++||||++. .+..++.||+||+..++.++.++.+++.|+++||+| T Consensus 125 ~v~~~~d~~---------~~~~~~~~~~~~~~~~eIiHik~---~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~N 192 (337) T protein:vir:78 125 YLRRREDGC---------FVYLQQGKPNLIYRPDDVIWLAQ---YDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLN 192 (337) T ss_pred eeEeeeCCe---------EEEEEcCCceEEECCccEEEECC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 998765421 22233455667899999999974 234567899999999999999999999999999999 Q ss_pred cCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc-cchhHHHHHHHHHHHH Q lcl|NC_012530. 271 GGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ-AEDMQFQSWLNYLINI 345 (559) Q Consensus 271 g~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~-~~D~qf~e~~~~~~~~ 345 (559) |++|+|||.++. +.+++++++++++.|++ +.|.+|.+++.|+. +++++|+|++. ++|+||+|++++++++ T Consensus 193 Ga~p~~il~~~~----~~l~~e~~~~lk~~~~~-~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~e 267 (337) T protein:vir:78 193 GAHMGFIFYATD----PNMDDDTEEEMKEMIAN-SKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQD 267 (337) T ss_pred cCCCceeEEcCC----CCCCHHHHHHHHHHHHH-hcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHH Confidence 999999998643 36899999999999986 67888888876664 34689999995 7899999999999999 Q ss_pred HHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchh Q lcl|NC_012530. 346 ICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGD 422 (559) Q Consensus 346 Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~ 422 (559) ||++|||||++||+.+.+.. .+++|++++.+.|+++||.||+++||+++|++|++......|++.-..++ T Consensus 268 Ia~a~~VPp~llGi~~~~~~-------~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 268 VLTAHRYPPALAGIIPTNGG-------GGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALWVTFRETIGAAV 337 (337) T ss_pred HHHHhCCCHHHcccccCCCc-------CccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhceeccccccccC Confidence 99999999999998776532 23578999999999999999999999999999887543333333333344 No 97 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.1e-54 Score=316.32 Aligned_cols=331 Identities=14% Similarity=0.134 Sum_probs=229.8 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc--c--cccccccccc--cccc-ccccccCCCCCcccHHHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY--T--EPVDGNLMFS--TLED-TSIVPKPSPIAFGRITDV 76 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~--~--~~~~~~~~~~--~~~~-~~~~~~p~~~~~~~~~~~ 76 (559) .+|=+.. +.++. .....+..++-.++ + +|++...+.. ...+ .+-...| ...+..| T Consensus 1 ~~~~~~~-----------~~~~~---~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~p----p~~~~~l 62 (344) T protein:vir:20 1 MSKKKGK-----------TPQPA---AKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEP----PVSFTGL 62 (344) T ss_pred CCcccCC-----------CCcch---hhhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecC----CCCHHHH Confidence 1111000 00000 00000011111111 1 2222111100 0000 0000111 1223344 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .+.+..++.+.+||.+.++-++. .. .||+++++ .+| T Consensus 63 a~~~~a~~~h~~~i~~k~n~l~~--------------------------------------~~--~Pn~~lt~---~~f- 98 (344) T protein:vir:20 63 AKSLRAAVHHSSPIYVKRNILAS--------------------------------------TF--IPHPWLSQ---QDF- 98 (344) T ss_pred HHHHhhhhhhCccceehhhhHHH--------------------------------------hc--cCCCCCCH---HHH- Confidence 44444555555555554443331 11 25566665 345 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCC Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSD 236 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~ 236 (559) ++++.|++++||||++++|+..|+|++|+||+|.+|++..+.+ .|+++..++....|.++||||++.. + T Consensus 99 ~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~--------~~~~~~~~~~~~~~~~~eIiHir~~---~ 167 (344) T protein:vir:20 99 SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPGSVFHLLEP---D 167 (344) T ss_pred HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCC--------EEEEEccCCeEEEEcCccEEEeCCC---C Confidence 5788899999999999999999999999999999999866543 3555666777788999999999742 3 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 237 ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 237 ~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) ..++.||+||+..++.++.++.+++.|+.+||+||++|+|||.++. +.++++++++++++|++.. |. ++|+.++ T Consensus 168 ~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d----~~l~~e~~~~ik~~~~~~~-g~-~n~r~l~ 241 (344) T protein:vir:20 168 INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD----AVQDRNDIEMLRENMVKSK-GR-NNFKNLF 241 (344) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----cCCCHHHHHHHHHHHHHhc-CC-CCccceE Confidence 4467899999999999999999999999999999999999998753 4689999999999999875 43 6688777 Q ss_pred cc-----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHh Q lcl|NC_012530. 317 IT-----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGL 390 (559) Q Consensus 317 l~-----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l 390 (559) |. .++++|+|++. ++|+||+|++++++++||++|||||++||+.+.++. +++|++++.+.|++++| T Consensus 242 l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~--------~~~n~e~~~~~f~~~~l 313 (344) T protein:vir:20 242 LYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVG--------SLGDIEKVAKVFVRNEL 313 (344) T ss_pred EecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCC--------ccccHHHHHHHHHHHHH Confidence 74 35799999995 789999999999999999999999999999876543 46789999999999999 Q ss_pred hHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 391 MPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 391 ~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) .||+++|| +||..|-. ..++|++..++..|+ T Consensus 314 ~P~~~~~e-~in~~lg~----~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 314 IPLQDRIR-EINGWLGQ----EVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHH-HHHHhcCC----cccccCccccccCCC Confidence 99999998 57777632 346777777777777 No 98 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=3e-54 Score=314.03 Aligned_cols=332 Identities=12% Similarity=0.134 Sum_probs=225.6 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc----ccccccccccc--cccc---ccccccCCCCCcccHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY----TEPVDGNLMFS--TLED---TSIVPKPSPIAFGRIT 74 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~----~~~~~~~~~~~--~~~~---~~~~~~p~~~~~~~~~ 74 (559) .+|.+..-... .+.+ ...++-.++ .+|++...... ...+ +.+. .| ...+. T Consensus 1 m~~~~~~~~~~--------------~~~~--~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~p----p~~~~ 59 (346) T protein:vir:10 1 MKKQLRKNLTQ--------------NDRL--QPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWY-NP----PMSFD 59 (346) T ss_pred CCcccCCCCCc--------------cccc--ccccCeEEEecCCcceecCchhHHHHHHHhhcCCceE-ec----CCCHH Confidence 11110000000 0000 000000000 01111100000 0000 0000 00 01122 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) .+.+.+..++.+..|+.+.+ +.+..++.+ ||+++++ ++ T Consensus 60 ~la~l~~~~~~h~~~i~~k~-------------------------------------n~l~~l~~~--Pn~~~t~---~~ 97 (346) T protein:vir:10 60 GLAKSLRSSTHHESAIITKA-------------------------------------NILLSTCEV--DSRYLSR---RD 97 (346) T ss_pred HHHHHHHhhhhcchhhhhhh-------------------------------------hhHHHHHhC--CCCCCCH---HH Confidence 22222233333333333322 233344444 5556655 45 Q ss_pred HHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccC Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPR 234 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~ 234 (559) |++ ++.|++++||+|++++|+..|++++|+||+|.+|++..+.+++ .|+....++....|+++||||++.. T Consensus 98 f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~~------~~~~~~~~g~~~~~~~~dIih~r~~-- 168 (346) T protein:vir:10 98 LSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQF------YYVPQRFDHQEHEFAKGSIYHLLEP-- 168 (346) T ss_pred HHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCeE------EEEEEccCCeEEEEecccEEEecCC-- Confidence 654 5679999999999999999999999999999999987766543 2444445566778999999999742 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_012530. 235 SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRI 314 (559) Q Consensus 235 ~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~ 314 (559) +..++.||+||+..++.++.++.+++.|+.++|+||++|+|||.++. +.++++++++++++|++. .|.+|++++ T Consensus 169 -~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d----~~l~~e~~~~i~~~~~~~-~g~~n~~~~ 242 (346) T protein:vir:10 169 -DINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSD----ASQKQEDVENIRQQLKQS-KGVGNFKNL 242 (346) T ss_pred -CCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC----CCCCHHHHHHHHHHHHHh-cCccccCce Confidence 44567899999999999999999999999999999999999998743 358999999999999886 577899998 Q ss_pred ccccCC----ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_012530. 315 PMITAE----DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKG 389 (559) Q Consensus 315 ~vl~~g----~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~ 389 (559) .|+.++ +++++|++. ++|+||+|.+++++++||++|||||.+||+.+.++. +++|++++.+.|++++ T Consensus 243 ~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~--------~~s~~e~~~~~f~~~~ 314 (346) T protein:vir:10 243 FVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTG--------GFGNVADAAEVFFITE 314 (346) T ss_pred eEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC--------CcccHHHHHHHHHHHH Confidence 887543 689999985 689999999999999999999999999999876543 4678999999999999 Q ss_pred hhHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 390 LMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 390 l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) |.||+++||+ +|.+|.. ..++|+-..+++.|+ T Consensus 315 l~P~~~~iee-~n~~L~~----e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 315 IEPLQERLKE-FNQWLGQ----EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHH-HHhhccc----ceeeechhhhcccCC Confidence 9999999985 7766643 235566567888887 No 99 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=3.9e-54 Score=313.37 Aligned_cols=331 Identities=14% Similarity=0.145 Sum_probs=224.8 Q ss_pred hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc--c--cccccccccc--cccc-ccccccCCCCCcccHHHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY--T--EPVDGNLMFS--TLED-TSIVPKPSPIAFGRITDV 76 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~--~--~~~~~~~~~~--~~~~-~~~~~~p~~~~~~~~~~~ 76 (559) .+|=+.. - ..+. .....+..++-.++ + +|++...+.. ...+ .+....| ...+..+ T Consensus 1 ~~~~~~~----~-------~~~~---~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p----p~~~~~l 62 (344) T protein:vir:56 1 MSKKKGK----T-------PQPA---AKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEP----PVSFTGL 62 (344) T ss_pred CCCCCCC----C-------Cchh---hHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccC----CCCHHHH Confidence 1111100 0 0000 00001111111111 1 2222111100 0000 0000111 1233444 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .+.+..++.+.+||...++-++. ...||+++++ .+| T Consensus 63 a~~~~a~~~h~s~i~~k~n~l~~----------------------------------------~~~Pnp~~t~---~~f- 98 (344) T protein:vir:56 63 AKSLRAAVHHSSPIYVKRNILAS----------------------------------------TFIPHPWLSQ---QDF- 98 (344) T ss_pred HHHHhhhhhhCccceehhhhHHh----------------------------------------hcCCCCCCCH---HHH- Confidence 44445555555566554443331 1135666665 345 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCC Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSD 236 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~ 236 (559) ++++.|++++||+|++++||..|+|++|+||+|.+|++..+.+ .|+++..++....|.++||||++. .+ T Consensus 99 ~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~--------~~~~~~~~g~~~~~~~~dIiHir~---~~ 167 (344) T protein:vir:56 99 SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPGSVFHLLE---PD 167 (344) T ss_pred HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCC--------EEEEEecCCeEEEEcCccEEEECC---CC Confidence 5778899999999999999999999999999999999876543 245556666777899999999974 23 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 237 ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 237 ~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) ..++.||+||+..++.++.++.+++.|+++||+||++|+|||.++. +.++++++++||++|++.. |. |+|++++ T Consensus 168 ~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d----~~ls~e~~~~lk~~~~~~~-g~-~~~r~l~ 241 (344) T protein:vir:56 168 INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD----AVQDRNDIEMLRENMVKSK-GR-NNFKNLF 241 (344) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHHhc-CC-CCccceE Confidence 4567899999999999999999999999999999999999998753 4689999999999999865 43 6789888 Q ss_pred cc-----CCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHh Q lcl|NC_012530. 317 IT-----AEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGL 390 (559) Q Consensus 317 l~-----~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l 390 (559) |. ++|++++|++. ++|+||+|++++++++||++|||||++||+.+.++. +++|++++.+.|+++|| T Consensus 242 l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~--------~~~n~eq~~~~f~~~tL 313 (344) T protein:vir:56 242 LYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVG--------SLGDIEKVAKVFVRNEL 313 (344) T ss_pred EecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCC--------ccccHHHHHHHHHHHHH Confidence 84 35799999995 789999999999999999999999999999876543 46789999999999999 Q ss_pred hHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 391 MPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 391 ~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) .||+++||+ +|.+|..+. ++|+--.+...|. T Consensus 314 ~Pl~~~ie~-~n~~l~~~~----~~F~~y~l~~~~~ 344 (344) T protein:vir:56 314 IPLQDRIRE-INGWIGQEV----IRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHH-HHhhhcccc----ccCCCccccccCC Confidence 999999985 777775422 2222112222222 No 100 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=3.6e-54 Score=313.56 Aligned_cols=274 Identities=8% Similarity=0.005 Sum_probs=216.5 Q ss_pred HHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEEC Q lcl|NC_012530. 97 VTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYD 176 (559) Q Consensus 97 ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd 176 (559) ||++|+.+++... . . .+.+..+| +..||+++ ++++|++.++.+++++||||++++|+ T Consensus 1 ia~l~~~~~~~~~-----------~--~------~~~l~~lL-~~~PN~~~---t~~~f~~~~~~~ll~~Gna~~~i~r~ 57 (278) T protein:vir:78 1 MASLPLKMYEDYK-----------V--V------NTEVSDLL-TVSPNNSL---SSFDFINQIETIRNEKGNAYVLIERD 57 (278) T ss_pred CccceeEEEecCc-----------c--c------ccHHHHHH-HhcCCCCC---CHHHHHHHHHHHHhhcCCEEEEEEEC Confidence 7877765543211 1 1 12333333 33455555 45789999999999999999999999 Q ss_pred CCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHH Q lcl|NC_012530. 177 SNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFIS 256 (559) Q Consensus 177 ~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~ 256 (559) .+|+|++||||+|++|++..+.+|.. ..|.....++....|+++||||++++. ..++.||+||+.++..++.. T Consensus 58 ~~G~~~~l~~l~~~~v~v~~~~~~~~----~~y~~~~~~g~~~~~~~~evih~~~~~---~~~~~~G~s~~~~~~~~i~~ 130 (278) T protein:vir:78 58 IYHQPSKLFLLNPDVVEMLIENQSRE----LYYSIHAATGNKLIVHNMDMLHFKHIV---ASNMVQGISPIDVLKNTTDF 130 (278) T ss_pred CCCcEEEEEEECCceeEEEEcCCCce----EEEEEEcCCceEEEEccccEEEECCCC---CCCCeeeccHHHHHHHHHHH Confidence 99999999999999999998877643 223333445556789999999998542 23567899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHH Q lcl|NC_012530. 257 HENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQF 335 (559) Q Consensus 257 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf 335 (559) +.++++++...|.+ .|+++++.+ +.+++++.+++++.|++.+. ++|++++++ ++++|+++++ +.|++| T Consensus 131 ~~~~~~~~~~~~~~--~~~~i~~~~-----~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~-~g~~~~~l~~~~~d~~~ 199 (278) T protein:vir:78 131 DNAVRTFNLTEMQK--PDSFMLKYG-----SNVGKEKRQQVLEDFKQYYE---ENGGILFQE-PGVEIEPLPKKYVSEDI 199 (278) T ss_pred HHHHHHHHHHHhcC--CCcEEEEeC-----CCCCHHHHHHHHHHHHHHhc---cCCCceecC-CCceEEEccCChhHHHH Confidence 99999997665555 478888764 46889999999999998764 578888775 5599999985 799999 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccC---c Q lcl|NC_012530. 336 QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILG---D 412 (559) Q Consensus 336 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~---~ 412 (559) +|++++++++||++|||||++||+.+.+ +++|++++.+.|++.||+|+++.||++||++||++.+. . T Consensus 200 ~e~~~~~~~~Ia~~fgVpp~~lg~~~~~----------~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~ 269 (278) T protein:vir:78 200 VASENLTRERVANVFQLPSVFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIG 269 (278) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCc Confidence 9999999999999999999999976543 46789999999999999999999999999999987653 3 Q ss_pred cceeeecch Q lcl|NC_012530. 413 NYMLEFVGG 421 (559) Q Consensus 413 ~~~~~f~~l 421 (559) .++|+++.+ T Consensus 270 ~~~f~~~~l 278 (278) T protein:vir:78 270 ILNLTLNLI 278 (278) T ss_pred eEEEecccC Confidence 344444556 No 101 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=4.7e-53 Score=307.47 Aligned_cols=330 Identities=11% Similarity=0.136 Sum_probs=228.3 Q ss_pred HHHHHHHHHHhhhhcccccc--c--cccccccc-cccccc---cccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDRA--Y--TEPVDGNL-MFSTLE---DTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRA 94 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~a--~--~~~~~~~~-~~~~~~---~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia 94 (559) +.|..+.........+..++ + .+|..... ++.... .+.+. .| ...+..+.+.+..++.+.+||...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~-ep----p~~~~~la~~~~~~~~h~~~i~~k~ 75 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCY-LP----PVNRHALAKLPHQNAQHGGILHSRA 75 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccc-cC----CCCHHHHHHHhhcchhhcchhhhhh Confidence 21211111111111111111 1 12221000 110000 00011 11 2234445555566778888887766 Q ss_pred HHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE Q lcl|NC_012530. 95 NQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT 174 (559) Q Consensus 95 ~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~ 174 (559) +-+++ ...||+++++ .+|+ .++.|++++||+|++++ T Consensus 76 n~l~~----------------------------------------~~~Pn~~~t~---~~f~-~~v~d~ll~Gnay~~i~ 111 (345) T protein:vir:37 76 NMVSA----------------------------------------TYEGGKALSK---MEMR-ALCLNLIQFGDVGLLKV 111 (345) T ss_pred hHHhh----------------------------------------ccCCCCCCCH---HHHH-HHHHHHHhcCCeEEEEE Confidence 65541 1135666665 4564 45679999999999999 Q ss_pred ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHH Q lcl|NC_012530. 175 YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREF 254 (559) Q Consensus 175 rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i 254 (559) ||..|++++|+||+|.+|++..+...... ..++.+..++....|+++||||++. .+..++.||+||+..++.++ T Consensus 112 rn~~G~~~~L~pl~~~~vr~~~d~~~~~~---~~~~~~~~~g~~~~~~~~eViHir~---~~~~~~~~Gl~~~~~a~~si 185 (345) T protein:vir:37 112 RNGFGQVVRLVPLSSLYLRVHKDGGYSYL---MKKSLYDTAQEIYRYDAKDIIFIKL---YDPMQQVYGSPDYVGGIQSA 185 (345) T ss_pred ECCCCCEEEEEEecCceeEEeecCCeeEE---EeeeeeccCceEEEEccccEEEEcC---CCCCCCcccchHHHHHHHHH Confidence 99999999999999999998765443211 1222333445567899999999974 24456789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc- Q lcl|NC_012530. 255 ISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ- 329 (559) Q Consensus 255 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~- 329 (559) .++.+++.|+++||+||++|+|||.++. +.+++++.++++++|++.++ .+|.+.+.|+. .+|++++|++. T Consensus 186 ~l~~~a~~~~~~~f~NGa~~~~Il~~t~----~~l~~e~~~~lk~~~~~~~g-~~n~~~~~i~~~~g~~~G~~~~pl~~~ 260 (345) T protein:vir:37 186 LLNSDATVFRRRYFSNGAHMGFILYSTD----PDLTEEMEEEIARKISESKG-VGNFRSMFVNIAGGHPDGLKVIPIGDT 260 (345) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCC----CCCCHHHHHHHHHHHHHhcC-ccccCceeEecCCCCccceeEEEccCC Confidence 9999999999999999999999998643 46899999999999999864 45655544442 34699999985 Q ss_pred cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc Q lcl|NC_012530. 330 AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI 409 (559) Q Consensus 330 ~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 409 (559) ++|+||++++++++++||++|||||.+||+.+.++. +++|++++.+.|++.||.||+++||+++|+.+ + T Consensus 261 ~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~--------~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~--e- 329 (345) T protein:vir:37 261 GTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTG--------GLGDPLKYREVYHYDEVMPLQEIIAETINQDP--E- 329 (345) T ss_pred hhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCC--------CcccHHHHHHHHHHHHHHHHHHHHHHHhhhhh--c- Confidence 789999999999999999999999999999876543 46789999999999999999999999999742 1 Q ss_pred cCccceeeec--chhh Q lcl|NC_012530. 410 LGDNYMLEFV--GGDT 423 (559) Q Consensus 410 ~~~~~~~~f~--~l~~ 423 (559) ....+.++|+ .+++ T Consensus 330 ~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 330 IKNLLKIKFREQNFAK 345 (345) T ss_pred cCCcceEEECchhhcC Confidence 1234555665 4555 No 102 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=4.7e-53 Score=307.48 Aligned_cols=338 Identities=14% Similarity=0.112 Sum_probs=219.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc--c--ccccccccc---cccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY--T--EPVDGNLMF---STLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~--~--~~~~~~~~~---~~~~~~~~~~~p~~~~~~~~ 73 (559) |.=..+++..---.-.. ...........++-.++ + +|++..... .-....+-...| ...+ T Consensus 1 m~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~p----p~~~ 67 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQS---------AQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEP----PLSM 67 (350) T ss_pred CCccccCCCcCccccCC---------cchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccC----CCCH Confidence 21111111000000000 00000000000000000 0 111110000 000000000000 0111 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) ..+.+.+..++.+.+||.+.++-++ ....||+++++ + T Consensus 68 ~~la~~~~~~~~h~~~l~~k~n~l~----------------------------------------~~~~Pn~~~t~---~ 104 (350) T protein:vir:11 68 EGLAKSVGSSVYLQSGLKFKRNMLA----------------------------------------KTFIPHRLLSR---A 104 (350) T ss_pred HHHHHHHhhhhhhccchhhhhhhhh----------------------------------------hcccCCCCCCH---H Confidence 2222222233333333333222111 11135666665 3 Q ss_pred HHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEeccc Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNP 233 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~ 233 (559) +|+ +++.|++++||||++++||..|+|++|+||+|.+|++..+.+ .|+++..++....|.++||||++.. T Consensus 105 ~f~-~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~--------~~~~~~~~~~~~~~~~~eVihir~~- 174 (350) T protein:vir:11 105 TFE-QFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE--------TFYQVRSWKDEHEFEKGSVIQLREA- 174 (350) T ss_pred HHH-HHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC--------eEEEEeeCCeEEEECcccEEEeCCC- Confidence 454 467899999999999999999999999999999999876543 2455566777788999999999742 Q ss_pred CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccc Q lcl|NC_012530. 234 RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYR 313 (559) Q Consensus 234 ~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~ 313 (559) +..++.||+||+..++.++.++.+++.|+++||+||++|+|||.+++ +.+++++++++++.|++. .|.+|+|+ T Consensus 175 --~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~----~~ls~e~~~~l~~~~~~~-~G~~N~~~ 247 (350) T protein:vir:11 175 --DINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTD----AAQNEEDIDALRTALKTA-KGPGNFRN 247 (350) T ss_pred --CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC----CCCCHHHHHHHHHHHHHh-cCccccCc Confidence 34457799999999999999999999999999999999999998753 468999999999999885 78899999 Q ss_pred cccccC----Cceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_012530. 314 IPMITA----EDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSK 388 (559) Q Consensus 314 ~~vl~~----g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~ 388 (559) +.|+.. ++++++|++. ++|+||+|.+++++++||++|||||++||+.+.++. +++|++++.+.|+++ T Consensus 248 ~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~--------~~sn~e~~~~~f~~~ 319 (350) T protein:vir:11 248 LFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAG--------GFGSISDAAAVWASL 319 (350) T ss_pred eeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC--------CcCCHHHHHHHHHHH Confidence 877743 4689999995 789999999999999999999999999999876542 467899999999999 Q ss_pred HhhHHHHHHHHHHHhhccccccCccceeeecch Q lcl|NC_012530. 389 GLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGG 421 (559) Q Consensus 389 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l 421 (559) ||.||+++||+ +|.+|..+. ....+|+..++ T Consensus 320 ~L~P~~~~ie~-ln~~l~~~~-~~F~~~~~~~l 350 (350) T protein:vir:11 320 ELAPMQTRLQQ-VNEMIGEEV-VRFAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHH-HHhhcCccc-cccCcccccCC Confidence 99999999985 888875432 12223333444 No 103 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=5.4e-53 Score=307.13 Aligned_cols=332 Identities=13% Similarity=0.155 Sum_probs=225.6 Q ss_pred HHHHHHHHHHhhhhccc--cccc--cccccccc-cccccc---cccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGV--DRAY--TEPVDGNL-MFSTLE---DTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRA 94 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr--~~a~--~~~~~~~~-~~~~~~---~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia 94 (559) +.+............+. -.++ .+|..... ++...+ .+.+.+ | ...+..+.+.+..++.+.+||.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~e-p----p~~~~~la~l~~~~~~h~~~i~~k~ 75 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYL-P----PVNRHALAKLPHQNAQHGGILHSRA 75 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccC-C----CCCHHHHHHHhhcccccccceeeec Confidence 11111000000000110 0111 12221100 010000 000101 1 1123334444445566666664444 Q ss_pred HHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE Q lcl|NC_012530. 95 NQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT 174 (559) Q Consensus 95 ~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~ 174 (559) +-++ .+ . .||+++++ .+|+ .++.|++++||+|++++ T Consensus 76 n~l~-------------------------------------~~-~--~Pn~~lt~---~~f~-~~~~d~ll~Gnay~~~~ 111 (345) T protein:vir:37 76 NMVS-------------------------------------SL-Y--EGGKALSR---MDMR-ALCLNLIQFGDVGLLKV 111 (345) T ss_pred hHHH-------------------------------------hh-c--cCCCCCCH---HHHH-HHHHHHHhcCCeEEEEE Confidence 3332 11 1 25555665 4564 45679999999999999 Q ss_pred ECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHH Q lcl|NC_012530. 175 YDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREF 254 (559) Q Consensus 175 rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i 254 (559) ||..|+|++|+||+|.+|++..+.+... ...++....++....|+++||||++. .+...+.||+||+..++.++ T Consensus 112 rn~~G~~~~L~pl~~~~vr~~~d~~~~~---~~~~~~~~~~g~~~~~~~~dVihir~---~~~~~~~~Gls~~~~a~~si 185 (345) T protein:vir:37 112 RNGFGQVVRLVPLSSLYLRVRKDGGYSY---LMKKSLYDTAQEIYRYDAKDIIFIKL---YDPMQQVYGSPDYVGGIQSA 185 (345) T ss_pred EcCCCcEEEEEEEcCceeEEEEeCCeeE---EEEEeEecCCceEEEEccccEEEecC---CCCCCCcccccHHHHHHHHH Confidence 9999999999999999999877654322 12233334455667899999999974 23445779999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc- Q lcl|NC_012530. 255 ISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ- 329 (559) Q Consensus 255 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~- 329 (559) .++.+++.|+++||+||++|+|||.++. +.+++++++++|++|++ ..|.+|.+++.|+. .++++++|++. T Consensus 186 ~l~~~a~~~~~~~f~NG~~p~~Il~~~d----~~l~~e~~~~lk~~~~~-~~g~~n~~~~~i~~p~g~~~G~~~~pls~~ 260 (345) T protein:vir:37 186 LLNSDATVFRRRYFSNGAHMGFILYSTD----PDLTEEMEEEIARKISE-SKGVGNFRSMFVNIANGHPDGLKVIPIGDT 260 (345) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEecC----CCCCHHHHHHHHHHHHH-hcCcccccceEEEcCCCcccceEEEEccCC Confidence 9999999999999999999999998742 46899999999999987 47888888877664 35799999995 Q ss_pred cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc Q lcl|NC_012530. 330 AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI 409 (559) Q Consensus 330 ~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 409 (559) ++|+||+|++++++++||++|||||++||+.+.++. +++|++++.+.|+++||.||+++||+++|+.+ +. T Consensus 261 ~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~--------~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~~--~~ 330 (345) T protein:vir:37 261 GTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTG--------GLGDPLKYREVYHYDEVMPLQEIIAETINQDP--EI 330 (345) T ss_pred hhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCC--------CcccHHHHHHHHHHHHHHHHHHHHHHHhhhhc--cC Confidence 789999999999999999999999999999776542 46789999999999999999999999999743 11 Q ss_pred cCccceeeecchhhhh Q lcl|NC_012530. 410 LGDNYMLEFVGGDTRS 425 (559) Q Consensus 410 ~~~~~~~~f~~l~~~d 425 (559) .....++|+.-+..- T Consensus 331 -~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 331 -KNLLKIKFREQNFAK 345 (345) T ss_pred -CCcceEEecchhhcC Confidence 234556665322111 No 104 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=2.3e-45 Score=265.35 Aligned_cols=214 Identities=12% Similarity=0.197 Sum_probs=162.9 Q ss_pred EEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_012530. 192 IYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHG 271 (559) Q Consensus 192 V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng 271 (559) |++.. +|...+. .....+..++....+.++||+|++.. .+.++.||+|||.+++.+|..+.++++|+.+||+|| T Consensus 1 ~r~~~--dg~~~y~-~~~~~~~~~g~~~~~~~~eilH~r~~---~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng 74 (219) T protein:vir:98 1 MRVCK--DGNYKYL-MKKSLYDTKSEIYEYNKNDVIFIKLY---DPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNG 74 (219) T ss_pred Cceee--cCeEEEE-EecceecCCceeEEeccccEEEecCC---CCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33222 2211100 00011122345668999999999742 234577899999999999999999999999999999 Q ss_pred CCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc----CCceeeeeccc-cchhHHHHHHHHHHHHH Q lcl|NC_012530. 272 GTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT----AEDAKFVSMTQ-AEDMQFQSWLNYLINII 346 (559) Q Consensus 272 ~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~----~g~~~~~~ls~-~~D~qf~e~~~~~~~~I 346 (559) ++|+|||.++. +.++++++++++++|++. .|.+|+++++|+. ++|++|++++. ++|+||+|++++++++| T Consensus 75 ~~p~gil~~~~----~~l~~e~~~~~~~~~~~~-~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eI 149 (219) T protein:vir:98 75 AHMGFILYSTD----PDMTEEMEDEIAERIRDS-KGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDV 149 (219) T ss_pred CCCceEEEeCC----CCCCHHHHHHHHHHHHHh-cCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHH Confidence 99999998753 368999999999999885 6777776654442 35799999985 78999999999999999 Q ss_pred HHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 347 CALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 347 a~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) |++|||||++||+.+.++. +++|++++...|+++||+||+++||++||++++.+. ..+++|+.....|. T Consensus 150 a~~fgVPp~~lG~~~~~~~--------~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~---~~~~~F~~~~~~d~ 218 (219) T protein:vir:98 150 LTSHRFPPGLSGIIPVNTA--------GLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKS---ALKVNFKQPEKRDK 218 (219) T ss_pred HHHhCCCHHHcccccCCCC--------CccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCC---ccEEeecCcccccC Confidence 9999999999998775443 468899999999999999999999999998755432 24566654443333 Q ss_pred H Q lcl|NC_012530. 427 Q 427 (559) Q Consensus 427 ~ 427 (559) . T Consensus 219 ~ 219 (219) T protein:vir:98 219 N 219 (219) T ss_pred C Confidence 3 No 105 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=1.8e-44 Score=260.47 Aligned_cols=248 Identities=17% Similarity=0.107 Sum_probs=175.9 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||.++-..... ...... ..+. . ..++..........-+.| T Consensus 1 MglF~~~~~r~~~-----------------------------~~~~~~--~~~~-~------~~~~~~~~~~~~v~~~~a 42 (251) T protein:vir:46 1 MGIFYKNEKRDLQ-----------------------------YNEDDL--QMMV-Q------TLPSFQGTKLRQYKDIEA 42 (251) T ss_pred CCccccccccccC-----------------------------CCccch--hhhh-h------hhccccCcCcceechhhh Confidence 9998776110000 000000 0000 0 000101111111233567 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +.+|+|++||++||++||++|+.+++... .. ..+.+..+ ++..||++++ +++|++.++ T Consensus 43 l~~~~v~~~i~~ia~~iA~lp~~~~~~~~------------~~------~~~~~~~l-l~~~Pn~~~t---~~~f~~~l~ 100 (251) T protein:vir:46 43 IRHSDIFTAVMMIASDLARMPIRVTVNGQ------------IN------YSDRIVNL-LNTRPNPMYN---GYIFKLVVF 100 (251) T ss_pred hccHHHHHHHHHHHHhHhhCceEEeeCcc------------cc------ccchHHHH-HhccCCCCCC---HHHHHHHHH Confidence 88999999999999999999977643211 00 11223333 3344555554 478999999 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEe---cCceeeeecccceEEEecccCCCc Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYI---DNKVRGSFTADEMGMFIRNPRSDI 237 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~n~~~~~ 237 (559) .+++++||||++++||.+|+|++|+||+|++|++..+.+|...+ +++.. .++....++++||||++..+ T Consensus 101 ~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~----~~~~~~~~~~g~~~~~~~~diiH~r~~~---- 172 (251) T protein:vir:46 101 VSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY----FHQRIDSNGNNIERNVKFEDMLDIKFYS---- 172 (251) T ss_pred HHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEE----EEEEeccCCcceeEEECCccEEEecCcC---- Confidence 99999999999999999999999999999999999988775421 22222 23445789999999998643 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) .++.+|+||+.+++.+|.++.++++|+.++|+||++|+|||++++.. .++++++++++.|++.++|.+|+|++++. T Consensus 173 ~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l----~~~e~~~~~~~~~~~~~~g~~n~g~~~~g 248 (251) T protein:vir:46 173 LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL----DNKKARDRAREEFPKVLVELNKLGKLSYS 248 (251) T ss_pred CCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCC----CCHHHHHHHHHHHHHHhcCcccccccccc Confidence 24678999999999999999999999999999999999999997432 35778899999999999999999998874 Q ss_pred cCC Q lcl|NC_012530. 318 TAE 320 (559) Q Consensus 318 ~~g 320 (559) .++ T Consensus 249 m~~ 251 (251) T protein:vir:46 249 MNQ 251 (251) T ss_pred cCC Confidence 333 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.92 E-value=1.8e-24 Score=150.75 Aligned_cols=411 Identities=12% Similarity=0.138 Sum_probs=223.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |++.|-|. +.+.... ..|.++. +..+ . ....+..++...| T Consensus 1 ~~~~D~~~-----------------~~~~~~g----~~~~~~~---------~~~~---------~-~~~~~~~~l~a~Y 40 (437) T protein:vir:52 1 MKFFDGIK-----------------SLALKLG----SKQEQTY---------YSPS---------L-SLTDDLVQLEALW 40 (437) T ss_pred CchhhhhH-----------------hHHhcCC----Cccccce---------eecC---------c-cccccHHHHHHHH Confidence 55555440 0000000 0011000 0000 0 0112455677788 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) ..+.+++++|+++|+...+- +++|...+. ..+..+.+...+.+.+ +++-+...+ T Consensus 41 ~~~~l~~~~vd~~a~d~~r~-----------~~~i~~~d~------~~~~~~~~~~~~~~l~---------~~~~l~~a~ 94 (437) T protein:vir:52 41 RDNWIANKVCIKRPEDMVRN-----------WREIYSNDL------NSKQLDLFTKFERSLK---------LRETLTKAL 94 (437) T ss_pred HhCchhhHHhhcchHHhhcC-----------CceEecCCC------CHHHHHHHHHHHHhhc---------HHHHHHHHH Confidence 89999999999999876532 344443221 1223344555555542 233444445 Q ss_pred HHHHHcCCcceEEEECC---------CCcEEEEEEecCceEEEEecCccc---ccccceEEEEEecCceeeeecccceEE Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDS---------NGRLSHTRMVDPTTIYFANDEHGH---RRTRGKIYRQYIDNKVRGSFTADEMGM 228 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~---------~G~~~~L~~l~p~~V~~~~~~~g~---~~~~~~~y~~~~~~~~~~~~~~~evi~ 228 (559) ...-++|.|+++++.+. .|.+..|.++++..|++....+.. ..+.-+.++++..+.....+.++.||| T Consensus 95 ~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~ 174 (437) T protein:vir:52 95 QWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSILGGSQSITVHHSRLII 174 (437) T ss_pred HhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEEEecCCcceeEccceeEE Confidence 55567999999999875 378999999999998753322111 112334556665555555788999999 Q ss_pred EecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCcc-CCccCCHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 229 FIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSP-SVTNTSMRALEDFKRHWTATSSG 307 (559) Q Consensus 229 ~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~e~~~~l~~~~~~~~~G 307 (559) |...+.+......+|.|.++.+.+.|.....+.......+.+...+ ++++++.. .-+.-.++...+..+.++. + T Consensus 175 ~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~~~~~~~~~~~~~~~~~---~ 249 (437) T protein:vir:52 175 LNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIAAGMENEVASVISAVQE---I 249 (437) T ss_pred ecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhcCCcHHHHHHHHHHHHH---h Confidence 9765555555667799999999999999999888888877665443 35554310 0011113333333333332 2 Q ss_pred cccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHH Q lcl|NC_012530. 308 INGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASK 386 (559) Q Consensus 308 ~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~ 386 (559) .+.+++.++..+ -+|..++.+ .+ +.+...+....||++++||..+|.-...++.++. .....|.-...+... T Consensus 250 -~~~~~~~~~d~~-~~~e~~~~~~sg--l~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasg---e~D~~~yyd~i~~~Q 322 (437) T protein:vir:52 250 -KSATNSLLLDAE-NEYDRKELTFTG--LKDLLTEFRNAVAGAADMPVTILFGQSVSGLASG---DEDIQNYHEAIRRLQ 322 (437) T ss_pred -cCCCceEEEcCC-cceEEEecCcCC--HHHHHHHHHHHHHHHhcCchhhhcCcCccccccc---HHHHHHHHHHHHHHH Confidence 234566677554 566666532 33 3456677888999999999988843333333211 111222222333333 Q ss_pred HHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHH-------HHHHHc-CCCCHHHHHHHhC----CCC Q lcl|NC_012530. 387 SKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKS-------VQLELQ-TATTVNDYREKQG----LPK 454 (559) Q Consensus 387 ~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~-------~~~~~~-~~~T~NE~R~~~g----l~p 454 (559) ..-|+|+++++-..|-+..+... ...+.|+|+.+...+.++++++ +..++. |+++++|+|+++. ++. T Consensus 323 e~~l~p~le~l~~~i~~~~~g~~-~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~ 401 (437) T protein:vir:52 323 ETRLRPIFEIIDPLICNELFGGL-PADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFAN 401 (437) T ss_pred HHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCC Confidence 35688888888777766655432 3469999999988887777665 444454 5689999999873 233 Q ss_pred CCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 455 IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 455 i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ++..|.. ... ....+......++.. +..++..+.++ T Consensus 402 i~~~~~~-------------~~~---------~~~~~~~~~~~~~~~-~~~~~~~~~~~ 437 (437) T protein:vir:52 402 ISAEHIE-------------ELK---------NADEFAGNFEEPEKM-EGAQVQNSEDQ 437 (437) T ss_pred CCccccc-------------ccc---------CCCCCCCccCCCCCC-CCCCCCCCCCC Confidence 3322210 000 000000000000000 00011111111 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.90 E-value=8.9e-23 Score=141.46 Aligned_cols=460 Identities=13% Similarity=0.142 Sum_probs=220.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccc--------cccccccc-----cccccccccccCC- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTE--------PVDGNLMF-----STLEDTSIVPKPS- 66 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~--------~~~~~~~~-----~~~~~~~~~~~p~- 66 (559) |++|.+ .++-.....++ +.......-++++-.. +....... +.+...+. .++. T Consensus 25 ~~~~~~------~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~-~~~~~ 91 (537) T protein:vir:10 25 VGIFGA------GDDEKPFTRAQ------LVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANP-NLSEG 91 (537) T ss_pred cCCCcc------cchhhHHHHHH------hhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccc-cccch Confidence 444432 11111111111 1011111111211100 00000000 00000000 0000 Q ss_pred ---C--CCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhc Q lcl|NC_012530. 67 ---P--IAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERM 141 (559) Q Consensus 67 ---~--~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~ 141 (559) + ...-.-.+++..|..++.++++|+++|+...+- ++++...+.++..++..+ .++..+.+. T Consensus 92 ~~~~~~~~~~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r~-----------~~~i~~~~~~~~~~~~~~---~l~~~~~~l 157 (537) T protein:vir:10 92 LVLWYAQQAFIGHQMCALIATHWLVNKACSQMPRDAMRK-----------GYKIISDDGNELDPKDAK---FIDRYDRAF 157 (537) T ss_pred hhhhccccCCccHHHHHHHHhCchhhhhhhhhhHHhhcC-----------CceeecCCcccccHHHHH---HHHHHHHHh Confidence 0 011122457777889999999999999987532 345544444433333332 334444333 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE---C-------------CCCcEEEEEEecCceEEEEecC----c-c Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY---D-------------SNGRLSHTRMVDPTTIYFANDE----H-G 200 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r---d-------------~~G~~~~L~~l~p~~V~~~~~~----~-g 200 (559) +. +.-|.+. +...-++|.+++++.- | ..|.+..|.+|+|..|.+.... + - T Consensus 158 ~~--------~~~l~~a-~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~ 228 (537) T protein:vir:10 158 NI--------KKHAIQF-VRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPV 228 (537) T ss_pred hH--------HHHHHHH-HHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCC Confidence 21 1224444 4444568998887753 2 1234567888888877753211 1 0 Q ss_pred cccccceEEEEEecCceeeeecccceEEEecccCCCcc---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|NC_012530. 201 HRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL---SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGI 277 (559) Q Consensus 201 ~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~---~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 277 (559) ...+.-+.++++ .+ ..+.++.|||+..++.++.. .+++|+|.++.+...|.....+.......+...... + T Consensus 229 sp~fg~P~~y~v-~g---~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~--v 302 (537) T protein:vir:10 229 SMHFYEPTYWLI-NG---KKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQT--V 302 (537) T ss_pred ccccCCceeeee-cC---eEecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--e Confidence 111122334433 22 35678899998766554432 235699999999999988888877777777665443 4 Q ss_pred EEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_012530. 278 LLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAE 356 (559) Q Consensus 278 l~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~ 356 (559) ++++.... -.++++ +.+.++...++.+|.+- .++..++.+|..++.+ .+ +-+......+.||.+.|||..+ T Consensus 303 ~k~~~~~~--l~~~~~---~~~r~~~~~~~r~n~g~-~~id~e~e~~e~~~~~lsg--l~~~l~~~~~~iAa~~~IP~t~ 374 (537) T protein:vir:10 303 LKVDAAQV--LANKQQ---FDETMSWWTATRDNYQV-RVVDKDNEDVVQIDTTLND--LDKVIMNQYQLVCAIARTPAPK 374 (537) T ss_pred eeechHHh--hcCHHH---HHHHHHHHHhhcCCcce-eEecCCCceeEEEeccCCC--HHHHHHHHHHHHHhhhCCCcee Confidence 45543211 122333 33333333333444443 4554544667766532 22 3466777888899999999996 Q ss_pred h-ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHH--- Q lcl|NC_012530. 357 I-GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKS--- 432 (559) Q Consensus 357 l-g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~--- 432 (559) | |....+..++ +.....|.-...... +..|.|.+.++.+.|.+..+.. ...+.|+|+.|...|.++++++ T Consensus 375 L~G~sp~Glnat---Ge~D~~~yyd~I~~~-Qe~l~p~l~~l~~ll~~~~~~~--~~~~~i~f~pL~~~s~kEkAei~~~ 448 (537) T protein:vir:10 375 MLGTVPTGFNST---GDYEEASYHEECEST-QDDMRPLIDRHHQLVCRSHLRK--RIRVKVEFPPMDAPKESERADTFLK 448 (537) T ss_pred eccCCccccccc---hhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcCCC--CcceEEEeCCCCCCCHHHHHHHHHH Confidence 5 4322222111 111222222223322 3358999999988887766653 3468899999998888888775 Q ss_pred ----HHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 433 ----VQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 433 ----~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) ++.++. |+|++||+|+.++..|..|-+-+... +.. . +.+..... ........ .+..+ T Consensus 449 ~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~-----~~~----e--d~e~~~~~-~~~~~~~~------~~~~~ 510 (537) T protein:vir:10 449 KMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPA-----MRP----T--DAEDIDVD-DEGKPVRI------IEDQP 510 (537) T ss_pred HHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCC-----CCh----h--hhhcccCC-ccCCcCCC------CCCCC Confidence 455554 56899999999999876543322100 000 0 00000000 00000000 00011 Q ss_pred cccccchhccccccccccccccccccccccccccccccc Q lcl|NC_012530. 508 SSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKK 546 (559) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~ 546 (559) ...+..+.+..+..-.+...+|. |-.+ T Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~a------------~~~~ 537 (537) T protein:vir:10 511 APSEMFGATSSGESANDPRDSGA------------AFED 537 (537) T ss_pred CccccCCCCccccccCCCccCcc------------ccCC Confidence 11111111111111111111111 1111 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.89 E-value=2.9e-21 Score=133.19 Aligned_cols=473 Identities=12% Similarity=0.074 Sum_probs=229.6 Q ss_pred cccCCcch-------HHHH---HHHHHHHHHHhhhhccccccc----cccccc-ccc--ccc--cccccccccC--C--- Q lcl|NC_012530. 11 FYTDDPNA-------FFKH---IDSKIANDTASKALNGVDRAY----TEPVDG-NLM--FST--LEDTSIVPKP--S--- 66 (559) Q Consensus 11 ~~~~~~~~-------~~~~---~~~~~~~~~~~~~~~gr~~a~----~~~~~~-~~~--~~~--~~~~~~~~~p--~--- 66 (559) ..|-++-. -+-+ ++.+++....-. .-|+-+. ..|... ... +.. +...+...+. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~ 78 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLG--LATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVE 78 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhh--hhhhhhhcccccccccccccccccccccccCccccccccccccc Confidence 33333321 1111 122222211111 1111000 011111 000 111 1111110000 0 Q ss_pred CCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCC Q lcl|NC_012530. 67 PIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYS 146 (559) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~ 146 (559) +... .-.+++..|..+++++++|+++|+....- ++++......+..+... ..++..+.+.+ T Consensus 79 ~~~~-~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~-----------~~~i~~~~~~~~~~~~~---~~i~~~~~~l~---- 139 (532) T protein:vir:94 79 ATSW-PGFPTLALLAQLPEYRTMHETPADECVRA-----------WGKITCSSKDELAADKA---TRITQKLEQYN---- 139 (532) T ss_pred cccc-chHHHHHHHHcCchhhhhhccchHHHhhC-----------CceEeeCCccccchHHH---HHHHHHHHhhh---- Confidence 1111 23366778889999999999999976532 34454443333333322 23333333321 Q ss_pred CChhhHHHHHHHHHHHHHHcCCcceEEEECC-------------------CCcEEEEEEecCceEEEEecCcc---cccc Q lcl|NC_012530. 147 PIRDDFTSFLRKLVRDTYTYDQVNYENTYDS-------------------NGRLSHTRMVDPTTIYFANDEHG---HRRT 204 (559) Q Consensus 147 ~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~-------------------~G~~~~L~~l~p~~V~~~~~~~g---~~~~ 204 (559) .++.+...+....++|.+++++.-.. .|.+.+|.+|+|..|.+...... ...+ T Consensus 140 -----v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~f 214 (532) T protein:vir:94 140 -----VRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSF 214 (532) T ss_pred -----HHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheeccccccccccccccc Confidence 23344444555567999988765321 23457899999998876432111 1112 Q ss_pred cceEEEEEecCceeeeecccceEEEecccCCCcc---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec Q lcl|NC_012530. 205 RGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL---SGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVK 281 (559) Q Consensus 205 ~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~---~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 281 (559) .-+.+++...+ ..+.++.|||+..++.++.. ...+|.|.++.+...|.....+.............. ++++. T Consensus 215 g~P~~y~v~~g---~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~--v~k~~ 289 (532) T protein:vir:94 215 YKPDSWIATSG---KKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMT--NLATD 289 (532) T ss_pred CCceeEEEccC---eeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--eeeec Confidence 22334443322 35778899999877665533 234699999999999999888877777655543322 33443 Q ss_pred CccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHh-cc Q lcl|NC_012530. 282 PSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEI-GM 359 (559) Q Consensus 282 ~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~l-g~ 359 (559) . ...++.+..+.+.+.++....+.+|.+ +.++..+..+|+.++.+ .+ +.+........||.+.+||..+| |. T Consensus 290 ~---a~~ls~~~~~~~~~r~~~~~~~~~n~g-~~~id~~~e~~e~~~~~lsg--l~~~l~~~~~~iAaa~~IP~t~LfG~ 363 (532) T protein:vir:94 290 M---AQLLAPGGAQSLDARLQLFNLYRDNRN-IGALDKGTEEIQQTNTPLSG--LDSLQAQSQEQMAAVSHIPLVKLLGI 363 (532) T ss_pred h---HHhhcchhHHHHHHHHHHHHhhcCCcc-ceEEcCCCceeEEEecccCC--HHHHHHHHHHHHHhHhCCCeeeeecC Confidence 1 223444556777777765544444433 34554444566666532 22 35566777889999999999865 53 Q ss_pred ccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHH------ Q lcl|NC_012530. 360 QNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSV------ 433 (559) Q Consensus 360 ~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~------ 433 (559) . .++.++ ++.....|.-...+.....-|.|++..+-+.|.+..+... ...+.|+|+.|...+.++++++. T Consensus 364 s-p~Glns--tGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~-~~d~~~~f~pL~~~s~kEkAei~~~~a~a 439 (532) T protein:vir:94 364 T-PNGLNA--SSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQI-DPGLAWEWSPLMELDDKELAEVRQLNAST 439 (532) T ss_pred C-cccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCCceEEeCCCCCCCHHHHHHHHHHHHHH Confidence 2 222221 1112233333444444445688999998888876655432 34699999999888888777654 Q ss_pred -HHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 434 -QLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 434 -~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) ..++. |.+++||+|++++..|..+.+....... .+ +........+... ..++.++.++......+ T Consensus 440 ~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~--~~---------~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 506 (532) T protein:vir:94 440 DSTLMELGVIDAKMVQQRLAADPTSGYAGALGERD--EL---------DDVEEIAKQLMAA--ALNPPATAPQTPNPQPD 506 (532) T ss_pred HHHHHhcCCCCHHHHHHHHhcCCcccccccccccc--cc---------ccccchhhhhccc--ccCCCCCCCCCCCCCCC Confidence 44454 4589999999999999877543211100 00 0000000000000 00000000000000000 Q ss_pred cchhccccccccccccccccccccccccccccccchhhhhhccCC Q lcl|NC_012530. 512 SFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGS 556 (559) Q Consensus 512 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~ 556 (559) ...+ ++ +++....+|+.+ +++++- .+ T Consensus 507 ~~~d-----------~~--~~~~~~~~~~~~-~~~~~~-----~~ 532 (532) T protein:vir:94 507 SEDD-----------QT--DNQPDAQADPAQ-NDQPVG-----NR 532 (532) T ss_pred CCCC-----------CC--CCccCCCccccc-cCCCcC-----CC Confidence 0000 00 001111111110 111110 00 No 109 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.82 E-value=5.4e-19 Score=120.74 Aligned_cols=491 Identities=9% Similarity=0.044 Sum_probs=216.6 Q ss_pred Ccchhhhc--cccccCCcchHH------------------HHHHHHHHHHHHhhhhcccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFR--TKFYTDDPNAFF------------------KHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTS 60 (559) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~ 60 (559) .+-+-|.+ -+|.-+.+..-+ +++-.+++...+.+...+.-.....-.+.+.....+.... T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~ 115 (862) T protein:vir:99 36 LDPLARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGK 115 (862) T ss_pred cchHHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhcccccc Confidence 22222222 123333332111 1111222222222111110000000000111111111100 Q ss_pred ccccC------CC---CCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecc-cccccChhHHHH Q lcl|NC_012530. 61 IVPKP------SP---IAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLK-NGDKPTKEQQKK 130 (559) Q Consensus 61 ~~~~p------~~---~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~-d~~~~~~~~~~~ 130 (559) ....+ .+ ..... .+++..|..+++++++|+++|+...+- +++|... +..+..++. T Consensus 116 ~s~y~~~~~~~~~~~~~~f~g-yql~alY~~~~larkiVd~pAeDatR~-----------g~~I~~~~d~~e~~~e~--- 180 (862) T protein:vir:99 116 QSSYAVPEALQDWYLSQGFIG-HQACALIAQHWLVDKACSLAGEDAIRN-----------GWHLKSLGEGEEIDEES--- 180 (862) T ss_pred ccccccchhccccccccCccc-HHHHHHHHhCchhhhhhhhhhHHHhhC-----------CceEeecCcccccCHHH--- Confidence 00000 00 01111 256778889999999999999987642 3344432 222333333 Q ss_pred HHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE---CC-------------CCcEEEEEEecCceEEE Q lcl|NC_012530. 131 IDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY---DS-------------NGRLSHTRMVDPTTIYF 194 (559) Q Consensus 131 ~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r---d~-------------~G~~~~L~~l~p~~V~~ 194 (559) ...++..+.+.+. ++-+...+...-++|.+++++.- |. .|.+.+|.+|+|..+.+ T Consensus 181 ~~~ie~~~~rL~v---------~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p 251 (862) T protein:vir:99 181 LEKFKAIDVEFKV---------KENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMP 251 (862) T ss_pred HHHHHHHHHHhhH---------HHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcc Confidence 3345555544321 22233334444467877776542 21 24567888888887765 Q ss_pred Ee----cCc-ccccccceEEEEEecCceeeeecccceEEEecccCCCc---cCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 195 AN----DEH-GHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDI---LSGGYGLSELEMGLREFISHENTELFNDR 266 (559) Q Consensus 195 ~~----~~~-g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~---~~~~~G~Spl~~~~~~i~~~~~~~~~~~~ 266 (559) .. ..+ ....+.-+.++++ .+ ..+.++-||++.-.+.++. ...++|+|.++.+...|.....+...... T Consensus 252 ~~v~~~~~Dp~sp~yGkP~~y~I-~g---~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~ 327 (862) T protein:vir:99 252 MLTAESTADPSSQFFYEPEFWII-SG---QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPL 327 (862) T ss_pred cccccccccccccccCCceeeee-cC---eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHH Confidence 32 111 1111122333332 22 2456777777765444332 22346999999999999999999888888 Q ss_pred HHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHH Q lcl|NC_012530. 267 FFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINI 345 (559) Q Consensus 267 ~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~ 345 (559) ++.+... -+++++.. ..+..+ +.+.+.++....+.+|.| +.++.. +-+|..++.+ .+ +-+........ T Consensus 328 Ll~ka~l--~v~ktd~l---~~l~~e--d~l~~r~~~~~~~rdN~G-i~liD~-eEe~e~ls~slSG--L~dll~~~~q~ 396 (862) T protein:vir:99 328 LAMNKRT--TAIHTDTA---KAIANE--DKFIQRLMFWVRYRDNHA-VKVLGT-DETMEQFDTSLAD--FDAVIMGQYQL 396 (862) T ss_pred HHHHhcc--ceeechhH---hhhccH--HHHHHHHHHHHhccCcce-eEEecC-CCceeEEecccCC--hHHHHHHHHHH Confidence 8877543 24455432 122222 234444443334444433 556644 3567766532 22 34556666778 Q ss_pred HHHHhCCCHHH-hccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhh Q lcl|NC_012530. 346 ICALVAMDPAE-IGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTR 424 (559) Q Consensus 346 Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~ 424 (559) ||.+.+||..+ +|....|..++.+ ....|.-......-..-|.|+++++...+...+.. ...+.|+|+.|... T Consensus 397 IAaas~IP~tiLfGqspaGlnATGE---~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~---~~d~~ieFnpL~~~ 470 (862) T protein:vir:99 397 VASIAKTPATKLLGTAPKGFNSTGE---FETISYHEELESIQEHVYMPFLQRHYLISRLSLGI---QHEIDVVMEPVASM 470 (862) T ss_pred HHhhhCCCceeecccCcccccCchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---CCcceEEeCCCCCC Confidence 99999999985 4533223211111 12233333333333456889999998877655432 24699999999988 Q ss_pred hHHHHHHHH-------HHHHc-CCCCHHHHHHHh------CCCCCCCCCEeeccceeccccccccccccccccccccccc Q lcl|NC_012530. 425 SQQDKLKSV-------QLELQ-TATTVNDYREKQ------GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQ 490 (559) Q Consensus 425 d~~~~~~~~-------~~~~~-~~~T~NE~R~~~------gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 490 (559) +.++++++. +.++. |.|+++|+|+++ |++.++..|..-.+.-. + +...+. T Consensus 471 sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~------------~--e~~~~~-- 534 (862) T protein:vir:99 471 TAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGAS------------P--ENLAAY-- 534 (862) T ss_pred CHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCC------------c--cccccc-- Confidence 888887764 44554 558999999976 33333322211000000 0 000000 Q ss_pred ccccCCCCCCCCCCCCccccc----cchhccccccccccccccccccccccccccc--------cccchhhhhhccCCCC Q lcl|NC_012530. 491 LESALQNPSGTPPTLPPSSSN----SFQQNQEGYTGKDAKPSGKDNQQGVGKDGQL--------KNKKNTNSYKQGGSSK 558 (559) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~--------k~~~~~~~~~~~~~~~ 558 (559) ...+.+...++.++..... .+.++.. ....++++..+....+++.. +...+.++++ .-+++ T Consensus 535 --e~~g~a~~~ap~de~~aga~~~~~e~d~~~----~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~~~~~~~~~-e~~~~ 607 (862) T protein:vir:99 535 --QKAGAAQETASAKETQAGAAVTTAEGDQPN----VQMVPSMKPGQMVGPEVGITAPMPEDDAPVAGVVAKLA-ELQQA 607 (862) T ss_pred --ccCCcccccccccccccccCCccccCCccc----ccccCCCCCCCccccccccccCCCccccccCcccccch-hhhcC Confidence 0000011101100000000 0000000 00011112111111111100 0000111000 00111 Q ss_pred C Q lcl|NC_012530. 559 K 559 (559) Q Consensus 559 ~ 559 (559) + T Consensus 608 ~ 608 (862) T protein:vir:99 608 Q 608 (862) T ss_pred c Confidence 1 No 110 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.77 E-value=5.3e-18 Score=115.31 Aligned_cols=449 Identities=12% Similarity=0.041 Sum_probs=213.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcc----c---- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFG----R---- 72 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~----~---- 72 (559) ||||||.-+.|. +....+++. .+ . .. .+|..-..+. ... ..+ +..+... . T Consensus 1 mn~~dr~i~~~s---P~~~~~R~~-ar--~----~~----~~y~aa~~~r---~~~----~~~-~~~s~~~~~~~~~~~l 58 (502) T protein:vir:79 1 MAILDDVIGVFS---PGWKAARLR-SR--A----VI----QAYEAVKTTR---THK----ARR-ENRTADQLSQYGAVSL 58 (502) T ss_pred CchHhhHHhhcC---hHHHHHHHh-hH--H----HH----hhccccCccc---ccC----CCC-CCCChHHHHHHHHHHH Confidence 999999944432 222222211 11 1 11 1121110000 000 000 1111010 1 Q ss_pred HHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecc--ccc-ccChhHHHHH-HHHHHHHHhcCCCCCCC Q lcl|NC_012530. 73 ITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLK--NGD-KPTKEQQKKI-DYAERYIERMGVDYSPI 148 (559) Q Consensus 73 ~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~--d~~-~~~~~~~~~~-~~~~~~L~~~~p~~~~~ 148 (559) ....++.+.+++++..+|+.+.+.|-.- .++.+..+ ..+ ....+..+++ .....|..++.. .. T Consensus 59 r~RaRdl~rNn~~a~~av~~~~~nvVG~----------ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~---~g 125 (502) T protein:vir:79 59 REQARYLDNNHDLVIGVFDKLEERVVGK----------NGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEV---TG 125 (502) T ss_pred HHHHHHHHhcChHHHHHHHHHHHhhccC----------CceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCc---cc Confidence 1133556778999999999888876421 12222221 111 1111111121 122344444322 34 Q ss_pred hhhHHHHHHHHHHHHHHcCCcceEEEECCC-------CcEEEEEEecCceEEEE------------ecCcccccccceEE Q lcl|NC_012530. 149 RDDFTSFLRKLVRDTYTYDQVNYENTYDSN-------GRLSHTRMVDPTTIYFA------------NDEHGHRRTRGKIY 209 (559) Q Consensus 149 ~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~-------G~~~~L~~l~p~~V~~~------------~~~~g~~~~~~~~y 209 (559) +.+|..+...+++.++..|.+|+.++++.. +.+..|..|+|++|..- .|..|.. ..| T Consensus 126 ~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~----~aY 201 (502) T protein:vir:79 126 QFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRP----EKY 201 (502) T ss_pred cCCHHHHHHHHHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCce----EEE Confidence 568899999999999999999999877543 34679999999888532 2222221 122 Q ss_pred EEEe-c-----CceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCc Q lcl|NC_012530. 210 RQYI-D-----NKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPS 283 (559) Q Consensus 210 ~~~~-~-----~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 283 (559) .... . ......+++++|||+...-++ ...-|+|.+..++..+.......+....--+=.+...++|+.+.+ T Consensus 202 ~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~---gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 278 (502) T protein:vir:79 202 LVYKSRPVSGRQMETKEVDAERMLHLKFVRRL---HQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDG 278 (502) T ss_pred EEeecCCCCCcccceeEechhheEEeecccCC---ccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC Confidence 2211 1 122346889999999753333 234599999888887766655555544444445566777765432 Q ss_pred cCCccCCHHHHHHHHHHHHHHhcCcccccc-cccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccc Q lcl|NC_012530. 284 PSVTNTSMRALEDFKRHWTATSSGINGAYR-IPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQN 361 (559) Q Consensus 284 ~~~~~~~~e~~~~l~~~~~~~~~G~~nag~-~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 361 (559) .... .+.... .-....... ..|. ++.|..| .+++..+.+ ....|.+..+...+.||+.+|||-+.|--.- T Consensus 279 ~~~~---~~~~~~---~~~~~~~~l-~pG~i~~~L~pG-e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~ 350 (502) T protein:vir:79 279 QSYE---PDGNGS---KENERELTI-QPGIIYDDLKPG-EEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNY 350 (502) T ss_pred cccc---cccCCC---CCccccccc-cCCccccccCCC-ceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc Confidence 1110 000000 000000011 1243 3445443 555554432 3457999999999999999999987773211 Q ss_pred ccccccccccc-hhhhhHHHHHHHHHHHHhhHHHHH-HHHHHHhhccccc--cC--ccceeeecc--hhhhhHHHHHHHH Q lcl|NC_012530. 362 RGGATGNKSNS-LNESNNQNKIDASKSKGLMPLLDM-IAKNLTNGIIRQI--LG--DNYMLEFVG--GDTRSQQDKLKSV 433 (559) Q Consensus 362 ~~~~~~~~~~~-~~~an~~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~~~--~~--~~~~~~f~~--l~~~d~~~~~~~~ 433 (559) .++|++...+. ......+.....+....++|+..+ ++.++....++-. .. ..+.++|.+ ....|+...+++. T Consensus 351 s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~ 430 (502) T protein:vir:79 351 NGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAW 430 (502) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHH Confidence 11111110000 000112223344566777776664 5666655554321 11 123445543 3345777777888 Q ss_pred HHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc-----ccCCCCCCCCCCCCc Q lcl|NC_012530. 434 QLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE-----SALQNPSGTPPTLPP 507 (559) Q Consensus 434 ~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~ 507 (559) ..++.+| +|+-++-++.|..|-+--++ +.. +.+ ...+.+.+. ...+..+.+.+..++ T Consensus 431 ~~~i~~Gl~t~~~~~a~~G~D~~~v~~q------------~a~----e~~-~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~ 493 (502) T protein:vir:79 431 KIQIRGGAATESDWVRAGGRNPDDVKRR------------RKA----EID-ENRKLDLVFDTDPASDKGGSSAATKRQEP 493 (502) T ss_pred HHHHHcCCCCHHHHHHHcCCCHHHHHHH------------HHH----HHH-HHHHcCCCCCCCCCCCCCCCCCCCCCCCC Confidence 8888765 69999888889987432111 000 000 000001000 000000000000000 Q ss_pred cccccchhccccccccccccc Q lcl|NC_012530. 508 SSSNSFQQNQEGYTGKDAKPS 528 (559) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~ 528 (559) .. + .+.+|. T Consensus 494 ~~-----------~-~~~~e~ 502 (502) T protein:vir:79 494 QH-----------T-DDQSEE 502 (502) T ss_pred CC-----------C-CCCCCC Confidence 00 0 000000 No 111 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.76 E-value=1.8e-16 Score=106.93 Aligned_cols=425 Identities=13% Similarity=0.084 Sum_probs=220.0 Q ss_pred hccc-ccccccccccccccccccc-cccccc----CCCCCcccHHHHHHHH-hhChHHHHHHHHHHHHHHhhhhHhhhhc Q lcl|NC_012530. 36 LNGV-DRAYTEPVDGNLMFSTLED-TSIVPK----PSPIAFGRITDVLRQY-SMNVVLNAIINTRANQVTEYAHRASTDD 108 (559) Q Consensus 36 ~~gr-~~a~~~~~~~~~~~~~~~~-~~~~~~----p~~~~~~~~~~~~~~~-~~~~~v~acv~~ia~~ia~~~~~~~~~~ 108 (559) ..-+ ++...++.+++.......+ +.+... |.-++... .++.+.. .+-+.|.+|+..|...|..++ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~-~~ly~~m~e~D~~i~s~l~~rk~av~~~~------- 72 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQS-VAVYSRMDNEDSRVTSLLEAISLPIRSTP------- 72 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccc-hHHHHHHHhhChHHHHHHHHHHHHHhcCC------- Confidence 1111 0111111222221111100 111110 10111111 2233333 358899999999999888544 Q ss_pred CCcceeeecccccccChhHHHHHHHHHHHHHhcCCCC--------CCChhhHHHHHHHHHHHHHHcCCcceEEEECC--- Q lcl|NC_012530. 109 NGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDY--------SPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS--- 177 (559) Q Consensus 109 ~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~--------~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~--- 177 (559) |+|.+.+.+ ++ ..+.+...|..+.... ...+.+|++++..++.+.+.+|.++.+++|.. T Consensus 73 ----w~v~p~~~~---~e---~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~ 142 (469) T protein:vir:10 73 ----WRIRANGAS---DE---VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ 142 (469) T ss_pred ----ceEecCCCC---HH---HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc Confidence 556443221 11 2223333333221110 01134678888888888888999999999864 Q ss_pred --CCc--EEEEEEecCceEE-EEecCccccc-cc------ceEEEEEecCceeeeecccceEEEecccCCCccCCccccc Q lcl|NC_012530. 178 --NGR--LSHTRMVDPTTIY-FANDEHGHRR-TR------GKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLS 245 (559) Q Consensus 178 --~G~--~~~L~~l~p~~V~-~~~~~~g~~~-~~------~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~S 245 (559) +|. +..|.+.|+.++. ...+.++... .. ......+..+.....+++...|++++++.+ ..+||.| T Consensus 143 ~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~---g~p~g~g 219 (469) T protein:vir:10 143 SPDGRFWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRP---GQWQGKS 219 (469) T ss_pred cCCCceeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCC---CCcccch Confidence 343 6678888887663 3333333211 00 000011111222234556666777665544 3578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeee Q lcl|NC_012530. 246 ELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFV 325 (559) Q Consensus 246 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~ 325 (559) .+..|......-....++...|...-+.|--|.+++. ..++++++.+.+...+...|.+ ++ .|++.+ +++. T Consensus 220 Llr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~-----~a~~~ek~~l~~a~~~~~~g~~-a~--~iip~~-~~ie 290 (469) T protein:vir:10 220 ILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS-----ATDEDEVRKMAALARSVRGGIN-AG--VGLAQG-QILE 290 (469) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC-----CCCHHHHHHHHHHHHHHhcCCc-eE--EEccCC-ceEE Confidence 9999999999999999999999999888877776643 3567888888887776655543 22 345443 5555 Q ss_pred eccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhh Q lcl|NC_012530. 326 SMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNG 404 (559) Q Consensus 326 ~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~ 404 (559) -+.. .....|.+..++..++|+.+. ||-.-.. ...+++ ++..+. ........+.-.++.|+..||+. T Consensus 291 ~~ea~g~~~~~~~li~~~d~~Isk~i------LG~tlTs---~~~gGS--~a~~~v-h~ev~~d~~~sDa~~i~~tln~~ 358 (469) T protein:vir:10 291 LLGVSGNLPDIRRAIEGHDRSIALSG------LAHFLNL---DGKGGS--YALASV-LEDPFTQAVHAYATSICRIANQH 358 (469) T ss_pred EeecCCCchHHHHHHHHHHHHHHHHH------hcccccc---cCccch--hhHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 4442 233468889999999998875 4422111 111222 232222 23344567888999999999998 Q ss_pred ccccc-----cC--ccceeeecchhhhhHHHHHHHHHHHHcCCC------CHHHHHHHhCCCCCCCCCEeeccceecccc Q lcl|NC_012530. 405 IIRQI-----LG--DNYMLEFVGGDTRSQQDKLKSVQLELQTAT------TVNDYREKQGLPKIAGGDIILSAVYIQRLG 471 (559) Q Consensus 405 L~~~~-----~~--~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~------T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~ 471 (559) |+.+. .. ...+|.|.... .+.+..++.++.++..|+ +.+.+|+.+|+|+-..++..+.+. .+.. T Consensus 359 li~~l~~lN~g~~~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~--~~~~ 435 (469) T protein:vir:10 359 IIEDLVDINFGVDTPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPE--EPAA 435 (469) T ss_pred HHHHHHHhcCCCCCCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccch--hccc Confidence 87631 11 12467776544 455667788877776443 457899999999766554432110 0000 Q ss_pred cccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccch Q lcl|NC_012530. 472 QQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKN 547 (559) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~ 547 (559) ....+.. +....+.+..+. ...+.++.....+++ T Consensus 436 ---------------~~~~~~~----~~~~~~~~~~~~-----------------------~~~~~~~~~~~l~da 469 (469) T protein:vir:10 436 ---------------VPNQSAA----PARTRSSGNADA-----------------------RARAPKADQGVLFDA 469 (469) T ss_pred ---------------CCCCCcc----ccccCCCCCccc-----------------------ccccCCChHHhhccC Confidence 0000000 000000000000 000011111111111 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.76 E-value=9.4e-18 Score=113.92 Aligned_cols=392 Identities=14% Similarity=0.134 Sum_probs=195.0 Q ss_pred ccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeee Q lcl|NC_012530. 37 NGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVR 116 (559) Q Consensus 37 ~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~ 116 (559) --|.-.|..-+. + +... ....-+........+...|..+.+++++|+++|+...+- +|+|. T Consensus 1 ~~~~D~~~n~~~-----g-g~~~--~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~-----------g~~i~ 61 (422) T protein:vir:10 1 MVKTDSYANIFL-----G-GSDG--SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAA-----------GFHID 61 (422) T ss_pred CccchhhHHHHc-----C-CCCC--ccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcC-----------Ccccc Confidence 000000111000 0 1000 011111223345667778889999999999999987532 23343 Q ss_pred cccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE-C---------CCCcEEEEEE Q lcl|NC_012530. 117 LKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY-D---------SNGRLSHTRM 186 (559) Q Consensus 117 ~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r-d---------~~G~~~~L~~ 186 (559) ..+. +.+ ++.-+.+.+ .++-+...+....++|.+++.+.- + ..|.+..|.+ T Consensus 62 ~~~~-------~~~---~~~~~~~l~---------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v 122 (422) T protein:vir:10 62 GIDD-------EPA---FWSRWDDLE---------MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRV 122 (422) T ss_pred CCCH-------HHH---HHHHHHHhh---------HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEe Confidence 2211 111 112222211 233444555566678999888764 3 3567889999 Q ss_pred ecCceEEEEecCcc-c-ccccceEEEEEecCc--eeeeecccceEEEecccCCC---ccCCcccccHHHH-HHHHHHHHH Q lcl|NC_012530. 187 VDPTTIYFANDEHG-H-RRTRGKIYRQYIDNK--VRGSFTADEMGMFIRNPRSD---ILSGGYGLSELEM-GLREFISHE 258 (559) Q Consensus 187 l~p~~V~~~~~~~g-~-~~~~~~~y~~~~~~~--~~~~~~~~evi~~~~n~~~~---~~~~~~G~Spl~~-~~~~i~~~~ 258 (559) +++..|.+..-... . ..+.-+.++++..+. ....+.++.|||+...+.++ .....||.|++.. +.+.|.... T Consensus 123 ~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~ 202 (422) T protein:vir:10 123 YDRTQVKVQTREENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYT 202 (422) T ss_pred eccccccchhcccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHH Confidence 99998876431111 0 112233444444332 22456777788885444332 2344579999986 678888888 Q ss_pred HHHHHHHHHHHhcCCCceEEEecCc---cCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhH Q lcl|NC_012530. 259 NTELFNDRFFTHGGTTKGILLVKPS---PSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQ 334 (559) Q Consensus 259 ~~~~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~q 334 (559) .+.......|..... -+++++.. ...+....+.++++. ......| +.+.+ ++.+++.++..++.+ .+ T Consensus 203 ~~~~~~~~l~~~~~~--~v~~~~~l~~~~~~~~~~~~~~~r~~--~~~~~~~--~~~~~-~l~~~~e~~e~~~~~lsg-- 273 (422) T protein:vir:10 203 NCERLATQLLKRKQQ--AVWKAKGLAELCDDSEGFGAARLRLA--QVDNNSG--VGQAI-GIDAESEEYSVLNSDIGG-- 273 (422) T ss_pred HHHHHHHHHHHHhcc--ccccchhHHHhcCCccchHHHHHHHH--HHHHhcC--Cccce-eEecCCcceEEEecccCC-- Confidence 888887776655433 23454431 111222222233322 2222333 33333 443444567766532 33 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccc Q lcl|NC_012530. 335 FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNY 414 (559) Q Consensus 335 f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~ 414 (559) +.+........||++.+||..+|.=...++.++++ .....|.-...+..-..-|.|.+.++=..|- . ...+ T Consensus 274 l~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatg--d~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~----~---s~~~ 344 (422) T protein:vir:10 274 IDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQ--NTALETFHKLVDRKRNAELLPILEFLIPFIV----N---AEEW 344 (422) T ss_pred hHHHHHHHHHHHHhhhCCCeeeeccCCcccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----c---cCCc Confidence 45677888889999999999877322233332211 1111222233333333456676665543332 1 1368 Q ss_pred eeeecchhhhhHHHHHHHH-------HHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccc Q lcl|NC_012530. 415 MLEFVGGDTRSQQDKLKSV-------QLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQT 486 (559) Q Consensus 415 ~~~f~~l~~~d~~~~~~~~-------~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~ 486 (559) .|+|+.+...+.++++++. +.++. |.++++|+|+.+--.....| +.+ .+.+. .... T Consensus 345 ~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~---~~~-~~~~~------------~~~~ 408 (422) T protein:vir:10 345 SVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVK---IND-GSVET------------EVTI 408 (422) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhccccc---CCC-CCCcc------------ccch Confidence 8999999888888777653 34444 55899999998743221111 000 00000 0000 Q ss_pred ccccccccCCCCCCCCCCCCcccc Q lcl|NC_012530. 487 RLTQLESALQNPSGTPPTLPPSSS 510 (559) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~ 510 (559) . ..+++++.+|... T Consensus 409 ~----------~~~~~~~~~~~~d 422 (422) T protein:vir:10 409 S----------ETSNDPLEVPTDD 422 (422) T ss_pred h----------hcCCCCCCCCCCC Confidence 0 0001111111111 No 113 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.75 E-value=2.1e-17 Score=111.97 Aligned_cols=403 Identities=13% Similarity=0.128 Sum_probs=197.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+||-+=..+ ...+.-+|...+.+.. +.....+....+.+...+...| T Consensus 1 ~~~~m~~~~~-------------------------~~~~~D~~~~~~~~~~-------g~~~~~~~~~~~~~~~~l~~~Y 48 (435) T protein:vir:79 1 MGVFMSDKVK-------------------------AITKEDGYNEIFGSKD-------GTFRPNAFYMQRAAFKALSQFY 48 (435) T ss_pred CCcccccccc-------------------------cchhhcchhhhhcccc-------cccccCcccCCcCCHHHHHHHH Confidence 7776543110 0001111111111000 0000111112233556677778 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) ..+.+++++|+++|+...+- +++|...+ ..+ .++..+.+.+ .++-+...+ T Consensus 49 ~~~~l~~~~Vd~~aed~~r~-----------g~~i~g~~-------~~~---~~~~~~~~l~---------~~~~l~~a~ 98 (435) T protein:vir:79 49 EEDGMARRIVDVIPEEMVTP-----------GFKVDGVK-------NEK---SFKSRWDELR---------LNAKIIDAL 98 (435) T ss_pred hcCchhhhhhccchHHhhcC-----------CceecCCC-------hHH---HHHHHHHHhh---------HHHHHHHHH Confidence 89999999999999886532 23332211 111 2233333321 123444445 Q ss_pred HHHHHcCCcceEEEE-C---------CCCcEEEEEEecCceEEEEecCcc--cccccceEEEEEecCc--eeeeecccce Q lcl|NC_012530. 161 RDTYTYDQVNYENTY-D---------SNGRLSHTRMVDPTTIYFANDEHG--HRRTRGKIYRQYIDNK--VRGSFTADEM 226 (559) Q Consensus 161 ~d~ll~Gna~~~i~r-d---------~~G~~~~L~~l~p~~V~~~~~~~g--~~~~~~~~y~~~~~~~--~~~~~~~~ev 226 (559) ....++|.+++.+.- + ..|.+..|.+++|..|++...... ...+.-+.++++..+. ....+.++.| T Consensus 99 ~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRl 178 (435) T protein:vir:79 99 SWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRI 178 (435) T ss_pred HhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeE Confidence 556678988887763 2 345677899999988865321110 0112234455544332 2346778888 Q ss_pred EEEecccCCCc---cCCcccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCc---cCCccCCHHHHHHHHH Q lcl|NC_012530. 227 GMFIRNPRSDI---LSGGYGLSEL-EMGLREFISHENTELFNDRFFTHGGTTKGILLVKPS---PSVTNTSMRALEDFKR 299 (559) Q Consensus 227 i~~~~n~~~~~---~~~~~G~Spl-~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~e~~~~l~~ 299 (559) ||+...+.++. ....+|.|++ +.+.+.|.....+.......+...... +++++.. ...+....+..+++. T Consensus 179 i~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~- 255 (435) T protein:vir:79 179 CIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQA--VWKARDLALMCDDEEGRYAARLRLA- 255 (435) T ss_pred EEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc--cccchhHHHhhcCccchHHHHHHHH- Confidence 88865443322 2456799998 678898988888888877766443322 2344321 111111222222321 Q ss_pred HHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH Q lcl|NC_012530. 300 HWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN 378 (559) Q Consensus 300 ~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~ 378 (559) .....++ +.+.+ ++.+++.++..++.+ .+ +.+........||.+.+||..+|.=...++.+++ +.....|. T Consensus 256 -~~~~~~~--~~~~~-~i~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnst--gd~d~~~y 327 (435) T protein:vir:79 256 -QVDDESG--VGKAI-GIDATDEEYEVLNSDVSG--VPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSAS--QNTALETF 327 (435) T ss_pred -HHHHhcC--CCCce-eEecCCcceEEEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCCccccccc--hhHHHHHH Confidence 1222333 22334 443444566666532 22 4566778888999999999977632222222211 11122233 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHH-------HHHHc-CCCCHHHHHHHh Q lcl|NC_012530. 379 QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSV-------QLELQ-TATTVNDYREKQ 450 (559) Q Consensus 379 ~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~-------~~~~~-~~~T~NE~R~~~ 450 (559) -...+..-...+.|.+.++=..+- . ...+.|+|+.+...+.++++++. +.++. |.++++|+|+.+ T Consensus 328 yd~i~~~Qe~~l~p~l~~l~~li~----~---s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L 400 (435) T protein:vir:79 328 YKLIDRKRVEDYKPILEFLLPFMI----S---ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTL 400 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh----c---CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH Confidence 333333333456677666543332 1 14689999999888888777654 33444 568999999877 Q ss_pred -CCCCCCC-CCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 451 -GLPKIAG-GDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 451 -gl~pi~g-GD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ...|..| .+.. ... + +. +++.+++......+.+ T Consensus 401 ~~~~~~~~~~~~~-----~~~---~-----------~~-----------~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 401 RSICPDLKIMDND-----NIE---L-----------PE-----------PEDLDPEPGQEGGLNK 435 (435) T ss_pred HHhccccCCCCcc-----ccc---C-----------Cc-----------cccCCCCCCCCCCCCC Confidence 3222111 0000 000 0 00 0000000000000000 No 114 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.73 E-value=5.4e-17 Score=109.77 Aligned_cols=393 Identities=13% Similarity=0.122 Sum_probs=193.5 Q ss_pred cccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccc Q lcl|NC_012530. 40 DRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKN 119 (559) Q Consensus 40 ~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d 119 (559) =|.+....+.. +-.+...+. ..|....+ +-.++...|..+++++++|+++|+...+- +++|...+ T Consensus 1 ~~~~~~d~~~~--~~~~~~~~~-~~~~~~~~-~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~-----------g~~i~g~~ 65 (427) T protein:vir:10 1 MKIVKHDGYND--IFNGGADGS-PKPFFMSD-ASYHVGSFYNDNATAKRIVDVIPEEMVTA-----------GFKMSGVK 65 (427) T ss_pred CCccccchHHH--HhhcCCCCc-ccCccccC-chHHHHHHHHcCchhhhhhccchHHhhcC-----------CccccCcc Confidence 11111111100 001111111 11222222 22356778889999999999999987532 23343211 Q ss_pred ccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE----------CCCCcEEEEEEecC Q lcl|NC_012530. 120 GDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY----------DSNGRLSHTRMVDP 189 (559) Q Consensus 120 ~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r----------d~~G~~~~L~~l~p 189 (559) +..+ ++..+.+.+ .++-+...+...-++|.+++.+.- +..|.+..|.++++ T Consensus 66 -------~~~~---~~~~~~~l~---------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~ 126 (427) T protein:vir:10 66 -------DEKE---FKSLWDSYK---------LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDR 126 (427) T ss_pred -------HHHH---HHHHHHHhh---------HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEech Confidence 1111 222222221 233444555566678999987742 34678899999999 Q ss_pred ceEEEEecCcc--cccccceEEEEEecCce--eeeecccceEEEecccCCC---ccCCcccccHHH-HHHHHHHHHHHHH Q lcl|NC_012530. 190 TTIYFANDEHG--HRRTRGKIYRQYIDNKV--RGSFTADEMGMFIRNPRSD---ILSGGYGLSELE-MGLREFISHENTE 261 (559) Q Consensus 190 ~~V~~~~~~~g--~~~~~~~~y~~~~~~~~--~~~~~~~evi~~~~n~~~~---~~~~~~G~Spl~-~~~~~i~~~~~~~ 261 (559) ..|++...... ...+.-+.++++..+.. ...+.++.+||+...+.++ .....||.|++. .+.+.|.....+. T Consensus 127 ~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~ 206 (427) T protein:vir:10 127 FAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCE 206 (427) T ss_pred hcccccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHH Confidence 88876322110 01122344555443332 2457778888886544332 234567999986 5678888887777 Q ss_pred HHHHHHHHhcCCCceEEEecCc---cCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPS---PSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQS 337 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e 337 (559) ......|..... -++++++- ...+....+.++++. ......+ +.+.+.+. ..+-++..++.+ .+ +-+ T Consensus 207 ~~~~~l~~k~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~--~~~~~~~--~~~~~~l~-~~~e~~e~~~~~lsg--l~~ 277 (427) T protein:vir:10 207 SLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLA--QVDDNSG--VGRAIGID-AETEEYDVLNSDISG--VPE 277 (427) T ss_pred HHHHHHHHHhcc--ccccchhHHHHhcCccchHHHHHHHH--HHHHhcC--cccceeee-cCCCceeEEecccCC--hHH Confidence 777776655432 23444321 111122222233322 2223333 23334333 333566665532 22 455 Q ss_pred HHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceee Q lcl|NC_012530. 338 WLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLE 417 (559) Q Consensus 338 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~ 417 (559) ........||++.+||..+|.=...++.+++ +.....|.-...+..-...|.|.+.++=+.|- . ...+.++ T Consensus 278 ~~~~~~~~iaaa~~IP~t~L~G~sp~Glnst--gd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~----~---s~~~~~~ 348 (427) T protein:vir:10 278 FLSSKMDRIVSLSGIHEIIIKNKNVGGVSAS--QNTALETFYKLVDRKREEDYRPLLEFLLPFIV----D---EEEWSIE 348 (427) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCccccccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----c---CCCcEEE Confidence 6777888999999999987732223322211 11122333333333333557777766543332 1 1368999 Q ss_pred ecchhhhhHHHHHHHH-------HHHHc-CCCCHHHHHHHh----CCCCCCCCCEeeccceecccccccccccccccccc Q lcl|NC_012530. 418 FVGGDTRSQQDKLKSV-------QLELQ-TATTVNDYREKQ----GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQ 485 (559) Q Consensus 418 f~~l~~~d~~~~~~~~-------~~~~~-~~~T~NE~R~~~----gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~ 485 (559) |+.+...+.++++++. ..++. |.++++|+|+.+ +...+.+++.+ .. + +.. T Consensus 349 f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~-------~~---------e--~~~ 410 (427) T protein:vir:10 349 FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNI-------NI---------R--EPE 410 (427) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccc-------cc---------c--ccc Confidence 9999888888777653 34454 558999999877 23333222110 00 0 000 Q ss_pred cccccccccCCCCCCCCCCCCcccccc Q lcl|NC_012530. 486 TRLTQLESALQNPSGTPPTLPPSSSNS 512 (559) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (559) .. .+.+++.++....+. T Consensus 411 ~~----------~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 411 ET----------TEPEPGLGEKLEDEN 427 (427) T ss_pred hh----------cCCCCCCCCCCCCCC Confidence 00 000000000000000 No 115 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.73 E-value=3.3e-17 Score=110.97 Aligned_cols=411 Identities=11% Similarity=0.066 Sum_probs=203.2 Q ss_pred HHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHh Q lcl|NC_012530. 25 SKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRA 104 (559) Q Consensus 25 ~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~ 104 (559) +...++.-.+...++--++..-..+.. ............+......+...+...|..+..++++|+++|+..-+ T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g-~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r----- 74 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHG-KANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVR----- 74 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcC-CcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhc----- Confidence 111111111111111111111111110 00000000000011111235566667788899999999999987642 Q ss_pred hhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE-CCC----- Q lcl|NC_012530. 105 STDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY-DSN----- 178 (559) Q Consensus 105 ~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r-d~~----- 178 (559) .++++...++ + ..+.+..++.+.+ .++-+...+.+..++|.+|+++.- +.+ T Consensus 75 ------~g~~i~~~~~-----~---~~~~~~~~~~~l~---------~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~ 131 (461) T protein:vir:80 75 ------AGWSLKTDNK-----E---MKKNIESKWRKLK---------TKDRFQKLYADKRLYGDGFLSIGVVSSNREQAD 131 (461) T ss_pred ------CCeeeecCCH-----H---HHHHHHHHHHHhh---------HHHHHHHHHHhhcccccEEEEEEeecCCccccC Confidence 1344443321 1 2233445554432 234455556667789999988753 211 Q ss_pred -------CcEEEEEEe---cCceEEE---EecCcccccccceEEEEEec-------------CceeeeecccceEEEecc Q lcl|NC_012530. 179 -------GRLSHTRMV---DPTTIYF---ANDEHGHRRTRGKIYRQYID-------------NKVRGSFTADEMGMFIRN 232 (559) Q Consensus 179 -------G~~~~L~~l---~p~~V~~---~~~~~g~~~~~~~~y~~~~~-------------~~~~~~~~~~evi~~~~n 232 (559) +.+.+|..| ++..|.+ ..+..+ ..+.-+.++++.. +.....+.+.-|||+... T Consensus 132 ~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~s-p~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~ 210 (461) T protein:vir:80 132 LSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFS-EHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGL 210 (461) T ss_pred ccCCcccccccceeEEEeccccccchhhhcccCcC-cccccceEEEEeccccccccccccccCccceEEccccEEEecCC Confidence 122233333 3333221 111111 1112233444322 223356788899998765 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAY 312 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag 312 (559) +.++ ..+|.|.++.+...|.....+......+..+...+ +++++.. ..+..+....+.+.++...+ | . T Consensus 211 ~~~~---~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l---~~~~~~~~~~~~~~~~~~~~---~-~ 278 (461) T protein:vir:80 211 RFEG---ETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFK--VYKTDDI---DALNKDDKANLTAMLDFMFR---T-E 278 (461) T ss_pred CCCc---cccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ceecchH---HhhhchHHHHHHHHHHHhcC---C-c Confidence 5443 45799999999999999888888887777665443 4565532 22333444455555654432 2 2 Q ss_pred ccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 313 RIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 313 ~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) .+.++.. +-++..++.+ .+ +.+........||.+-+||..+|.-...+..++ +.....|.-...+..-..-+. T Consensus 279 g~~~~d~-~e~~e~~~~~lsg--l~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~as---ge~D~~~yyd~i~~~qe~~l~ 352 (461) T protein:vir:80 279 ALAIIKG-DEQLTKESTNVSG--MKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTG---AQYDVMNYYARVSSIQENRLR 352 (461) T ss_pred eEEEEcC-CcceEEEecCcCC--HHHHHHHHHHHHhhhhcCCeeeeecccCCcccc---chHHHHHHHHHHHHHHHHHHH Confidence 3445543 3566666532 33 456778888899999999998764333333222 111223333334444445678 Q ss_pred HHHHHHHHHHHhhccccc-----cCccceeeecchhhhhHHHHHHH-------HHHHHc-CCCCHHHHHHHh-C---CCC Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQI-----LGDNYMLEFVGGDTRSQQDKLKS-------VQLELQ-TATTVNDYREKQ-G---LPK 454 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~-----~~~~~~~~f~~l~~~d~~~~~~~-------~~~~~~-~~~T~NE~R~~~-g---l~p 454 (559) |++.++-..|-+..+... ....+.|+|+.+...+.++++++ +..++. |+++++|+|+.+ + +.| T Consensus 353 p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~ 432 (461) T protein:vir:80 353 PQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLEN 432 (461) T ss_pred HHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCC Confidence 888888777765443311 12468899999998888888775 444554 568999999855 2 333 Q ss_pred CCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccc Q lcl|NC_012530. 455 IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG 519 (559) Q Consensus 455 i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (559) ..+ +..+.. +..+....... .+.+ ++-+| T Consensus 433 ~~~---------~~~~~~-------~~~~~~~~~~~------~~~~--------------e~~~g 461 (461) T protein:vir:80 433 SSK---------FSGDSA-------EIDKLAKLVYD------AYAK--------------KNADG 461 (461) T ss_pred Ccc---------CCCCCc-------hhhhhhhhccc------cccc--------------cCCCC Confidence 210 000000 00000000000 0000 00000 No 116 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.72 E-value=1.3e-16 Score=107.74 Aligned_cols=500 Identities=10% Similarity=0.036 Sum_probs=209.7 Q ss_pred Ccchhhhccc------------cccCCcc------hHHHHHHHHHH--HHHHhhhhcccccccccccc--cccccc---- Q lcl|NC_012530. 1 MGIFDRFRTK------------FYTDDPN------AFFKHIDSKIA--NDTASKALNGVDRAYTEPVD--GNLMFS---- 54 (559) Q Consensus 1 ~~~~~~~~~~------------~~~~~~~------~~~~~~~~~~~--~~~~~~~~~gr~~a~~~~~~--~~~~~~---- 54 (559) |=-|+-..+. +....+. .+ .++-+.+. -+.-...+.-+-+-+..|.. ...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~ 79 (765) T protein:vir:96 1 MFKLSWIFGRKKDNAACSESAPEKVARIPQHDPLDPM-IKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGP 79 (765) T ss_pred CceeeeecccccccccccccCchhhhhcCCCCCcccc-hhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccc Confidence 2222211111 1111110 01 00000000 00000000000000111110 000000 Q ss_pred ----cccccccc----ccC--CCCC--cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc Q lcl|NC_012530. 55 ----TLEDTSIV----PKP--SPIA--FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK 122 (559) Q Consensus 55 ----~~~~~~~~----~~p--~~~~--~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~ 122 (559) ....++.. ... .+.. ...-.+++..|..+.+++++|+++|+..-+- +++|... .++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~-----------g~~I~~~-~~e 147 (765) T protein:vir:96 80 TPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARN-----------GWELKSD-GRK 147 (765) T ss_pred cchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcC-----------CceeecC-ccc Confidence 00000000 000 0000 0112467778889999999999999886532 3444332 222 Q ss_pred cChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEEC----------------CCCcEEEEEE Q lcl|NC_012530. 123 PTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYD----------------SNGRLSHTRM 186 (559) Q Consensus 123 ~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd----------------~~G~~~~L~~ 186 (559) ..++. ...++..+.+.+ .++.+...+...-++|.+|+.+.-+ ..|.+..|.. T Consensus 148 ~~~~~---~~~l~~~~~rl~---------v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~v 215 (765) T protein:vir:96 148 LSDEQ---SALIARRDMEFR---------VKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQ 215 (765) T ss_pred cCHHH---HHHHHHHHHHhh---------HHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEE Confidence 22222 223333333321 2344455556666789888876432 1134567777 Q ss_pred ecCceEEEEecC----c-ccccccceEEEEEecCceeeeecccceEEEecccCCCcc---CCcccccHHHHHHHHHHHHH Q lcl|NC_012530. 187 VDPTTIYFANDE----H-GHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDIL---SGGYGLSELEMGLREFISHE 258 (559) Q Consensus 187 l~p~~V~~~~~~----~-g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~---~~~~G~Spl~~~~~~i~~~~ 258 (559) |+|..+.+.... + ....+.-+.++++ .+ ..+.++-|||+...+.++.. ...+|.|-++.+...|.... T Consensus 216 ldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i-~g---~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~ 291 (765) T protein:vir:96 216 IDPYWAMPQLTAESTADPSAEHFYEPDFWII-SG---KKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAE 291 (765) T ss_pred echhhcccccchhccccccccccCcceeeee-cC---ceeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHH Confidence 777766553211 1 0111112233332 22 24567778888655543322 23469999999999999998 Q ss_pred HHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHH Q lcl|NC_012530. 259 NTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQS 337 (559) Q Consensus 259 ~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e 337 (559) .+......++..... -+++++..... ..++ .+++.++......+|- .+.++.. +-+|..++.+ .+ +-+ T Consensus 292 ~t~~~~a~Ll~k~~~--~v~k~~~~~~l--~~~~---~l~~r~~~~~~~r~n~-g~~~id~-ee~~e~~s~~lsg--l~d 360 (765) T protein:vir:96 292 RTANEAPLLAMSKRT--STIHVDVEKAI--ANED---AFNARLAFWIANRDNH-GVKVIGI-DETMEQFDTNLSD--FDS 360 (765) T ss_pred HHHHHHHHHHHHhcc--ceeeechHhhh--ccHH---HHHHHHHHHHHhcCCc-eeEEecC-CcceeEEecccCC--HHH Confidence 888888777766543 24555432111 1222 3444444443333343 3455544 3567766532 22 455 Q ss_pred HHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceee Q lcl|NC_012530. 338 WLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLE 417 (559) Q Consensus 338 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~ 417 (559) ......+.||.+.+||...|-=...++.+. ++.....|.-...+..-...|.|.++++-+.|-.. ......+.|+ T Consensus 361 ~l~~~~~~iAaas~IP~t~LfGqsp~GlnA--TGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s---~~i~~d~~i~ 435 (765) T protein:vir:96 361 VIMNQYQLVAAIAKTPATKLLGTSPKGFNA--TGEHETISYHEELESIQEHIFDPLLERHYLLLAKS---ESIDVQLEIV 435 (765) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCcccccC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCCCcceEE Confidence 667778899999999986653222122211 11112223333333333356777777766555432 2223469999 Q ss_pred ecchhhhhHHHHHHH-------HHHHHc-CCCCHHHHHHHhCCCCCCCCCEe----------eccceecccccccccccc Q lcl|NC_012530. 418 FVGGDTRSQQDKLKS-------VQLELQ-TATTVNDYREKQGLPKIAGGDII----------LSAVYIQRLGQQEQIKQN 479 (559) Q Consensus 418 f~~l~~~d~~~~~~~-------~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~----------~~~~~~~~l~~~~~~~~~ 479 (559) |+.|...+.++++++ ++.++. |.++++|+|+++...|.-|.+-+ +.|.+...+.... . T Consensus 436 FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~----~ 511 (765) T protein:vir:96 436 WNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAG----A 511 (765) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCC----c Confidence 999998888888775 344454 55899999999876654332211 0010110000000 0 Q ss_pred cccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 480 EFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) +... ...........+...+...++...-+...+...+.+. .|.+.....+.+.....+ ... --++..+| T Consensus 512 ~~~~---~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~----~~~g~~~~~p~~~~p~~~-~~~--~~~~~~~~ 581 (765) T protein:vir:96 512 QSAK---AKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAE----EGAGEAATPPSRPNPRAE-LRN--LLSDLLSK 581 (765) T ss_pred cccc---ccCccccccCCCCccCCCCcccccCCcccCCcccccc----ccCccccCcccccccccc-chh--cccchhhh Confidence 0000 0000000000000001111111111111111111111 011111100000000000 000 00011111 No 117 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.69 E-value=4.3e-15 Score=99.35 Aligned_cols=434 Identities=12% Similarity=0.098 Sum_probs=207.3 Q ss_pred HHHHhhhhccccccccccccccccccccccccccccCCC--CCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhh Q lcl|NC_012530. 29 NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP--IAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRAST 106 (559) Q Consensus 29 ~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~ 106 (559) +...++...|=.+..-..+... ...+..+.+..-+.+ ++ ....++.+..+.-+.|.+|+..|...|..+ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~--~~~~~~~~~~~~~~~~Lr~-~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~------ 71 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSL--GLKVKNGRIYEEPRQALRF-PESIKTFQLMMRDPAVAASVNIIKMFVRKV------ 71 (488) T ss_pred CCCccccCCCCCHHHHHHHHHH--hhccccchhhccchhhhcc-cchHHHHHHHhhChHHHHHHHHHHHHHhcC------ Confidence 1111112222222110000000 000000111111111 11 112334455556789999999999998854 Q ss_pred hcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC--------- Q lcl|NC_012530. 107 DDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS--------- 177 (559) Q Consensus 107 ~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~--------- 177 (559) .|.|.+...........+..+.+..++.+.. .+|.+++..++ |.+.+|-++.+++|.. T Consensus 72 -----~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~-------~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~ 138 (488) T protein:vir:95 72 -----NWRFVPPKGKEQDPKMLERADFFNSLMDDME-------HDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQ 138 (488) T ss_pred -----CceEecCCCCchhHHHHHHHHHHHHHHhccC-------ccHHHHHHHHH-Hhhcccceeeeeeeecccccccccc Confidence 4566544333223333444455555554321 24556777765 6788999999999963 Q ss_pred ----CCc--EEEEEEecCceEE-EEecCccccccc---ceEEEEEe-------cCceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 178 ----NGR--LSHTRMVDPTTIY-FANDEHGHRRTR---GKIYRQYI-------DNKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 178 ----~G~--~~~L~~l~p~~V~-~~~~~~g~~~~~---~~~y~~~~-------~~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) +|. |..|.+.++.+++ ...+.++..... ........ .......++....|++++...+ .. T Consensus 139 ~~~~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~---g~ 215 (488) T protein:vir:95 139 SKFDDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEY---GN 215 (488) T ss_pred ccccCCeeeeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCC---Cc Confidence 232 4566666664432 222333321100 00000000 0011123455555666655543 35 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhc---Ccccccccccc Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSS---GINGAYRIPMI 317 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~---G~~nag~~~vl 317 (559) +||.+.+..|......-.....+...|...-+.|--+...+.... ...+++....+.+.+.+... +...+| .|+ T Consensus 216 p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~-~~~~~~e~~~l~~a~~~i~~~~~~~~~ag--~ii 292 (488) T protein:vir:95 216 PEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYL-DENAEPEKKAFVQYCKTVVNDMIANDRAG--LIW 292 (488) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCC-CCcccHHHHHHHHHHHHHHHHhhccchhh--eee Confidence 789999999999998888888888888876545444444433222 22334444444444433221 111122 244 Q ss_pred cCC-ce-------eeeeccc--cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_012530. 318 TAE-DA-------KFVSMTQ--AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKS 387 (559) Q Consensus 318 ~~g-~~-------~~~~ls~--~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~ 387 (559) +.+ .+ ++..++. ..-..|.+..++.-++|+.+. ||-.-..+ ..+++ +++..+ ....... T Consensus 293 P~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i------LGqtLT~~--~~~~G--s~Al~~-vh~ev~~ 361 (488) T protein:vir:95 293 PRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF------MSDVLAMG--QSKYG--SFSLAD-SKTSLLA 361 (488) T ss_pred ccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH------hccccccc--cCcch--hhhHHH-HHHHHHH Confidence 333 11 2222322 122247777888888888765 44321111 11112 233222 2233445 Q ss_pred HHhhHHHHHHHHHHHhhccccc-----cC--ccceeeecchhhhhHHHHHHHHHHHHcCCC--C----HHHHHHHhCCCC Q lcl|NC_012530. 388 KGLMPLLDMIAKNLTNGIIRQI-----LG--DNYMLEFVGGDTRSQQDKLKSVQLELQTAT--T----VNDYREKQGLPK 454 (559) Q Consensus 388 ~~l~P~~~~ie~~ln~~L~~~~-----~~--~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~--T----~NE~R~~~gl~p 454 (559) ..+.-.++.|+..||+.|+.+. .. ...+|.|......|.++.++.++.++..|+ + .+.+|+.+|+|+ T Consensus 362 ~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~ 441 (488) T protein:vir:95 362 MSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPP 441 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCC Confidence 6778899999999999887642 11 224788888888899999999998887654 3 356999999997 Q ss_pred CCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccc Q lcl|NC_012530. 455 IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQ 533 (559) Q Consensus 455 i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 533 (559) -+.+.....+..-++- ............+.+..+.++ +.....+. ++ T Consensus 442 ~~~~e~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~------------~~~~a~~~-------~~ 488 (488) T protein:vir:95 442 ADESQPVSEKLSPNSQ-------------SRSGDGYKTAGEGTAKTPSAK------------DPSTANKA-------NK 488 (488) T ss_pred CCCCccccccCCCCCC-------------CCCCcccCCCcccCCcccccc------------cchhhhhc-------cC Confidence 6554433222100000 000000000000000000000 00000000 00 No 118 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.68 E-value=3.1e-15 Score=100.13 Aligned_cols=462 Identities=11% Similarity=0.093 Sum_probs=226.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc-cccccccccccccccccccccCCCCCcccHH---HH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY-TEPVDGNLMFSTLEDTSIVPKPSPIAFGRIT---DV 76 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~-~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~---~~ 76 (559) -.|||.+--++--..+. +.....+.+-.+.+ ..|..+ ..+..+...+ +......+. ++ T Consensus 2 ~~~~d~~g~p~~~~~~~------------~~~~~~~~~~~~~~~~~~~~g---ltp~~l~~il---~~a~~gd~~~~~~L 63 (528) T protein:vir:10 2 AAIVDIYGNPLRTQQLR------------KQQTAHLAGLAKEFANHPAKG---LTPAKLAHIL---IEAEQGHLQAQAEL 63 (528) T ss_pred CeeECCCCCcccccccc------------chhhhhhhhhhhhhcccCCCC---CCHHHHHHHH---HhhhCCCHHHHHHH Confidence 24555553333222211 11111111111000 001000 0000000000 001111222 22 Q ss_pred HHHHh-hChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 77 LRQYS-MNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 77 ~~~~~-~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) .+.+. +.+-|.+|+..|...|..+ .|.|.+.+.. +...++..+.+..+|.+. ..|..+ T Consensus 64 ~~~m~e~D~~i~s~l~~Rk~av~~~-----------~w~I~p~~~~--~~~~~~~a~~v~~~l~~~--------~~f~~~ 122 (528) T protein:vir:10 64 FMDMEERDAHLFAEMSKRKRAVLGL-----------DWTIEPPRNA--SAAEKADAEYLHELLLDL--------EGIEDL 122 (528) T ss_pred HHHHHhhChHHHHHHHHHHHHHhcC-----------CceEecCCCC--CHHHHHHHHHHHHHHhCC--------ccHHHH Confidence 22223 5788999999999998854 4555543222 233445556666666542 124556 Q ss_pred HHHHHHHHHHcCCcceEEEECCC-C--cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecc Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDSN-G--RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRN 232 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~~-G--~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n 232 (559) +..++ +.+.+|.++.+++|... | .|..|.++++..+.+..+ +.. .+...........+++...+++++. T Consensus 123 i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~~--~~~-----~l~~~~~~~~g~~l~~~k~iv~~~~ 194 (528) T protein:vir:10 123 MLDCM-DGVGHGYSAIELDWSLQGREWLPQAFDHRPQSWFQLNPD--DQD-----ELRLRDNSIAGEVLQPFGWIMHKPR 194 (528) T ss_pred HHHHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceeeccC--CCc-----EEeccCCCCCceeecCCCeEEEeec Confidence 65544 56789999999998653 3 367899999987765332 211 1111111111223445554444544 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAY 312 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag 312 (559) +.+ ..+||.+.+..|......-....++...|...-+.|--+.+++. ..++++++++.+.+.+..++ + T Consensus 195 ~~~---g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~a~~~ek~~L~~al~~i~~~---~- 262 (528) T protein:vir:10 195 SRS---GYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPP-----GTPDEEKVTLLRAVTGLGHA---A- 262 (528) T ss_pred CCC---CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC-----CCCHHHHHHHHHHHHHHhhC---c- Confidence 433 35689999999999999999999999999999999987877753 35678888888888765432 1 Q ss_pred ccccccCC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 313 RIPMITAE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 313 ~~~vl~~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) ..|++.+ .+++...+...-..|.+..++..++|+.+. ||-.-.+....+++++ ++-.+. ........+. T Consensus 263 -~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtlTs~~~~g~~gS--~Alg~v-h~~v~~di~~ 332 (528) T protein:vir:10 263 -AGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI------LGGTLTSQTSESGGGA--YALGQV-HNEVRHDLLA 332 (528) T ss_pred -EEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH------hhhhhhccccccccch--hhhHHH-HHHHHHHHHH Confidence 2345443 355555433222357888899999998875 4422111001111122 222211 2223455677 Q ss_pred HHHHHHHHHHHhhccccc-----c-----CccceeeecchhhhhHHHHHHHHHHHHcCC--CCHHHHHHHhCCCCCCCCC Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQI-----L-----GDNYMLEFVGGDTRSQQDKLKSVQLELQTA--TTVNDYREKQGLPKIAGGD 459 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~-----~-----~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~--~T~NE~R~~~gl~pi~gGD 459 (559) -.++.|+..||+.|+.+. . ....+|.|......|.+++++.++..+..| ++..++|+.+|+|.-..|+ T Consensus 333 aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~~~e 412 (528) T protein:vir:10 333 ADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGIPLPANGE 412 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCCc Confidence 888999999998876532 1 112467888888889999999998887644 5899999999998666666 Q ss_pred EeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcc-ccccchhccccccccccccccccccccccc Q lcl|NC_012530. 460 IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPS-SSNSFQQNQEGYTGKDAKPSGKDNQQGVGK 538 (559) Q Consensus 460 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 538 (559) .++.+....+..+. ....... .....+...... ..+...+-......++..+. T Consensus 413 ~~~~~~~~~~~~~~----------~~~~~~~------~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~---------- 466 (528) T protein:vir:10 413 AVLGDQAGAGIAQL----------SRRPGPR------IAALAQVIGPRYRDQEALDQVLASLPAQDMQNQ---------- 466 (528) T ss_pred ccccCCCccccccc----------Ccccccc------cccccccccccccccchHHHHHHHHHHHHHHHH---------- Confidence 54422111000000 0000000 000000000000 00000000000000000000 Q ss_pred cccccccchhhhhhccCCCCC Q lcl|NC_012530. 539 DGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 539 ~~~~k~~~~~~~~~~~~~~~~ 559 (559) ...-.+.....-+.+.+-- T Consensus 467 --~~~~l~~i~~~l~~~~s~e 485 (528) T protein:vir:10 467 --ADSLVAPLLDVISRGGSEA 485 (528) T ss_pred --HHHHHHHHHHHHHhcCCHH Confidence 0000000000011111110 No 119 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.67 E-value=1e-14 Score=97.32 Aligned_cols=440 Identities=12% Similarity=0.054 Sum_probs=225.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHH- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQ- 79 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~- 79 (559) -.|||.+=.++--.... +.....+.+- .+++.+. ..+..+...+..+++. T Consensus 2 ~~~~d~~g~p~~~~~~~------------~~~~~~~~~~----~~~~~~~-------------~~~gltp~~l~~iL~~a 52 (512) T protein:vir:19 2 GRILDISGQPFDFDDEM------------QSRSDELAMV----MKRTQEH-------------PSSGVTPNRAAQMLRDA 52 (512) T ss_pred cceeCCCCCcccccccc------------ccccchhccc----chhhccc-------------cccCCCHHHHHHHHHHh Confidence 23444432222111110 0000000000 0000000 0011222222222222 Q ss_pred -----------H----hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 80 -----------Y----SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 80 -----------~----~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) + ++.+-|.+|+..|...|..+ .|.|.+... .+...++..+.+..+|.... T Consensus 53 ~~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk~av~~~-----------~w~I~p~~~--~~~~~~~~a~~v~~~l~~~~-- 117 (512) T protein:vir:19 53 ERGDLTAQADLAFDMEEKDTHLFSELSKRRLAIQAL-----------EWRIAPARD--ASAQEKKDADMLNEYLHDAA-- 117 (512) T ss_pred hCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCC-----------CceEecCCC--CCHHHHHHHHHHHHHHhcCC-- Confidence 1 24678888999888888754 455554321 23444555666777775421 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeee Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSF 221 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~ 221 (559) .|..++..|+ +.+.+|.++.+|+|.. ...|..|.+++|..+....+..+.. ++.. .......+ T Consensus 118 ------~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~~~~~~l-----r~~~--~~~~G~~l 183 (512) T protein:vir:19 118 ------WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPALFCANPDNLNEL-----RLRD--ASYHGLEL 183 (512) T ss_pred ------CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeeccccceeccCCCcEE-----EecC--CCCCceee Confidence 2556776655 5778999999999853 3357789999998876544322221 1111 11111234 Q ss_pred cccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHH Q lcl|NC_012530. 222 TADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHW 301 (559) Q Consensus 222 ~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~ 301 (559) ++...|++++.+.+ ..+||.+.+..|......-....++...|...-+.|--+-+++. ..++++++++.+.+ T Consensus 184 ~~~k~i~~~~~~~~---g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~a~~~ek~~L~~al 255 (512) T protein:vir:19 184 QPFGWFMHRAKSRT---GYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPT-----GSTNREKATLMQAV 255 (512) T ss_pred cCCceEEEeccCCC---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCC-----CCCHHHHHHHHHHH Confidence 55555555554443 35689999999999999999999999999999999977766643 35677888888887 Q ss_pred HHHhcCcccccccccccCC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHH Q lcl|NC_012530. 302 TATSSGINGAYRIPMITAE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQN 380 (559) Q Consensus 302 ~~~~~G~~nag~~~vl~~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~ 380 (559) .+..++ + ..|++.+ .+++...+......|.+..++..++|+.+. ||-. .++..+++.+++..+. T Consensus 256 ~~~~~~---a--~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i------LGqt----lTs~~g~~Gs~a~~~v 320 (512) T protein:vir:19 256 MDIGRR---A--GGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI------LGGT----LTTEAGDKGARSLGEV 320 (512) T ss_pred HHHhhC---c--EEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhh----hcccccccchhhHHHH Confidence 775332 2 2455444 344544333233458888999999999873 4432 1222122223343222 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhccccc-----cC-----ccceeeecchhhhhHHHHHHHHHHHHcCC-CCHHHHHHH Q lcl|NC_012530. 381 KIDASKSKGLMPLLDMIAKNLTNGIIRQI-----LG-----DNYMLEFVGGDTRSQQDKLKSVQLELQTA-TTVNDYREK 449 (559) Q Consensus 381 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~-----~~-----~~~~~~f~~l~~~d~~~~~~~~~~~~~~~-~T~NE~R~~ 449 (559) ........+...++.|+..||+.|+.+. .. ..-+|.|......|.+..++.+.....|. ++..++|+. T Consensus 321 -h~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~ 399 (512) T protein:vir:19 321 -HDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEK 399 (512) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHH Confidence 2334556788899999999999887632 11 12467788788889888888887766554 699999999 Q ss_pred hCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 450 QGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 450 ~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) +|+|-...++..+.+.... . ..... ...... ....+.++..+... T Consensus 400 ~Gip~~~~~e~~~~~~~~~---~-------------~~~~~--------~~~~~~---~~~~~~~~~~d~~~-------- 444 (512) T protein:vir:19 400 LHIPQPVGDEAVFTIQPVV---P-------------DNGSQ--------KEAALS---AEDIPQEDDIDRMG-------- 444 (512) T ss_pred hCCCCCCCccccccCCCcc---c-------------ccccc--------cccccc---ccCCCchhhHhHHh-------- Confidence 9997544444332110000 0 00000 000000 00000000000000 Q ss_pred ccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 530 KDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 530 ~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) .....-.+.-...-..-...... .+-+- T Consensus 445 -~~~~~~~~~~~~~~~~i~~~~~~-~s~ee 472 (512) T protein:vir:19 445 -VSPEDWQRSVDPLLKPVIFSVLK-DGPEA 472 (512) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHh-CCHHH Confidence 00000000000000000000000 00000 No 120 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.66 E-value=1.9e-16 Score=106.75 Aligned_cols=450 Identities=11% Similarity=0.032 Sum_probs=216.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCC-CCCccc------- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPS-PIAFGR------- 72 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~-~~~~~~------- 72 (559) ||++||+-... .-++.+..... -.+|..-..+.. .. ++...|+ .+.... T Consensus 8 ~~~~dr~i~~~-------~~~~~~~~~~~----------~~~y~aa~~~r~--~~----~w~~~~~~~s~~~~i~~~~~~ 64 (505) T protein:vir:96 8 PSLAQRMVNWA-------WYRYVEPQKNA----------ARAFEAARRDRL--GK----AWLRRASRLSADEEIYADLAS 64 (505) T ss_pred cchhhcccchh-------hhhhHHHHHHh----------hhhcccccCCCc--cc----cccCCCCCCChHHHHHHHHHH Confidence 99999993211 11111111100 112221110000 00 0111111 111111 Q ss_pred -HHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecc--ccc-ccChhHHHHHHH-HHHHHHhcCCCCCC Q lcl|NC_012530. 73 -ITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLK--NGD-KPTKEQQKKIDY-AERYIERMGVDYSP 147 (559) Q Consensus 73 -~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~--d~~-~~~~~~~~~~~~-~~~~L~~~~p~~~~ 147 (559) ....++.+.+++++..+|+.+.++|-.- .|+.+... ... ..+++..+++.. ...|.+.++-+ .. T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG~----------~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D-~~ 133 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIGP----------KGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCD-VT 133 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcCC----------CcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcc-ee Confidence 1234556778999999999888887520 12222222 111 122222222222 23333322221 23 Q ss_pred ChhhHHHHHHHHHHHHHHcCCcceEEEECCCC-cEEEEEEecCceEEEEec---Ccccccc--------cc-eEEEEEec Q lcl|NC_012530. 148 IRDDFTSFLRKLVRDTYTYDQVNYENTYDSNG-RLSHTRMVDPTTIYFAND---EHGHRRT--------RG-KIYRQYID 214 (559) Q Consensus 148 ~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G-~~~~L~~l~p~~V~~~~~---~~g~~~~--------~~-~~y~~~~~ 214 (559) .+.+|.++...+++.++..|.+|+.+++...+ .+..|..|+|++|..-.+ .+|.... .- ..|..... T Consensus 134 g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~ 213 (505) T protein:vir:96 134 GRYHFVTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVN 213 (505) T ss_pred ccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeec Confidence 44688899999999999999999988765433 467899999998853211 1111110 11 12221110 Q ss_pred ------------CceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC Q lcl|NC_012530. 215 ------------NKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP 282 (559) Q Consensus 215 ------------~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 282 (559) ......+++.+|||+...-++ ...-|+|.+..++..+.......+....--.=.+.-.++|+.+. T Consensus 214 hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~---gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~ 290 (505) T protein:vir:96 214 HPGDNSYCYHYAGQTYERVPADEIIHTFVPWRP---HQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDP 290 (505) T ss_pred CCCccccccccccccccccCHhHhhhhhcccCC---ccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCC Confidence 112234778899998643222 23459999988887776665555444444444555667777654 Q ss_pred ccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHh-ccc Q lcl|NC_012530. 283 SPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEI-GMQ 360 (559) Q Consensus 283 ~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~ 360 (559) +.......+.. ...... -..|.++.|..| .+++.++.+ ....|.+..+...+.||+.+|||-+.| |.. T Consensus 291 ~~~~~~~~~~~--------~~~~~~-l~pG~i~~L~pG-e~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~ 360 (505) T protein:vir:96 291 EAYDQPPEDDQ--------GEIVEE-VEAGTYQLLPYG-IRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDL 360 (505) T ss_pred ccCCCcccccc--------Cccccc-cCCceeeecCCC-CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc Confidence 32111111100 000111 124566666554 566665543 356899999999999999999998877 443 Q ss_pred ccccccccccc-chhhhhHHHHHHHHHHHHhhHHHHH-HHHHHHhhcccccc--C-ccceeeecch--hhhhHHHHHHHH Q lcl|NC_012530. 361 NRGGATGNKSN-SLNESNNQNKIDASKSKGLMPLLDM-IAKNLTNGIIRQIL--G-DNYMLEFVGG--DTRSQQDKLKSV 433 (559) Q Consensus 361 ~~~~~~~~~~~-~~~~an~~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~~~~--~-~~~~~~f~~l--~~~d~~~~~~~~ 433 (559) +..+|++...+ .......+.....++...++|+... ++.++....++-.. . ....+.|.+- ...|+...+++. T Consensus 361 s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~ 440 (505) T protein:vir:96 361 EGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAH 440 (505) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHH Confidence 33322221110 0011122333445566788886664 66666655543211 1 1234555433 345888888888 Q ss_pred HHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcccccc Q lcl|NC_012530. 434 QLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNS 512 (559) Q Consensus 434 ~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (559) ..++.+| .|+-|+-++.|..|-+--++ +.. +.+ ...+.+...... +...... . T Consensus 441 ~~~i~~G~~t~~~~~a~~G~D~~~v~~q------------~a~----e~~-~~~~~Gl~~~~~--~~~~~~~---~---- 494 (505) T protein:vir:96 441 SESIKNRTRSRSSIIRAAGDDPEDVFDE------------IAW----EEQ-LMRDKGVNPTPP--EQESKDA---T---- 494 (505) T ss_pred HHHHHcCCCCHHHHHHHcCCCHHHHHHH------------HHH----HHH-HHHHcCCCCCCC--CCCCCCC---C---- Confidence 8888766 69998888889977432111 100 000 000011100000 0000000 0 Q ss_pred chhccccccccc Q lcl|NC_012530. 513 FQQNQEGYTGKD 524 (559) Q Consensus 513 ~~~~~~~~~~~~ 524 (559) ..+++...+++ T Consensus 495 -~~~~~~~~~d~ 505 (505) T protein:vir:96 495 -TDEEDDSASDD 505 (505) T ss_pred -CCCCCCCCCCC Confidence 00000000000 No 121 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.66 E-value=9.3e-15 Score=97.51 Aligned_cols=448 Identities=12% Similarity=0.110 Sum_probs=226.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc-cccccccccccccccccccccCCCCCcccHHH---H Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY-TEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD---V 76 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~-~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~---~ 76 (559) =.|||++=.+|--... .+.....+.+-.+.+ ..|..+ ..+.....++ +......+.+ + T Consensus 2 ~~~~d~~g~p~~~~~~------------~~~~~~~~~~~~~~~~~~~~~g---ltp~~l~~iL---r~a~~gd~~~~~~L 63 (526) T protein:vir:99 2 AQIVDVYGNPIRTQQL------------REPQTSRLAGLAKEFAQHPAKG---LTPAKLARIL---VEAEQGNLQAQAEL 63 (526) T ss_pred CeeECCCCCccccccc------------cchhhhhhhhhhhhhcccCcCC---CCHHHHHHHH---HhhhCCCHHHHHHH Confidence 2345554222221111 111111111111000 000000 0000000000 0011112222 2 Q ss_pred HHHHh-hChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 77 LRQYS-MNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 77 ~~~~~-~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) .+... +.+.|.+|+..|...|..++ |.|.+.+.. +...++..+.+..+|.+. ..|..+ T Consensus 64 ~e~m~e~D~~i~s~l~~Rk~av~~~~-----------w~I~p~~~~--~~~~~~~a~~v~~~l~~~--------~~~~~~ 122 (526) T protein:vir:99 64 FMDMEERDAHLFAEMSKRKRAILGLD-----------WAVEPPRNA--SAAEKADADYLHELLLDL--------EGLEDL 122 (526) T ss_pred HHHHHhhChHHHHHHHHHHHHHhCCC-----------ceEecCCCC--CHHHHHHHHHHHHHHhcc--------cCHHHH Confidence 22222 47889999999999888544 555543221 233445556667776542 135567 Q ss_pred HHHHHHHHHHcCCcceEEEECCCC---cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecc Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDSNG---RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRN 232 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~~G---~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n 232 (559) +..++ +.+.+|.++.+++|...| .|..|.+.++.++.+..+..... ++.. ....-..+++...|.+++. T Consensus 123 i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f~~~~~~~~~l-----~~~~--~~~~g~~l~~~k~i~~~~~ 194 (526) T protein:vir:99 123 LLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQNEL-----RLRD--NSPAGEALQPFGWIIHRPR 194 (526) T ss_pred HHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeecccceeeccCCCcEE-----EecC--CCCCceeecCCCeEEEeec Confidence 76655 577899999999987543 36789999998877544322111 1111 1111123444444444444 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAY 312 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag 312 (559) +.+ ..+||.+.+..|......-....++...|...-+.|--+.+++. ..++++++++.+.+.+..++ + T Consensus 195 ~~~---g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~a~~~ek~~L~~av~~i~~d---~- 262 (526) T protein:vir:99 195 ARS---GYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-----GTADEEKATLLRAVTGLGHA---A- 262 (526) T ss_pred CCc---CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-----CCCHHHHHHHHHHHHHHhhC---c- Confidence 433 35689999999999999999999999999999999987877653 34678888888887765332 2 Q ss_pred ccccccCC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 313 RIPMITAE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 313 ~~~vl~~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) ..|++.+ .+++...+...-..|.+..++..++|+.++ ||-.-.+....++++ +++..+.. .......+. T Consensus 263 -~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtlTs~~~~g~~g--S~a~g~vh-~~v~~di~~ 332 (526) T protein:vir:99 263 -AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LGGTLTSTTSQSGGG--AFALGQVH-NEVRHDLLA 332 (526) T ss_pred -EEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhhccccccCcch--hhhHHHHH-HHHHHHHHH Confidence 2455443 345555433222358888899999998875 442211100111112 22222222 223445677 Q ss_pred HHHHHHHHHHHhhccccc-----c-----CccceeeecchhhhhHHHHHHHHHHHHcCC--CCHHHHHHHhCCCCCCCCC Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQI-----L-----GDNYMLEFVGGDTRSQQDKLKSVQLELQTA--TTVNDYREKQGLPKIAGGD 459 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~-----~-----~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~--~T~NE~R~~~gl~pi~gGD 459 (559) -.++.|+..||+.|+... . ....+|.|......|.+++++.++.++..| ++..++|+.+|+|.-..++ T Consensus 333 aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~e 412 (526) T protein:vir:99 333 SDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNE 412 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCcc Confidence 888999999998886531 1 112467787778889999999998888654 6899999999997655555 Q ss_pred EeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 460 IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 460 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) .++.+...... ....... .....+. ... +. ...++.. | T Consensus 413 ~~l~~~~~~~~---------------------------~~~~~~~--~~~~~~~--------~~~--~~-~~~~~~~--d 450 (526) T protein:vir:99 413 PVLRSAAQPAI---------------------------LSRQHGQ--RVAALAT--------IVG--PR-YGDQQAL--D 450 (526) T ss_pred cccCCCCCCcc---------------------------ccccccc--ccccccc--------ccc--cc-CcchhhH--H Confidence 44321100000 0000000 0000000 000 00 0000000 0 Q ss_pred ccccccch-------------hhhhhccCCCCC Q lcl|NC_012530. 540 GQLKNKKN-------------TNSYKQGGSSKK 559 (559) Q Consensus 540 ~~~k~~~~-------------~~~~~~~~~~~~ 559 (559) .....+.. ..+.-+.+++-- T Consensus 451 ~~l~~~~~~~~~~~~~~~l~~i~~~l~~~~s~e 483 (526) T protein:vir:99 451 KALADLPAKDMQNQANDLLAPLLEAVNRGDSET 483 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHH Confidence 00000000 011111111111 No 122 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.66 E-value=1.4e-14 Score=96.58 Aligned_cols=461 Identities=12% Similarity=0.102 Sum_probs=227.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccc-cccccccccccccccccccccCCCCCcccHHH---H Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAY-TEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD---V 76 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~-~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~---~ 76 (559) -.|||++=-++--.. ..+.....+.+-.+.+ ..|..+ ..+.....++ +......+.+ + T Consensus 2 ~~~~d~~g~p~~~~~------------~~~~~~~~~~~~~~~~~~~~~~g---ltp~~l~~il---~~a~~gd~~~~~~L 63 (526) T protein:vir:79 2 AQIVDVYGNPIRPQQ------------LREPQTSRLAGLAKEFAQHPAKG---LTPAKLARIL---VEAEQGNLQAQAEL 63 (526) T ss_pred CeeeCCCCCccCccc------------cchhhhhhhhhhhhhcccCCCCC---cCHHHHHHHH---HHhhCCCHHHHHHH Confidence 245555522221111 1111111122211111 011100 0000000000 0011112222 2 Q ss_pred HHHHh-hChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 77 LRQYS-MNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 77 ~~~~~-~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) .+... +.+-|.+|+..|...|..++ |.|.+.... +...++..+.+..+|.+. ..|..+ T Consensus 64 ~edm~e~D~~i~s~l~~Rk~av~~~~-----------w~I~p~~~~--~~~~~~~a~~v~~~l~~~--------~~~~~~ 122 (526) T protein:vir:79 64 FMDMEERDAHLFAEMSKRKRAILGLD-----------WAVEPPRNA--SAAEKADADYLHELLLDL--------EGLEDL 122 (526) T ss_pred HHHHHhhChHHHHHHHHHHHHHhCCC-----------ceEecCCCC--ChHHHHHHHHHHHHHhcc--------cCHHHH Confidence 22222 46888999999998887544 555543221 233445556677776542 135567 Q ss_pred HHHHHHHHHHcCCcceEEEECCCC---cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecc Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDSNG---RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRN 232 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~~G---~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n 232 (559) +..++ +.+.+|.++.+++|...| .|..|.+.++..+....+..... ++.. ....-..+++...|++++. T Consensus 123 i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~~~~~~~~~l-----~~~~--~~~~g~~l~~~k~iv~~~~ 194 (526) T protein:vir:79 123 LLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQNEL-----RLRD--NSPAGEALQPFGWIIHRPR 194 (526) T ss_pred HHHHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceEeccCCCcEE-----EecC--CCCCceeecCCceEEEeec Confidence 76655 467899999999987643 36789999998777543322111 1111 1111223455545544554 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAY 312 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag 312 (559) +.+ ..+||.+.+..|......-....++...|...-+.|--+.+++. ..++++++++.+.+.+..++ + T Consensus 195 ~~~---g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~-----~a~~~ek~~L~~av~~i~~d---a- 262 (526) T protein:vir:79 195 ARS---GYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-----GTADEEKATLLRAVTGLGHA---A- 262 (526) T ss_pred CCc---CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-----CCCHHHHHHHHHHHHHHhcC---c- Confidence 433 35689999999999999999899999999999899987777653 35677888888877766332 2 Q ss_pred ccccccCC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 313 RIPMITAE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 313 ~~~vl~~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) ..|++.+ .+++...+...-..|.+..++..++|+.+. ||-.-.+....++++ +++..+.. .......+. T Consensus 263 -~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtlTs~~~~g~~g--S~a~g~vh-~~v~~di~~ 332 (526) T protein:vir:79 263 -AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LGGTLTSTTSQSGGG--AFALGQVH-NEVRHDILA 332 (526) T ss_pred -EEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhhccccccCcch--hhhhHHHH-HHHHHHHHH Confidence 2455443 345555433222358888899999998864 442211101111122 22322222 223455677 Q ss_pred HHHHHHHHHHHhhccccc-----cC-----ccceeeecchhhhhHHHHHHHHHHHHcCC--CCHHHHHHHhCCCCCCCCC Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQI-----LG-----DNYMLEFVGGDTRSQQDKLKSVQLELQTA--TTVNDYREKQGLPKIAGGD 459 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~-----~~-----~~~~~~f~~l~~~d~~~~~~~~~~~~~~~--~T~NE~R~~~gl~pi~gGD 459 (559) -.++.|+..||+.|+... .. ...+|.|......|.+++++.++.++..| ++..++|+.+|+|....++ T Consensus 333 aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~e 412 (526) T protein:vir:79 333 SDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNE 412 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCch Confidence 889999999999886532 11 12367777778889999999998888654 5889999999997555454 Q ss_pred EeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 460 IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 460 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) .++.|...... .. . ....... ..... ..+.. +.+...+.... +... . .... T Consensus 413 ~~l~~~~~~~~--------~~-------~-~~~~~~~--~~~~~-~~~~~--~~~~~~d~~l~--~~~~-~----~~~~- 463 (526) T protein:vir:79 413 PVLRPAAQPAI--------LS-------R-QHGQRVA--ALATI-VGPRY--GDQQALDKALA--DLPA-K----DMQN- 463 (526) T ss_pred hhccccCCccc--------cc-------c-ccccccc--ccccc-ccccC--chhhHHHHHHH--HHHH-H----HHHH- Confidence 43322110000 00 0 0000000 00000 00000 00000000000 0000 0 0000 Q ss_pred ccccccchhhhhhccCCCCC Q lcl|NC_012530. 540 GQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 540 ~~~k~~~~~~~~~~~~~~~~ 559 (559) ....-.+...+.-+.+++-- T Consensus 464 ~~~~~~~~i~~~~~~~~s~e 483 (526) T protein:vir:79 464 QANDLLAPLLDAVNRGDSET 483 (526) T ss_pred HHHHHHHHHHHHHHhcCCHH Confidence 00000011111111222111 No 123 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.66 E-value=6.3e-15 Score=98.43 Aligned_cols=424 Identities=11% Similarity=0.072 Sum_probs=221.5 Q ss_pred HHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC---CCcccHHHHHHHHhhChHHHHHHHHHHHHHHh Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP---IAFGRITDVLRQYSMNVVLNAIINTRANQVTE 99 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~---~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~ 99 (559) +.+....+..-....++.. +.+-.+....+.++. .....+ ++.+..+..+-|.+|++.|...|.. T Consensus 1 v~~~~l~~e~at~~~~~d~-----------~~~~~~~l~~~~~~il~~a~~g~~-~~y~~l~~D~~i~s~l~~rk~av~~ 68 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDI-----------TRPFISGLQVPNDSILQRRGGNDL-RVYEEILSDAQVKTVWGQRQLAVVS 68 (488) T ss_pred CCccchhHHHHHHHhhhhh-----------hccccCCCCCCChHHHHhhccCCH-HHHHHHhhChHHHHHHHHHHHHHhc Confidence 2221111111111111110 011011111111111 011112 2333445678999999999999985 Q ss_pred hhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC- Q lcl|NC_012530. 100 YAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN- 178 (559) Q Consensus 100 ~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~- 178 (559) + .|+|.+... ++..++..+.+..+|.++ .|..++..++ +.+.+|.++.+++|... T Consensus 69 ~-----------~w~i~p~~~---~~~~~~~ae~v~~~l~~~---------~~~~~l~~~l-da~~~G~s~~Ei~w~~~~ 124 (488) T protein:vir:99 69 R-----------EWKVEAGGD---RPIDQAAAEHLEQQLQRV---------GWDRVTSKML-FGVFYGYAVSELIYGRDD 124 (488) T ss_pred C-----------CceEEcCCC---ChHHHHHHHHHHHHHhCC---------CHHHHHHHHH-hhhhhcceeEEEEEeecC Confidence 4 456654332 334455556666666543 3556777766 56789999999999654 Q ss_pred C--cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecc-cceEEEecccCCCccCCcccccHHHHHHHHHH Q lcl|NC_012530. 179 G--RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTA-DEMGMFIRNPRSDILSGGYGLSELEMGLREFI 255 (559) Q Consensus 179 G--~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~-~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~ 255 (559) | .|..|.++|+.++.+. .++.. ++...........++. ...|++++.+.+ ..+||.|.+..|..... T Consensus 125 g~~~~~~l~~r~~~~f~~d--~~~~l-----~~~~~~~~~~g~~lp~~~~~i~~~~~~~~---g~p~g~gLl~~~~w~~~ 194 (488) T protein:vir:99 125 RYITLEAIKVRNRRRFRYD--QDGGL-----RLLTPNNMFEGEPCPAPYFWHFSTGADND---DEPYGLGLAHWLYWPVF 194 (488) T ss_pred CeeeEeeeeeecccceeec--CCCce-----EEeccCCCCCccccccCceEEEEeecCCC---CCcccchHHHHHHHHHH Confidence 3 3678999999877643 23221 1111111111122322 234443444433 35789999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC-ceeeeeccccchhH Q lcl|NC_012530. 256 SHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE-DAKFVSMTQAEDMQ 334 (559) Q Consensus 256 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g-~~~~~~ls~~~D~q 334 (559) .-....++...|...-+.|--+-+++. ...+++.++++.+.+.+..+. + ..|++.+ .+++...+...-.. T Consensus 195 fK~~~~~~w~~f~E~yG~P~~igky~~----~~a~~~ek~~l~~av~~~~~~---~--~~viP~~~~ie~~ea~~~~~~~ 265 (488) T protein:vir:99 195 FKRNGIKFWLIFLDKFGMPTAVGRYDD----KTATPEDKAKLLAALHAIQTD---S--AIIMPAGMQAELLEAGRSGTAD 265 (488) T ss_pred HHHhhHHHHHHHHHHcCCceeeeecCC----CCCCHHHHHHHHHHHHHHhcC---c--EEEecCCceeEEeecCCCChHH Confidence 999999999999999999976666542 135677788887777665332 1 2344443 34555443322235 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc----- Q lcl|NC_012530. 335 FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI----- 409 (559) Q Consensus 335 f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~----- 409 (559) |.+..++..++|+.+. ||-.- ++. .++.+++..+... ......+...++.|+..||+.|+.+. T Consensus 266 ~~~li~~~d~~Isk~i------LGqtl----ts~-~~~Gs~a~~~vh~-~v~~d~~~aDa~~i~~tln~~li~~l~~~N~ 333 (488) T protein:vir:99 266 YKTLHDTMDATIAKVG------LGQVA----STQ-GTPGRLGNDDLQA-DVRLDLVKADADLICESFNLGPARWLTEWNF 333 (488) T ss_pred HHHHHHHHHHHHHHHH------hhhhh----ccc-ccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCc Confidence 8888999999998873 44322 111 1112233333222 34456788899999999998876532 Q ss_pred cC-ccceeeecchhhhhHHHHHHHHHHHHc-CC--CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccc Q lcl|NC_012530. 410 LG-DNYMLEFVGGDTRSQQDKLKSVQLELQ-TA--TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQ 485 (559) Q Consensus 410 ~~-~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~--~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~ 485 (559) .. ....|.|......|.+++++.++..+. +| ++..++|+.+|+|+-..++....+... T Consensus 334 ~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~~~------------------ 395 (488) T protein:vir:99 334 PGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAEATAPTPS------------------ 395 (488) T ss_pred CCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccccccCCCc------------------ Confidence 11 224567777788899999999988876 35 578889999999875544332111000 Q ss_pred cccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 486 TRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) ... .....+. ++. ....+.. + . .-+..-..-.+...+.-+...+-- T Consensus 396 -------~~~--~~~~~~~-~~~-----~~~~~~~------~-----~--~~~~~~~~~~~~i~~~l~~a~s~e 441 (488) T protein:vir:99 396 -------TEF--AEGDQPS-DPA-----AAMAPQL------A-----E--AMQPVVGNWTTQLRTLIEQASSLE 441 (488) T ss_pred -------ccC--CCCCCCC-Cch-----HHHHHHH------H-----H--HHHHHHHHHHHHHHHHHHhcCCHH Confidence 000 0000000 000 0000000 0 0 000000001111222222111111 No 124 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.65 E-value=6.7e-16 Score=103.77 Aligned_cols=485 Identities=11% Similarity=0.014 Sum_probs=220.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCC-------CcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPI-------AFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~-------~~~~~ 73 (559) ||||||+-..|. +....+++. .+ ... .+|..-..+.. ......+... ...-. T Consensus 1 Mn~iDr~i~~~s---P~~a~~R~~----ar---~~~----~~y~aa~~~r~-------~~~~~~~~s~~~~i~~~~~~lr 59 (548) T protein:vir:95 1 MNLIDRLLEPLA---PELVARRLA----AR---EAI----QAYEAARPGRT-------HKAKRQPLGADTSLQKSAVSMR 59 (548) T ss_pred CchHHhHhhhcc---hHHHHHHHH----hH---HHh----ccccccCcccc-------ccccCCCCChHHHHHHHHHHHH Confidence 999999966652 221212111 01 010 12221111100 0000111110 01111 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHH-HHHHHHhcCCCCCCChhh Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDY-AERYIERMGVDYSPIRDD 151 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~-~~~~L~~~~p~~~~~~~~ 151 (559) ...++.+.+++++..+|+.+.++|-.- .|.++.-+....+. ...+-.+.++. ...|..++.. ..+.+ T Consensus 60 ~RaRdL~rNn~~a~~av~~~~~nvVG~--------~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~---~g~~~ 128 (548) T protein:vir:95 60 EQCRKLDEDHDLVTGLLDRLEERVVGG--------SGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPET---SGELT 128 (548) T ss_pred HHHHHHHhcChHHHHHHHHHHHhccCc--------cccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccc---cccCC Confidence 234456778999999999988776420 01111111111111 11111112222 2344444332 23568 Q ss_pred HHHHHHHHHHHHHHcCCcceEEEECCC-------CcEEEEEEecCceEEEEecCcccccccc---------eEEEEEec- Q lcl|NC_012530. 152 FTSFLRKLVRDTYTYDQVNYENTYDSN-------GRLSHTRMVDPTTIYFANDEHGHRRTRG---------KIYRQYID- 214 (559) Q Consensus 152 ~~~f~~~~v~d~ll~Gna~~~i~rd~~-------G~~~~L~~l~p~~V~~~~~~~g~~~~~~---------~~y~~~~~- 214 (559) |..+...+++.++..|.+++.+.+... ..+..|..|+|++|..-.+..+.....+ ..|..... T Consensus 129 f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~h 208 (548) T protein:vir:95 129 RPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDH 208 (548) T ss_pred HHHHHHHHHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecC Confidence 889999999999999999998887532 2367899999998853222222111111 11222111 Q ss_pred ---------CceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccC Q lcl|NC_012530. 215 ---------NKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPS 285 (559) Q Consensus 215 ---------~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 285 (559) ......+++.+|||+...-+. ...-|+|.+..++..+......+.....--+=.+...++|+.+.+.. T Consensus 209 Pgd~~~~~~~~~~~rvpA~~VlHif~~~r~---gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~ 285 (548) T protein:vir:95 209 PGNLQTLGGSLAVKRVEAERIIHIAYRKRI---GQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDS 285 (548) T ss_pred CCcccccccccceeeechhHheecccccCC---ccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcc Confidence 112345889999998643222 23459999888887776665555444444444455567776543211 Q ss_pred CccCCHHHHHHHHHHHHHHhcCcccccc-cccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccccc Q lcl|NC_012530. 286 VTNTSMRALEDFKRHWTATSSGINGAYR-IPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRG 363 (559) Q Consensus 286 ~~~~~~e~~~~l~~~~~~~~~G~~nag~-~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 363 (559) .. .+. ....- ...-.. ..|. ++.|..| .+++.++.+ ....|.+..+...+.||+.+|||-+.|--.-. T Consensus 286 ~~---~~~---~~~~~-~~~~~~-~pG~iv~~L~pG-e~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s- 355 (548) T protein:vir:95 286 YT---VEP---GKDRK-NRTIPI-APGMVFDDLEPG-EDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD- 355 (548) T ss_pred cc---CCC---Ccccc-cccccc-cCCccccccCCC-ceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc- Confidence 10 000 00000 000001 1233 3444443 455554432 34689999999999999999999877732111 Q ss_pred ccccccccchhhhhH-----------HHHHHHHHHHHhhHHHHH-HHHHHHhhcccc--ccC--ccceeeecch--hhhh Q lcl|NC_012530. 364 GATGNKSNSLNESNN-----------QNKIDASKSKGLMPLLDM-IAKNLTNGIIRQ--ILG--DNYMLEFVGG--DTRS 425 (559) Q Consensus 364 ~~~~~~~~~~~~an~-----------~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~~--~~~--~~~~~~f~~l--~~~d 425 (559) .|||++ +.....++...++|+..+ ++.++....++- +.. ..+.+++.+- ...| T Consensus 356 ---------~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iD 426 (548) T protein:vir:95 356 ---------GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWIN 426 (548) T ss_pred ---------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccC Confidence 133333 223334556677785554 566665544431 111 1244555443 3457 Q ss_pred HHHHHHHHHHHHcCC-CCHHHHHHHhCCCCCCCCCEee------ccceecccccccccccccccccccccccccccCCCC Q lcl|NC_012530. 426 QQDKLKSVQLELQTA-TTVNDYREKQGLPKIAGGDIIL------SAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNP 498 (559) Q Consensus 426 ~~~~~~~~~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~------~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (559) +...+++....+.++ .|.-|+-++.|..|-+--+++. .-..+. +..............+....+.....+.. T Consensus 427 P~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (548) T protein:vir:95 427 PMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLV-FSSDAYHQLVKSGMDPVEAVQKVYLGVGK 505 (548) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCC-CCCcccccccccccCCCCchhhhcccccc Confidence 777788888888765 5998888888987743111000 000000 00000000000000000000000000000 Q ss_pred C-CCCCCCCccccccchhccccccccccccccccccccccccccccccch Q lcl|NC_012530. 499 S-GTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKN 547 (559) Q Consensus 499 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~ 547 (559) . ......++.+.....-...++.+-+. ..+-+.|||..+-+. T Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 506 MLTADEARELVNRYGAGLPVPGPDFPNE-------SNNGGADGQPSNPDP 548 (548) T ss_pred ccccchhHHhhccCCCCCcCCCCCCCcc-------cccCCCCCCCCCCCC Confidence 0 00000000111122222222322211 123355665555554 No 125 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.65 E-value=3e-15 Score=100.20 Aligned_cols=416 Identities=12% Similarity=0.035 Sum_probs=216.9 Q ss_pred HHHHHHHHHHhhhhccccc-ccc----cccccccc---ccccccccccccCCC--CCcccHHHHHHHHhhChHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDR-AYT----EPVDGNLM---FSTLEDTSIVPKPSP--IAFGRITDVLRQYSMNVVLNAIINT 92 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~-a~~----~~~~~~~~---~~~~~~~~~~~~p~~--~~~~~~~~~~~~~~~~~~v~acv~~ 92 (559) ++|+ .+|.....+. +.. .|.++... ....+.+.+.....+ .++..+ ++.+..+..+-|.+|+.. T Consensus 1 m~k~-----~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~-~ly~~m~~D~hi~s~l~~ 74 (448) T protein:vir:79 1 MAKR-----GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGL-LVYHKMLSDGTVKNALNY 74 (448) T ss_pred CCCC-----CCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccch-HHHHHHhhChHHHHHHHH Confidence 1111 1111111111 111 11111110 111111111111111 111122 334445567899999999 Q ss_pred HHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceE Q lcl|NC_012530. 93 RANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYE 172 (559) Q Consensus 93 ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~ 172 (559) |...|..+ .|.|.+.. .++.+.+..+.+..+|..+... ..+..|.+++..++ +.+.+|.++++ T Consensus 75 Rk~av~~~-----------~w~v~p~~---~~~~~~~~ae~v~~~l~~~~~~--~~~~~f~~~~~~~l-da~~~G~s~~E 137 (448) T protein:vir:79 75 IFGRIRSA-----------KWYVEPAS---TDPEDIAIAAFIHAQLGIDDAS--VGKYPFGRLFAIYE-NAYIYGMAAGE 137 (448) T ss_pred HHHHHhcC-----------CceEecCC---CCHHHHHHHHHHHHHhhhhhhh--hccCCHHHHHHHHH-HhhhhcceeEE Confidence 99988854 45664322 2344455556666666543322 12345667776654 57789999999 Q ss_pred EEEC--CCCc--EEEEEEecCceEE-EEecCcccccccceEEEEEec-------CceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 173 NTYD--SNGR--LSHTRMVDPTTIY-FANDEHGHRRTRGKIYRQYID-------NKVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 173 i~rd--~~G~--~~~L~~l~p~~V~-~~~~~~g~~~~~~~~y~~~~~-------~~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) ++|. .+|. +..|.+.++.++. ...+.++.. .+..... +.....++..-++|+.+ +++ .. T Consensus 138 ivw~~~~~g~~~~~~l~~r~~~~~~~f~~~~d~~l-----~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~---g~ 208 (448) T protein:vir:79 138 IVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGP-----KALKLSGEVKGGSQFVSGLEIPIWKTVVFLH-NDD---GS 208 (448) T ss_pred EEeeecCCCceecccccccCCccccceeeecCCce-----EEeecCCcccccccCCCccccccceEEEEec-Ccc---CC Confidence 9985 3565 4467777776543 222223221 1111110 01111235566777643 333 35 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) +||.+.+..|......-....++...|...-+.|--|.+++.+. ..+++.++.+.+...+...|.+ ++ .|++.+ T Consensus 209 p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga---~~~~~~~~~l~~av~~i~~g~~-a~--~iiP~~ 282 (448) T protein:vir:79 209 FTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSV---RQGTKQWEAAKEIVKNFVQKPR-HG--IILPDD 282 (448) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCC---CcCHHHHHHHHHHHHHHhcCCc-eE--EEecCC Confidence 78999999999999999999999999999999887777776432 2345666777666655444432 32 345444 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++.-+.. ..-..+.+..++..++|+.+. ||-.- ++. .++.+++...........+.+.-.+++|+. T Consensus 283 -~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i------LGqtl----Ts~-~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~ 350 (448) T protein:vir:79 283 -WKFDTVDLKSAMPDAIPYLTYHDAGIARAL------GIDFN----TVQ-LNMGVQAINIGEFVSLTQQTIISLQREFAS 350 (448) T ss_pred -ceEEEEecCCCcccHHHHHHHHHHHHHHHH------hhhhh----ccc-cccchhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 55444432 222346678888888998765 44221 111 111122222222223345667788899999 Q ss_pred HHHhhccccc-----c-Ccc-ceeeecchhhhhHHHHHHHHHHHHcCCC-CHHHHHHHhCCC-CCCCCCEeeccceeccc Q lcl|NC_012530. 400 NLTNGIIRQI-----L-GDN-YMLEFVGGDTRSQQDKLKSVQLELQTAT-TVNDYREKQGLP-KIAGGDIILSAVYIQRL 470 (559) Q Consensus 400 ~ln~~L~~~~-----~-~~~-~~~~f~~l~~~d~~~~~~~~~~~~~~~~-T~NE~R~~~gl~-pi~gGD~~~~~~~~~~l 470 (559) .||+.|+.+. . ... -.|.|......|.++.++.+...+.... .-+-+|+.+|+| |.++ +.+..+. T Consensus 351 tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~-~~~~a~~----- 424 (448) T protein:vir:79 351 AVNLYLIPKLVLPNWPSATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELKALIDALPS-KMRRALG----- 424 (448) T ss_pred HHHHHHHHHHHHhcCCCcCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCC-ccccccC----- Confidence 9999887642 1 222 3778887788888888888877664443 334468888887 3333 2111100 Q ss_pred ccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccc Q lcl|NC_012530. 471 GQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG 519 (559) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (559) .. ....+....+.++...-+.... T Consensus 425 -~~------------------------~~~~~~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 425 -VV------------------------DEVREAVRQPADSRYLYTRRRR 448 (448) T ss_pred -CC------------------------CcccccccCCccccchhhcccC Confidence 00 0001111122222223333222 No 126 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.58 E-value=5e-14 Score=93.49 Aligned_cols=418 Identities=12% Similarity=0.042 Sum_probs=212.5 Q ss_pred HHHHHHHHHHhhhhcccc-ccc----cccccccc--cc-cccccccccccCCC--CCcccHHHHHHHHhhChHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVD-RAY----TEPVDGNL--MF-STLEDTSIVPKPSP--IAFGRITDVLRQYSMNVVLNAIINT 92 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~-~a~----~~~~~~~~--~~-~~~~~~~~~~~p~~--~~~~~~~~~~~~~~~~~~v~acv~~ 92 (559) ++|+ .+|.....+ ++. ..+.++.. .+ ...+.+.+..++.+ .++. ..++.+..+..+-|.+|+.. T Consensus 1 m~kk-----~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~-~~~ly~~m~~D~hi~s~l~~ 74 (448) T protein:vir:77 1 MAKR-----GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKD-GLLVYHKMLSDGTVKNALNY 74 (448) T ss_pred CCCC-----CCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhcccc-chHHHHHHhhChHHHHHHHH Confidence 2211 111111110 000 01111111 00 01111111111111 1121 22344445667899999999 Q ss_pred HHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceE Q lcl|NC_012530. 93 RANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYE 172 (559) Q Consensus 93 ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~ 172 (559) |...|..+ .|.|.+... ++.+++....+..+|..+.. ...+..|.+++..|+ +.+.+|.++++ T Consensus 75 Rk~av~~~-----------~w~v~p~~~---~~~d~~~ae~v~~~l~~~~~--~~~~~~f~~~i~~~l-da~~~G~s~~E 137 (448) T protein:vir:77 75 IFGRIRSA-----------KWYVEPAST---DPEDIAIAAFIHAQLGIDDA--SVGKYPFGRLFAIYE-NAYIYGMAAGE 137 (448) T ss_pred HHHHHhcC-----------CceEecCCC---CHHHHHHHHHHHHHhhchhh--hhccCCHHHHHHHHH-HhhhhcceeEE Confidence 99988854 455643222 23444555566666654322 122345777887765 78899999999 Q ss_pred EEEC--CCCc--EEEEEEecCceEE-EEecCcccccccceEEEEEecC-------ceeeeecccceEEEecccCCCccCC Q lcl|NC_012530. 173 NTYD--SNGR--LSHTRMVDPTTIY-FANDEHGHRRTRGKIYRQYIDN-------KVRGSFTADEMGMFIRNPRSDILSG 240 (559) Q Consensus 173 i~rd--~~G~--~~~L~~l~p~~V~-~~~~~~g~~~~~~~~y~~~~~~-------~~~~~~~~~evi~~~~n~~~~~~~~ 240 (559) ++|. .+|. +..|.+.++.+++ ...+.++.. .+...... .....++..-++|+.+ ..+ .. T Consensus 138 ivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~l-----~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~---g~ 208 (448) T protein:vir:77 138 IVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGP-----KALKLSGEVKGGSQFVNGLEIPIWKTVVFLH-NDD---GS 208 (448) T ss_pred EEEeecCCCceeeccccccCCCccceeeeecCCce-----EEEecCCcccccccCCCccccccceEEEEec-CCc---CC Confidence 9985 3565 4467777776543 222333321 11111110 1112335566777643 322 35 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE 320 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g 320 (559) +||.+.+..|......-....++...|.+.-+.|--|.+++.+. ..+++.++.+.+...+...|. +++ .|++.+ T Consensus 209 p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga---~~~~~~~~~l~~av~~i~~g~-~a~--~iiP~g 282 (448) T protein:vir:77 209 FTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSV---RQGTKQWEAAKEIVKNFVQKP-RHG--IILPDD 282 (448) T ss_pred cccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCC---CCCHHHHHHHHHHHHHHhcCC-ceE--EEecCC Confidence 78999999999999999999999999999999998777775432 234566677766665544443 232 345444 Q ss_pred ceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) +++.-+.. ..-..+.+..++.-++|+.+. ||-.- ++. .++...+...........+.+.-.++.|+. T Consensus 283 -~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i------LGqtl----Ts~-~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~ 350 (448) T protein:vir:77 283 -WKFDTVDLKSAMPDAIPYLTYHDAGIARAL------GIDFN----TVQ-LNMGVQAVNIGEFVSLTQQTIISLQREFAS 350 (448) T ss_pred -ceEEEEecCCCccCHHHHHHHHHHHHHHHH------hcccc----ccc-cccchhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 44443332 222346677888888998875 33211 111 111122333322223455667788899999 Q ss_pred HHHhhccccc-----c-Ccc-ceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccccc Q lcl|NC_012530. 400 NLTNGIIRQI-----L-GDN-YMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQ 472 (559) Q Consensus 400 ~ln~~L~~~~-----~-~~~-~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~ 472 (559) .||+.|+.+. . ... -.|.|......|.++.++.+...+ +-+|+.+|+|.-.+++... . T Consensus 351 tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~a~~~~~l~------~~~~~~~~ip~~~~~~~~~-------~-- 415 (448) T protein:vir:77 351 AVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSAAANLMGMLI------NAVKDSEDIPTELKALIDA-------L-- 415 (448) T ss_pred HHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHHHHHHhHHHH------HHHHHHhcCCccCCcCCCC-------C-- Confidence 9999887632 1 222 377888778888888888776654 5689999997421111000 0 Q ss_pred ccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccc Q lcl|NC_012530. 473 QEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG 519 (559) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (559) +...... ....+.+.+...+|.+.....+.... T Consensus 416 ------------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~~~ 448 (448) T protein:vir:77 416 ------------PSKMRRA--LGVVDEVREAVRQPADSRYLYTRRRR 448 (448) T ss_pred ------------chhcccc--cCCCCCCCchhhcchhhHHHHhhhcC Confidence 0000000 00000111111111111111111111 No 127 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.57 E-value=2.5e-14 Score=95.18 Aligned_cols=460 Identities=10% Similarity=0.020 Sum_probs=204.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC-------CCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP-------IAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~-------~~~~~~ 73 (559) |+.=+-+ +++. +...+..... .....++. + ...++.+.... ....-. T Consensus 1 ~~~~~~~-------~~~~--~~~~~~~~~~--~~~a~~~~-----~----------~~~~w~~~~~s~~~~i~~~~~~lr 54 (530) T protein:vir:38 1 MKIPSLV-------GPDG--KTSLREYAGY--HGGGGGFG-----G----------QLRGWNPPSESADAALLPNYSRGN 54 (530) T ss_pred Cccceee-------cCcc--ccchHHHhhh--hcccCCCC-----C----------cccccccCCCCHHHHHHHHHHHHH Confidence 4332222 0110 0000000000 00000000 0 00001110000 000011 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc---ccCh-hHHHHHHHHHH----HHHhcCCC- Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD---KPTK-EQQKKIDYAER----YIERMGVD- 144 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~---~~~~-~~~~~~~~~~~----~L~~~~p~- 144 (559) ...++.+.+++++..||+.+.+.|- |.|+.+..+-.. ..+. ...+-.+.++. |.++++.. T Consensus 55 ~RaRdl~rNn~~a~~av~~~~~nvV-----------G~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~ 123 (530) T protein:vir:38 55 ARADDLVRNNGYAANAVQLHQDHIV-----------GSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGI 123 (530) T ss_pred HHHHHHHhcChHHHHHHHHHHHHhh-----------CCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEE Confidence 2345567789999999999988875 334444332110 0111 11122223333 33332211 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC-C--cEEEEEEecCceEEEEec-Cccccccc---------ceEEEE Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN-G--RLSHTRMVDPTTIYFAND-EHGHRRTR---------GKIYRQ 211 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~-G--~~~~L~~l~p~~V~~~~~-~~g~~~~~---------~~~y~~ 211 (559) ....+.+|.++.+.+++.++..|.+++.+.+... | .+..|..|+|+.|....+ .+|..... -..|+. T Consensus 124 D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i 203 (530) T protein:vir:38 124 DAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYV 203 (530) T ss_pred eeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEE Confidence 1234568899999999999999999999887643 3 257899999988753211 11211111 112222 Q ss_pred Eec--C-c---------eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE Q lcl|NC_012530. 212 YID--N-K---------VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILL 279 (559) Q Consensus 212 ~~~--~-~---------~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 279 (559) +.. . . .....++.+|||+...-++ ...-|+|.+..++..+.......+....--+=.+.-.++|+ T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~---gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~ 280 (530) T protein:vir:38 204 SDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMED---GQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIE 280 (530) T ss_pred eeccCCCccccccceeeeeeccChhHeEeeccccCC---CcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeee Confidence 211 0 0 0122556689998643222 33459999988887776665555544444444455566665 Q ss_pred ecCccCCc------cCCHHHHHHHHHHHHH---HhcC---cccccccccccCCceeeeecccc-chhHHHHHHHHHHHHH Q lcl|NC_012530. 280 VKPSPSVT------NTSMRALEDFKRHWTA---TSSG---INGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINII 346 (559) Q Consensus 280 ~~~~~~~~------~~~~e~~~~l~~~~~~---~~~G---~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~I 346 (559) .+.+.... ....+....+.....+ .+.+ .=+.|.++.|..| .+++..+.+ ....|.+..+...+.| T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-e~i~~~~p~~p~~~~~~f~~~~lr~i 359 (530) T protein:vir:38 281 SELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPG-DSLNLQSAQDTDNGYSTFEQSLLRYI 359 (530) T ss_pred ccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCC-CeeeeeCCCCCCCCHHHHHHHHHHHH Confidence 43321000 0000001111111100 0000 0123555555444 555555433 3468999999999999 Q ss_pred HHHhCCCHHHh-ccccccccccccccch-hhhhHHHHHHHHHHHHhhHHHHH-HHHHHHhhccccccCcc---------- Q lcl|NC_012530. 347 CALVAMDPAEI-GMQNRGGATGNKSNSL-NESNNQNKIDASKSKGLMPLLDM-IAKNLTNGIIRQILGDN---------- 413 (559) Q Consensus 347 a~~fgVPp~~l-g~~~~~~~~~~~~~~~-~~an~~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~~~~~~~---------- 413 (559) |+.+|||-+.| |..+..+|++...+.. .....+.....+...-++|+..+ ++.++....++-..+.. T Consensus 360 aaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~ 439 (530) T protein:vir:38 360 AAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAW 439 (530) T ss_pred HhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhh Confidence 99999998877 5443333222111000 01122333334445555665554 66666665554211111 Q ss_pred ceeeecc--hhhhhHHHHHHHHHHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccc Q lcl|NC_012530. 414 YMLEFVG--GDTRSQQDKLKSVQLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQ 490 (559) Q Consensus 414 ~~~~f~~--l~~~d~~~~~~~~~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 490 (559) ..+++.+ ....|+...+++....+.++ .|+-++-++.|..|-+--+ ++.. +.... .+.+. T Consensus 440 ~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~------------q~a~----e~~~~-~~~Gl 502 (530) T protein:vir:38 440 GNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFA------------QQVR----ESMER-RAAGL 502 (530) T ss_pred hceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHH------------HHHH----HHHHH-HHcCC Confidence 2234432 33457887788888888766 5999888888987743111 1000 00000 00000 Q ss_pred ccccCCCCCCCCCCCCccccccchhcccccccc Q lcl|NC_012530. 491 LESALQNPSGTPPTLPPSSSNSFQQNQEGYTGK 523 (559) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (559) ... ......+.. ....+..+.+++.++. T Consensus 503 ~~~-~~~~~~~~~----~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 503 NPP-AWAAAAFEA----GVKKSNEEEQDGARAA 530 (530) T ss_pred CCC-CCcccccCC----CCCCCCCCCCCCCCCC Confidence 000 000000000 0000000000000000 No 128 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.57 E-value=1.5e-14 Score=96.45 Aligned_cols=457 Identities=11% Similarity=0.081 Sum_probs=203.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccccc-ccccccccCCCCC---cccH--- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTL-EDTSIVPKPSPIA---FGRI--- 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~-~~~~~~~~p~~~~---~~~~--- 73 (559) |+-..|- +..++..+....... . ..+|. + ....+ ...++.+...... .... T Consensus 2 ~~~~~r~------------~~~~a~~~~~~~~~~---~-~~~y~----g--A~~~~r~~~~w~~~~~s~~~~~~~~~~~l 59 (553) T protein:vir:63 2 TKVTVRK------------LSEVTSGRPEQSASL---G-GGGLE----G--ASRLSRETVSWNPSLRSPDALINPLKRIA 59 (553) T ss_pred cchhhhh------------hcccccccchhhhhh---h-ccccc----c--cccCCCcccccccCCCChHHHHHHHHHHH Confidence 1111111 011111111000000 0 00110 0 00000 0011111111100 0011 Q ss_pred -HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc----cChhH-HHHHHHH----HHHHHhcCC Q lcl|NC_012530. 74 -TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK----PTKEQ-QKKIDYA----ERYIERMGV 143 (559) Q Consensus 74 -~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~----~~~~~-~~~~~~~----~~~L~~~~p 143 (559) ...++.+.+++++..+|+.+.++|- |.|+....+.... .+.+. .+-.+.+ ..|.++++. T Consensus 60 r~RaRdL~rNn~~a~~av~~~~~nvV-----------G~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~ 128 (553) T protein:vir:63 60 DARGRDMADNDGFTNGAVGYQRDSIV-----------GAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLAC 128 (553) T ss_pred HHHHHHHHhcChHHHHHHHHHHHhhc-----------cCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccc Confidence 1334567789999999998888875 3344444332111 11111 1112223 333333221 Q ss_pred C-CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC-C--cEEEEEEecCceEEEEecC-cccccccc---------eEE Q lcl|NC_012530. 144 D-YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN-G--RLSHTRMVDPTTIYFANDE-HGHRRTRG---------KIY 209 (559) Q Consensus 144 ~-~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~-G--~~~~L~~l~p~~V~~~~~~-~g~~~~~~---------~~y 209 (559) . ......+|..+...+++.++..|.+++.+.+... | .+..|..|+|++|..-.+. +|.....+ ..| T Consensus 129 ~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY 208 (553) T protein:vir:63 129 YIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGY 208 (553) T ss_pred eeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEE Confidence 1 1234568899999999999999999998876543 2 2568899999988542221 22111111 122 Q ss_pred EEEe-cCc----------------eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 210 RQYI-DNK----------------VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 210 ~~~~-~~~----------------~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ..+. ..+ ....+++.+|||+...-++ ...-|+|.+..++..+......++....--.=.+ T Consensus 209 ~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~---gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A 285 (553) T protein:vir:63 209 WIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREP---DQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINA 285 (553) T ss_pred EeeccCCCccccccccccceeeeccccccChhHheecccccCC---CcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhh Confidence 2111 000 1124678899998643222 2345999998888777665555544444444445 Q ss_pred CCceEEEecCccCCccCCHHHHHHHH----------------HHHHHHhcCc----ccccccccccCCceeeeecccc-c Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFK----------------RHWTATSSGI----NGAYRIPMITAEDAKFVSMTQA-E 331 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~----------------~~~~~~~~G~----~nag~~~vl~~g~~~~~~ls~~-~ 331 (559) .-.++|+.+.+ . ....+.+. +.....+.|. =+.|.++.|..| .+++.++.+ . T Consensus 286 ~~a~fi~~~~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-e~i~~~~p~~p 358 (553) T protein:vir:63 286 SYAAAIESELP---P---EFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPG-TKLNLKPMGTP 358 (553) T ss_pred hheeeeecCCC---h---hhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCC-CeeeecCCCCC Confidence 55667764321 0 11111110 0000111111 123556555444 455554433 3 Q ss_pred hhHHHHHHHHHHHHHHHHhCCCHHHh-ccccccccccccccc-hhhhhHHHHHHHHHHHHhhHHHHH-HHHHHHhhccc- Q lcl|NC_012530. 332 DMQFQSWLNYLINIICALVAMDPAEI-GMQNRGGATGNKSNS-LNESNNQNKIDASKSKGLMPLLDM-IAKNLTNGIIR- 407 (559) Q Consensus 332 D~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~~~~~-~~~an~~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~- 407 (559) ...|.+..+...+.||+.+|||-+.| |..+..+|++...+. ......+.....|....++|+..+ ++.++-...++ T Consensus 359 ~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~ 438 (553) T protein:vir:63 359 GGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPM 438 (553) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccC Confidence 46899999999999999999998876 443333222111100 001112333345666677786555 55555544332 Q ss_pred -cccCc-----------cceeeecch--hhhhHHHHHHHHHHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceeccccc Q lcl|NC_012530. 408 -QILGD-----------NYMLEFVGG--DTRSQQDKLKSVQLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQ 472 (559) Q Consensus 408 -~~~~~-----------~~~~~f~~l--~~~d~~~~~~~~~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~ 472 (559) ..... ...+++.+- ...|+...+++....+.+| .|+-|+-++.|..|-+--++ T Consensus 439 p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q------------ 506 (553) T protein:vir:63 439 PPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQ------------ 506 (553) T ss_pred CCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHH------------ Confidence 11111 112344432 3457777788888888766 69998888889877431111 Q ss_pred ccccccccccccccccccc-----cccCCC--CCCCCCCCCccccccchhcccccccccccc Q lcl|NC_012530. 473 QEQIKQNEFQRQQTRLTQL-----ESALQN--PSGTPPTLPPSSSNSFQQNQEGYTGKDAKP 527 (559) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~-----~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (559) +.. +.+ ...+.+.+ ....+. ..+.++..++. ....++++ | T Consensus 507 ~a~----e~~-~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-------e 553 (553) T protein:vir:63 507 RAR----EDA-LLKKYGLTFNLSAKRSLGDGRDAATGIAEDPA---AAQTSQQG-------E 553 (553) T ss_pred HHH----HHH-HHHHcCCCCCCCCccccCCCcccCCCCCCCCC---CCCccccc-------C Confidence 000 000 00000000 000000 00000000000 00000000 0 No 129 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.54 E-value=5.4e-14 Score=93.32 Aligned_cols=470 Identities=9% Similarity=-0.006 Sum_probs=206.9 Q ss_pred cccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHH Q lcl|NC_012530. 11 FYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAII 90 (559) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv 90 (559) +.+-.+.. +.-++.-...+....-..|- +...+.. ..|.+... +....-......-....++.+.+++++..|| T Consensus 1 ~~~p~~~~-~~~~~~~~~~~~~~~y~~~a--~~~~~~~--~~w~p~~~-s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av 74 (533) T protein:vir:34 1 MKTPTIPT-LLGPDGMTSLREYAGYHGGG--SGFGGQL--RSWNPPSE-SVDAALLPNFTRGNARADDLVRNNGYAANAI 74 (533) T ss_pred CCCchhhh-hhcccccchHHHHHhhhhcc--CCCCCcc--cccccCCC-CHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 22221111 11111100000000000000 0000000 00000000 0000000000111123445667899999999 Q ss_pred HHHHHHHHhhhhHhhhhcCCcceeeeccccc---ccChh-HHHHHHHH----HHHHHhcCCC-CCCChhhHHHHHHHHHH Q lcl|NC_012530. 91 NTRANQVTEYAHRASTDDNGMGYQVRLKNGD---KPTKE-QQKKIDYA----ERYIERMGVD-YSPIRDDFTSFLRKLVR 161 (559) Q Consensus 91 ~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~---~~~~~-~~~~~~~~----~~~L~~~~p~-~~~~~~~~~~f~~~~v~ 161 (559) +.+.+.|- |.|+.+..+-.. ..+.+ ..+-.+.+ ..|.+.++-. ....+.+|.++...+++ T Consensus 75 ~~~~~nvV-----------G~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r 143 (533) T protein:vir:34 75 QLHQDHIV-----------GSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVA 143 (533) T ss_pred HHHHHHhh-----------CCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHH Confidence 99988874 334444433110 01111 11112222 3344443221 12344688999999999 Q ss_pred HHHHcCCcceEEEECCC-C--cEEEEEEecCceEEEEec-Ccccccccc---------eEEEEEec--Cc---------- Q lcl|NC_012530. 162 DTYTYDQVNYENTYDSN-G--RLSHTRMVDPTTIYFAND-EHGHRRTRG---------KIYRQYID--NK---------- 216 (559) Q Consensus 162 d~ll~Gna~~~i~rd~~-G--~~~~L~~l~p~~V~~~~~-~~g~~~~~~---------~~y~~~~~--~~---------- 216 (559) .++..|.+|+.+.+... | .+..|..|+|+.|..-.+ .+|.....+ ..|..+.. ++ T Consensus 144 ~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~ 223 (533) T protein:vir:34 144 MHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIP 223 (533) T ss_pred HHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceee Confidence 99999999999887643 2 257899999988753221 111111111 12222211 11 Q ss_pred eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC------ccCC Q lcl|NC_012530. 217 VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV------TNTS 290 (559) Q Consensus 217 ~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~------~~~~ 290 (559) .....++.+|||+...-++ ...-|+|.+..++..+.......+....--+=.+.-.++|+.+.+... +... T Consensus 224 ~~~~v~a~~VlH~f~~~r~---gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~ 300 (533) T protein:vir:34 224 RELPGGRASFIHVFEPVED---GQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANS 300 (533) T ss_pred eeeccChhHeeeeccccCC---CcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCc Confidence 0122457789998643222 234599999888877766655555444444445555667764422100 0000 Q ss_pred HHHHHHHHH---HHHHHhcCc---ccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHh-ccccc Q lcl|NC_012530. 291 MRALEDFKR---HWTATSSGI---NGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEI-GMQNR 362 (559) Q Consensus 291 ~e~~~~l~~---~~~~~~~G~---~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~ 362 (559) .+..+.+.. .-...+.+. =+.|.++.|..| .+++.++.+ ....|.+..+...+.||+.+|||-+.| |..+. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-e~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~ 379 (533) T protein:vir:34 301 QEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG-DSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQ 379 (533) T ss_pred ccccccccccchhhhhccCcceeeccCceeeecCCC-CeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhccc Confidence 111111111 111111111 123556555444 555555433 346899999999999999999998877 43333 Q ss_pred cccccccccc-hhhhhHHHHHHHHHHHHhhHHHHH-HHHHHHhhccccccCc----------cceeeecc--hhhhhHHH Q lcl|NC_012530. 363 GGATGNKSNS-LNESNNQNKIDASKSKGLMPLLDM-IAKNLTNGIIRQILGD----------NYMLEFVG--GDTRSQQD 428 (559) Q Consensus 363 ~~~~~~~~~~-~~~an~~~~~~~~~~~~l~P~~~~-ie~~ln~~L~~~~~~~----------~~~~~f~~--l~~~d~~~ 428 (559) .+|++...+. .....++.....+....++|+..+ ++.++....++-..+. ...++|.+ ....|+.. T Consensus 380 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~K 459 (533) T protein:vir:34 380 MSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLK 459 (533) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHH Confidence 3222211100 001112333444566667777665 5656655444311110 12344443 34457777 Q ss_pred HHHHHHHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 429 KLKSVQLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 429 ~~~~~~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) .+++....+.++ +|+-|+-++.|..|-+--++ +.. +.. ...+.+......+........ T Consensus 460 e~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q------------~a~----e~~-~~~~~gl~~~~~~~~~~~s~~--- 519 (533) T protein:vir:34 460 EVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQ------------QVR----ETM-ERRAAGLKPPAWAAAAFESGL--- 519 (533) T ss_pred HHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHH------------HHH----HHH-HHHhcCCCCCCCCCcCccCCC--- Confidence 788888888766 69999888889977432111 000 000 000000000000000000000 Q ss_pred cccccchhccccccccccccccc Q lcl|NC_012530. 508 SSSNSFQQNQEGYTGKDAKPSGK 530 (559) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~g~ 530 (559) ..+ ........+++ T Consensus 520 -~~~--------~~~~~~~~~~~ 533 (533) T protein:vir:34 520 -RQS--------TEEEKSDSRAA 533 (533) T ss_pred -CCC--------CCCCcccCCCC Confidence 000 00000000011 No 130 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.54 E-value=1.3e-13 Score=91.28 Aligned_cols=386 Identities=9% Similarity=0.064 Sum_probs=202.7 Q ss_pred HHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC-C--CcccHH---HHHHHHh-hChHHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP-I--AFGRIT---DVLRQYS-MNVVLNAIINTRAN 95 (559) Q Consensus 23 ~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~-~--~~~~~~---~~~~~~~-~~~~v~acv~~ia~ 95 (559) +.+.+.+. -.....|+-++. .. ..+...++. .|-+ . ...+.. ++.+-++ ..+.|.+|+..|.. T Consensus 1 ~~~~~~~~--p~~~~~~~~~~~------~~-~~~~~~g~~-~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~ 70 (446) T protein:vir:98 1 MNMEVRNA--PTPAIRRRTIYA------ME-HLGLATSYL-SEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIAL 70 (446) T ss_pred CcccccCC--Cchhhhhhhhhc------cc-cchhhcccC-CcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHH Confidence 22222221 111111222211 10 011111111 1111 0 011121 2222333 47899999999999 Q ss_pred HHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE Q lcl|NC_012530. 96 QVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY 175 (559) Q Consensus 96 ~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r 175 (559) .|.++. |.|.+. .++..+.+..+|.... ++++...+.|.+.+|.++.|++| T Consensus 71 av~~~~-----------w~V~p~--------~~~~a~~v~~~l~~~~----------~~~~~~~~ldai~~G~s~~Eivw 121 (446) T protein:vir:98 71 SVLNKV-----------GPYQHG--------DKRIKKFIDDQLRNRA----------KTWISHCVKSIMTYGFSLSEQIY 121 (446) T ss_pred HhhcCC-----------ceecCc--------cHHHHHHHHHHHhhcC----------chhHHHHHHHHHhhCceeeeEEE Confidence 888554 555432 1234455667765531 23445557799999999999998 Q ss_pred CCC-C--cEE----EEEEecCceEEEEecCcccccccc---------eEEE------------EEecCceeeeecccceE Q lcl|NC_012530. 176 DSN-G--RLS----HTRMVDPTTIYFANDEHGHRRTRG---------KIYR------------QYIDNKVRGSFTADEMG 227 (559) Q Consensus 176 d~~-G--~~~----~L~~l~p~~V~~~~~~~g~~~~~~---------~~y~------------~~~~~~~~~~~~~~evi 227 (559) ... | .|. .+....|..++...+.++...... ..+. .....+....++....+ T Consensus 122 ~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi 201 (446) T protein:vir:98 122 AHGARDNMPATVLDDIVNYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRL 201 (446) T ss_pred eecccccccchhhccccccccccceeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceE Confidence 632 2 111 122223333333333222211100 0000 00011112235667778 Q ss_pred EEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCC-HH---HHHHHHHHHHH Q lcl|NC_012530. 228 MFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTS-MR---ALEDFKRHWTA 303 (559) Q Consensus 228 ~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~-~e---~~~~l~~~~~~ 303 (559) ++++.+.++ .+||.|.+..|...........++...|...-+.|--+.+++.+.+..+.+ ++ +-+...+.+.. T Consensus 202 ~~~~~~~~~---~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~ 278 (446) T protein:vir:98 202 FINYNTKGN---NPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAED 278 (446) T ss_pred EEEecCCCC---CccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHH Confidence 888766543 478999999999999999999999999999999998888887654433221 11 11222233333 Q ss_pred HhcCc-cccc-ccccc-cCCceeeeeccc--cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH Q lcl|NC_012530. 304 TSSGI-NGAY-RIPMI-TAEDAKFVSMTQ--AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN 378 (559) Q Consensus 304 ~~~G~-~nag-~~~vl-~~g~~~~~~ls~--~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~ 378 (559) ++... .+++ -+|.+ ...++++.-++. ..-..|.+..++..++|+++.....--+|-.. +.++|..-++ T Consensus 279 av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~------~~~GS~ala~- 351 (446) T protein:vir:98 279 ALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRE------TTFGTGRASE- 351 (446) T ss_pred HHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccc------cccchhhhHH- Confidence 33221 1222 22111 133466655542 22234888899999999998755433332111 1122222222 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHhhcccccc-----Cc-------cceeeecchhhhhHHHHHHHHHHHHcCCC-CH-- Q lcl|NC_012530. 379 QNKIDASKSKGLMPLLDMIAKNLTNGIIRQIL-----GD-------NYMLEFVGGDTRSQQDKLKSVQLELQTAT-TV-- 443 (559) Q Consensus 379 ~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-----~~-------~~~~~f~~l~~~d~~~~~~~~~~~~~~~~-T~-- 443 (559) ... ....+-++..+++|++.||+.|+.+.- .. .-.++|.....+|.+..++.++.++..|+ ++ T Consensus 352 -vh~-~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~ 429 (446) T protein:vir:98 352 -IQL-ELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGD 429 (446) T ss_pred -HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCcccccc Confidence 222 234456778999999999998865321 10 01234555567888899999988887553 43 Q ss_pred -HHHHHHhCCCCCCCCCE Q lcl|NC_012530. 444 -NDYREKQGLPKIAGGDI 460 (559) Q Consensus 444 -NE~R~~~gl~pi~gGD~ 460 (559) +.+|+.+|+|+-.. |+ T Consensus 430 ~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 430 KDHIRSITGLPDAIS-ST 446 (446) T ss_pred HHHHHHHhCcCCCCC-CC Confidence 45999999976421 22 No 131 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.51 E-value=2.2e-12 Score=84.48 Aligned_cols=435 Identities=12% Similarity=0.101 Sum_probs=216.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC--CCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP--IAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~ 78 (559) =+||+-- ++++.- .++.+....+.+. +.....-|. .+...+++.+ ..+....++.+ T Consensus 3 ~~i~~~~-g~p~~~------~~~~~~~~~~ia~-----~~~~~~~~~----------~~~~~~~~~~iLr~~~~~~~~y~ 60 (491) T protein:vir:10 3 KGLWVSP-TEFVTF------GEPDKSLSSQIAT-----RARSIDFFA----------LGMYLPNPDPVLKALGKDIRVYR 60 (491) T ss_pred CceeCCC-CCccCc------ccCChHHHHHHHh-----hhccccccc----------ccCCccchHHHHHhcCCCHHHHH Confidence 2344433 112211 1111111111111 111110110 0111111100 00001112333 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) ..+..+.|.+|+..|...|... .|.|...+.+ .+..+.+..+|.++ .|..++.. T Consensus 61 ~m~~D~~i~s~l~~Rk~av~~~-----------~w~i~~~~~~------~~~~e~v~e~l~~~---------~~~~~l~~ 114 (491) T protein:vir:10 61 ELRADAHVGGCVRRRKAAVKAL-----------EWGLDRGKAK------SRVAKSIADVFADL---------DLSRIVTE 114 (491) T ss_pred HHhhChHHHHHHHHHHHHHhCC-----------CcEEecCCCC------HHHHHHHHHHHhcC---------CHHHHHHH Confidence 4457889999999999988754 4556443221 12334566666543 35567777 Q ss_pred HHHHHHHcCCcceEEEECCCC---cEEEEEEecCceEEEEecCcccccccceEEEEEecC-ceeeeecccceEEEecccC Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSNG---RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN-KVRGSFTADEMGMFIRNPR 234 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~G---~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~-~~~~~~~~~evi~~~~n~~ 234 (559) ++ +.+.+|.++.+++|...| .|..|.++|+.++.+.. ++.. ++.. .++ .....+++...|++++.+. T Consensus 115 ~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~--~~~l-----~~~~-~~~~~~g~~l~~~k~i~~~~~~~ 185 (491) T protein:vir:10 115 ML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDP--ENQL-----RFRS-KDHWMQGEELPARKFLVPRQEAT 185 (491) T ss_pred HH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeecc--CCce-----EEec-CCCCCCcceecCCCEEEEEecCC Confidence 66 678899999999997543 36689999998876532 2322 2221 122 1223455666666665544 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_012530. 235 SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRI 314 (559) Q Consensus 235 ~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~ 314 (559) +. .+||.+.+..|......-....++...|...-+.|--+.+++. ..++++++++.+.+.+..+. + . T Consensus 186 ~~---~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~a~~~ek~~l~~al~~~~~~---a--~ 252 (491) T protein:vir:10 186 YL---NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR-----SASDGEKNLLLDCLEDMVQD---A--V 252 (491) T ss_pred CC---CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCC-----CCCHHHHHHHHHHHHHHhcC---c--E Confidence 33 4789999999999999999999999999999999987777653 35677888888877775332 1 2 Q ss_pred ccccCC-ceeeeeccc-cchh-HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhh Q lcl|NC_012530. 315 PMITAE-DAKFVSMTQ-AEDM-QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLM 391 (559) Q Consensus 315 ~vl~~g-~~~~~~ls~-~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~ 391 (559) .|++.+ .+++...+. .... -|.+..++..++|+.+. ||-. .++..+++ ++..+.. .......+. T Consensus 253 ~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqt----lTt~~~gs--~a~~~vh-~~v~~di~~ 319 (491) T protein:vir:10 253 AVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL------LGQN----QTTEATST--RASAQAG-LEVTDDIRD 319 (491) T ss_pred EEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH------hhhh----cccCcccc--hhHHHHH-HHHHHHHHH Confidence 355443 344544332 2223 37888889989998763 4532 22222232 2322222 223455667 Q ss_pred HHHHHHHHHHHhhccccc------cCccceeeecchhhhhHHHHHHHHHHHHcCC--CCHHHHHHHhCCCCCCCCCEeec Q lcl|NC_012530. 392 PLLDMIAKNLTNGIIRQI------LGDNYMLEFVGGDTRSQQDKLKSVQLELQTA--TTVNDYREKQGLPKIAGGDIILS 463 (559) Q Consensus 392 P~~~~ie~~ln~~L~~~~------~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~--~T~NE~R~~~gl~pi~gGD~~~~ 463 (559) -.++.|+..||+ |+.+. .....+|.|.... .+.+.+++.++..+..| ++..++|+.+|+|+-+.++.+. T Consensus 320 ~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~~~~~- 396 (491) T protein:vir:10 320 GDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGDLDERPL- 396 (491) T ss_pred HHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCcCcccc- Confidence 788888888885 55421 1234567765432 33366788887777654 5889999999998654443221 Q ss_pred cceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccccccc Q lcl|NC_012530. 464 AVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLK 543 (559) Q Consensus 464 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k 543 (559) + .... . .... ....+....+.. .. .............+ .-.. T Consensus 397 ~-------~~~~--~----------~~~~-----~~~~~~~~~~~~-~~-d~~~~~~~~~~~~~------------~~~~ 438 (491) T protein:vir:10 397 P-------VSAV--D----------TVGA-----ASFAEFEAPDQD-AL-DAALNTLSARDLNA------------DAQA 438 (491) T ss_pred c-------cCCC--C----------Cccc-----ccccccCCCCCC-ch-HHHHHHHHHHHHHH------------HHHH Confidence 0 0000 0 0000 000000000000 00 00000000000000 0000 Q ss_pred ccchhhhhhccCCCCC Q lcl|NC_012530. 544 NKKNTNSYKQGGSSKK 559 (559) Q Consensus 544 ~~~~~~~~~~~~~~~~ 559 (559) -.....+.-+...+-- T Consensus 439 ~~~~i~~~l~~~~s~~ 454 (491) T protein:vir:10 439 LVAPLLKRIANGASAD 454 (491) T ss_pred HHHHHHHHHHhcCCHH Confidence 0011111111111111 No 132 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.49 E-value=9.1e-14 Score=92.07 Aligned_cols=437 Identities=9% Similarity=0.023 Sum_probs=200.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC----CCcccHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP----IAFGRITDV 76 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~----~~~~~~~~~ 76 (559) ||++++= ++- +... ....... .+|..-..+.... .+.. ..|.. ....-.... T Consensus 1 m~~~~~~---~~a--~~~~---~~~~~~~-----------~~y~aa~~~~~~~---~~~~--~s~d~~~~~~~~~lr~Ra 56 (495) T protein:vir:10 1 MNMTPSG---YQS--LASG---LLVPVGA-----------SAYEGASGGHRWQ---DIGD--YGPDTAVASGIQTLRARS 56 (495) T ss_pred CCccccc---ccc--cchh---hhhHHHh-----------hhhhccccCcccC---CCCC--CChhHHHHHHHHHHHHHH Confidence 9999983 221 1111 1111100 0111000000000 0000 00110 001111234 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHH-HHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKI-DYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~-~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ++.+.+++++..||+.+.+.|- |.|+....+..+ ++..+++ .....|..++.. ..+.+|..+ T Consensus 57 Rdl~rNn~~a~~av~~~~~~vV-----------G~Gi~p~~~~~~---~~~~~~ie~~w~~wa~~~D~---~g~~~f~~l 119 (495) T protein:vir:10 57 HHNVRNNPWATNAVATWVAAAV-----------GNGLTPRWRMKE---QELRQELQELWGDWVNEADF---DEVQSFYGL 119 (495) T ss_pred HHHHhcChHHHHHHHHHHHhhc-----------CCCcccccCCch---HHHHHHHHHHHHHhhcCccc---ccccCHHHH Confidence 5567789999999998888874 233433333222 1112222 223444444422 345688899 Q ss_pred HHHHHHHHHHcCCcceEEEECC--CC--cEEEEEEecCceEEE-Eec---Cccccccc---------ceEEEEEe-cC-- Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDS--NG--RLSHTRMVDPTTIYF-AND---EHGHRRTR---------GKIYRQYI-DN-- 215 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~--~G--~~~~L~~l~p~~V~~-~~~---~~g~~~~~---------~~~y~~~~-~~-- 215 (559) ...+++.++..|.+|+.+.+.. .| .+..|..|+|++|.. ... .+|..... ...|.... .. T Consensus 120 q~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd 199 (495) T protein:vir:10 120 QALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAE 199 (495) T ss_pred HHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCc Confidence 9999999999999999877643 33 468999999999852 111 12211111 11222211 11 Q ss_pred -------ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC-c Q lcl|NC_012530. 216 -------KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV-T 287 (559) Q Consensus 216 -------~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-~ 287 (559) .....+++++|||+ +..+. ...-|+|.+..+. .+......++....--+=.+...++|+.+.+... . T Consensus 200 ~~~~~~~~~~~rvpA~~vlH~-f~~r~---gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~ 274 (495) T protein:vir:10 200 SSLIGDPVDTVWIKAEHVLHV-TVLTV---RSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGG 274 (495) T ss_pred ccccccccceeeechhheEec-cccCC---CcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccc Confidence 11244788999998 43332 2345888665433 2333222222222222333455666654322110 0 Q ss_pred cCC-HHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHh-ccccccc Q lcl|NC_012530. 288 NTS-MRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEI-GMQNRGG 364 (559) Q Consensus 288 ~~~-~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~ 364 (559) ... ....+. -.....+ -+.|.++.|..| .+++.++.+ .-..|.+..+...+.||+.+|||-+.| |..+..+ T Consensus 275 ~~~~~~~~~~----~~~~~~~-l~pG~i~~L~pG-e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~n 348 (495) T protein:vir:10 275 PTIGQPKRSK----GGKRITG-LNPGTLQYLQPG-QEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVN 348 (495) T ss_pred cccCcccccc----Cccccee-cCCceeeecCCC-CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc Confidence 000 000000 0000011 124566666544 555555432 245789999999999999999999877 4444333 Q ss_pred cccccccchhhhhHHHHHH--------HHHHHHhhHHHHH-HHHHHHhhccc--cccCc---cceeeecc--hhhhhHHH Q lcl|NC_012530. 365 ATGNKSNSLNESNNQNKID--------ASKSKGLMPLLDM-IAKNLTNGIIR--QILGD---NYMLEFVG--GDTRSQQD 428 (559) Q Consensus 365 ~~~~~~~~~~~an~~~~~~--------~~~~~~l~P~~~~-ie~~ln~~L~~--~~~~~---~~~~~f~~--l~~~d~~~ 428 (559) |++ .++...++.+ .++...++|+..+ ++.++....++ .+... ...++|.+ ....|+.. T Consensus 349 YSS------~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~K 422 (495) T protein:vir:10 349 YSS------IRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLK 422 (495) T ss_pred HHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHH Confidence 322 2222222222 2344556675554 55565554332 22111 12344443 34458888 Q ss_pred HHHHHHHHHcCC-CCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc-----ccCCCCCCCC Q lcl|NC_012530. 429 KLKSVQLELQTA-TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE-----SALQNPSGTP 502 (559) Q Consensus 429 ~~~~~~~~~~~~-~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 502 (559) .+++....+.+| +|+-++-++.|..|-+--+ ++.. + .+...+.+... ...++...++ T Consensus 423 e~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~------------q~a~----e-~~~~~~~Gl~~~~~p~~~~~~~~~~~ 485 (495) T protein:vir:10 423 KHLADLGDVRAGFAPISDKQAERGYDMEELFD------------MISD----A-NQLIDEYDLRLDSDPRYVNGSGAEQK 485 (495) T ss_pred HHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHH------------HHHH----H-HHHHHHcCCCCCCCCCcCCCccCCCC Confidence 888888888765 6999888888997743111 1000 0 00001111100 0000000000 Q ss_pred CCCCccccccchhcccccccc Q lcl|NC_012530. 503 PTLPPSSSNSFQQNQEGYTGK 523 (559) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~ 523 (559) +..++.+ +++ T Consensus 486 ~~~~~~~-----------~~e 495 (495) T protein:vir:10 486 SVMEAAL-----------NNE 495 (495) T ss_pred CCCCCCC-----------CCC Confidence 0000000 000 No 133 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.48 E-value=5.8e-13 Score=87.66 Aligned_cols=409 Identities=11% Similarity=0.039 Sum_probs=181.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |- +++ .+. +|-.+...+..+..........| -.......+..+. .| ....+.++...| T Consensus 1 ~~--~~~--~~~---~~~~~~~~~~~~~rd~l~~~~~g-----------lg~~r~~~~~~~g-~~---~~~~~~~l~~~Y 58 (449) T protein:vir:10 1 MT--DKL--TLA---VNHALNDARMARARMGLMVPTMG-----------LDNKRHSAWCEYG-FP---ELVTYENLYSLY 58 (449) T ss_pred Cc--hhh--HHH---HhhhcchhHHHHHHHHHHHHHhc-----------CCcccchhhhhcC-Cc---ccCCHHHHHHHH Confidence 10 110 000 11111111111111111111111 1111011111110 01 134566777777 Q ss_pred hhChHHHHHHHHHHHHHH-hhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVT-EYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia-~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) ..+.+.++||+.+++..- ..+.+ ...+.....+....-...++.++.+ .-|..+.+.+ T Consensus 59 r~~~ia~~iVd~~~d~~~~~~~~i------------~~g~~~~~~~~~~~~e~~~~~l~~~---------~~~~~l~ea~ 117 (449) T protein:vir:10 59 RRGGIAHGAVEKLVGKCWQTNPEI------------IEGDDADDSEDETSWEKKSKQVFTN---------RLWRSFAEAD 117 (449) T ss_pred hcCchhHHHHHhhhhhhhhcCccc------------ccCccccchhhhHHHHHHHHHHHHH---------HHHHHHHHHH Confidence 889999999999998542 11111 1111111011111111122222221 1122233343 Q ss_pred HHHHHHcCCcceEE-EECC---------CCcEEEEEEecCceEEEEe---cCcccccccceEEEEEec---C--ceeeee Q lcl|NC_012530. 160 VRDTYTYDQVNYEN-TYDS---------NGRLSHTRMVDPTTIYFAN---DEHGHRRTRGKIYRQYID---N--KVRGSF 221 (559) Q Consensus 160 v~d~ll~Gna~~~i-~rd~---------~G~~~~L~~l~p~~V~~~~---~~~g~~~~~~~~y~~~~~---~--~~~~~~ 221 (559) -+ ..++|-+++++ ++|+ .+.+..|.|+....|.+.. +... ..+.-+.|+++.. + .....+ T Consensus 118 ~~-~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s-p~yg~P~~y~v~~~~~g~~~~~~~i 195 (449) T protein:vir:10 118 RR-RLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINS-KTYGQPKLWKYTERLPNGSSRRVDI 195 (449) T ss_pred Hh-hhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCC-CCCCCceEEEEeeeccCCCccceee Confidence 33 34567666655 3332 2356777777765555421 1111 1122344554331 1 112235 Q ss_pred cccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCceEE--------EecCccCCccCCHH Q lcl|NC_012530. 222 TADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENT-ELFNDRFFTHGGTTKGIL--------LVKPSPSVTNTSMR 292 (559) Q Consensus 222 ~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~-~~~~~~~f~ng~~p~gil--------~~~~~~~~~~~~~e 292 (559) .++-|+|+...| .-|.|-|+.+.+.+.....+ ..+...+++|-.+-..+. .+.... +...++ T Consensus 196 H~SRl~~~~~~~-------~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~--~~~~e~ 266 (449) T protein:vir:10 196 HPDRVFILGDYS-------EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY--GVSIDE 266 (449) T ss_pred ccceeEeecCCC-------CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh--hCCchH Confidence 566666653211 22778888887765333322 223333333321111110 000000 111233 Q ss_pred HHHHHHHHHHHHhcCcccccccccccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccc Q lcl|NC_012530. 293 ALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSN 371 (559) Q Consensus 293 ~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~ 371 (559) ..+++.+......+|.+ . .++..+ -+++.++. ..+ +.+.......+||++-+||...|-=...++.++++ T Consensus 267 ~~~~~~~~~~~~~~~~~---~-~~i~~~-~d~~~~~~~~sg--l~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~-- 337 (449) T protein:vir:10 267 LQDKFNEVAGEINRGND---V-LMTTQG-ATVTPLVTSVAD--PTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTE-- 337 (449) T ss_pred HHHHHHHHHHHHhccch---h-eeecCC-cceEEEecccCC--hhHHHHHHHHHHHHHhCCCeeeeeccCccccccch-- Confidence 34455555554444443 2 233333 34555543 233 33455666778999999997776544445554332 Q ss_pred chhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH-------HHH-cC---C Q lcl|NC_012530. 372 SLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ-------LEL-QT---A 440 (559) Q Consensus 372 ~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~-------~~~-~~---~ 440 (559) ...|....... .+.-|+|.++++-+.|-+.-+... ...+.|+|+.|...+.++++++.+ ..+ .| . T Consensus 338 --D~~nyyd~i~~-~Q~~l~p~le~l~~~l~~s~~g~~-~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~ 413 (449) T protein:vir:10 338 --DQKYFNARCQS-RRVDLSFEIEDFCDKLIELKIIDA-VAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPA 413 (449) T ss_pred --hHHHHHHHHHH-HHHhhhHHHHHHHHHHHHhhcCCC-CCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCC Confidence 23444444433 234588988888777755444322 236999999999999998877654 223 23 5 Q ss_pred CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccch Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQ 514 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (559) ++++|+|+.+|++|..+.+.+ . .++++. +...++. . T Consensus 414 ~~~~EiR~~~~~~~~~~~~~~-------------~----------------------e~~de~-~~~~d~~--a 449 (449) T protein:vir:10 414 FSREEIRTAAGYDNDDEEPLG-------------E----------------------EDGDEE-DKATDSA--A 449 (449) T ss_pred cCHHHHHHHhcccCCCCCCCC-------------C----------------------CCCccc-cccCCcC--C Confidence 799999999999886431100 0 000000 0000000 0 No 134 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.46 E-value=1.5e-12 Score=85.48 Aligned_cols=497 Identities=10% Similarity=0.060 Sum_probs=222.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) +|-||-- +...-++.-+ |.+.+-- .-+.. .+...++-...++|+....+... .....+..+. ..+... T Consensus 52 ~~~~~~~--~~~~~~~~~~---~~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F~Gy-~~la~l 119 (698) T protein:vir:10 52 LNALDAA--PVAEPSPSLR---LARQFEV---DVSNY--TPRERRAASYALDFNGTSMDALS-FVTSSGFPGF-PTLVLL 119 (698) T ss_pred ccccccc--cccCCCcccc---cccccee---ccccC--Cccccchhhhhhcccccccccch-hhhccCcchH-HHHHHH Confidence 4444432 2222222222 2222211 11111 12222222233444433333221 1111223333 356677 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhh-----cCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTD-----DNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~-----~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ++.|-+++|+.++++...+- |.-... .+-.|+.+....... .+-.+++.++.-+.+.+ .++- T Consensus 120 aQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~---~d~dqi~~L~~e~erl~---------V~~~ 186 (698) T protein:vir:10 120 AQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAST---SDGDQLKQINDEIERLR---------IRDA 186 (698) T ss_pred hhccchhhHHHHHHHHhhcc-cceeccccchhhhhhccccccccccc---ccHHHHHHHHHHHHHHH---------HHHH Confidence 88999999999999887643 211100 000011111111111 11234555666665542 2233 Q ss_pred HHHHHHHHHHcCCcceEEEEC-----------------CCCcEEEEEEecCceEEEEecCcc---cccccceEEEEEecC Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYD-----------------SNGRLSHTRMVDPTTIYFANDEHG---HRRTRGKIYRQYIDN 215 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd-----------------~~G~~~~L~~l~p~~V~~~~~~~g---~~~~~~~~y~~~~~~ 215 (559) +...+..--++|-+.+++.-+ ..|.+..|.+|+|..|.+...... ...++-+.|+++. + T Consensus 187 l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G 265 (698) T protein:vir:10 187 VRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G 265 (698) T ss_pred HHHHHHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEEe-c Confidence 333344444567666555321 235567799999998887431110 0112234455443 2 Q ss_pred ceeeeecccceEEEecccCCCccCCc---ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC--ccCCccCC Q lcl|NC_012530. 216 KVRGSFTADEMGMFIRNPRSDILSGG---YGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP--SPSVTNTS 290 (559) Q Consensus 216 ~~~~~~~~~evi~~~~n~~~~~~~~~---~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~~~~~~ 290 (559) . .+.+.-++.+...|.++..-.. +|+|-++.+...|.....+..........- ...++. ++- .+.++ .. T Consensus 266 ~---~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~-~~~~l~-~dla~aL~~g-~~ 339 (698) T protein:vir:10 266 S---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGIL-MDLAQALTPG-AN 339 (698) T ss_pred c---eecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHh-hHHHHH-HHHHHhcCCh-hh Confidence 2 3455556656555655544333 399999999999888777766666555432 222221 110 01111 11 Q ss_pred HHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccc Q lcl|NC_012530. 291 MRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNK 369 (559) Q Consensus 291 ~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~ 369 (559) .+... |-++-+++++ |-| +.+|..+.-+|...+.+ . -+-+......+.||.+-+||..+|--....+.+.++ T Consensus 340 ~~l~~--R~eli~~~Rs--n~G-~~llDk~~Eefeq~st~lS--GLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATG 412 (698) T protein:vir:10 340 VDLSM--RAELINRYRD--NRN-ILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASS 412 (698) T ss_pred HHHHH--HHHHHHHhcC--ccc-eEEEecCCcceEEEecCcC--CHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccc Confidence 12112 3344455554 333 34554334566666532 2 233455556678999999998777555555553322 Q ss_pred ccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH-------HH-cCCC Q lcl|NC_012530. 370 SNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL-------EL-QTAT 441 (559) Q Consensus 370 ~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~-------~~-~~~~ 441 (559) .....|.-+.....-..-|+|.++++-+.|-+.+|... ...+.|+|+.|...+.++++++.++ ++ .|.+ T Consensus 413 --E~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i-dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI 489 (698) T protein:vir:10 413 --EGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVI 489 (698) T ss_pred --hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCC Confidence 22344555555556667899999998888877777654 3469999999999999998887532 23 3668 Q ss_pred CHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccc Q lcl|NC_012530. 442 TVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYT 521 (559) Q Consensus 442 T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (559) +++|+|.++.-.|--+ | .-.++..+.....++...+.... ...+..+..+...+.+ .-.+.. T Consensus 490 ~~~evr~rL~~d~~s~----Y----~~~~d~~d~p~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~------~~~~~~ 551 (698) T protein:vir:10 490 RPDQVAARLNTEPDGP----Y----AGKLDANDDPGAPADDDIDGVLT----YVQRMAEGGDTGAPTA------PGGARA 551 (698) T ss_pred CHHHHHHHHhccCCCc----c----ccccCCcccCCCCCCCcchHHHh----hhcCCcCCCCcccccc------cccccC Confidence 9999999998765321 0 00011000000000000000000 0000000000000000 000111 Q ss_pred cccccccc---ccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 522 GKDAKPSG---KDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 522 ~~~~~~~g---~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) +...-+.. +-|...+....+...-..+--.-. ++.| T Consensus 552 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~giv~~--~g~~ 590 (698) T protein:vir:10 552 GATAPPAAANVNANANPREAGAQDAAMRAAGIVFR--AGDK 590 (698) T ss_pred CCCCCcccccccCCCCccccCcccceeeEEEEEEE--cCCe Confidence 11110000 000111111111000000000000 1223 No 135 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.41 E-value=8.1e-12 Score=81.40 Aligned_cols=495 Identities=10% Similarity=0.073 Sum_probs=223.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) +|-||-- +.+.-++.-+ |.+.+--+..-. .++..++-...++|+........ .....+..+. ..+... T Consensus 52 ~~~~~~~--~~~~~~~~~~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F~Gy-~~la~l 119 (695) T protein:vir:36 52 LNALDAA--PVVEPSPSLR---LARQFEVDVSNY-----TPRERRAASYALDFNGTSMDALS-FVTSSGFPGF-PTLVLL 119 (695) T ss_pred ccccccc--cccCCCcccc---cceeceeccccc-----Cccccchhhhhhcccccccccch-hhhccCcchH-HHHHHH Confidence 5555543 2232333222 222221111111 12222222233444433333221 1111223333 356677 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhh-----hcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRAST-----DDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~-----~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ++.|-+++|+.++++...+- |.-.. +.+..|+.+...+..+. +-.+++.++.-+.+.+ .++- T Consensus 120 aQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~---d~dqik~L~~e~erL~---------V~~~ 186 (695) T protein:vir:36 120 AQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTS---DGDQLKQINDEIERLR---------IRDA 186 (695) T ss_pred hhccchhhHHHHHHHHhhcc-cceecccchhhhhhccccccccccccC---chHHHHHHHHHHHHHH---------HHHH Confidence 88999999999999987643 21110 00011111111111111 1234555666655532 2333 Q ss_pred HHHHHHHHHHcCCcceEEEEC-----------------CCCcEEEEEEecCceEEEEecC----cccccccceEEEEEec Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYD-----------------SNGRLSHTRMVDPTTIYFANDE----HGHRRTRGKIYRQYID 214 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd-----------------~~G~~~~L~~l~p~~V~~~~~~----~g~~~~~~~~y~~~~~ 214 (559) +...+..--++|-+.+++.-+ ..|.+..|.+|+|..|.+.... .+- .++-+.|+++. T Consensus 187 l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp-dfgkP~~y~V~- 264 (695) T protein:vir:36 187 VRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD-DFYKPSTWWMI- 264 (695) T ss_pred HHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh-ccCCCceEEEe- Confidence 333344445577776555322 2356777999999998874311 111 12234444443 Q ss_pred CceeeeecccceEEEecccCCCccCC---cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC--ccCCccC Q lcl|NC_012530. 215 NKVRGSFTADEMGMFIRNPRSDILSG---GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP--SPSVTNT 289 (559) Q Consensus 215 ~~~~~~~~~~evi~~~~n~~~~~~~~---~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~~~~~ 289 (559) +. .+.+.-++.|.-.|.++..-. .+|+|-.+.+...|.....+..........- ...++ +++- .+.++ . T Consensus 265 G~---kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~aL~~g-~ 338 (695) T protein:vir:36 265 GT---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQALMPG-A 338 (695) T ss_pred ce---EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hHHHH-HHHHHHhhcCh-h Confidence 22 345555655555555554322 3499999999988888777766666555432 22222 1110 01111 1 Q ss_pred CHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccc Q lcl|NC_012530. 290 SMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGN 368 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 368 (559) ..+... |-++-+++++ |-| +.+|..+.-+|...+.+ . -+-+......+.||.+-+||..+|--....+.+.+ T Consensus 339 ~~~l~~--R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNAT 411 (695) T protein:vir:36 339 NVDLSM--RAELINRYRD--NRN-ILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNAS 411 (695) T ss_pred HHHHHH--HHHHHHHhcC--ccc-eEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhhcCchhhhhccCccccccc Confidence 112122 3344455554 333 34555334566665532 2 23344455667899999999877755555555332 Q ss_pred cccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH-------HH-cCC Q lcl|NC_012530. 369 KSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL-------EL-QTA 440 (559) Q Consensus 369 ~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~-------~~-~~~ 440 (559) + .....|.-+.....-..-|+|.++++-+.|-+.+|... ...+.|+|+.|...+.++++++.++ ++ .+. T Consensus 412 G--E~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i-dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv 488 (695) T protein:vir:36 412 S--EGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV 488 (695) T ss_pred c--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC Confidence 2 22344555555566677899999998888877776653 3469999999999999988887532 33 366 Q ss_pred CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) ++++|+|.++.-+|--+ +...++..+......+...... ..-.++.++.++. .+..+ .. T Consensus 489 I~~~evr~rL~~d~~s~--------Y~~~~D~~d~p~~~~~~~~~~~---~~~~~~~~~~~~~---~~~~~-------~~ 547 (695) T protein:vir:36 489 IRPDQVAARLNTEPDGP--------YAGKLDANDDPGVPADDDIDGV---LTYVQRLAEGGDT---GAPGG-------AR 547 (695) T ss_pred CCHHHHHHHHhcCCCcc--------cccccccccCCCcCccchhhhh---HhhhcCccccccc---CCCCc-------cc Confidence 89999999998876321 0000110000000000000000 0000000000000 00000 00 Q ss_pred cccccccccccccccc---cc----------------ccccccc-c-hhhhhhccCCCCC Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGV---GK----------------DGQLKNK-K-NTNSYKQGGSSKK 559 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~---~~----------------~~~~k~~-~-~~~~~~~~~~~~~ 559 (559) .+....++.....-+. +. ||..=.. + .++=.--|||=+. T Consensus 548 ~g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~ 607 (695) T protein:vir:36 548 AGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEG 607 (695) T ss_pred ccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCC Confidence 1111101100000000 00 1100000 0 0011111222111 No 136 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.41 E-value=2.8e-11 Score=78.43 Aligned_cols=434 Identities=11% Similarity=0.084 Sum_probs=219.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCC--CCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSP--IAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~ 78 (559) =+||+.--.++...... +.....++.+.+.+... .+.+..+.+.+ ..+....++.+ T Consensus 3 ~~i~~~~g~~~~~~~~~------------~~~~~~ia~~~~~~~~~----------~~~~~~p~~~~il~~~~~~~~~y~ 60 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPD------------KSLSSQIATRARSIDFF----------ALGMYLPNPDPVLKALGKDIRVYR 60 (491) T ss_pred CeeeCCCCCcccccccc------------hhHHHHHhhhccccccc----------cccccCcchhHHHhhccCCHHHHH Confidence 24554443222221111 11111122222222110 01111111110 00111123444 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) ..+..+.|.+|+..|...|... .|.|.+.+.+ .+..+.+..+|.++ .|..++.. T Consensus 61 ~m~~D~~i~s~l~~Rk~av~~~-----------~w~i~~~~~~------~~~a~~i~e~l~~~---------~~~~~i~~ 114 (491) T protein:vir:79 61 ELRADAHVGGCVRRRKAAVKAL-----------EWGLDRGKAK------SRVAKSIADVFADL---------DLSRIATE 114 (491) T ss_pred HHhhChHHHHHHHHHHHHHhCC-----------CcEEecCCCC------HHHHHHHHHHHhcC---------CHHHHHHH Confidence 4567899999999999998854 4555543321 12234556666543 25567766 Q ss_pred HHHHHHHcCCcceEEEECCC-C--cEEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCC Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSN-G--RLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRS 235 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~-G--~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~ 235 (559) |+ +.+.+|.++.+++|... | .|..|.++|+.++.+. .++.. ++....+......+++...|++++.+.+ T Consensus 115 ~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d--~~~~l-----~l~~~~~~~~g~~lp~~k~i~~~~~~~~ 186 (491) T protein:vir:79 115 ML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD--PENQL-----RFRSKEHWVQGEELPARKFLVPRQEATY 186 (491) T ss_pred HH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceeec--cCCce-----EEeecCCCCCceeecCCCeEEEEecCCC Confidence 55 67789999999998654 3 3568999999888643 23321 2222222222234566666666665544 Q ss_pred CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccc Q lcl|NC_012530. 236 DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIP 315 (559) Q Consensus 236 ~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~ 315 (559) + .+||.+.+..|......-....++...|...-+.|--+.+++. ..+++.++++.+.+.+..+. + .. T Consensus 187 g---~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~-----~a~~~ek~~l~~al~~~~~~---a--~~ 253 (491) T protein:vir:79 187 L---NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR-----SASDAETNLLLDRLEDMVQD---A--VA 253 (491) T ss_pred C---CcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC-----CCCHHHHHHHHHHHHHHhcC---e--EE Confidence 3 4789999999999999999999999999999999987777653 35677888887777765332 2 23 Q ss_pred cccCC-ceeeeeccc-cchh-HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhH Q lcl|NC_012530. 316 MITAE-DAKFVSMTQ-AEDM-QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMP 392 (559) Q Consensus 316 vl~~g-~~~~~~ls~-~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P 392 (559) |++.+ .+++...+. .... .|.+..++..++|+.+. ||-. .++..+++ ++..+... ......+.- T Consensus 254 viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqt----lTt~~~gs--~a~~~vh~-~v~~~i~~~ 320 (491) T protein:vir:79 254 VIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL------LGQN----QTTEATST--RASAQAGL-EVTDDIRDG 320 (491) T ss_pred EecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH------hhhh----hccCcccc--hhhHHHHH-HHHHHHHHH Confidence 55443 345554332 2223 37888888889998865 5532 22222222 33332222 234556777 Q ss_pred HHHHHHHHHHhhccccc------cCccceeeecchhhhh-HHHHHHHHHHHHcCC--CCHHHHHHHhCCCCCCCCCEeec Q lcl|NC_012530. 393 LLDMIAKNLTNGIIRQI------LGDNYMLEFVGGDTRS-QQDKLKSVQLELQTA--TTVNDYREKQGLPKIAGGDIILS 463 (559) Q Consensus 393 ~~~~ie~~ln~~L~~~~------~~~~~~~~f~~l~~~d-~~~~~~~~~~~~~~~--~T~NE~R~~~gl~pi~gGD~~~~ 463 (559) .+..|+..||+ |+.+. ....++|.|.. ..+ .+.+++.++..+..| ++..++|+.+|+|+.+.++.++. T Consensus 321 D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e--~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~~~~ 397 (491) T protein:vir:79 321 DKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWE--QEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDLDERPLP 397 (491) T ss_pred HHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecC--cCchhHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCCccccC Confidence 88888888885 55431 12235555543 233 245677787777644 58999999999987655443220 Q ss_pred cceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccc-cccccccc Q lcl|NC_012530. 464 AVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQ-GVGKDGQL 542 (559) Q Consensus 464 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~ 542 (559) +. . ... .......+.... .++. .+. . ..... ..-+..-. T Consensus 398 ~~------~---------------~~~----~~~~~~~~~~~~---------~~~~---~d~--~-~~~~~~~~~~~~~~ 437 (491) T protein:vir:79 398 VS------A---------------VDA----VGAASFAEFEAP---------DQDA---LDA--A-LNALSARDLNADAQ 437 (491) T ss_pred cC------c---------------ccc----cccccccccCCC---------CCcc---hHH--H-HHHHHHHHHHHHHH Confidence 00 0 000 000000000000 0000 000 0 00000 00000000 Q ss_pred cccchhhhhhccCCCCC Q lcl|NC_012530. 543 KNKKNTNSYKQGGSSKK 559 (559) Q Consensus 543 k~~~~~~~~~~~~~~~~ 559 (559) .-.....+.-+.+.+-- T Consensus 438 ~~~~~i~~~l~~~~s~~ 454 (491) T protein:vir:79 438 ALVAPLLKRIANGASAD 454 (491) T ss_pred HHHHHHHHHHHhcCCHH Confidence 00011111111111111 No 137 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.37 E-value=3.3e-11 Score=78.05 Aligned_cols=495 Identities=10% Similarity=0.065 Sum_probs=221.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) +|-||-- +...-++.-+ |.+.+-- .-+.. .+...++-...++|+........ .....+..+. ..+... T Consensus 52 ~~~~~~~--~~~~~~~~~~---~~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F~Gy-~~la~l 119 (695) T protein:vir:78 52 LNALDAA--PVAEPSPSLR---LARQFEV---DVSNY--TPRERRAASYALDFNGTSMDALS-FVTSSGFPGF-PTLVLL 119 (695) T ss_pred ccccccc--cccCCCcccc---cceecee---ccccC--Cccccchhhhhhcccccccccch-hhhccCcchH-HHHHHH Confidence 4444432 2222222212 2222111 11111 12222222233444433333221 1111223333 356677 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhh-----cCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTD-----DNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~-----~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ++.|-+++|+.++++...+- |.-... .+-.|+.+....... .+-.+++.++.-+.+.+ .++- T Consensus 120 aQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~---~d~dqi~~L~~e~erL~---------V~~~ 186 (695) T protein:vir:78 120 AQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAST---SDGDQLKQINDEIERLR---------IRDA 186 (695) T ss_pred hhccchhhHHHHHHHHhhcc-cceeccccchhhhhhccccccccccc---ccHHHHHHHHHHHHHHH---------HHHH Confidence 88999999999999987643 211100 000011111111111 11234555666655542 2333 Q ss_pred HHHHHHHHHHcCCcceEEEEC-----------------CCCcEEEEEEecCceEEEEec----CcccccccceEEEEEec Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYD-----------------SNGRLSHTRMVDPTTIYFAND----EHGHRRTRGKIYRQYID 214 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd-----------------~~G~~~~L~~l~p~~V~~~~~----~~g~~~~~~~~y~~~~~ 214 (559) +...+..--++|-+.+++.-+ ..|.+..|.+|+|..|.+... ..+- .++-+.|+++. T Consensus 187 l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp-dfgkP~~y~V~- 264 (695) T protein:vir:78 187 VRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD-DFYKPSTWWMI- 264 (695) T ss_pred HHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh-ccCCCceEEEe- Confidence 333344445577776555322 235667799999999887431 1111 12234444443 Q ss_pred CceeeeecccceEEEecccCCCccCC---cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC--ccCCccC Q lcl|NC_012530. 215 NKVRGSFTADEMGMFIRNPRSDILSG---GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP--SPSVTNT 289 (559) Q Consensus 215 ~~~~~~~~~~evi~~~~n~~~~~~~~---~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~~~~~ 289 (559) +. .+.+.-++.|.-.|.++..-. .+|+|-.+.+...|.....+..........- ...++ +++- .+.++ . T Consensus 265 G~---kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~~L~~g-~ 338 (695) T protein:vir:78 265 GT---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQALMPG-A 338 (695) T ss_pred ce---EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHHH-HHHHHHhhcCh-h Confidence 22 345555555555555554322 3499999999998888877766666655432 22222 1110 01111 1 Q ss_pred CHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccc Q lcl|NC_012530. 290 SMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGN 368 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 368 (559) ..+... |-++-+++++ |-| +.+|..+.-+|...+.+ . -+-+......+.||.+-+||..+|--....+.+.+ T Consensus 339 ~~~l~~--R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNAT 411 (695) T protein:vir:78 339 NVDLSM--RAELINRYRD--NRN-ILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNAS 411 (695) T ss_pred HHHHHH--HHHHHHHhcC--ccc-eEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhhcCchhhhhccCCcccccc Confidence 111112 3344455554 333 34555334566665532 2 23344455667899999999877755555555332 Q ss_pred cccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH-------HH-cCC Q lcl|NC_012530. 369 KSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL-------EL-QTA 440 (559) Q Consensus 369 ~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~-------~~-~~~ 440 (559) + .....|.-+.....-..-|+|.++++-+.|-+.+|... ...+.|+|+.|...+.++++++.++ ++ .+. T Consensus 412 G--E~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i-dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv 488 (695) T protein:vir:78 412 S--EGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV 488 (695) T ss_pred c--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC Confidence 2 22344555555666677899999998888877776654 3469999999999999988887532 33 366 Q ss_pred CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) ++++|+|.++.-+|--+ +...++..+......+...... ..-.++.++.++. .+..+ .. T Consensus 489 I~~~evr~rL~~d~~s~--------Y~~~~D~~d~p~~~~~~~~~~~---~~~~~~~~~~~~~---~~~~~-------~~ 547 (695) T protein:vir:78 489 IRPDQVAARLNTEPDGP--------YAGKLDANDDPGVPADDDIDGV---LTYVQRLAEGGDT---GAPGG-------AR 547 (695) T ss_pred CCHHHHHHHHhcCCCcc--------cccccccccCCCcCccchhhhh---HhhhcCccccccc---CCCCC-------CC Confidence 89999999998876321 0000110000000000000000 0000000000000 00000 00 Q ss_pred cccccccccccccccc---cc----------------ccccccc-c-hhhhhhccCCCCC Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGV---GK----------------DGQLKNK-K-NTNSYKQGGSSKK 559 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~---~~----------------~~~~k~~-~-~~~~~~~~~~~~~ 559 (559) .|....++-.....++ +. ||..=.. + .++=.--|||=+. T Consensus 548 ~g~~~~~~~~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~ 607 (695) T protein:vir:78 548 AGATAPPTVANVNANVKPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEG 607 (695) T ss_pred CCCCCCCceeeeeccccccccCCCCcccceeEEEEEeCCEEEEEEecCCCccCCccccCC Confidence 0000000000000001 00 1100000 0 0000111121111 No 138 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.34 E-value=7e-11 Score=76.27 Aligned_cols=498 Identities=10% Similarity=0.063 Sum_probs=221.4 Q ss_pred CcchhhhccccccC-------CcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTD-------DPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) |++--- |++.+++ ++..- -+|.+.+..+..-...+.++ +-...++|+....+.. ......+..+. T Consensus 41 ~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l-~~~~~~~F~Gy 112 (694) T protein:vir:10 41 VPADFA-RRGALNALDAAPVAEPSPS-LRLARQFEVDVSNYTPRERR-----AASYALDFNGTSMDAL-SFVTSSGFPGF 112 (694) T ss_pred ccCCcc-ccccchhhcccccCCCCcc-hhhhhhccccccCCCccccc-----hhhhhhccCcccccch-hhhhccCcchH Confidence 544211 1111111 11111 22333332222222222221 1112233333332221 11111223333 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhh-----cCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTD-----DNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPI 148 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~-----~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~ 148 (559) ..+...++.|-+++|+.++++...+- |.-... .+-.|+.+....... .+-.+++.++.-+.+.+ T Consensus 113 -~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~---~d~dqi~~L~~e~erl~------ 181 (694) T protein:vir:10 113 -PTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAST---SDGDQLKQINDEIERLR------ 181 (694) T ss_pred -HHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhccccccccccc---ccHHHHHHHHHHHHHHH------ Confidence 35667788999999999999887643 211100 000011111111111 11234555666655542 Q ss_pred hhhHHHHHHHHHHHHHHcCCcceEEEEC-----------------CCCcEEEEEEecCceEEEEecC----cccccccce Q lcl|NC_012530. 149 RDDFTSFLRKLVRDTYTYDQVNYENTYD-----------------SNGRLSHTRMVDPTTIYFANDE----HGHRRTRGK 207 (559) Q Consensus 149 ~~~~~~f~~~~v~d~ll~Gna~~~i~rd-----------------~~G~~~~L~~l~p~~V~~~~~~----~g~~~~~~~ 207 (559) .++-+...+..--++|-+.+++.-+ ..|.+..|.+|+|..|.+.... .+- .++-+ T Consensus 182 ---V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp-dfgkP 257 (694) T protein:vir:10 182 ---IRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD-DFYKP 257 (694) T ss_pred ---HHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchhh-ccCCC Confidence 2333333344445577776655321 2356777999999998874311 111 12234 Q ss_pred EEEEEecCceeeeecccceEEEecccCCCccCC---cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC-- Q lcl|NC_012530. 208 IYRQYIDNKVRGSFTADEMGMFIRNPRSDILSG---GYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP-- 282 (559) Q Consensus 208 ~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~---~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-- 282 (559) .|+++. +. .+.+.-++.|.-.|.++..-. .+|+|-.+.+...|.....+..........- ...++ +++- T Consensus 258 ~~y~V~-G~---~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~ 331 (694) T protein:vir:10 258 STWWMI-GT---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQ 331 (694) T ss_pred ceEEEe-ce---EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHHH-HHHHHH Confidence 444443 22 345555555555555554322 3499999999998888777766666655432 22222 1110 Q ss_pred ccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccc Q lcl|NC_012530. 283 SPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQN 361 (559) Q Consensus 283 ~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 361 (559) .+.++ ...+... |-++-+++++ |-| +.+|..+.-+|...+.+ . -+-+......+.||.+-+||..+|--.. T Consensus 332 ~L~~g-~~~~l~~--R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa~IPltkLfGqS 403 (694) T protein:vir:10 332 ALMPG-ANVDLSM--RAELINRYRD--NRN-ILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVSHIPLIKLLGIT 403 (694) T ss_pred hhcCh-hHHHHHH--HHHHHHHhcC--ccc-eEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhhcCchhhhhccC Confidence 01111 1111112 3344455554 333 34555334566665532 2 2334445566789999999987775555 Q ss_pred ccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH------ Q lcl|NC_012530. 362 RGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL------ 435 (559) Q Consensus 362 ~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~------ 435 (559) .++.+.++ .....|.-+.....-..-|+|.++++-+.|-+.+|... ...+.|+|+.|...+.++++++.++ T Consensus 404 PkGlNATG--E~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i-dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~ 480 (694) T protein:vir:10 404 PTGLNASS--EGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDV 480 (694) T ss_pred cccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHH Confidence 55553322 22344555555666677899999998888877776653 3469999999999998988887532 Q ss_pred -HH-cCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 436 -EL-QTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 436 -~~-~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ++ .+.++++|+|.++.-+|--+ +...++..+......+...... ..-.++.++.++. .+..+ T Consensus 481 ~~~~~gvI~~~evr~rL~~d~~s~--------Y~~~~D~~d~p~~~~~~~~~~~---~~~~~~~~~~~~~---~~~~~-- 544 (694) T protein:vir:10 481 LYVQEQVIRPDQVAARLNTEPDGP--------YAGKLDANDDPGVPADDDIDGV---LTYVQRLAEGGDT---GAPGG-- 544 (694) T ss_pred HHHHhcCCCHHHHHHHHhcCCCcc--------cccccccccCCCcCccchhhhh---HhhhcCccccccc---CCCCc-- Confidence 33 36689999999998876321 0000110000000000000000 0000000000000 00000 Q ss_pred hhccccccccccccccccccccc---cc----------------ccccccc--chhhhhhccCCCCC Q lcl|NC_012530. 514 QQNQEGYTGKDAKPSGKDNQQGV---GK----------------DGQLKNK--KNTNSYKQGGSSKK 559 (559) Q Consensus 514 ~~~~~~~~~~~~~~~g~~~~~~~---~~----------------~~~~k~~--~~~~~~~~~~~~~~ 559 (559) ...+....++.....-+. +. ||..=.. ..++=.--|||=+. T Consensus 545 -----~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g~vLl~kr~~g~W~lPgG~vE~ 606 (694) T protein:vir:10 545 -----ARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEG 606 (694) T ss_pred -----ccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCC Confidence 000110001100000000 00 1100000 00111111222111 No 139 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.16 E-value=1.1e-09 Score=69.75 Aligned_cols=328 Identities=16% Similarity=0.121 Sum_probs=160.9 Q ss_pred ceEEEECCC-C--cEEEEEEecCceEE-EEecCcccccccceEEEEEe-cCceeeeecccceEEEecccCCCccCCcccc Q lcl|NC_012530. 170 NYENTYDSN-G--RLSHTRMVDPTTIY-FANDEHGHRRTRGKIYRQYI-DNKVRGSFTADEMGMFIRNPRSDILSGGYGL 244 (559) Q Consensus 170 ~~~i~rd~~-G--~~~~L~~l~p~~V~-~~~~~~g~~~~~~~~y~~~~-~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~ 244 (559) +.||+|... | .|..|.+.++.++. ...+.++... ...+.. .+.....++....|++++.+.+ ..+||. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~----~~~~~~~~g~~~~~lp~~kfi~~~~~~~~---g~p~G~ 73 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLV----AIEQWGVFGKATVRIPVDRLVVFVNEREG---ANWLGQ 73 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCcee----EEEecCCCCCCcceeccCCEEEEEeCCCC---CCccch Confidence 788988654 4 36788888988664 3344444321 222222 2222334566667776665543 347899 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccC--------CHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 245 SELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNT--------SMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 245 Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~--------~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) +.+..|......-....++...|...-+.|--+.+.+.+....+. +.+.++.+.........|.. + ..| T Consensus 74 gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~-a--~~i 150 (355) T protein:vir:78 74 SLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEA-A--GGY 150 (355) T ss_pred hhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcc-e--eEe Confidence 999999999999999999999999876554444444433221111 12223333333333323422 2 235 Q ss_pred ccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHH Q lcl|NC_012530. 317 ITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLD 395 (559) Q Consensus 317 l~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~ 395 (559) ++.+ +++.-+.. .....|.+..++..++|+.++. |-.-... ...++++ ++-.+. ......+.+.-.+. T Consensus 151 ip~g-~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iL------GqtlTs~-~~~~gGS--~Alg~v-h~~v~~~~~~aD~~ 219 (355) T protein:vir:78 151 IPHG-ANFTLTGVQGKLPEMDGPIRYHDEQIARAVL------AHFLTLG-GDKSTGS--YALGDT-FASFFTGSLNAVMK 219 (355) T ss_pred ecCC-ceEEEeecCCCcccHHHHHHHHHHHHHHHHh------hhhhccc-cCCccch--hhHHHH-HHHHHHHHHHHHHH Confidence 5444 55555542 3334577888889999988763 3211100 0011122 222222 22344567778889 Q ss_pred HHHHHHHhhccccc-----cC--ccceeeecchhhhhHHHHHHHHHHHHcCCC-CH-----HHHHHHhCCCCCCCCCEee Q lcl|NC_012530. 396 MIAKNLTNGIIRQI-----LG--DNYMLEFVGGDTRSQQDKLKSVQLELQTAT-TV-----NDYREKQGLPKIAGGDIIL 462 (559) Q Consensus 396 ~ie~~ln~~L~~~~-----~~--~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~-T~-----NE~R~~~gl~pi~gGD~~~ 462 (559) .|++.||+.|+... .. ...+|.|.... .+.++.++.++.++..|+ .+ +.+|+.+|+|+-+.++... T Consensus 220 ~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~ 298 (355) T protein:vir:78 220 HIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGA 298 (355) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCccc Confidence 99999998886531 11 12366776544 344567888888776543 32 4589999998655554432 Q ss_pred ccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccc Q lcl|NC_012530. 463 SAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDG 540 (559) Q Consensus 463 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 540 (559) .+... +... .......... ....+.+.......+.....+..-.. .-+ ..--|..|| T Consensus 299 ~~~~~-~~~~--------~~~~~~~~~~-------~~~~~~~a~~~~a~~~~~~~~~~~~~-~~~----~~~~~~~~~ 355 (355) T protein:vir:78 299 DAAAA-KAAG--------RRRAKRLPGQ-------RQGAALPSRSPRADPPRRRGPLRRRP-RHP----AHRRCAPDG 355 (355) T ss_pred CCccc-cccc--------cccccccCCc-------cccccccccCCCCCChhhhHHHHHHh-hcc----ccCCCCCCC Confidence 21100 0000 0000000000 00000000000000000000000000 000 012233333 No 140 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=99.07 E-value=1.1e-09 Score=69.61 Aligned_cols=482 Identities=12% Similarity=0.117 Sum_probs=197.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccc--cc-cccc-cccccc--cccc-cccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGV--DR-AYTE-PVDGNL--MFST-LEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr--~~-a~~~-~~~~~~--~~~~-~~~~~~~~~p~~~~~~~~ 73 (559) |--|-.+ ...+.....+...+.++|- ++ +.+. ++-.+. ++.. +....+. .. ...+. T Consensus 1 ~~~~~~w-------------~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~-gg---~~~n~ 63 (533) T protein:vir:58 1 MPSLEKY-------------KKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFY-GG---IEFNR 63 (533) T ss_pred CCCcchh-------------hhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhh-cc---ccccH Confidence 5444443 3333333333333333332 11 1111 111010 0000 0001110 00 11232 Q ss_pred HHHH----HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCCh Q lcl|NC_012530. 74 TDVL----RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIR 149 (559) Q Consensus 74 ~~~~----~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~ 149 (559) .++. ..++.+|.|..+|..|++.+. ..+.+....+|.+.+. +.....+.++..+.. T Consensus 64 ~eLI~~YR~ma~~~pEVd~AideIvneai------v~d~~~~pV~v~l~~~-e~s~~iK~kI~~lld------------- 123 (533) T protein:vir:58 64 FFLYDMYDRMDYTDPLISTVLDIIADECT------IPNENGNIVDVVTKDI-ELAKAILSYLDYVIN------------- 123 (533) T ss_pred HHHHHHHHHhhccCcchhhHHHhhhceee------EecCCCceeEeecccc-cccHHHHHHHHHHhc------------- Confidence 3333 333467999999999988753 3344444455554332 233333333322111 Q ss_pred hhHHHHHHHHHHHHHHcCCcceEEEEC-CCCcEEEEEEecCceEEEEecCcccccc--cceEEEEEecCceeeeecccce Q lcl|NC_012530. 150 DDFTSFLRKLVRDTYTYDQVNYENTYD-SNGRLSHTRMVDPTTIYFANDEHGHRRT--RGKIYRQYIDNKVRGSFTADEM 226 (559) Q Consensus 150 ~~~~~f~~~~v~d~ll~Gna~~~i~rd-~~G~~~~L~~l~p~~V~~~~~~~g~~~~--~~~~y~~~~~~~~~~~~~~~ev 226 (559) |..--..+++.+++.|..|..++-+ ..+-+.+|.+|||.+|+.+....+...+ .+..|.....+......+.+.| T Consensus 124 --f~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI 201 (533) T protein:vir:58 124 --IEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDV 201 (533) T ss_pred --chhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchhhe Confidence 1111233455667899999988643 4556889999999999887754332110 0111111112223345677888 Q ss_pred EEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhc Q lcl|NC_012530. 227 GMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSS 306 (559) Q Consensus 227 i~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~ 306 (559) +|+.... ....+.|++|-|..|...+....-.+...--|=-.-+.-+-|+-++-+..+..-.++-+..+...+++.+- T Consensus 202 ~y~~SGl--~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklv 279 (533) T protein:vir:58 202 IHFSHKI--DTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYW 279 (533) T ss_pred eeeeecc--ccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceE Confidence 8886533 23356788899999987777766666655544333333344665555444433333333444444443321 Q ss_pred Cccccccc----------ccccC---------CceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Q lcl|NC_012530. 307 GINGAYRI----------PMITA---------EDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATG 367 (559) Q Consensus 307 G~~nag~~----------~vl~~---------g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 367 (559) =..+.|.+ .++++ -+.++..|.- ..+--++-.+|..+.+.++++||.+.|+.... + T Consensus 280 YDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpG-g~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~--f-- 354 (533) T protein:vir:58 280 VRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQG-SKVDLAEDVEYMLNRLISALKVPKAFIGYEGD--V-- 354 (533) T ss_pred EeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCC-CCCCcHHHHHHHHHHHHHHhCCCeeecCCCCC--C-- Confidence 11122222 12211 1234443331 22445677888999999999999999986442 1 Q ss_pred ccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccc--CccceeeecchhhhhHH-------HHHHHHHHHHc Q lcl|NC_012530. 368 NKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQIL--GDNYMLEFVGGDTRSQQ-------DKLKSVQLELQ 438 (559) Q Consensus 368 ~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~~f~~l~~~d~~-------~~~~~~~~~~~ 438 (559) +.++.+ +-.+ .+ ....|.-+..++.+.|.+.|+.... ...+.|+|.....-.+. .|+.++..+ . T Consensus 355 gr~~eI---tRDE-iK--F~KFI~rLR~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~-d 427 (533) T protein:vir:58 355 NAKNTL---ATQD-IK--FNNTIKRIQGFFVEELERMVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERL-K 427 (533) T ss_pred ccchhh---hHHH-HH--HHHHHHHHHHHHHHHHhcccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHh-c Confidence 111111 1122 22 3445677888888888888764321 23456666544332222 233322211 1 Q ss_pred CCCCHHHHH-HHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcc Q lcl|NC_012530. 439 TATTVNDYR-EKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQ 517 (559) Q Consensus 439 ~~~T~NE~R-~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (559) .++.-.=|| ..|.+.- |. ..+....+.+........+.. ..+...-...+...++.+... T Consensus 428 pyvgk~yi~k~ILr~td----ei------------~~q~e~ie~E~~~~~~~~~~~---~~e~~~~~~~~~~~~p~~~~~ 488 (533) T protein:vir:58 428 GWVREDWIYSNILQIPY----DL------------KPQEEVAEAAGGGGLFDTGGF---GEETTPADFLGERGSPIESPR 488 (533) T ss_pred chhhHHHHHHHHhcCCh----hh------------hHHHHHHHHhhcCCCCCCCCc---ccccCCcccCccccCcccCCC Confidence 111111111 1222210 00 000000000000000000000 000000000000000000000 Q ss_pred ccccccccccccccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 518 EGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 518 ~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) ...... .|..-....+..-..++.- --...+-|.+++ T Consensus 489 ~~~~~~----~~~~~~~~~~~~~~~~~a~-~~~~~~~g~~~~ 525 (533) T protein:vir:58 489 GRTEFD----FGTEGGEELGGELNLGGAF-EEFEEETGGGEE 525 (533) T ss_pred ChhhHh----cccCCcccccccccccccc-hhhhhhcCCccc Confidence 000000 0000000000000000000 000011122222 No 141 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.07 E-value=1e-09 Score=69.82 Aligned_cols=382 Identities=12% Similarity=0.014 Sum_probs=168.5 Q ss_pred ccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHh Q lcl|NC_012530. 61 IVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIER 140 (559) Q Consensus 61 ~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~ 140 (559) +++ ... ...+..+++. ....+...||+++++.+- ..+|.. .|.. ..+ .+.+++.+ T Consensus 1 ~l~--~~~-~~~~~~~~~~-~v~n~~~~ivd~~~~~l~-----------~~gf~~--~d~~-----~~~---~~~~i~~~ 55 (434) T protein:vir:98 1 MLP--KNA-EQAFLDFQRK-ARTNFCGLIANASVHRLL-----------ALGVTG--PDGE-----PDT---RASRWWQA 55 (434) T ss_pred CCC--CCc-cHHHHHhhhh-hhccchHHHHHHHHhhhc-----------cCceec--CCCc-----hHH---HHHHHHHh Confidence 221 111 1122222222 234567777877766432 123332 1211 111 22233322 Q ss_pred cCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCc-----E-EEEEEecCceEEEEecCcccccccceEEEEEec Q lcl|NC_012530. 141 MGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGR-----L-SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID 214 (559) Q Consensus 141 ~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~-----~-~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~ 214 (559) ..|......+..+.+++|.+|..+.++..+. + ..+.+++|..|.++.|...........|+.... T Consensus 56 ---------N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~ 126 (434) T protein:vir:98 56 ---------NRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDI 126 (434) T ss_pred ---------cChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEecc Confidence 1234566678889999999999998765442 2 237788999988887754332222222222111 Q ss_pred C-ceee-e---------------------------------------ecccceEEEecccCCCccCCcccccHHHHHHHH Q lcl|NC_012530. 215 N-KVRG-S---------------------------------------FTADEMGMFIRNPRSDILSGGYGLSELEMGLRE 253 (559) Q Consensus 215 ~-~~~~-~---------------------------------------~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~ 253 (559) + .... . +..==|++|..|+.. ...|.|-++..... T Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~----~~~g~sd~e~vi~l 202 (434) T protein:vir:98 127 DGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDL----GEDPEPEFAGVLDI 202 (434) T ss_pred CCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCc----CcCCcchhhhHHHH Confidence 1 0000 0 000013444433321 12577877777766 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchh Q lcl|NC_012530. 254 FISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM 333 (559) Q Consensus 254 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~ 333 (559) ++....+..-......-.+.|.-+|. +. ...+...+. ......++-... ..+++.+++.++.++..+.. .++ T Consensus 203 iDa~~~~~s~~~~~~~~~a~p~~~i~--G~-~~~~~~~~~-~~~~~~~~~~~~---~~~~i~~~~~~~~~~~q~~~-~~~ 274 (434) T protein:vir:98 203 QDRVNLGILNRMAASRFSGFRQKWIK--GH-KFAKRTDPA-TGMTVVDQPFVP---SPSAVWASEGENTQFGQLDA-TDL 274 (434) T ss_pred HHHHHHHHHHHHHHHHHhcchhhhhc--CC-Ccccccccc-cccchhhhhhhc---cccccccCCCCCceEEEecC-cch Confidence 66666555544444444455544442 11 111111111 111111221111 12345555556677766643 233 Q ss_pred -HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHH---HHHHHHHHHHhhHHHHHHHHHHHhhccc-c Q lcl|NC_012530. 334 -QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQ---NKIDASKSKGLMPLLDMIAKNLTNGIIR-Q 408 (559) Q Consensus 334 -qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~-~ 408 (559) .|++..+..+..|+..=++|++.+|....+.+ + .....-+.... +..+..+...|+-++..+. .+.. . T Consensus 275 ~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~S-g-~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~-----~~~g~~ 347 (434) T protein:vir:98 275 SGFLKEHASDVRDMLTISQTPTYLYATDLVNIS-A-DTIGALDILHVAKVREHIASFSEGLESVLALAA-----AQAGVP 347 (434) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccccCChH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCC Confidence 37888889999999999999999984211110 0 00000000000 0011111112222221111 1111 1 Q ss_pred ccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccc Q lcl|NC_012530. 409 ILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRL 488 (559) Q Consensus 409 ~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 488 (559) .....+.+.|......+..+.++++.++...+++..-+++++|+++-+ +..+ .+. .+.+....+. T Consensus 348 ~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~~~e~~~~~lg~~~~e----------~~r~---~~e--~~~~~~~~~~ 412 (434) T protein:vir:98 348 EDYTEAEVRWANPAHVTMAVKADAATKLKSIGYPLDVIAEELDESPAR----------VRRI---VAG--AASQALLAAS 412 (434) T ss_pred hhheeeeEEecCCCCCCHHHHHHHHHHHHhcCCcHHHHHHhCCCCHHH----------HHHH---HHH--HHHHHHHHHh Confidence 123357778888888899999998888776567777788888875421 0111 100 0000000000 Q ss_pred ccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccc Q lcl|NC_012530. 489 TQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDG 540 (559) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 540 (559) .. ...+.+.... ++. ++...|| T Consensus 413 ~~--~~~~~~~~g~---~~~-------------------------~~~~~dg 434 (434) T protein:vir:98 413 LL--PAPGAPSAGN---VPD-------------------------SGGAVDG 434 (434) T ss_pred hh--ccCCCCCCCC---CCc-------------------------ccCCCCC Confidence 00 0000000000 000 0000111 No 142 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.05 E-value=1.5e-09 Score=69.02 Aligned_cols=430 Identities=10% Similarity=0.044 Sum_probs=172.6 Q ss_pred Ccchhhhccc---------cccCCcc-hHHHHHHHHHHHHH------Hhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK---------FYTDDPN-AFFKHIDSKIANDT------ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~---------~~~~~~~-~~~~~~~~~~~~~~------~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) |++++-++.. -+++.++ +.+..+-+....+. +..=-.|++... . .. ... T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i----~-----~~------~~~ 70 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDI----L-----AG------ERR 70 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc----c-----cC------ccc Confidence 8888866522 2222222 22222222211110 010011111100 0 00 000 Q ss_pred CC-CCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 65 PS-PIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 65 p~-~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) +. ...+.+. + ...++...+|+..+.-+. |.+..+...+ .+..+.+..++... T Consensus 71 ~~~~~~~~~~----k--i~~n~~~~ivd~~~~~l~-----------g~~~~~~~~d--------~~~~~~l~~~~~~n-- 123 (481) T protein:vir:10 71 LQKYGDKADH----R--AVHNYAKYVSRFIVGYLT-----------GNPITITHQD--------NQTNDKIIELNDLN-- 123 (481) T ss_pred cccccccccc----e--eecchHHHHHHHHHhhhc-----------cCCceEecCC--------hhHHHHHHHHHHhc-- Confidence 00 0000000 0 123455556665554332 1122222211 11223455555431 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccc-cccceEEEEEecCc-----e Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHR-RTRGKIYRQYIDNK-----V 217 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~-~~~~~~y~~~~~~~-----~ 217 (559) .|..+...+..+.+++|.+|..+.++.+|++ .+..++|..+.++.+..+.. ....++|+...+.. . T Consensus 124 -------~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~ 195 (481) T protein:vir:10 124 -------DADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQH 195 (481) T ss_pred -------ChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEE Confidence 2345677888899999999999999999886 47889999998887765421 22223333222211 1 Q ss_pred eeeecccceEEEecc-----------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEe Q lcl|NC_012530. 218 RGSFTADEMGMFIRN-----------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLV 280 (559) Q Consensus 218 ~~~~~~~evi~~~~n-----------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 280 (559) ...+..+.+.++... |.-......+|.|.++.+...+........-..+.+...+.|-.++.- T Consensus 196 ~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g 275 (481) T protein:vir:10 196 VEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIG 275 (481) T ss_pred EEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeec Confidence 123344444433221 100011123466766555555544333333333334444455555432 Q ss_pred cCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cC--CceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 281 KPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TA--EDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 281 ~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~--g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) .. ..+.+..+.++..- .+.. ..+.... .+ ++++|..... ....+.+..+...+.|...-++|.... T Consensus 276 ~~-----~~~~~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~ 344 (481) T protein:vir:10 276 NV-----DLDSEDAKAFRDAN--MIHL---EPGTNANGSEGKAEVKYVYKQY-DVAGVEAYKKRLQNDIHKYTNTPDLND 344 (481) T ss_pred Cc-----CCCccchhhhhhcc--ceec---cccccccCCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccc Confidence 11 11222223332210 0000 0011111 12 2334433222 345577778888999999999997655 Q ss_pred ccccccccccccccchhhhhHHH---HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQN---KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ 434 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~---~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~ 434 (559) +-...+. ++ ......++.... ..+..+..+|+=++..+...++..-........+.+.|......+..+.++.+. T Consensus 345 ~~~~~n~-Sg-~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~ 422 (481) T protein:vir:10 345 EQFSGVQ-SG-ESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFN 422 (481) T ss_pred ccccccc-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHH Confidence 4322111 00 000000011111 111222233333333333222221111222345788888888888888888876 Q ss_pred HHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccch Q lcl|NC_012530. 435 LELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQ 514 (559) Q Consensus 435 ~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (559) .+ .|+++.-.+.+++++ ++ |. ...+..+.. +.............+ +..+...++++++ T Consensus 423 kl-~g~is~et~~~~l~~--i~--d~------~~E~~ri~~------E~~~~~~~~~~~~~~--~~~~~~~~~dd~~--- 480 (481) T protein:vir:10 423 AL-SGGVSESTRLSLLDF--ID--NP------KEELEKMQE------EEAQREKQADKRGYG--EAFENHLNVDDSN--- 480 (481) T ss_pred HH-hccCChHHHHHhCCC--CC--CH------HHHHHHHHH------HHHHHHhhhhhccCC--ccCCCCCCCCCCC--- Confidence 65 466787667776654 21 10 001111100 000000000000000 0000000011111 Q ss_pred hcccccccccccccc Q lcl|NC_012530. 515 QNQEGYTGKDAKPSG 529 (559) Q Consensus 515 ~~~~~~~~~~~~~~g 529 (559) | T Consensus 481 --------------g 481 (481) T protein:vir:10 481 --------------G 481 (481) T ss_pred --------------C Confidence 1 No 143 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.00 E-value=4e-09 Score=66.65 Aligned_cols=420 Identities=8% Similarity=0.015 Sum_probs=169.5 Q ss_pred CcchhhhccccccCCcchHHHHH------HHHHHHHHHhhh-------------hccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHI------DSKIANDTASKA-------------LNGVDRAYTEPVDGNLMFSTLEDTSI 61 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~-------------~~gr~~a~~~~~~~~~~~~~~~~~~~ 61 (559) .|+|+|=.++-.+++ .+..| .++.+.+...+- -.|++....++. . .. . .. T Consensus 2 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~-----~-~~-~-~~ 70 (474) T protein:vir:95 2 FNIIRMPWDKPYGEE---VVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMK-----K-VD-V-YG 70 (474) T ss_pred cceeecCCCCchhhH---HHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccc-----c-cc-c-cc Confidence 344444333222211 11111 111111111110 111111110000 0 00 0 00 Q ss_pred cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhc Q lcl|NC_012530. 62 VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERM 141 (559) Q Consensus 62 ~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~ 141 (559) . ...... ..-...+....+|+..+.-+. |.+..+...+ .+..+.+..|+.+ T Consensus 71 ~---~~~~~~------~~ki~~n~~~~Ivd~~~~~l~-----------g~p~~~~~~d--------~~~~~~l~~~~~n- 121 (474) T protein:vir:95 71 N---IDYDKP------DWRITTNFHQNLVDQKVSYVA-----------SKPVTYSCED--------ESVLKIIHDVLDT- 121 (474) T ss_pred c---cccccc------cceeccchHHHHHHHHHhhhc-----------cCCceeccCc--------hHHHHHHHHHHhc- Confidence 0 000000 000113444455555544332 2222222111 1122334444431 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeee Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGS 220 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~ 220 (559) .|......+..+.+.+|.+|..+.++.+|++. +..++|..+.++.+... ......++|+.......... T Consensus 122 ---------~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~ 191 (474) T protein:vir:95 122 ---------RWDNKLIDILTATSNKGIDWLQVYINENGEMK-LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEF 191 (474) T ss_pred ---------cHHHHHHHHHHHHhhcCcEEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEE Confidence 23344556778899999999999899888764 77889999988776431 12222344443333333334 Q ss_pred ecccceEEEecc----------------------cCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 221 FTADEMGMFIRN----------------------PRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGT 273 (559) Q Consensus 221 ~~~~evi~~~~n----------------------~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 273 (559) +..+.+.++... +.. .......|.|-++-+...++....+..-..+.+...+. T Consensus 192 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~ 271 (474) T protein:vir:95 192 WTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVE 271 (474) T ss_pred EeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 444444433211 000 00112347777766666666555444444445555566 Q ss_pred CceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-CCceeeeeccccchhHHHHHHHHHHHHHHHHhCC Q lcl|NC_012530. 274 TKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-AEDAKFVSMTQAEDMQFQSWLNYLINIICALVAM 352 (559) Q Consensus 274 p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgV 352 (559) |-.++. +.. .. ..+.+.... ..+++..+. +++++|..... ....+....+...+.|+..-++ T Consensus 272 p~lv~~--g~~----~~--~~~~~~~~~--------~~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~ 334 (474) T protein:vir:95 272 LIYILK--GYE----GQ--DLEEFMRGL--------KYYKAINVDGDGGVETIQVEV-PVSSTKEYIDLMRAYIMEFGQG 334 (474) T ss_pred ceeeee--cCC----cc--cchhhhhhh--------hccceeeccCCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCC Confidence 654442 211 11 111122111 112322232 33444443332 3445677778888999998899 Q ss_pred CHHHhccccccccccccccch--hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHH Q lcl|NC_012530. 353 DPAEIGMQNRGGATGNKSNSL--NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQ 427 (559) Q Consensus 353 Pp~~lg~~~~~~~~~~~~~~~--~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~ 427 (559) |..--+ +..++.++... -+... .+..+..+..+|+-++..|...+.. ......+.+.|+.....|.. T Consensus 335 p~~~~~----~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~----~~d~~~i~v~f~~~~p~d~~ 406 (474) T protein:vir:95 335 VDFQTD----KFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL----KMDVKDIEISFNFNRMMNDA 406 (474) T ss_pred cccccc----cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCCCcCHH Confidence 853211 11111110000 00011 1122234445555555555443321 22345577888877788888 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 428 DKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 428 ~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) +.++.+.. .|.|+...+..+++. ++. . ...+..+.. +................+..+...++ T Consensus 407 e~a~~~~~--~g~iS~et~i~~l~~--v~d--~------~~E~~ri~~------E~~~~~~~~~~~~~~~~d~~~~~~~~ 468 (474) T protein:vir:95 407 EQSQIIAQ--SQYLSRETLVKSSPL--VDD--Y------KAELERIEQ------EQMEYNKQLPNLDDGGADGAQQQERS 468 (474) T ss_pred HHHHHHHh--cCCCchHHHHHhCCC--CCC--H------HHHHHHHHH------HHHHHHhcccccccccCCCCcCCCCC Confidence 77776544 366888777777654 211 0 011111111 11111111111001011111111111 Q ss_pred cccccc Q lcl|NC_012530. 508 SSSNSF 513 (559) Q Consensus 508 ~~~~~~ 513 (559) .+.+++ T Consensus 469 ~~~~~~ 474 (474) T protein:vir:95 469 NDKESE 474 (474) T ss_pred ccCCCC Confidence 111111 No 144 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.97 E-value=8.6e-09 Score=64.80 Aligned_cols=437 Identities=11% Similarity=0.014 Sum_probs=176.5 Q ss_pred Ccch----hhh--ccccccCCcchHHHHHHHHHHHHHHhhh-----hccccccccccccccccccccccccccccCCCCC Q lcl|NC_012530. 1 MGIF----DRF--RTKFYTDDPNAFFKHIDSKIANDTASKA-----LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIA 69 (559) Q Consensus 1 ~~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~ 69 (559) |-=. |.| |..-++++-.+.+.+|.+.+..+...-. -.|++.. ...+.... T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i-------------------~~~~~~~p 61 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAI-------------------RQIGNLIP 61 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccc-------------------hhcccccc Confidence 3221 222 1122333333445555554433221111 1122110 00111000 Q ss_pred cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCCh Q lcl|NC_012530. 70 FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIR 149 (559) Q Consensus 70 ~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~ 149 (559) .++++......+..-||+++++.+. ..||.+- +..+ ....+..+... T Consensus 62 ----~~~~~~~~v~n~~~~iVd~~a~rl~-----------~~Gf~~~--d~~~-------~~~~l~~i~~~--------- 108 (504) T protein:vir:99 62 ----PEYLRTATVLGWSAKAVDTLARRCN-----------LESFVWP--DGDY-------GSIGGPDVWDE--------- 108 (504) T ss_pred ----HHHHHHhhccCcHHHHHHHHHhhhc-----------cceeeCC--CCCh-------hhHHHHHHHHh--------- Confidence 1222333445666677777776542 1234321 1111 11123333332 Q ss_pred hhHHHHHHHHHHHHHHcCCcceEEEECCCCcEE-EEEEecCceEEEEecCcccccccceEEEEEecC-ce--eeeecccc Q lcl|NC_012530. 150 DDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLS-HTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDN-KV--RGSFTADE 225 (559) Q Consensus 150 ~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~-~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~-~~--~~~~~~~e 225 (559) ..+......+..+.++||.+|+.|..+.+|.+. .+.+++|..+.++.|...........|+..... .. ...+.++. T Consensus 109 N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~ 188 (504) T protein:vir:99 109 NFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGV 188 (504) T ss_pred cChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCc Confidence 123345667888999999999999988888764 577889999987776543322222222211111 11 11223333 Q ss_pred ------------------------eEEEecccCCCccCCcccccHH----HHHHHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|NC_012530. 226 ------------------------MGMFIRNPRSDILSGGYGLSEL----EMGLREFISHENTELFNDRFFTHGGTTKGI 277 (559) Q Consensus 226 ------------------------vi~~~~n~~~~~~~~~~G~Spl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 277 (559) |+++..++. ....+|.|.| ..+.+++...+.-......||.. |.-+ T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~---~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~---p~r~ 262 (504) T protein:vir:99 189 TVTADMDDDGDWHADVRTHKLGVPVEVLPYKPR---EDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSF---PQLI 262 (504) T ss_pred EEEEEEcCCceeeeccccCCCCcceEEeccccc---CccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhh Confidence 333332222 1345677743 44555555444444444555544 2222 Q ss_pred EEecCccCCccCCHHHHHHHHHHHHHHhcC---cccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 278 LLVKPSPSVTNTSMRALEDFKRHWTATSSG---INGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMD 353 (559) Q Consensus 278 l~~~~~~~~~~~~~e~~~~l~~~~~~~~~G---~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVP 353 (559) |. + ..+.+...+.- .-...|+..... ........+....+.++..+.. .+++ |++..+..+..|+..=++| T Consensus 263 i~--G-~~~~~~~~~d~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~-~~l~~~~~~l~~~i~~~a~~t~~P 337 (504) T protein:vir:99 263 LL--G-ADAKNFRNKDG-SMKPAWQIALARVFALPDDEDEPDAARARADVKQFPA-SSPQPHIEMLEQIAMMFSGETSIP 337 (504) T ss_pred hc--c-CCccccccccc-cccchhhhhhhhhhcCCCccccccccCccceeeecCC-CChHHHHHHHHHHHHHHHhhhCCC Confidence 21 0 00111100000 001122222111 1111111111122345544432 3454 8899999999999999999 Q ss_pred HHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhh------ccc-----cccCccceeeecchh Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNG------IIR-----QILGDNYMLEFVGGD 422 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~------L~~-----~~~~~~~~~~f~~l~ 422 (559) ++.+|+......+ ++. -+........ ..+.-..+.+...|.+. +.. ......+++.|.... T Consensus 338 ~~~lG~~~~~n~s---Sa~----Ai~~~~~~L~-~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~ 409 (504) T protein:vir:99 338 VESLGFSNRANPT---SAD----AYIASREDLI-AEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPL 409 (504) T ss_pred HHHhccccccccc---HHH----HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCC Confidence 9999976432111 000 0111111111 11111112222222211 111 122345677788888 Q ss_pred hhhHHHHHHHHHHHHc-CCC--CH-HHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc-----c Q lcl|NC_012530. 423 TRSQQDKLKSVQLELQ-TAT--TV-NDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE-----S 493 (559) Q Consensus 423 ~~d~~~~~~~~~~~~~-~~~--T~-NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~ 493 (559) ..+..+.++++.+++. +.+ .+ .-+.+++|+.|-+ +..+...... +......+... . T Consensus 410 ~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~e----------i~r~~~e~~~-----~~~~~~~~~l~~~~~~~ 474 (504) T protein:vir:99 410 YLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQ----------AKRALAERRR-----ASSVSIIEALNRRQQEA 474 (504) T ss_pred ccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHH----------HHHHHHHHHH-----HhhHHHHHHHhcccCCC Confidence 8888899988877665 332 22 3355677875432 0000000000 00000001000 0 Q ss_pred cCCCCCCCCCCCCccccccchhccccccccccccccccccccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDG 540 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 540 (559) ..+...+.++..++ ... +..+....+.++| T Consensus 475 ~~~~~~~~~~~~e~-----a~~------------~~~~~~~~p~~~~ 504 (504) T protein:vir:99 475 ATAGEDQDQGAGEP-----PAN------------EPPAALGRPTLVG 504 (504) T ss_pred CCCCCCCCcCCCCC-----CCC------------CCCccCCCcccCC Confidence 00101111111111 110 1111122223333 No 145 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.97 E-value=8.7e-09 Score=64.77 Aligned_cols=423 Identities=9% Similarity=0.031 Sum_probs=174.3 Q ss_pred Ccchhh------------hccccccCCcchHHHHHHHHHHHH-----HHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDR------------FRTKFYTDDPNAFFKHIDSKIAND-----TASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =|++-. |+..+-.+.+.+.|..+-.....+ .+..=-.|++....++... .. .... T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~----~~----~~~~ 80 (483) T protein:vir:12 9 GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV----DA----TGAV 80 (483) T ss_pred CceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc----cc----cccc Confidence 122211 222222222223332222111111 0111112332222111000 00 0000 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) .+ .+.+ .+ ...+....+|+..+.-+- |.+..+...+ ....+.+..|+.+ T Consensus 81 ~~---~~~~----~k--i~~n~~k~Ivd~~~~~l~-----------G~p~~~~~~d--------~~~~~~l~~~~~n--- 129 (483) T protein:vir:12 81 DP---LKPD----DR--MITNFHANLVDQKVSYIV-----------GKPIAFKHTD--------DEVVKRIDEVLGN--- 129 (483) T ss_pred cc---cccc----cc--cccchHHHHHHHHhhhhc-----------ccCceeccCC--------hHHHHHHHHHHhc--- Confidence 00 0000 00 123455556665554432 2222221111 1122334444421 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeec Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFT 222 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~ 222 (559) .+......+..+.+.+|.+|..+.+|.+|++. +..++|..+.++.+... ......++|+..........+. T Consensus 130 -------~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~-i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~ 201 (483) T protein:vir:12 130 -------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWD 201 (483) T ss_pred -------cHHHHHHHHHHHHhhCCeEEEEEEEcCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEe Confidence 12334455677889999999999999998864 88899999988876332 2222234444333333333333 Q ss_pred ccceEEEec----------------------ccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_012530. 223 ADEMGMFIR----------------------NPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTK 275 (559) Q Consensus 223 ~~evi~~~~----------------------n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 275 (559) ...+.|+.. |+.. .......|.|-++.....++....+..-..+.+...+.|- T Consensus 202 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 281 (483) T protein:vir:12 202 KVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELT 281 (483) T ss_pred cCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 333333321 1000 0001224667776666666555544444445555556665 Q ss_pred eEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_012530. 276 GILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDP 354 (559) Q Consensus 276 gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp 354 (559) .+++ +. +.+....++..+. .+++..+ .+++++|..... .+..+....+...+.|+..-++|. T Consensus 282 lv~~--g~------~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~ 344 (483) T protein:vir:12 282 YVLT--NY------DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVD 344 (483) T ss_pred eeee--cC------CcccchhHHHhhh--------hccccccCCCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCC Confidence 5543 21 1111112222211 1122222 234455544332 344566777788888988888886 Q ss_pred HHhccccccccccccccchh---hhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 355 AEIGMQNRGGATGNKSNSLN---ESN---NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 355 ~~lg~~~~~~~~~~~~~~~~---~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) .-.+-. +++.++... +.. -....+..+..+|+-+++.|...+.. ......+.+.|+.....+..+ T Consensus 345 ~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~----~~~~~~i~v~f~~~~p~~~~~ 415 (483) T protein:vir:12 345 FSSDKF-----GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHKDVDISFNYNKVANTEL 415 (483) T ss_pred CCcccc-----ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCccceeeEEeCCCCCCCHHH Confidence 432211 111111100 011 11222334455555555555444332 223456788898888888888 Q ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcc Q lcl|NC_012530. 429 KLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPS 508 (559) Q Consensus 429 ~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (559) .++.+..+ .|+|+..-+.++++.- + |. ...+..+.+ +........+......++....+.++. T Consensus 416 ~a~~~~kl-~GiiS~et~~~~~~~v--~--d~------~~E~~ri~~------E~~~~~~~~~~~~~~~~d~~~~~~~~~ 478 (483) T protein:vir:12 416 QVQTAQQS-MGIVSHETVLENHPFV--E--DL------QAELERIEQ------EQMEYNKQLPNLDDGGADGAQQQERSN 478 (483) T ss_pred HHHHHHHH-hccCchHHHHHhCCCC--C--CH------HHHHHHHHH------HHHHHHhhcccccccccCCcccCCCCC Confidence 88887765 4678887777777542 1 10 011111111 111000001000000000000000000 Q ss_pred ccccc Q lcl|NC_012530. 509 SSNSF 513 (559) Q Consensus 509 ~~~~~ 513 (559) +.+.+ T Consensus 479 ~~e~e 483 (483) T protein:vir:12 479 NKESE 483 (483) T ss_pred cccCC Confidence 00000 No 146 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.96 E-value=9.8e-09 Score=64.49 Aligned_cols=423 Identities=9% Similarity=0.034 Sum_probs=174.3 Q ss_pred Ccchhhhc-cccccCCcchHHHHHHHHHHHHHHh---------hhhccccccccccccccccccccccccccccCCCCCc Q lcl|NC_012530. 1 MGIFDRFR-TKFYTDDPNAFFKHIDSKIANDTAS---------KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 70 (559) |..-..++ ..|+.+-=.+.+..+-+........ .=-.|++....++... .......+ .+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~--------~~~~~~~~---~~ 73 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV--------DATGAVDP---LK 73 (472) T ss_pred CCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchh--------hccccccc---cc Confidence 33322222 2233331111222222222222111 1111222111111000 00000000 00 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) .+. + ...++...+|+..+.-+- |.+..+...+ ....+.+..|+.+ T Consensus 74 ~~~----r--i~~n~~~~ivd~~~~~l~-----------g~~~~~~~~d--------~~~~~~l~~~~~n---------- 118 (472) T protein:vir:93 74 PDD----R--MITNFHANLVDQKVSYIV-----------GKPIAFKHTD--------DEVVKRIDEVLGN---------- 118 (472) T ss_pred ccc----c--cccchHHHHHHHHhhhhc-----------ccCeeeccCC--------hHHHHHHHHHHhc---------- Confidence 000 0 123555566666554432 2222222111 1122334444421 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCc-ccccccceEEEEEecCceeeeecccceEEE Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEH-GHRRTRGKIYRQYIDNKVRGSFTADEMGMF 229 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~-g~~~~~~~~y~~~~~~~~~~~~~~~evi~~ 229 (559) .+......+..+.+.+|.+|..+..+.+|++. +..++|..+.++.+.. .......++|+...+......+....+.++ T Consensus 119 ~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~-i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~ 197 (472) T protein:vir:93 119 RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYY 197 (472) T ss_pred cHHHHHHHHHHHHhhcCeEEEEEEECCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEE Confidence 12344556678899999999999999888764 7789999999887532 222222333333322222222333333222 Q ss_pred ec----------------------ccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC Q lcl|NC_012530. 230 IR----------------------NPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKP 282 (559) Q Consensus 230 ~~----------------------n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 282 (559) .. |+.. ......+|.|-++.+...++....+..-..+.+...+.|-.++. + T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~--g 275 (472) T protein:vir:93 198 VYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--N 275 (472) T ss_pred EEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEee--c Confidence 11 1000 00112357777777666666555555555555666667755553 2 Q ss_pred ccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhcccc Q lcl|NC_012530. 283 SPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQN 361 (559) Q Consensus 283 ~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 361 (559) . +.+....+...+. .+++..+ .+++++|.... ..+..+....+...+.|+..-++|..-.+-.. T Consensus 276 ~------~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~ 340 (472) T protein:vir:93 276 Y------DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVE-VPVENSKKYLDELYQKIMLFGQAVDFSSDKFG 340 (472) T ss_pred C------CcccchhhHHHHh--------hccccccCCCCcceeEeec-CCHHHHHHHHHHHHHHHHHHhCCCCCCccccc Confidence 1 1111112222111 1122323 23445554332 24556777888888999999999864432211 Q ss_pred ccccccccccchh---hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH Q lcl|NC_012530. 362 RGGATGNKSNSLN---ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL 435 (559) Q Consensus 362 ~~~~~~~~~~~~~---~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~ 435 (559) ++.++... +... .+..+..+...|+-+++.|...++. ......+.+.|......+..+.++.+.. T Consensus 341 -----~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~i~v~f~~~~p~~~~~~~~~~~k 411 (472) T protein:vir:93 341 -----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHKDVDISFNYNKVANTELQVQTAQQ 411 (472) T ss_pred -----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeCCCCCCCHHHHHHHHHH Confidence 11111100 0101 1112233344444444444433321 1234567888888888888888887766 Q ss_pred HHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchh Q lcl|NC_012530. 436 ELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQ 515 (559) Q Consensus 436 ~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (559) + .|+++.--+.++++.-+ |. ...+..+.. +...............++..++..++.+.+ ++ T Consensus 412 ~-~giis~et~l~~l~~~~----d~------~~E~~ri~~------E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~e 472 (472) T protein:vir:93 412 S-MGIVSHETVLENHPFVE----DL------QAELERIEQ------EQMEYNKQLPNLDDGGADGAQQQERSNNKE--SE 472 (472) T ss_pred H-hccCchHHHHHhCCCCC----CH------HHHHHHHHH------HHHHHHHhccCcCcccCCCCCCCCCCCccc--CC Confidence 4 46678777777665421 10 011111111 110000000000000011000000000000 00 No 147 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.95 E-value=1.1e-08 Score=64.29 Aligned_cols=429 Identities=11% Similarity=0.077 Sum_probs=172.7 Q ss_pred hccccc----cCCcchHHHHHHHHHHHHHHh-----hhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 7 FRTKFY----TDDPNAFFKHIDSKIANDTAS-----KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 7 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~-----~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) .-+++- .++....+.+|.+.+..+... .=-.|++.- ...+... .. .++ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i-------------------~~~~~~~-~~---~~~ 57 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRP-------------------EAIGVTV-PI---QMQ 57 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-------------------hhcCCCC-Ch---hhh Confidence 111111 122222333344333322211 111222110 0000000 00 111 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHH Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLR 157 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~ 157 (559) +......+...||++.++.+- ..+|.+ .+.+ + ....+.+++.. ..+..+.. T Consensus 58 ~~~~~~n~~~~ivd~~~~~l~-----------~~g~~~--~~~~----~---~~~~~~~i~~~---------N~~d~~~~ 108 (485) T protein:vir:10 58 SLLAHVGYPRLYVDSIAERQA-----------VEGFRF--GDAD----E---ADEELWQWWQA---------NNLDIEAP 108 (485) T ss_pred hhhhhcCcHHHHHHHHHhhhc-----------ccceec--CCCc----h---hHHHHHHHHHh---------cCHhHHHH Confidence 111223566667776665431 123332 1111 1 11223344432 13445677 Q ss_pred HHHHHHHHcCCcceEEEECCCCc-------EEEEEEecCceEEEEecCcccccccceEEEEEecCce---eeeecccc-- Q lcl|NC_012530. 158 KLVRDTYTYDQVNYENTYDSNGR-------LSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKV---RGSFTADE-- 225 (559) Q Consensus 158 ~~v~d~ll~Gna~~~i~rd~~G~-------~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~---~~~~~~~e-- 225 (559) .+..+++++|.+|..+.++..+. ...+.+++|..+.++.+..........++++...+.. ...+..+. T Consensus 109 ~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~ 188 (485) T protein:vir:10 109 LGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIF 188 (485) T ss_pred HHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEE Confidence 88889999999999988875432 2247888999988877654332221222221111111 11122222 Q ss_pred -----------------------eEEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEE Q lcl|NC_012530. 226 -----------------------MGMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGIL 278 (559) Q Consensus 226 -----------------------vi~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 278 (559) |+++..++. ..+.+|.|.++ .+.+++...+.-......||. .|--+| T Consensus 189 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~---~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a---~p~~~i 262 (485) T protein:vir:10 189 GWYRVENEWQEWFNNPHGLGVVPVVPIPNRTR---LSDLYGTSEITPELRSMTDAAARILMLMQATAELMG---VPQRLI 262 (485) T ss_pred EEEEcCCceEEeccccCCCCcccEEEeccccc---cCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhc---chHHHH Confidence 233333221 23457887654 344444444433333334443 343333 Q ss_pred EecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 279 LVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 279 ~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~l 357 (559) .- . ...+...+. +.-...|+.. .+++..++.++.++..+.. .+++ |++..+..+..|+.+=++|++.+ T Consensus 263 ~G--~-~~~~~~~~~-~~~~~~~~~~------~~~i~~~~~~d~k~~q~~~-~~~~~~~~~l~~~i~~~~~~~~~p~~~f 331 (485) T protein:vir:10 263 FG--I-KPEEIGVDP-ETGQTLFDAY------LARILAFEDAEGKIQQFSA-AELANFTNALDQIAKQVAAYTGLPPQYL 331 (485) T ss_pred hc--C-Ccccccccc-cccchhhhhc------ccceeccCCCCceEEeecc-cchHHHHHHHHHHHHHHhcccCCCHHHh Confidence 21 0 000000000 0001112211 2344455556677766543 2343 78888888999999999999999 Q ss_pred ccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ 434 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~ 434 (559) |....+..++ .......... .+..+..+...|+-++..+.. +....-.......+.+.|......+..+.++++. T Consensus 332 g~~~~n~~Sg-~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~-~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~ 409 (485) T protein:vir:10 332 STAADNPASA-EAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYR-MMKGGDVPPDMLRMETVWRDPSTPTYAAKADAAS 409 (485) T ss_pred ccccCchhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhCCCCCcccceeeeEEecCCCCCCHHHHHHHHH Confidence 8533221110 0000111111 111222233334333333221 1111000112245778888888888899998887 Q ss_pred HHHc-C--CCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccC-CCCCCCCCCCCcccc Q lcl|NC_012530. 435 LELQ-T--ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESAL-QNPSGTPPTLPPSSS 510 (559) Q Consensus 435 ~~~~-~--~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 510 (559) +++. | +++..-+++.+|+.+-+ ...+....+... .......+...... +.++++++. T Consensus 410 kl~~ag~~~~s~et~~~~lg~~~~~----------~~~~~~~~ee~~---~~~~~~~~~~~~~~~~~~~~~~~~------ 470 (485) T protein:vir:10 410 KLYNGGTGVIPRERARKDMGYSIAE----------REEMRRWDEEEA---AMGLGLIGTMVDPNPTVPGSPSPA------ 470 (485) T ss_pred HHHhccccCCCHHHHHHhCCCCHhH----------HHHHHHHHHHHH---HHHHHHHHHhhccCCCCCCCCCcc------ Confidence 7764 3 36777788888875421 011110000000 00000000000000 000000000 Q ss_pred ccchhcccccccccccccccccccccccccc Q lcl|NC_012530. 511 NSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 511 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 541 (559) +++......++.||+ T Consensus 471 ----------------~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 471 ----------------PAPKPAALESGGDAA 485 (485) T ss_pred ----------------ccccCcCCCCCCCCC Confidence 000000111222222 No 148 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.94 E-value=1.2e-08 Score=64.10 Aligned_cols=433 Identities=11% Similarity=0.065 Sum_probs=171.4 Q ss_pred CcchhhhccccccCCcc--hHHHHHHHHHHHHHHhhh-----hccccccccccccccccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPN--AFFKHIDSKIANDTASKA-----LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-----~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) |- ++.+..++.-+ +-+..|.+.+-.+..... -.|+++- ...+... + T Consensus 1 ~~----~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i-------------------~~~~~~~---~- 53 (486) T protein:vir:42 1 MT----APLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRP-------------------EAIGVTV---P- 53 (486) T ss_pred CC----CCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc-------------------hhccccc---c- Confidence 21 12222322221 224444444333221111 1122100 0000000 0 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) ..+++.-....+..-||+.+++.+- ..||.+- +... ....+.+++.. ..+. T Consensus 54 ~~~~~~~~v~n~~~~iVd~~~~~l~-----------~~g~~~~--~~~~-------~~~~~~~i~~~---------N~~d 104 (486) T protein:vir:42 54 REMQQLLAHVGYPRLYVDSVAERQA-----------VEGFRLG--DADE-------ADEELWQWWQA---------NNLD 104 (486) T ss_pred hhHhhhhhccchHHHHHHHHHhhhc-----------ccceecC--CCch-------hHHHHHHHHHh---------cChh Confidence 0111111234566666666655431 1233321 1110 11123333332 1233 Q ss_pred HHHHHHHHHHHHcCCcceEEEECCCCc-------EEEEEEecCceEEEEecCcccccccceEEEEEecCce---eeeecc Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYDSNGR-------LSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKV---RGSFTA 223 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd~~G~-------~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~---~~~~~~ 223 (559) .....+..+++++|.+|+.+.++..|. ...+.+++|..+.++.+..........+|++..++.. ...+.. T Consensus 105 ~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 184 (486) T protein:vir:42 105 IEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTP 184 (486) T ss_pred HHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcC Confidence 456678889999999999988765432 2357788999988877644322222222221111111 111222 Q ss_pred cceE-------------------------EEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 224 DEMG-------------------------MFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTT 274 (559) Q Consensus 224 ~evi-------------------------~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p 274 (559) +.++ +++.++ ...+.+|.|.++ .+.+++...+.-..-...+| +.| T Consensus 185 ~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~---~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~---a~p 258 (486) T protein:vir:42 185 METIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRT---RLSDLYGTSEITPELRSMTDAAARILMLMQATAELM---GVP 258 (486) T ss_pred CcEEEEEecCCcEEeecceecCCCCceEEEecccc---ccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhh---cch Confidence 2222 222222 123457887654 33444444433222222333 334 Q ss_pred ceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 275 KGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMD 353 (559) Q Consensus 275 ~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVP 353 (559) .-+|.- . .......+. ++-...|+. ..+++.+++.++.++..+.. .+++ |++..+..+..++..=++| T Consensus 259 ~~~i~G--~-~~~~~~~~~-~~~~~~~~~------~~~~~~~~~~~~~~~~q~~~-~~~e~~~~~l~~~i~~~s~~~~~p 327 (486) T protein:vir:42 259 QRLIFG--I-KPEEIGVDS-ETGQTLFDA------YLARILAFEDAEGKIQQFSA-AELANFTNALDQIAKQVAAYTGLP 327 (486) T ss_pred HHHhhc--C-Ccccccccc-ccccchhhh------hhchhcccCCCCceEEeecc-cCHHHHHHHHHHHHHHHhcccCCC Confidence 333321 0 000000000 000111221 12345555556677766542 2443 7888888999999999999 Q ss_pred HHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHH Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKL 430 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~ 430 (559) ++.+|....+..++ ......+... .+..+..+...|.-++..+....+..-. ......+.+.|......+..+.+ T Consensus 328 ~~~fg~~~~n~~Sg-~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~d~~~i~v~w~~~~~~s~~~~a 405 (486) T protein:vir:42 328 PQYLSTAADNPASA-EAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDV-PPDMLRMETVWRDPSTPTYAAKA 405 (486) T ss_pred HHHhccccCchhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-cccceeeeEEecCCCCCCHHHHH Confidence 99998533221110 0000011111 1122223333444444333222211101 11234577888888888888888 Q ss_pred HHHHHHHc---CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 431 KSVQLELQ---TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 431 ~~~~~~~~---~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) +++.++++ +.++..-+++.+|+-+-+ ...+..+.+... .......+......+..+..+....+ T Consensus 406 d~~~kl~~~~~g~~s~et~~~~lg~~~d~----------~~e~~~~~~e~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (486) T protein:vir:42 406 DAATKLYGNGQGVIPRERARIDMGYSVKE----------REEMRRWDEEEA---AMGLGLLGTMVDADPTVPGSPSPTAP 472 (486) T ss_pred HHHHHHHhcccCCCCHHHHHhcCCCChhH----------HHHHHHHHHHHH---HHHHHHHHHhhcCCCCCCCCCCCCCC Confidence 88876654 335666677777763321 011111000000 00000000000001001110110011 Q ss_pred cccccchhccccccccccccccccccccccccccccccchhhhhhccCCC Q lcl|NC_012530. 508 SSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGGSS 557 (559) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~~~ 557 (559) ..+++.. ++ +||.| T Consensus 473 ~~~~~~~----------------------------------~~--~~~~~ 486 (486) T protein:vir:42 473 PKPQPAI----------------------------------ES--SGGDA 486 (486) T ss_pred CCCCccc----------------------------------CC--CCCCC Confidence 1111111 11 11111 No 149 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.87 E-value=2.4e-08 Score=62.36 Aligned_cols=435 Identities=11% Similarity=0.027 Sum_probs=170.8 Q ss_pred CcchhhhccccccCCcc-hH-HHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPN-AF-FKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |- -.+ -.++.++ +. +++|.+.+......-. +-..|-+ |.-. +...+.... .++.+ T Consensus 1 ~~---~~~--~~~~~~~~~~~~~~l~~~~~~~~~rl~---~l~~Yy~---G~~~--------i~~~~~~~~----~~~~~ 57 (484) T protein:vir:77 1 MT---SPL--QKQENVDPEKAREEMLNLFTERTQDLG---DNTAYYE---SERR--------PDAVGVTVP----QQMQK 57 (484) T ss_pred CC---Ccc--cccCCCCHHHHHHHHHHHHHHHHHHHH---HHHHHHh---cccc--------chhcccccc----hhHHh Confidence 22 111 1223333 22 2333333222211110 1111100 0000 000000000 11222 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) ....+.+...||+..++.+- ..||.+- +.. .....+..+... ..+...... T Consensus 58 ~~~~~n~~~~ivd~~~~~l~-----------~~g~~~~--~~~-------~~~~~l~~i~~~---------N~~d~~~~~ 108 (484) T protein:vir:77 58 LLAHVGYPRLYIDAIAARQE-----------LEGFRLG--GAD-------KADEQLWDWWQA---------NDLDIESTL 108 (484) T ss_pred hhhhcCcHHHHHHHHHhhhc-----------cCceecC--Ccc-------hhHHHHHHHHHh---------cCHhHHHHH Confidence 22345666677776665432 1233321 111 111233344332 123456677 Q ss_pred HHHHHHHcCCcceEEEECCCCcE-------EEEEEecCceEEEEecCcccccccceEEEEEecCcee---eeecccc--- Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSNGRL-------SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVR---GSFTADE--- 225 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~G~~-------~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~---~~~~~~e--- 225 (559) +..+.+++|.+|..+.++..|.+ ..|.+++|..+.++.+..........+|+...+++.. ..|..+. T Consensus 109 ~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~ 188 (484) T protein:vir:77 109 GHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVI 188 (484) T ss_pred HHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEE Confidence 88899999999999998887753 2477889999887776532221111111111111100 0111111 Q ss_pred ----------------------eEEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEE Q lcl|NC_012530. 226 ----------------------MGMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGILL 279 (559) Q Consensus 226 ----------------------vi~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 279 (559) |++|..++ ...+.+|.|.++ .+.+++.....-..-...||. .|.-+|. T Consensus 189 ~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~---~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a---~p~~~i~ 262 (484) T protein:vir:77 189 WNREDGQWVQVANVAHNLEMVPVIPIPNRT---RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMG---VPQRLLF 262 (484) T ss_pred EEecCCceEeeccccCCCCCcceEEecccc---ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhh---hhHHHHh Confidence 23443222 223446877654 334444443333333334443 3433332 Q ss_pred ecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_012530. 280 VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEIG 358 (559) Q Consensus 280 ~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg 358 (559) + ....+...+. ..-...|+. ..+++.+++.++.++..+.. .++ -|++..+..+..|+.+=++|++.+| T Consensus 263 --G-~~~~~~~~~~-~~~~~~~~~------~~~~~~~~~~~~~~~~q~~~-~~~e~~~~~l~~~i~~~s~~~~~p~~~fg 331 (484) T protein:vir:77 263 --G-VKGEELGVDP-ETGQTLFDA------YLARILAFEDHESKAQQFSA-AELRNFVDALDALDRKAAAYTGLPPYYLS 331 (484) T ss_pred --C-CCcchhcccc-cccchhhhh------hhhhhcccCCCCceeEeecC-CChHHHHHHHHHHHHHHhcccCCCHHHhc Confidence 1 0011110000 000111221 12345556666677766643 233 3888888899999999999999998 Q ss_pred cccccccccccccchhhhhHH---HHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHH Q lcl|NC_012530. 359 MQNRGGATGNKSNSLNESNNQ---NKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQL 435 (559) Q Consensus 359 ~~~~~~~~~~~~~~~~~an~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~ 435 (559) ....+..++ ......+.... +..+..+...|.-++..+....+..- .......+.+.|......+..+.++.+.+ T Consensus 332 ~~~~n~~Sg-~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~-~~~~~~~i~v~w~~~~~~s~~~~ad~~~k 409 (484) T protein:vir:77 332 FSSENPASA-EAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGD-IPPEYYRMESIWRDPSTPTYAAKADAATK 409 (484) T ss_pred cccCcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-cccccccceEEecCCCCCCHHHHHHHHHH Confidence 533221110 00000011100 11111222222222222211111100 01112356778877778888888887776 Q ss_pred HHc-C--CCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccc-ccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 436 ELQ-T--ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQT-RLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 436 ~~~-~--~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) ++. | +++..-+++++|+-+-+ ...+...........+.... ......+..++++.++++....+ T Consensus 410 l~~~g~gi~s~et~~~~l~~~~~~----------~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 477 (484) T protein:vir:77 410 LYNNGQGVIPKERARIDMGYSITE----------REEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPN-- 477 (484) T ss_pred HHhccCCCCCHHHHHhcCCCChhH----------HHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCC-- Confidence 654 3 45777788888874321 01111111000000000000 00000011111111111000000 Q ss_pred cchhcccccccccccccc Q lcl|NC_012530. 512 SFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 512 ~~~~~~~~~~~~~~~~~g 529 (559) .+...+| T Consensus 478 -----------~~~~~~~ 484 (484) T protein:vir:77 478 -----------PAEEAAA 484 (484) T ss_pred -----------CccccCC Confidence 0000000 No 150 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.86 E-value=2.5e-08 Score=62.25 Aligned_cols=423 Identities=9% Similarity=0.033 Sum_probs=173.2 Q ss_pred Ccchhhh--------ccccccCC----cchHHHHHHHHHHHHH-----Hhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRF--------RTKFYTDD----PNAFFKHIDSKIANDT-----ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~--------~~~~~~~~----~~~~~~~~~~~~~~~~-----~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =|++-.| ..-|+.+. ..+.|..+-.....+. +..=-.|++....++... ...... T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~--------~~~~~~ 89 (492) T protein:vir:97 18 GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV--------DATGAV 89 (492) T ss_pred CceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc--------cccccc Confidence 1222221 11122221 1122222211111111 111112332221111000 000000 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) .+ .+.+ .+ ...++...||+..+.-+- |.+..+...+ ....+.+..|+.+ T Consensus 90 ~~---~~~~----~r--i~~n~~k~Ivd~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~n--- 138 (492) T protein:vir:97 90 DP---LKPD----DR--MITNFHANLVDQKVSYIV-----------GKPIAFKHTD--------DEVVKRIDEVLGN--- 138 (492) T ss_pred cc---cccc----cc--cccchHHHHHHHHhhhhc-----------ccCceeccCc--------hHHHHHHHHHHhc--- Confidence 00 0000 00 113555556665554432 1222221111 1122334444421 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeec Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFT 222 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~ 222 (559) .+......+..+++.+|.+|..+.++.+|++ .+..++|..+.++.+... .......+|+...+......+. T Consensus 139 -------~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~ 210 (492) T protein:vir:97 139 -------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWD 210 (492) T ss_pred -------cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEe Confidence 1233445677889999999999999988875 477899999988876432 1222233343333332333333 Q ss_pred ccceEEEec----------------------ccCCC-----ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_012530. 223 ADEMGMFIR----------------------NPRSD-----ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTK 275 (559) Q Consensus 223 ~~evi~~~~----------------------n~~~~-----~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 275 (559) ...+.++.. |+... ......|.|-++.....++....+..-..+.+...+.|- T Consensus 211 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 290 (492) T protein:vir:97 211 KVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELT 290 (492) T ss_pred cCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccce Confidence 333333321 11000 001124777777766666666555555556666666665 Q ss_pred eEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-CCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_012530. 276 GILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-AEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDP 354 (559) Q Consensus 276 gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp 354 (559) .++. +. +.+....++..+.. .++..+. +++++|..... .+..+....+...+.|+..-++|. T Consensus 291 l~~~--g~------~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~ 353 (492) T protein:vir:97 291 YVLK--NY------DDQELPEFKRLLRY--------YGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVD 353 (492) T ss_pred eeee--cC------CcccchhHHHHHhh--------ccceecCCCCcceeEeccC-CHHHHHHHHHHHHHHHHHHhCCCC Confidence 5543 21 11111222222111 1222332 33455543222 345567778888889999888885 Q ss_pred HHhccccccccccccccchh---hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 355 AEIGMQNRGGATGNKSNSLN---ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 355 ~~lg~~~~~~~~~~~~~~~~---~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) .-.+-. +++.++... +... .......+..+|+.+++.|...++. ......+.+.|+.....+..+ T Consensus 354 ~~~~~~-----~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~----~~~~~~i~v~f~~~~p~~~~e 424 (492) T protein:vir:97 354 FSSDKF-----GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHKDVDISFNYNKVANTEL 424 (492) T ss_pred CCcccc-----ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CcccceeeEEecCCCCCCHHH Confidence 332211 111111100 0111 1122233444555555555443331 223456788888888888888 Q ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcc Q lcl|NC_012530. 429 KLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPS 508 (559) Q Consensus 429 ~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (559) .++.+..+ .|.++..-+.++++.-+ |. ...+..+.+ +........+....+..+...+. . T Consensus 425 ~a~~~~kl-~G~iS~et~l~~l~~v~----d~------~~Eleri~~------E~~~~~~~~~~~~~~~~~~~~~~---~ 484 (492) T protein:vir:97 425 QVQTAQQS-MGIVSHETVLENHPFVE----DL------QAELERIEQ------EQTEYNKQLPNLDDGGADSAQQQ---E 484 (492) T ss_pred HHHHHHHH-hccCchHHHHHhCCCCC----CH------HHHHHHHHH------HHHHHHHhhhccccCCCCCCccc---c Confidence 88877665 46678777777775421 10 011111111 10000000000000000000000 0 Q ss_pred ccccchhc Q lcl|NC_012530. 509 SSNSFQQN 516 (559) Q Consensus 509 ~~~~~~~~ 516 (559) .++++.++ T Consensus 485 ~~~~~~~e 492 (492) T protein:vir:97 485 RSNNKESE 492 (492) T ss_pred cccccccC Confidence 00000000 No 151 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.85 E-value=2.4e-08 Score=62.36 Aligned_cols=409 Identities=12% Similarity=0.078 Sum_probs=165.7 Q ss_pred cccCCcchHHHHHHHHHHHHHHhhh-----hccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChH Q lcl|NC_012530. 11 FYTDDPNAFFKHIDSKIANDTASKA-----LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVV 85 (559) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 85 (559) +..+++++-+..|.+.+..+..... -.|++. +. + .+. ..|. .+.. ++......+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~-----i~--------~-~~~-~~~~-----~~~~-~~~~~~~n~ 59 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAP-----LP--------E-LTR-NTSA-----AWRS-FQREARTNW 59 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----hh--------h-cCc-ccCh-----hhch-hhhhhhcch Confidence 5555666565555555433222111 112210 00 0 000 0111 1111 111122345 Q ss_pred HHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHH Q lcl|NC_012530. 86 LNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYT 165 (559) Q Consensus 86 v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll 165 (559) ...||+..++.+- |.++.+...+..+. ...+.+++... .+..+...+..+++. T Consensus 60 ~~~ivd~~~~~l~-----------~~g~~~~~~~d~~~-------~~~~~~~~~~n---------~~d~~~~~~~~~a~~ 112 (456) T protein:vir:79 60 GLMVRDSVADRII-----------PNGITVGGSADSDL-------ALRARRIWRDN---------RMDSVCKQWVKYGLD 112 (456) T ss_pred HHHHHHHHHhhhc-----------cCCeecCCCCCccH-------HHHHHHHHHhc---------ChhHHHHHHHHHHhh Confidence 6667766665443 33444322221111 11233343321 233466678889999 Q ss_pred cCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCceee--eeccc------------------ Q lcl|NC_012530. 166 YDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNKVRG--SFTAD------------------ 224 (559) Q Consensus 166 ~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~~~~--~~~~~------------------ 224 (559) +|.+|..+.++.+|.+ .+..++|..+.++.+.... ......+|+...++.... .+..+ T Consensus 113 ~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) T protein:vir:79 113 FGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) T ss_pred cCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccce Confidence 9999999888889987 4788899998887764321 111122222111111000 00000 Q ss_pred -------------ceEEEec-ccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCC Q lcl|NC_012530. 225 -------------EMGMFIR-NPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTS 290 (559) Q Consensus 225 -------------evi~~~~-n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~ 290 (559) ++-|... -|. -......|+|-++.....++....+..-........+.|--++.-.. ...... T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pv-v~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~-~~~~~~- 268 (456) T protein:vir:79 192 LVTRISDSWVPVGDAVVTGSPPPV-VVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSE-HRLPKV- 268 (456) T ss_pred eeeccCCceeecccccCCCCceeE-EEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCC-cccccc- Confidence 0001000 000 00112246666666555444433222221122222222222221000 000000 Q ss_pred HHHHHH--HHHHHHHHhcCcccccccccccCCceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Q lcl|NC_012530. 291 MRALED--FKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEIGMQNRGGATG 367 (559) Q Consensus 291 ~e~~~~--l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 367 (559) ++.-.. ....|... .+.+ +...++.++..+.. .++ .|++..+..+..|+..=++|++.+|....+.+ + T Consensus 269 d~~g~~i~~~~~~~~~------~~~~-~~~~~~~~~~q~~~-~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~S-g 339 (456) T protein:vir:79 269 DENGNAIDYASIFEAA------PGAL-WELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS-A 339 (456) T ss_pred cccccccchhhhhhhh------cccc-ccCCCCcceeeecc-cChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcH-H Confidence 000000 11122221 1222 22244566655543 233 38889999999999999999999985322211 0 Q ss_pred ccccchhhhhHH---HHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCC-CCH Q lcl|NC_012530. 368 NKSNSLNESNNQ---NKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTA-TTV 443 (559) Q Consensus 368 ~~~~~~~~an~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~-~T~ 443 (559) .+...-+...+ +..+..+..+|+-++..+. .+........+++.|......+..+.++++.+++..| ++. T Consensus 340 -~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~-----~~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~ 413 (456) T protein:vir:79 340 -EGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-----QIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWA 413 (456) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChH Confidence 00000011111 0111122222222222221 1111222345778888888888889999887776655 566 Q ss_pred HHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 444 NDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 444 NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) .-+++.+|+.|-+ + ... +.+....+.+.. .+++-+ .+ ++..+- T Consensus 414 ~~~~~~lg~~~~~---i-------~~~---------e~~r~~~e~~~~---~~~~~~-~~--~~~~~~ 456 (456) T protein:vir:79 414 SIRRNILNYNADQ---I-------KQD---------DLDRAREQITLF---AGNPVQ-RP--QEDGSR 456 (456) T ss_pred HHHHhcCCCCHHH---H-------HHH---------HHHHHHHHHHHH---hhhHhh-cC--CCCCCC Confidence 6666777775421 0 000 000000000000 000000 00 000000 No 152 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.80 E-value=4.1e-08 Score=61.07 Aligned_cols=416 Identities=10% Similarity=0.021 Sum_probs=175.6 Q ss_pred chhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhh Q lcl|NC_012530. 3 IFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM 82 (559) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 82 (559) +|..| +.++..++.+- ..=-.|++.....+-.... . .++ ..+ .. T Consensus 1 ~~~~~--------~~~~~~r~~~l------~~yy~g~~~~~~~~~~~~~---~-------~~~--~~k----------i~ 44 (440) T protein:vir:95 1 MLAAF--------LGSQKQRLAIL------ASYAQGDNFSILSGHRRLD---D-------EKA--DYR----------VR 44 (440) T ss_pred ChhhH--------HHHHHHHHHHH------HHHhccCCccccccccccc---c-------cCC--cce----------ee Confidence 33333 33333333222 1112344332111100000 0 000 000 11 Q ss_pred ChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHH Q lcl|NC_012530. 83 NVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRD 162 (559) Q Consensus 83 ~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d 162 (559) ......+|+..+.-+. |.+..+...+. ...+....+..++.+. .+......+..+ T Consensus 45 ~n~~~~ivd~~~~~l~-----------g~~~~~~~~~~-----~~~~~~~~l~~~~~~n---------~~~~~~~~~~~~ 99 (440) T protein:vir:95 45 HKWGGYISSFATGYVI-----------GNPVSIGVMEG-----GSADQLSTIKDIEWQN---------DINALNSDLAFD 99 (440) T ss_pred cchHHHHHHhhhhhee-----------ccCceEeeCCC-----ccHHHHHHHHHHHHhc---------CHhHHHHHHHHH Confidence 2334444444433321 11112211111 1122233344444331 233455677889 Q ss_pred HHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCceeeeecccceEEEec---------- Q lcl|NC_012530. 163 TYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNKVRGSFTADEMGMFIR---------- 231 (559) Q Consensus 163 ~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~~~~~~~~~evi~~~~---------- 231 (559) .+++|.+|..+.++.+|+|. +..++|..+.++.+..+. .....++|+...+......+..+.++++.. T Consensus 100 ~~~~G~a~~~~~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~ 178 (440) T protein:vir:95 100 ASVYGRAYEYHFRDKDKVDR-VVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVV 178 (440) T ss_pred HhhcCeEEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceee Confidence 99999999999999988864 777899999998876542 222233333333332233344444443321 Q ss_pred -----ccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHH Q lcl|NC_012530. 232 -----NPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHW 301 (559) Q Consensus 232 -----n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~ 301 (559) |+.. .......|.|-++.....+.....+.....+.....+.|-.+++-. ......+.++...++..- T Consensus 179 ~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~--~~~~~~~~e~~~~~~~~~ 256 (440) T protein:vir:95 179 DDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGD--LDGIKLSPEDAAKMKDAN 256 (440) T ss_pred cceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecc--cccCCCCccchhhhhhcc Confidence 1100 0111224666666655555554444444444445555565554321 112233455554544321 Q ss_pred HHHhcCcccccccccccCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH--- Q lcl|NC_012530. 302 TATSSGINGAYRIPMITAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN--- 378 (559) Q Consensus 302 ~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~--- 378 (559) .-.. .........-..++++|.....+ +..+....+...+.|+..-++|..-.+-...+. ++ .....-++.. T Consensus 257 ~~~~--~~~~~~~~~~~~~~~~~lt~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg-~Al~~~~~~l~~k 331 (440) T protein:vir:95 257 MLFL--KTGISTTGQQTTADASYIYKQYD-VNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTS-SG-IALLYKMIGLEQV 331 (440) T ss_pred ceec--ccccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccccc-hH-HHHHHHHHHHHHH Confidence 1111 00000001111233444433322 344667788888999999999974433211110 00 0000111111 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCC Q lcl|NC_012530. 379 QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGG 458 (559) Q Consensus 379 ~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gG 458 (559) ....+.++...|.-++..|...++..--.......+.+.|......+..+.++.+.++ .|.|+.--+.++++.- T Consensus 332 ~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl-~g~iS~et~~~~l~~~----- 405 (440) T protein:vir:95 332 RKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA-GGEISQETLMENASFT----- 405 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH-hccCcHHHHHHhCCCC----- Confidence 1112244455555555555544432211222334578889888888999999887765 4667776666665431 Q ss_pred CEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 459 DIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 459 D~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) |. + ..+..+. .++....... ....++.+. .+.+.| T Consensus 406 d~---~---~E~~ri~------~E~~~~~~~~-~~~~~~~~~-----~~~~~e 440 (440) T protein:vir:95 406 DY---K---TEHSRIL------KQGGSSDLEI-GQIVGDADV-----GQADTE 440 (440) T ss_pred Cc---H---HHHHHHH------HHHHHhhhhH-HhhccCCCC-----CCcCCC Confidence 10 0 1111111 1111110000 000010000 000000 No 153 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.80 E-value=4.3e-08 Score=60.99 Aligned_cols=432 Identities=9% Similarity=0.017 Sum_probs=179.8 Q ss_pred CcchhhhccccccCCcch-HHHHHHHHHH---HHHHh--hhhcc---ccccccccccccc-cccccccccccccCCCCCc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNA-FFKHIDSKIA---NDTAS--KALNG---VDRAYTEPVDGNL-MFSTLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---~~~~~--~~~~g---r~~a~~~~~~~~~-~~~~~~~~~~~~~p~~~~~ 70 (559) |+|. +|...+-..++.. .|..+-+... .+-.+ .--.| ..+...++-.... .+..+. ... ... .+ T Consensus 1 ~~~~-~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~--~~ 74 (474) T protein:vir:94 1 MTLY-KLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGG--NVR-RLD--VS 74 (474) T ss_pred CchH-HHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcc--ccc-ccc--cC Confidence 7653 5555565555553 2222211110 01000 00001 0111111100000 000000 000 000 00 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) .+. + ..++....+|+..+.-+. |.+..+...+... ...+....+.+|+.+ . T Consensus 75 ~~~----k--i~~n~~~~ivd~~~~yl~-----------g~pv~~~~~~~~~---~~e~~~~~l~~~~~~---------n 125 (474) T protein:vir:94 75 VNN----K--LNNSFDSEIVDTRVGYLH-----------GVPVTYDLDENAE---KNEKLKKFITNFAIR---------N 125 (474) T ss_pred ccc----c--cccchHHHHHHhHhhhee-----------ccceeEeeCCCCc---chHHHHHHHHHHHhh---------c Confidence 000 0 012344444444443222 2222222222111 111222334444433 1 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCc------eeeeeccc Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNK------VRGSFTAD 224 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~------~~~~~~~~ 224 (559) .+......+..+++.+|.+|..+.++.+|++ .+..++|..+.++.+..+... ..++|+...+.. ....+... T Consensus 126 ~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~-~~i~~~~~~~~~~~~~~~~~~~y~~~ 203 (474) T protein:vir:94 126 SVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPT-YSLRYFYEKDDDNGTDYVYAEFYDNA 203 (474) T ss_pred CHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceE-EEEEEEEEeeCCCceEEEEEEEEcCc Confidence 2345667778899999999999989988875 578899999988887665432 233333322111 01122222 Q ss_pred ceEEEecc-------------cCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC Q lcl|NC_012530. 225 EMGMFIRN-------------PRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV 286 (559) Q Consensus 225 evi~~~~n-------------~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 286 (559) .+.++... +.. .......|.|-++.....+.....+..-..+.+...+.|-.+++ +. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~--- 278 (474) T protein:vir:94 204 YYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GM--- 278 (474) T ss_pred eEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cC--- Confidence 22222211 000 01112346666666555555544444444444444445544442 21 Q ss_pred ccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccccccc Q lcl|NC_012530. 287 TNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGA 365 (559) Q Consensus 287 ~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 365 (559) +++++....++ ..+. .++.+++.++.-++.+ .+..+....+...+.|...-++|..-.+-... T Consensus 279 -~~~~~~~~~~~-----------~~~~-i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~--- 342 (474) T protein:vir:94 279 -GMSEEMIQETQ-----------KSGA-FELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG--- 342 (474) T ss_pred -CCCchhhhhhh-----------hcce-eEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--- Confidence 22333322221 1122 3333333444444422 23456677788888998888888644321111 Q ss_pred ccccccch--hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhhhHHHHHHHHHHHHc Q lcl|NC_012530. 366 TGNKSNSL--NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ 438 (559) Q Consensus 366 ~~~~~~~~--~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~ 438 (559) +.++... -++.. ....+..+..+|+-.++.|...++.+-. .+.....+.+.|......|..+.++++..+ . T Consensus 343 -n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~ 420 (474) T protein:vir:94 343 -NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL-K 420 (474) T ss_pred -cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH-h Confidence 1111110 00111 1222345555666665555555543211 122234578889888888999999888765 4 Q ss_pred CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 439 TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 439 ~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) |+++..-+.++++.- + |. ...+..+. .+........+....++.++ .+...++. T Consensus 421 g~iS~et~~~~l~~v--~--d~------~~E~eri~------~E~~e~~~~~~~~~~~~~~~-----~~~~~~s~ 474 (474) T protein:vir:94 421 GQVSERTRLGQSQLV--D--DV------DYELDEME------KESLEFNDKLPDIDEGDAND-----KSQNNQSE 474 (474) T ss_pred ccCchHHHHHhCCCC--C--CH------HHHHHHHH------HHHHHHHhhcccccCCCcCC-----CCccccCC Confidence 668887788877642 1 10 01111111 11111111111111111111 11111111 No 154 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.80 E-value=4.3e-08 Score=60.99 Aligned_cols=432 Identities=9% Similarity=0.017 Sum_probs=179.8 Q ss_pred CcchhhhccccccCCcch-HHHHHHHHHH---HHHHh--hhhcc---ccccccccccccc-cccccccccccccCCCCCc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNA-FFKHIDSKIA---NDTAS--KALNG---VDRAYTEPVDGNL-MFSTLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---~~~~~--~~~~g---r~~a~~~~~~~~~-~~~~~~~~~~~~~p~~~~~ 70 (559) |+|. +|...+-..++.. .|..+-+... .+-.+ .--.| ..+...++-.... .+..+. ... ... .+ T Consensus 1 ~~~~-~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~--~~ 74 (474) T protein:vir:10 1 MTLY-KLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGG--NVR-RLD--VS 74 (474) T ss_pred CchH-HHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcc--ccc-ccc--cC Confidence 7653 5555565555553 2222211110 01000 00001 0111111100000 000000 000 000 00 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) .+. + ..++....+|+..+.-+. |.+..+...+... ...+....+.+|+.+ . T Consensus 75 ~~~----k--i~~n~~~~ivd~~~~yl~-----------g~pv~~~~~~~~~---~~e~~~~~l~~~~~~---------n 125 (474) T protein:vir:10 75 VNN----K--LNNSFDSEIVDTRVGYLH-----------GVPVTYDLDENAE---KNEKLKKFITNFAIR---------N 125 (474) T ss_pred ccc----c--cccchHHHHHHhHhhhee-----------ccceeEeeCCCCc---chHHHHHHHHHHHhh---------c Confidence 000 0 012344444444443222 2222222222111 111222334444433 1 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEEecCc------eeeeeccc Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNK------VRGSFTAD 224 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~------~~~~~~~~ 224 (559) .+......+..+++.+|.+|..+.++.+|++ .+..++|..+.++.+..+... ..++|+...+.. ....+... T Consensus 126 ~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~-~~i~~~~~~~~~~~~~~~~~~~y~~~ 203 (474) T protein:vir:10 126 SVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPT-YSLRYFYEKDDDNGTDYVYAEFYDNA 203 (474) T ss_pred CHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceE-EEEEEEEEeeCCCceEEEEEEEEcCc Confidence 2345667778899999999999989988875 578899999988887665432 233333322111 01122222 Q ss_pred ceEEEecc-------------cCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC Q lcl|NC_012530. 225 EMGMFIRN-------------PRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV 286 (559) Q Consensus 225 evi~~~~n-------------~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 286 (559) .+.++... +.. .......|.|-++.....+.....+..-..+.+...+.|-.+++ +. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~--- 278 (474) T protein:vir:10 204 YYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GM--- 278 (474) T ss_pred eEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cC--- Confidence 22222211 000 01112346666666555555544444444444444445544442 21 Q ss_pred ccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccccccc Q lcl|NC_012530. 287 TNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGA 365 (559) Q Consensus 287 ~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 365 (559) +++++....++ ..+. .++.+++.++.-++.+ .+..+....+...+.|...-++|..-.+-... T Consensus 279 -~~~~~~~~~~~-----------~~~~-i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~--- 342 (474) T protein:vir:10 279 -GMSEEMIQETQ-----------KSGA-FELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG--- 342 (474) T ss_pred -CCCchhhhhhh-----------hcce-eEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--- Confidence 22333322221 1122 3333333444444422 23456677788888998888888644321111 Q ss_pred ccccccch--hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhhhHHHHHHHHHHHHc Q lcl|NC_012530. 366 TGNKSNSL--NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ 438 (559) Q Consensus 366 ~~~~~~~~--~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~ 438 (559) +.++... -++.. ....+..+..+|+-.++.|...++.+-. .+.....+.+.|......|..+.++++..+ . T Consensus 343 -n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~ 420 (474) T protein:vir:10 343 -NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL-K 420 (474) T ss_pred -cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH-h Confidence 1111110 00111 1222345555666665555555543211 122234578889888888999999888765 4 Q ss_pred CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 439 TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 439 ~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) |+++..-+.++++.- + |. ...+..+. .+........+....++.++ .+...++. T Consensus 421 g~iS~et~~~~l~~v--~--d~------~~E~eri~------~E~~e~~~~~~~~~~~~~~~-----~~~~~~s~ 474 (474) T protein:vir:10 421 GQVSERTRLGQSQLV--D--DV------DYELDEME------KESLEFNDKLPDIDEGDAND-----KSQNNQSE 474 (474) T ss_pred ccCchHHHHHhCCCC--C--CH------HHHHHHHH------HHHHHHHhhcccccCCCcCC-----CCccccCC Confidence 668887788877642 1 10 01111111 11111111111111111111 11111111 No 155 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.79 E-value=4.8e-08 Score=60.70 Aligned_cols=440 Identities=10% Similarity=0.018 Sum_probs=173.0 Q ss_pred Ccchhhhc-----cccccCC-------cchHHHH-HHHHH-HHHHHhhhhccccccccccccccccccccccccccccCC Q lcl|NC_012530. 1 MGIFDRFR-----TKFYTDD-------PNAFFKH-IDSKI-ANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPS 66 (559) Q Consensus 1 ~~~~~~~~-----~~~~~~~-------~~~~~~~-~~~~~-~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~ 66 (559) --+|.+.- -...+.+ +...+.+ ..+.. ..+.+..=-.|++.....+... .. .. + T Consensus 18 ~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~-----~~---~~--~-- 85 (512) T protein:vir:97 18 NYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KE---EY--M-- 85 (512) T ss_pred eeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-----cc---cc--c-- Confidence 11111110 0001111 1111111 11100 0111222222333221111000 00 00 0 Q ss_pred CCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCC Q lcl|NC_012530. 67 PIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYS 146 (559) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~ 146 (559) +..+ ........+|+..+.-+. |.+..+...+ ....+.+..|+.. T Consensus 86 ~~~k----------i~~n~~k~Ivd~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~------ 130 (512) T protein:vir:97 86 ADNR----------VAHDYASYISDFINGYFL-----------GNPIQCQDDD--------KDVLEAIEAFNDL------ 130 (512) T ss_pred Ccce----------eecchHHHHHHHHhhhhc-----------ccCceeccCC--------hHHHHHHHHHHhh------ Confidence 0001 012333444544443332 1222222111 1122334444432 Q ss_pred CChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEec--Cc------e Q lcl|NC_012530. 147 PIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYID--NK------V 217 (559) Q Consensus 147 ~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~--~~------~ 217 (559) ..+......+..+++++|.+|..+.++.+|++. +..++|..+.++.++.. ......++|+.... +. . T Consensus 131 ---n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~-i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~ 206 (512) T protein:vir:97 131 ---NDVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFT 206 (512) T ss_pred ---cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEE Confidence 123456667788999999999999999888754 78899999998877543 22223344433211 10 1 Q ss_pred eeeecccceEEEecc----------------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_012530. 218 RGSFTADEMGMFIRN----------------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTK 275 (559) Q Consensus 218 ~~~~~~~evi~~~~n----------------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 275 (559) ...+..+.+.++... |.-.......|.|-++.+...++....+..-..+.+...+.|- T Consensus 207 ~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 286 (512) T protein:vir:97 207 VDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) T ss_pred EEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 123444444443211 0000111235777777777777666665555555556666665 Q ss_pred eEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccc-cccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 276 GILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIP-MITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMD 353 (559) Q Consensus 276 gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~-vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVP 353 (559) .++.-.. ..+.+.....+....-...........+ +-.+++.++.-++.+ .+..+....+...+.|+..-++| T Consensus 287 lv~~G~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p 361 (512) T protein:vir:97 287 LLIKGNL-----NLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) T ss_pred eeeecCc-----cCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 5553211 1122222222211111111111111111 112333444444432 23345667778888898888998 Q ss_pred HHHhccccccccccccccchhh--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhhhH Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNE--S---NNQNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~--a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~d~ 426 (559) ..-.+-... +.++..+-+ . +-....+..+..+|+-.+..|...+...-- .......+.+.|......+. T Consensus 362 ~~~~~~~~g----n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~ 437 (512) T protein:vir:97 362 NMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL 437 (512) T ss_pred ccCcccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH Confidence 754432211 111100100 0 011122233444444444444443332111 11122357888888888888 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccc-cccccCCCCCCCCCCC Q lcl|NC_012530. 427 QDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT-QLESALQNPSGTPPTL 505 (559) Q Consensus 427 ~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 505 (559) .+.++.+..+ .|+++.--+.++++. ++ |. ...+..+.... +....... .........++.+++ T Consensus 438 ~e~~~~~~kl-~giiS~et~~~~l~~--v~--d~------~~E~eri~~E~----~~~~~~~~~~~~~~~~~~~~~~~~- 501 (512) T protein:vir:97 438 IEELKAYIDS-GGKISQTTLMSLFSF--FQ--DP------ELEVKKIEEDE----KESIKKAQKGIYKDPRDINDDEQD- 501 (512) T ss_pred HHHHHHHHHH-hccCchHHHHHhCCC--CC--CH------HHHHHHHHHHH----HHHHHHHhhcccCCCCCCCCCCCC- Confidence 8888877665 366788777777654 21 10 01111111110 00000000 000000000000000 Q ss_pred Cccccccchhccc Q lcl|NC_012530. 506 PPSSSNSFQQNQE 518 (559) Q Consensus 506 ~~~~~~~~~~~~~ 518 (559) ...+...++.. T Consensus 502 --~~~~~~~~~~~ 512 (512) T protein:vir:97 502 --DDTKDTVDKKE 512 (512) T ss_pred --CCccccccccC Confidence 00000000000 No 156 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.76 E-value=6.1e-08 Score=60.13 Aligned_cols=423 Identities=9% Similarity=0.016 Sum_probs=171.7 Q ss_pred Ccchhhhc--------cccccCCcchHHHHHHHHHHHHHHh---------hhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFR--------TKFYTDDPNAFFKHIDSKIANDTAS---------KALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =|++-.|- .-|+.+.-.+....+-+....+..+ .=-.|++....++.. . ...... T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~----~----~~~~~~ 89 (492) T protein:vir:94 18 GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP----V----DATGAV 89 (492) T ss_pred CceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc----c----cccccc Confidence 22222221 0011111111112222222211111 001122211111100 0 000000 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) ...+.+. + ...++...+|+..+.-+-+ .+..+...+ ....+.+..|+.+ T Consensus 90 ---~~~~~~~----r--i~~n~~k~Ivd~~~~yl~G-----------~p~~~~~~d--------~~~~~~l~~~~~n--- 138 (492) T protein:vir:94 90 ---DPLKPDD----R--MITNFHANLVDQKVSYIVG-----------KPIAFKHTD--------DEVVKRIDEVLGN--- 138 (492) T ss_pred ---ccccccc----c--cccchHHHHHHHHHhhhcc-----------cCceeccCc--------hHHHHHHHHHHhc--- Confidence 0000000 0 1235555666655544322 122221111 1222344444421 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeec Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFT 222 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~ 222 (559) .+......+..+.+.+|.+|..+..|.+|+|. +..++|..+.++.+..- ......++|+..........+. T Consensus 139 -------~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~ 210 (492) T protein:vir:94 139 -------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWD 210 (492) T ss_pred -------cHHHHHHHHHHHHhhCCeEEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEe Confidence 12345566778899999999999999888864 77899999988775332 1222233443333332233333 Q ss_pred ccceEEEec----------------------ccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_012530. 223 ADEMGMFIR----------------------NPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTK 275 (559) Q Consensus 223 ~~evi~~~~----------------------n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 275 (559) ...+.++.. |+.. .......|.|-++.....++....+..-..+.+...+.|- T Consensus 211 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~ 290 (492) T protein:vir:94 211 KVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELT 290 (492) T ss_pred cCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 333333321 1110 0001124777777766666666655555556666666675 Q ss_pred eEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-CCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_012530. 276 GILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-AEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDP 354 (559) Q Consensus 276 gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp 354 (559) .++. +. +.+....++..+. .+++..+. +++++|..... .+..+....+...+.|+..-++|. T Consensus 291 lv~~--g~------~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~ 353 (492) T protein:vir:94 291 YVLK--NY------DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVD 353 (492) T ss_pred eeee--cC------CcccchhhHHHHh--------hccceecCCCCcceeEeccC-CHHHHHHHHHHHHHHHHHHhCCcC Confidence 5553 21 1111112222221 11222332 33455543222 233456667777888888888885 Q ss_pred HHhccccccccccccccchh---hhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHH Q lcl|NC_012530. 355 AEIGMQNRGGATGNKSNSLN---ESN---NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQD 428 (559) Q Consensus 355 ~~lg~~~~~~~~~~~~~~~~---~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~ 428 (559) .-.+-. +++.++... +.. -.......+..+|+-+++.|...++. ......+.+.|+.....+..+ T Consensus 354 ~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~----~~~~~~i~v~f~~~~p~~~~e 424 (492) T protein:vir:94 354 FSSDKF-----GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHKDVDISFNYNKVANTEL 424 (492) T ss_pred CCcccc-----ccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CcccceeeEEecCCCCCCHHH Confidence 322211 111111100 000 01112233344444444444433321 123456788898888888888 Q ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcc Q lcl|NC_012530. 429 KLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPS 508 (559) Q Consensus 429 ~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (559) .++.+... .|+++..-++++++.-+-+ ...+..+.+ +........+.. .+....+++.++ . T Consensus 425 ~~~~~~kl-~giiS~et~~~~l~~v~d~----------~~E~eri~~------E~~~~~~~~~~~-~~~~~~~~~~~~-~ 485 (492) T protein:vir:94 425 QVQTAQQS-MGIVSHETVLENHPFVEDL----------QAELERIEQ------EQMEYNKQLPNL-DDGGADSAQQQE-R 485 (492) T ss_pred HHHHHHHH-hccCchHHHHHhCCCCCCH----------HHHHHHHHH------HHHHHHhhcccc-ccccCCCCcccc-C Confidence 88877665 3667877777777542210 011111111 000000000000 000000000000 0 Q ss_pred ccccchh Q lcl|NC_012530. 509 SSNSFQQ 515 (559) Q Consensus 509 ~~~~~~~ 515 (559) +.+.+.+ T Consensus 486 ~~~~e~e 492 (492) T protein:vir:94 486 SNNKESE 492 (492) T ss_pred CccccCC Confidence 0000000 No 157 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.76 E-value=6.2e-08 Score=60.10 Aligned_cols=443 Identities=10% Similarity=0.059 Sum_probs=172.4 Q ss_pred Ccchhhhccc------cccCCcc---hHHHHHHHHHHHHH-------Hhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK------FYTDDPN---AFFKHIDSKIANDT-------ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~------~~~~~~~---~~~~~~~~~~~~~~-------~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) -+|+.++++. |+-+.++ +....+-++....+ +..=-.|++.....+.. ....... ... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~-----~~~~~~~-~~~ 75 (503) T protein:vir:59 2 ADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRT-----YYDAAGQ-QLV 75 (503) T ss_pred cccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccch-----hcccccc-ccc Confidence 3455544421 1112211 11111111111111 11111222221111100 0000000 000 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) .. ...+. + ...++...+|+..+.-+. |.+..+...+ .+..+.+..|+.+ T Consensus 76 ~~--~~~~~----r--i~~n~~~~ivd~~~~yl~-----------g~~~~~~~~d--------~~~~~~l~~~~~n---- 124 (503) T protein:vir:59 76 DD--TKTNN----R--TSHAWHKLFVDQKTQYLV-----------GEPVTFTSDN--------KTLLEYVNELADD---- 124 (503) T ss_pred cc--ccccc----e--eecchHHHHHHHHHhhhh-----------cCCeeeccCc--------HHHHHHHHHHHhc---- Confidence 00 00000 0 123455566666665443 2222222111 1112233333321 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCc-----ee Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNK-----VR 218 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~-----~~ 218 (559) .+......+..+.+.+|.+|..+.+|.+|++. +..++|..+.++.++.. ......++|+...... .. T Consensus 125 ------~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~-i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~ 197 (503) T protein:vir:59 125 ------DFDDILNETVKNMSNKGIEYWHPFVDEEGEFD-YVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKA 197 (503) T ss_pred ------CHHHHHHHHHHHHhhCCeEEEEEeecCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEE Confidence 23456666788999999999999999988865 88899999988776532 2222233333322211 11 Q ss_pred eeecccceEEEecc-------------------------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 219 GSFTADEMGMFIRN-------------------------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRF 267 (559) Q Consensus 219 ~~~~~~evi~~~~n-------------------------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~ 267 (559) ..+.+..+.++... |.........|.|-++.+...++....+..-..+. T Consensus 198 evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~ 277 (503) T protein:vir:59 198 ELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDS 277 (503) T ss_pred EEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 23333333332210 00011122347776666666666555444444455 Q ss_pred HHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHH Q lcl|NC_012530. 268 FTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINII 346 (559) Q Consensus 268 f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~I 346 (559) +...+.|-.++. +.. ..+ ..+ +...+. .+++..+ ..++++|.....+. ..+....+...+.| T Consensus 278 ~~~~~~~~~v~~--g~~-~~~-~~~----~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~i 340 (503) T protein:vir:59 278 FSDFQQIVYVLK--NYD-GEN-PKE----FTANLR--------YHSVIKVSGDGGVDTLRAEIPV-DSAAKELERIQDEL 340 (503) T ss_pred HHHhcCCeeEee--cCC-ccc-cch----hhhhhh--------cccceeccCCCcceeEeccCCH-HHHHHHHHHHHHHH Confidence 566666655543 211 111 111 111111 1122223 23345554333332 33455555666666 Q ss_pred HHHhCCCHHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcccc-ccCccceeeecchh Q lcl|NC_012530. 347 CALVAMDPAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQ-ILGDNYMLEFVGGD 422 (559) Q Consensus 347 a~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~-~~~~~~~~~f~~l~ 422 (559) ...-++|..-.+... +..++. ....-.... .......+...|+-++..|...++..--.. .....+.+.|.... T Consensus 341 ~~~s~~p~~~~~~~~-~~~Sg~-Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~ 418 (503) T protein:vir:59 341 YKSAQAVDNSPETIG-GGATGP-ALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTR 418 (503) T ss_pred HHHhcccCCCccccc-ccccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCC Confidence 666666532111100 100000 000000111 112233444555555555544444321111 12235788899899 Q ss_pred hhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 423 TRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 423 ~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) ..|..+.++.+..++. |+|+...+.++++. ++ | + ...+..+.+ +........ . ....+ T Consensus 419 p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~--v~--d----~--~~E~~ri~~------E~~~~~~~~--~---~~~~~ 477 (503) T protein:vir:59 419 IQNDSEIVQSLVQGVTGGIMSKETAVARNPF--VQ--D----P--EEELARIEE------EMNQYAEMQ--G---NLLDD 477 (503) T ss_pred CCCHHHHHHHHHHHHhCCCCchHHHHHhCCC--CC--C----H--HHHHHHHHH------HHHHHHhhh--c---cccCc Confidence 9999999998888875 55788778877654 22 1 0 011111111 000000000 0 00000 Q ss_pred CCCCCccccccchhccccccccccccccccccccccccc Q lcl|NC_012530. 502 PPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDG 540 (559) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 540 (559) ++ .. ..++++++..+ .+.+.++|+.. T Consensus 478 ~~----~~--~~~~~~~~~~~-------~~~~~~~g~~~ 503 (503) T protein:vir:59 478 EG----GD--DDLEEDDPNAG-------AAESGGAGQVS 503 (503) T ss_pred cC----CC--CCCCcCCCCCC-------cccCCCCCCcC Confidence 00 00 00000000000 00011111111 No 158 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.75 E-value=6.5e-08 Score=59.97 Aligned_cols=423 Identities=9% Similarity=0.017 Sum_probs=168.7 Q ss_pred CcchhhhccccccCCcc-----------hHHHHHHHHHHHHH-----Hhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPN-----------AFFKHIDSKIANDT-----ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~-----~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) .++.++=.++-.++++- +.|..+-.....+. +..=-.|++....++.. ..........+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~-----~~~~~~~~~~~ 76 (474) T protein:vir:95 2 INIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYK-----QDLHGNIDYTK 76 (474) T ss_pred cccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccch-----hhhcccccccc Confidence 34444433333332222 22222211111110 11111222211111100 00000000000 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) + .++ ...+....+|+..+.-+.+ .+..+...+ .+..+.+..|+.+ T Consensus 77 ~--~~k----------i~~n~~k~Iv~~~~~yl~g-----------~p~~~~~~~--------~~~~~~l~~~~~n---- 121 (474) T protein:vir:95 77 P--DWR----------ITTNFHQNLVDQKVSYVAG-----------KPVTYAHDD--------DKVLDVIHQVLDT---- 121 (474) T ss_pred c--ccc----------cccchHHHHHHhhhhhhcc-----------cCceeccCC--------hHHHHHHHHHHhc---- Confidence 0 000 0123344445444433322 122221111 1122334444421 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeecc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTA 223 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~ 223 (559) .+......+..+++.+|.+|..+.++.+|.+ .+..++|..+.++.+..- ......++++..........+.. T Consensus 122 ------~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~ 194 (474) T protein:vir:95 122 ------RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTA 194 (474) T ss_pred ------cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeC Confidence 2345566678899999999999999988876 477789999988875431 12222233333222222333444 Q ss_pred cceEEEecc----------------------c-----CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_012530. 224 DEMGMFIRN----------------------P-----RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKG 276 (559) Q Consensus 224 ~evi~~~~n----------------------~-----~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 276 (559) ..+.++... + .-.......|.|-++.....++....+..-..+.+...+.|-. T Consensus 195 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~l 274 (474) T protein:vir:95 195 ETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIY 274 (474) T ss_pred CeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 444433210 0 0000112346666666555555544444444444455555544 Q ss_pred EEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_012530. 277 ILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPA 355 (559) Q Consensus 277 il~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~ 355 (559) +++ +. ..+....+...+. ..++..+ .+++++|..... .+..+....+...+.|...-++|.. T Consensus 275 v~~--g~------~~~~~~~~~~~~~--------~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~ 337 (474) T protein:vir:95 275 ILR--GY------EGEDLSEFMEGLK--------YYKAINVSSDGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDF 337 (474) T ss_pred hhc--CC------Ccccccchhhhhh--------ccceeeccCCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCc Confidence 442 21 1111111222221 1223223 234455544332 3445666777788888888888853 Q ss_pred Hhccccccccccccccchh--hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHH Q lcl|NC_012530. 356 EIGMQNRGGATGNKSNSLN--ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKL 430 (559) Q Consensus 356 ~lg~~~~~~~~~~~~~~~~--~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~ 430 (559) -.. +..++.++...- +... ....+..+...|+-+++.|...+.. ......+.+.|+.....+..+.+ T Consensus 338 ~~~----~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:95 338 QTD----KFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KLDAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred ccc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEecCCCccCHHHHH Confidence 221 111111110000 0111 1112234444555555544432221 12345678888888888888887 Q ss_pred HHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcccc Q lcl|NC_012530. 431 KSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSS 510 (559) Q Consensus 431 ~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (559) +.+.. .|+|+.--++++++.- + |. ...+..+. .+.............+.++..+++.++.+. T Consensus 410 ~~~~~--~giiS~et~~~~lp~v--~--D~------~~E~eri~------~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 410 QIGAQ--SQYLSKETLVRHHPWV--D--DP------KAELERLD------EEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred HHHHH--cCCCChHHHHHhCCCC--C--CH------HHHHHHHH------HHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 76543 4778888888777542 1 10 01111111 011111011111111111111111111111 Q ss_pred ccc Q lcl|NC_012530. 511 NSF 513 (559) Q Consensus 511 ~~~ 513 (559) +++ T Consensus 472 e~~ 474 (474) T protein:vir:95 472 QSK 474 (474) T ss_pred ccC Confidence 111 No 159 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.75 E-value=6.5e-08 Score=59.97 Aligned_cols=423 Identities=9% Similarity=0.017 Sum_probs=168.7 Q ss_pred CcchhhhccccccCCcc-----------hHHHHHHHHHHHHH-----Hhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPN-----------AFFKHIDSKIANDT-----ASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~-----~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) .++.++=.++-.++++- +.|..+-.....+. +..=-.|++....++.. ..........+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~-----~~~~~~~~~~~ 76 (474) T protein:vir:96 2 INIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYK-----QDLHGNIDYTK 76 (474) T ss_pred cccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccch-----hhhcccccccc Confidence 34444433333332222 22222211111110 11111222211111100 00000000000 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) + .++ ...+....+|+..+.-+.+ .+..+...+ .+..+.+..|+.+ T Consensus 77 ~--~~k----------i~~n~~k~Iv~~~~~yl~g-----------~p~~~~~~~--------~~~~~~l~~~~~n---- 121 (474) T protein:vir:96 77 P--DWR----------ITTNFHQNLVDQKVSYVAG-----------KPVTYAHDD--------DKVLDVIHQVLDT---- 121 (474) T ss_pred c--ccc----------cccchHHHHHHhhhhhhcc-----------cCceeccCC--------hHHHHHHHHHHhc---- Confidence 0 000 0123344445444433322 122221111 1122334444421 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeecc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTA 223 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~ 223 (559) .+......+..+++.+|.+|..+.++.+|.+ .+..++|..+.++.+..- ......++++..........+.. T Consensus 122 ------~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~ 194 (474) T protein:vir:96 122 ------RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTA 194 (474) T ss_pred ------cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeC Confidence 2345566678899999999999999988876 477789999988875431 12222233333222222333444 Q ss_pred cceEEEecc----------------------c-----CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_012530. 224 DEMGMFIRN----------------------P-----RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKG 276 (559) Q Consensus 224 ~evi~~~~n----------------------~-----~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 276 (559) ..+.++... + .-.......|.|-++.....++....+..-..+.+...+.|-. T Consensus 195 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~l 274 (474) T protein:vir:96 195 ETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIY 274 (474) T ss_pred CeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 444433210 0 0000112346666666555555544444444444455555544 Q ss_pred EEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_012530. 277 ILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPA 355 (559) Q Consensus 277 il~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~ 355 (559) +++ +. ..+....+...+. ..++..+ .+++++|..... .+..+....+...+.|...-++|.. T Consensus 275 v~~--g~------~~~~~~~~~~~~~--------~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~ 337 (474) T protein:vir:96 275 ILR--GY------EGEDLSEFMEGLK--------YYKAINVSSDGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDF 337 (474) T ss_pred hhc--CC------Ccccccchhhhhh--------ccceeeccCCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCc Confidence 442 21 1111111222221 1223223 234455544332 3445666777788888888888853 Q ss_pred Hhccccccccccccccchh--hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHH Q lcl|NC_012530. 356 EIGMQNRGGATGNKSNSLN--ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKL 430 (559) Q Consensus 356 ~lg~~~~~~~~~~~~~~~~--~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~ 430 (559) -.. +..++.++...- +... ....+..+...|+-+++.|...+.. ......+.+.|+.....+..+.+ T Consensus 338 ~~~----~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:96 338 QTD----KFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KLDAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred ccc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEecCCCccCHHHHH Confidence 221 111111110000 0111 1112234444555555544432221 12345678888888888888887 Q ss_pred HHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcccc Q lcl|NC_012530. 431 KSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSS 510 (559) Q Consensus 431 ~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (559) +.+.. .|+|+.--++++++.- + |. ...+..+. .+.............+.++..+++.++.+. T Consensus 410 ~~~~~--~giiS~et~~~~lp~v--~--D~------~~E~eri~------~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 410 QIGAQ--SQYLSKETLVRHHPWV--D--DP------KAELERLD------EEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred HHHHH--cCCCChHHHHHhCCCC--C--CH------HHHHHHHH------HHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 76543 4778888888777542 1 10 01111111 011111011111111111111111111111 Q ss_pred ccc Q lcl|NC_012530. 511 NSF 513 (559) Q Consensus 511 ~~~ 513 (559) +++ T Consensus 472 e~~ 474 (474) T protein:vir:96 472 QSK 474 (474) T ss_pred ccC Confidence 111 No 160 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.75 E-value=4.8e-08 Score=60.70 Aligned_cols=413 Identities=10% Similarity=0.058 Sum_probs=174.8 Q ss_pred Ccchhhhccccc--------------cCC--cc---hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFY--------------TDD--PN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSI 61 (559) Q Consensus 1 ~~~~~~~~~~~~--------------~~~--~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~ 61 (559) |++++|.+.=|. .+. |+ +++.++++-...-. |+... .-++. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~------g~~~~------------~~~~~-- 60 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYK------SDWDS------------VLYLN-- 60 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhc------CCCCC------------ccccc-- Confidence 999998741110 010 11 11111111111000 00000 00000 Q ss_pred cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhc Q lcl|NC_012530. 62 VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERM 141 (559) Q Consensus 62 ~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~ 141 (559) ..+.... +..........+++..|+-|..-+ -.|... ..+..+.+.++|.. T Consensus 61 -~~~~~~~--------~~~~slnl~~~i~~~~A~lv~~e~-----------~~i~~~--------d~~~~~~l~~il~~- 111 (500) T protein:vir:98 61 -TDGETKK--------RDLNHLPIARTAAKKIASLVFNEQ-----------AEIKVD--------DDAANEFISETLKN- 111 (500) T ss_pred -CCCCccc--------CceeecchHHHHHHHHhhhhcCCc-----------ceEecC--------ChHHHHHHHHHHhh- Confidence 0000000 000111333445555554443211 011111 11223344444432 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc-------------cceE Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT-------------RGKI 208 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------~~~~ 208 (559) ..|...+...+.+.+..|.+++-+..|. |. +.+..++|..+.++....+.+.. ...+ T Consensus 112 --------n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~ 181 (500) T protein:vir:98 112 --------DRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVY 181 (500) T ss_pred --------ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceE Confidence 2456667778888899999999888874 44 34777888888775433222110 1111 Q ss_pred EEE-----EecCce-e---eeec--------------------ccc----------eEEEecccC--CCccCCcccccHH Q lcl|NC_012530. 209 YRQ-----YIDNKV-R---GSFT--------------------ADE----------MGMFIRNPR--SDILSGGYGLSEL 247 (559) Q Consensus 209 y~~-----~~~~~~-~---~~~~--------------------~~e----------vi~~~~n~~--~~~~~~~~G~Spl 247 (559) |.. ..++.. . ..|. +.+ ..|++ +|. .-..+.++|+|.+ T Consensus 182 yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~-~~~~N~~~~~sp~G~S~~ 260 (500) T protein:vir:98 182 YTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLK-TPGMNNKDINSPLGLSIF 260 (500) T ss_pred EEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEec-CCccccccCCCccCCchh Confidence 110 001110 0 0000 011 11222 221 1122467899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceE-----EEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCce Q lcl|NC_012530. 248 EMGLREFISHENTELFNDRFFTHGGTTKGI-----LLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDA 322 (559) Q Consensus 248 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~ 322 (559) .-+...|......-.-..+-|+-|.. ..+ |........++..+...-.+. +..|.+ +..-.+++. T Consensus 261 ~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~~~~~d~~---~~~~~~------~~~~~~~~~ 330 (500) T protein:vir:98 261 DNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGDVVPRPRFESD---QNVYIR------MGGRDLDSS 330 (500) T ss_pred hhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCccccCCcccCCC---cceEEE------cCCCCCcCc Confidence 99998888777666656666776533 222 211111000000000000000 000110 011112223 Q ss_pred eeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch---hhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 323 KFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL---NESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 323 ~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~---~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) .++.++. -.+-++.+..+...++|+...|++|..+|+...+..++.+..+. .+... ...+..++.+|.-++..|- T Consensus 331 ~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~-~~~~~~~~~al~~lv~~il 409 (500) T protein:vir:98 331 AIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR-NSIVALVEQSLKELVISIF 409 (500) T ss_pred ceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 3444432 14568889999999999999999999999876554333221110 11111 1233445566666666654 Q ss_pred HHHHh-hcccc--ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceecccccc Q lcl|NC_012530. 399 KNLTN-GIIRQ--ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQ 473 (559) Q Consensus 399 ~~ln~-~L~~~--~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~ 473 (559) ...+. .++.. .....+.++|+.....|..+.++....++. |.|+.-+++.++ |++.-+ ....+ T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee------------a~~~l 477 (500) T protein:vir:98 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK------------AQEIA 477 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH------------HHHHH Confidence 43321 12221 123346778887778888887777766665 568999887544 442100 00000 Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCccccccchh Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQ 515 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (559) . ... .+.....+. ..+. .+..++ T Consensus 478 ~---~i~-~E~~~~~~~----------~~~~-----~~~~g~ 500 (500) T protein:vir:98 478 A---EIN-TGIVDEINQ----------QRTD-----THLYGE 500 (500) T ss_pred H---HHH-HhccccCCC----------CCcc-----ccccCC Confidence 0 000 000000000 0000 000000 No 161 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.75 E-value=4.8e-08 Score=60.70 Aligned_cols=413 Identities=10% Similarity=0.058 Sum_probs=174.8 Q ss_pred Ccchhhhccccc--------------cCC--cc---hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFY--------------TDD--PN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSI 61 (559) Q Consensus 1 ~~~~~~~~~~~~--------------~~~--~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~ 61 (559) |++++|.+.=|. .+. |+ +++.++++-...-. |+... .-++. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~------g~~~~------------~~~~~-- 60 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYK------SDWDS------------VLYLN-- 60 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhc------CCCCC------------ccccc-- Confidence 999998741110 010 11 11111111111000 00000 00000 Q ss_pred cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhc Q lcl|NC_012530. 62 VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERM 141 (559) Q Consensus 62 ~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~ 141 (559) ..+.... +..........+++..|+-|..-+ -.|... ..+..+.+.++|.. T Consensus 61 -~~~~~~~--------~~~~slnl~~~i~~~~A~lv~~e~-----------~~i~~~--------d~~~~~~l~~il~~- 111 (500) T protein:vir:30 61 -TDGETKK--------RDLNHLPIARTAAKKIASLVFNEQ-----------AEIKVD--------DDAANEFISETLKN- 111 (500) T ss_pred -CCCCccc--------CceeecchHHHHHHHHhhhhcCCc-----------ceEecC--------ChHHHHHHHHHHhh- Confidence 0000000 000111333445555554443211 011111 11223344444432 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc-------------cceE Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT-------------RGKI 208 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------~~~~ 208 (559) ..|...+...+.+.+..|.+++-+..|. |. +.+..++|..+.++....+.+.. ...+ T Consensus 112 --------n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~ 181 (500) T protein:vir:30 112 --------DRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVY 181 (500) T ss_pred --------ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceE Confidence 2456667778888899999999888874 44 34777888888775433222110 1111 Q ss_pred EEE-----EecCce-e---eeec--------------------ccc----------eEEEecccC--CCccCCcccccHH Q lcl|NC_012530. 209 YRQ-----YIDNKV-R---GSFT--------------------ADE----------MGMFIRNPR--SDILSGGYGLSEL 247 (559) Q Consensus 209 y~~-----~~~~~~-~---~~~~--------------------~~e----------vi~~~~n~~--~~~~~~~~G~Spl 247 (559) |.. ..++.. . ..|. +.+ ..|++ +|. .-..+.++|+|.+ T Consensus 182 yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~-~~~~N~~~~~sp~G~S~~ 260 (500) T protein:vir:30 182 YTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLK-TPGMNNKDINSPLGLSIF 260 (500) T ss_pred EEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEec-CCccccccCCCccCCchh Confidence 110 001110 0 0000 011 11222 221 1122467899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceE-----EEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCce Q lcl|NC_012530. 248 EMGLREFISHENTELFNDRFFTHGGTTKGI-----LLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDA 322 (559) Q Consensus 248 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~ 322 (559) .-+...|......-.-..+-|+-|.. ..+ |........++..+...-.+. +..|.+ +..-.+++. T Consensus 261 ~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~~~~~d~~---~~~~~~------~~~~~~~~~ 330 (500) T protein:vir:30 261 DNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGDVVPRPRFESD---QNVYIR------MGGRDLDSS 330 (500) T ss_pred hhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCccccCCcccCCC---cceEEE------cCCCCCcCc Confidence 99998888777666656666776533 222 211111000000000000000 000110 011112223 Q ss_pred eeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch---hhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 323 KFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL---NESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 323 ~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~---~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) .++.++. -.+-++.+..+...++|+...|++|..+|+...+..++.+..+. .+... ...+..++.+|.-++..|- T Consensus 331 ~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~-~~~~~~~~~al~~lv~~il 409 (500) T protein:vir:30 331 AIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR-NSIVALVEQSLKELVISIF 409 (500) T ss_pred ceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 3444432 14568889999999999999999999999876554333221110 11111 1233445566666666654 Q ss_pred HHHHh-hcccc--ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceecccccc Q lcl|NC_012530. 399 KNLTN-GIIRQ--ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQ 473 (559) Q Consensus 399 ~~ln~-~L~~~--~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~ 473 (559) ...+. .++.. .....+.++|+.....|..+.++....++. |.|+.-+++.++ |++.-+ ....+ T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee------------a~~~l 477 (500) T protein:vir:30 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK------------AQEIA 477 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH------------HHHHH Confidence 43321 12221 123346778887778888887777766665 568999887544 442100 00000 Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCccccccchh Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQ 515 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (559) . ... .+.....+. ..+. .+..++ T Consensus 478 ~---~i~-~E~~~~~~~----------~~~~-----~~~~g~ 500 (500) T protein:vir:30 478 A---EIN-TGIVDEINQ----------QRTD-----THLYGE 500 (500) T ss_pred H---HHH-HhccccCCC----------CCcc-----ccccCC Confidence 0 000 000000000 0000 000000 No 162 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.74 E-value=6.9e-08 Score=59.83 Aligned_cols=443 Identities=11% Similarity=-0.017 Sum_probs=165.5 Q ss_pred Ccchhhhccc-------cccCCc-c---hHHHHHHHH----HH--HHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYTDDP-N---AFFKHIDSK----IA--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~~~~-~---~~~~~~~~~----~~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =||=.||... +..+.. - +++..+-.. .. .+.+..=-.|++.....+. .... .. T Consensus 15 ~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~-----~~~~-----~~ 84 (511) T protein:vir:99 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT-----RRKE-----EY 84 (511) T ss_pred hhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccC-----cccc-----cc Confidence 2222233211 010110 0 111111110 00 0111211223332211110 0000 00 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) ++ ..+ ........+|+..+.-+. |.+..+...+ .+..+.+..|+..- T Consensus 85 ~~--~~k----------i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~n-- 131 (511) T protein:vir:99 85 MA--DNR----------VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEAIEAFNDLN-- 131 (511) T ss_pred cC--cce----------eecchHHHHHHHHHhhhc-----------ccCceeecCc--------hHHHHHHHHHHhhc-- Confidence 00 001 012333344444443322 1111221111 11223455554431 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEe--cCc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYI--DNK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~--~~~---- 216 (559) .+......+..+++++|.+|.++.+|.+|++ .+..++|..+.++.+.... .....++|+... ++. T Consensus 132 -------~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~ 203 (511) T protein:vir:99 132 -------DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred -------CHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccce Confidence 2344666778899999999999999988875 4788999999988775431 222233433221 100 Q ss_pred --eeeeecccceEEEecccC----------------------CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRNPR----------------------SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n~~----------------------~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.+++.... -.......|.|.++.+...|+....+..-..+.+...+ T Consensus 204 ~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 283 (511) T protein:vir:99 204 VFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 112344444444322100 00011124666666665555544444444344444444 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|-.++.-.. ..+.+....+++.-.-......-.....+-..++.+...++.+ .+..+....+...+.|+..-+ T Consensus 284 ~~~lv~~G~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~ 358 (511) T protein:vir:99 284 DAMLLIKGNL-----NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN 358 (511) T ss_pred chhhhhccCc-----ccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 4443332111 1111222111110000000000000001111223344444422 234566777888889999989 Q ss_pred CCHHHhccccccccccccccchhhhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhhhH Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNESN---NQNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~d~ 426 (559) +|..-.+-...+ .++. ....-+.. -....+..+..+|.-.++.|...+...-- .......+.+.|......+. T Consensus 359 ~P~~~~~~~~gn-~Sg~-Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~ 436 (511) T protein:vir:99 359 TPNMKDDNFSGT-QSGE-AMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSL 436 (511) T ss_pred Cccccccccccc-chHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH Confidence 987543211111 0000 00000000 01111233334444444444333332110 11122357888888888888 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_012530. 427 QDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLP 506 (559) Q Consensus 427 ~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (559) .+.++.+..+ .|+++.--+.++++. ++ |. ...+..+............. .........+..+.++. T Consensus 437 ~e~~~~~~kl-~GiiS~et~l~~l~~--v~--D~------~~E~~ri~~E~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 502 (511) T protein:vir:99 437 IEELKAYIDS-GGKISQTTLMSLFSF--FQ--DP------ELEVKKIEEDEKESIKKAQK---NMYQDPRNINDDEQDDS 502 (511) T ss_pred HHHHHHHHHH-hccCCHHHHHHhCCC--CC--CH------HHHHHHHHHHHHHHHHHHhh---cccccCCCCCCCCCCCC Confidence 8888887665 366788777777643 21 10 01111111111100000000 00000000000000000 Q ss_pred ccccccchhccccccccccccc Q lcl|NC_012530. 507 PSSSNSFQQNQEGYTGKDAKPS 528 (559) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~ 528 (559) +... ++.++ T Consensus 503 ~~~~-------------~d~~e 511 (511) T protein:vir:99 503 TKDS-------------IDKKE 511 (511) T ss_pred CcCc-------------ccccC Confidence 0000 00000 No 163 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.74 E-value=7.2e-08 Score=59.76 Aligned_cols=440 Identities=10% Similarity=-0.023 Sum_probs=173.0 Q ss_pred Ccchhhhccc-------cccCCcc----hHHHHHHHHH------HHHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYTDDPN----AFFKHIDSKI------ANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~~~~~----~~~~~~~~~~------~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =+|=.||... +..+... +.+..+-... ..+.+..=-.|+++....+.. T Consensus 15 ~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~--------------- 79 (511) T protein:vir:93 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR--------------- 79 (511) T ss_pred hhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc--------------- Confidence 1111122211 1101100 1111111100 011122222333322111100 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) . +.....+. + ...+....+|+..+.-+. |.+..+...+ ....+.+..|+.. T Consensus 80 ~-~~~~~~~~----k--i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~--- 130 (511) T protein:vir:93 80 R-KEEYMADN----R--VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEVIEAFNDL--- 130 (511) T ss_pred C-cccccCcc----e--eecchHHHHHHHHhhhhc-----------ccCeeeccCC--------hHHHHHHHHHHhh--- Confidence 0 00000000 0 112334444444443322 2222222111 1122334444433 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---- 216 (559) ..+......+..+++++|.+|..+.++.+|++. +..++|..+.++.+.... .....++|+.... +. T Consensus 131 ------n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~-i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~ 203 (511) T protein:vir:93 131 ------NDVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred ------cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceE-EEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccce Confidence 123456677888999999999999999888764 788999999988775432 2223334433211 10 Q ss_pred --eeeeecccceEEEeccc----------------------CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRNP----------------------RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n~----------------------~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.++.... .-.......|.|-++.+...++....+..-..+.+...+ T Consensus 204 ~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~ 283 (511) T protein:vir:93 204 VFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhh Confidence 11234444444432110 000011234777777776666666555555555566556 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|-.++.-... .+.+.....+....-......-+....+-..++.+...++.+ .+..+....+...+.|...-+ T Consensus 284 ~~~lv~~G~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~ 358 (511) T protein:vir:93 284 DAMLLIKGNLN-----LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN 358 (511) T ss_pred CcceeeecCcc-----cCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 66555432111 112222111111000000000000000111223344444422 234456677888889999889 Q ss_pred CCHHHhccccccccccccccchhh--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--cccCccceeeecchhhh Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNE--S---NNQNKIDASKSKGLMPLLDMIAKNLTNGIIR--QILGDNYMLEFVGGDTR 424 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~--a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~~~~~~~~f~~l~~~ 424 (559) +|..-.+-...+ .++..+-+ . +-....+.++..+|.-.++.|...++..--. ......+.+.|...... T Consensus 359 ~P~~~~~~~~~n----~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~ 434 (511) T protein:vir:93 359 TPNMKDDNFSGT----QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPK 434 (511) T ss_pred Cccccccccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCC Confidence 987543321111 11100100 0 0111223445555665555555444332111 11233578888888888 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc-ccCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE-SALQNPSGTPP 503 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 503 (559) +..+.++.+..+ .|.|+.--++++++.-+ |. ...+..+... ............ ......++.++ T Consensus 435 n~~e~~~~~~kl-~g~iS~et~~~~l~~v~----d~------~~E~~ri~~E----~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:93 435 SLIEELKAYIDS-GGKISQTTLMSLFSFFQ----DP------ELEVKKIEED----EKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred CHHHHHHHHHHH-hccCchHHHHHhCCCCC----CH------HHHHHHHHHH----HHHHHHHHhhhcccCCCCCCCCCC Confidence 888888877665 46678877777764421 10 0111111110 000000000000 00000000000 Q ss_pred CCCccccccchhccc Q lcl|NC_012530. 504 TLPPSSSNSFQQNQE 518 (559) Q Consensus 504 ~~~~~~~~~~~~~~~ 518 (559) +++ .+...++.+ T Consensus 500 ~~~---~~~~~~~~~ 511 (511) T protein:vir:93 500 DDD---TKDTVDKKE 511 (511) T ss_pred CCc---ccccccccC Confidence 000 000000000 No 164 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.73 E-value=8e-08 Score=59.49 Aligned_cols=382 Identities=10% Similarity=0.047 Sum_probs=151.7 Q ss_pred CCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcce-----eeec--ccccccChhHHHHHHHHHHHHHh Q lcl|NC_012530. 68 IAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGY-----QVRL--KNGDKPTKEQQKKIDYAERYIER 140 (559) Q Consensus 68 ~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~-----~v~~--~d~~~~~~~~~~~~~~~~~~L~~ 140 (559) .+.- ...++..+++.+....+.+...-.+-.+.-.. .+.. ++..-....-..-.+.+..+| . T Consensus 1 ~~~~----------~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l-~ 69 (441) T protein:vir:80 1 MNSD----------ELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERL-D 69 (441) T ss_pred CCcc----------HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhh-c Confidence 0000 01123333333333333221111111111000 0000 000000000000111111111 0 Q ss_pred cCCCCCC---------ChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccc-eEEE Q lcl|NC_012530. 141 MGVDYSP---------IRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRG-KIYR 210 (559) Q Consensus 141 ~~p~~~~---------~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~-~~y~ 210 (559) +...... ...++..+...+..+++++|.+|..+.+|.+|.+ .+.+++|..|.++.+......... .+|+ T Consensus 70 ~~g~~~~d~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~ 148 (441) T protein:vir:80 70 WLGWTNGDGYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQ 148 (441) T ss_pred cccccCCChHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEE Confidence 0100000 0123456677788899999999999999999987 478999999988776543222111 1111 Q ss_pred EEecCc-eeeeecccc--------------------------eEEEecccCCCccCCcccccHHH----HHHHHHHHHHH Q lcl|NC_012530. 211 QYIDNK-VRGSFTADE--------------------------MGMFIRNPRSDILSGGYGLSELE----MGLREFISHEN 259 (559) Q Consensus 211 ~~~~~~-~~~~~~~~e--------------------------vi~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~ 259 (559) ...+.. ....+..+. |+|+..++. ....+|.|.|. .+.+++...+. T Consensus 149 ~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~---~~~~~G~s~l~~~v~~liDa~~~~~s 225 (441) T protein:vir:80 149 TCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRR---TSRIDGRSEITRSIRAYTDEAVRTLL 225 (441) T ss_pred EecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeecccc---CCccCCcccchhhHHHHHHHHHHHHH Confidence 110000 011122222 233332222 23456877553 34444444444 Q ss_pred HHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC----CceeeeeccccchhH- Q lcl|NC_012530. 260 TELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA----EDAKFVSMTQAEDMQ- 334 (559) Q Consensus 260 ~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~----g~~~~~~ls~~~D~q- 334 (559) -......+|. .|--+|. +. . +++... ..|+.. .+++..++. +..++..+.. .+++ T Consensus 226 ~~~~~~~~~~---~~~~~i~--G~-~---~~~~~~----~~~~~~------~~~i~~~~~~~~~~~~~~~~~~~-~~~~~ 285 (441) T protein:vir:80 226 GQSVNRDFYA---YPQRWVT--GV-S---ADEFSQ----PGWVLS------MASVWAVDKDDDGDTPNVGSFPV-NSPTP 285 (441) T ss_pred HHHHHHHhhc---Cceeeee--cC-C---cccccc----chhhhc------ccccccCCCCCCCCcceeEecCc-cchHH Confidence 3333334444 4543442 21 1 111111 112111 122222221 1234433332 3444 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHH---HHHHHHHHHHhhHHHHHHHHHHHhhccccccC Q lcl|NC_012530. 335 FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQ---NKIDASKSKGLMPLLDMIAKNLTNGIIRQILG 411 (559) Q Consensus 335 f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 411 (559) |++..+..+..|+..-++|+..+|....+..++ .....-+.... +..+..+..+|+-.+..+...++...-..... T Consensus 286 ~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg-~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~ 364 (441) T protein:vir:80 286 YSDQMRLLAQLTAGEAAVPERYFGFITSNPPSG-EALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFF 364 (441) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccCCCcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 788888999999999999999998644321111 00001111111 11112222233333332322222111111112 Q ss_pred ccceeeecchhhhhHHHHHHHHHHHHcCC-C--CHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccc Q lcl|NC_012530. 412 DNYMLEFVGGDTRSQQDKLKSVQLELQTA-T--TVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRL 488 (559) Q Consensus 412 ~~~~~~f~~l~~~d~~~~~~~~~~~~~~~-~--T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 488 (559) ..+.+.|......+..+.++.+.+++.++ + +..-++..+|+.+-+ +..+.. ...+.++..... T Consensus 365 ~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e-------------~~~~~~-e~~e~~~~~~~~ 430 (441) T protein:vir:80 365 GDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQ-------------VEAVMR-HRAESSDPLAVL 430 (441) T ss_pred eeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHH-------------HHHHHH-HHHHHHHHHHHH Confidence 35678888888899999998887776544 3 334467777764321 111110 000000000000 Q ss_pred ccccccCCCCCCCCCCCC Q lcl|NC_012530. 489 TQLESALQNPSGTPPTLP 506 (559) Q Consensus 489 ~~~~~~~~~~~~~~~~~~ 506 (559) .. .....++.- T Consensus 431 ~~-------~~~~~~~~~ 441 (441) T protein:vir:80 431 AG-------AISRQTNEV 441 (441) T ss_pred hh-------hhhcccccC Confidence 00 000000000 No 165 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.72 E-value=8.3e-08 Score=59.41 Aligned_cols=436 Identities=10% Similarity=0.042 Sum_probs=181.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhh----------hhccccc-cccccccccccccccccccccccCCCCC Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASK----------ALNGVDR-AYTEPVDGNLMFSTLEDTSIVPKPSPIA 69 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~gr~~-a~~~~~~~~~~~~~~~~~~~~~~p~~~~ 69 (559) ++-...|......+.....+.. -++...++... =-.|++. .+..+ .+ .... T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~-i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~----------------~~-~~~~ 84 (502) T protein:vir:48 23 RESRIRYRADNLEELMVNNWEL-LKNFINHHKLRQAPRIQELLDYARGENHDVLKSG----------------RR-KDNE 84 (502) T ss_pred hhHHhhhcccchhhhccccHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc----------------cc-cccc Confidence 4444455555544433333222 22222222211 1112210 00000 00 0000 Q ss_pred cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCCh Q lcl|NC_012530. 70 FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIR 149 (559) Q Consensus 70 ~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~ 149 (559) ..+ .+ ....+...+|+..+.-+. |.+..+...+.. ......+.+..++.. T Consensus 85 ~~~----~k--i~~n~~k~Ivd~~~~yl~-----------g~p~~~~~~d~~----~~~~~~~~l~~~~~~--------- 134 (502) T protein:vir:48 85 MAD----KR--AVHNYGRMISKFKTGYLA-----------GNPIRVEYDDNE----DNSQNDDAIKRIGRI--------- 134 (502) T ss_pred ccc----ce--eecchHHHHHHHHhhhhc-----------ccCeeEecCCcc----chhHHHHHHHHHHhh--------- Confidence 000 00 113444445555544332 222233322211 011112223333322 Q ss_pred hhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCc----eeeeeccc Q lcl|NC_012530. 150 DDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNK----VRGSFTAD 224 (559) Q Consensus 150 ~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~----~~~~~~~~ 224 (559) ..|......+..+++.+|.+|..+.++.+|.+. +..++|..+.++.+... .....+++|+...... ....+..+ T Consensus 135 N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~-i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~ 213 (502) T protein:vir:48 135 NDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETR-IKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQ 213 (502) T ss_pred cCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCC Confidence 134457778888999999999999999888754 77889999988876532 1222233333221111 11233333 Q ss_pred ceEEEecc----------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc Q lcl|NC_012530. 225 EMGMFIRN----------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN 288 (559) Q Consensus 225 evi~~~~n----------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~ 288 (559) .+.++... |.-.......|+|.++.+...|+....+..-..+.+...+.|-.++.-... . T Consensus 214 ~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~----~ 289 (502) T protein:vir:48 214 HIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLA----L 289 (502) T ss_pred eEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc----c Confidence 33322210 100011233577778777777766666666666666666666555532211 1 Q ss_pred CCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccch-hHHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Q lcl|NC_012530. 289 TSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAED-MQFQSWLNYLINIICALVAMDPAEIGMQNRGGATG 367 (559) Q Consensus 289 ~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D-~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 367 (559) ...+....++... ........+ .-..+++.++.-++.+.+ ..+....+...+.|+..-++|+...+-... T Consensus 290 ~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~----- 360 (502) T protein:vir:48 290 PQGMQASDMKRTR-LMQLKPPKS---ADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSG----- 360 (502) T ss_pred ccccchhhhhhcc-eeecccccc---ccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccccc----- Confidence 1112222222110 000000000 000112233443443222 334556788889999998999754432111 Q ss_pred ccccchhh---hh---HHHHHHHHHHHHhhHHHHHHHHHHHhhcc-ccccCccceeeecchhhhhHHHHHHHHHHHHcCC Q lcl|NC_012530. 368 NKSNSLNE---SN---NQNKIDASKSKGLMPLLDMIAKNLTNGII-RQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTA 440 (559) Q Consensus 368 ~~~~~~~~---an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~ 440 (559) +.++.... .. -....+..+..+|+-.+..+...++..-- .......+.+.|......|..+.++++..+ .|. T Consensus 361 n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl-~g~ 439 (502) T protein:vir:48 361 NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL-GGQ 439 (502) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH-hcc Confidence 11111110 00 01122244555555555555544443211 122334578899988889999998887766 466 Q ss_pred CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCC----CCCCCCCCCCccccccchhc Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQ----NPSGTPPTLPPSSSNSFQQN 516 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 516 (559) ++..-+.++++. ++ |. ...+..+.. ++.+........... ......++..+.+. T Consensus 440 iS~et~l~~l~~--v~--D~------~~E~~ri~~------E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~------ 497 (502) T protein:vir:48 440 VSQETALSLSGL--VE--NP------TEELDKINE------ESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDF------ 497 (502) T ss_pred CcHHHHHHhCCC--CC--CH------HHHHHHHHH------HHHhhhhhcccccccccccccCCCccCCCCcCc------ Confidence 788777777654 21 10 011111110 000000000000000 00000000000000 Q ss_pred cccccccccccc Q lcl|NC_012530. 517 QEGYTGKDAKPS 528 (559) Q Consensus 517 ~~~~~~~~~~~~ 528 (559) +...+ T Consensus 498 -------~~~~~ 502 (502) T protein:vir:48 498 -------ERVYE 502 (502) T ss_pred -------CCCCC Confidence 00000 No 166 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.72 E-value=8.3e-08 Score=59.39 Aligned_cols=440 Identities=10% Similarity=-0.015 Sum_probs=171.0 Q ss_pred Ccchhhhccc-------cccCC-cc---hHHHHH-HHHH---H--HHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYTDD-PN---AFFKHI-DSKI---A--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~~~-~~---~~~~~~-~~~~---~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =||=.||... +..+. .- +.+..+ .+-. . .+.+..=-.|++.....+... . .. . T Consensus 15 ~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-----~---~~--~ 84 (511) T protein:vir:10 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----K---EE--Y 84 (511) T ss_pred hhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-----c---cc--c Confidence 1221222210 00000 00 111111 1000 0 011111122332221111000 0 00 0 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) ++ ..+ ........+|+..+.-+. |.+..+...+ ....+.+..|+... T Consensus 85 ~~--~~k----------i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~n-- 131 (511) T protein:vir:10 85 MA--DNR----------VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEAIEAFNDLN-- 131 (511) T ss_pred cC--cce----------eecchHHHHHHHHhhhhc-----------ccCceeecCc--------hHHHHHHHHHHhhc-- Confidence 00 000 112333444544443322 1122222111 11223344444331 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---- 216 (559) .+......+..+++++|.+|..+.+|.+|.+ .+..++|..+.++.+.... .....++|+.... +. T Consensus 132 -------~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~ 203 (511) T protein:vir:10 132 -------DVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred -------CHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccce Confidence 2344566778899999999999999988875 4778899999988775542 2222333433211 00 Q ss_pred --eeeeecccceEEEecc----------------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRN----------------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n----------------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.++... |.-.......|.|-++-+...|+....+..-..+.+...+ T Consensus 204 ~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~ 283 (511) T protein:vir:10 204 VFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 0123344444433211 0000011124777777666666655555544455556556 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|-.++.-... .+.+.....++...-............+-.+++.+...++.+ .+..+....+...+.|+..-+ T Consensus 284 ~~~lv~~g~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~ 358 (511) T protein:vir:10 284 DAMLLIKGNLN-----LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN 358 (511) T ss_pred Cceeeeecccc-----CCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 66555432111 122222221111000000000001111112223344444432 334566777888889988889 Q ss_pred CCHHHhccccccccccccccchh--hhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhh Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLN--ESN---NQNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTR 424 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~--~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~ 424 (559) +|..-.+-.. ++.++..+- +.. -....+.++..+|.-.++.|...+...-- .......+.+.|...... T Consensus 359 ~P~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~ 434 (511) T protein:vir:10 359 TPNMKDDNFS----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK 434 (511) T ss_pred Cccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCc Confidence 9874332111 111110000 000 01222344445555555555444433211 112234578889888889 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccc-cccccCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT-QLESALQNPSGTPP 503 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 503 (559) |..+.++.+..+. |.++.--+.++++. ++ |. ...+..+.... +....... .........++.++ T Consensus 435 d~~~~~~~~~kl~-G~iS~et~~~~l~~--v~--d~------~~E~~ri~~E~----~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:10 435 SLIEELKAYIDSG-GKISQTTLMSLFSF--FQ--DP------ELEVKKIEEDE----KESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred CHHHHHHHHHHHh-ccCcHHHHHHhCCC--CC--CH------HHHHHHHHHHH----HHHHHHHhhhcccCCCCCCCCCC Confidence 9999988877663 66787667777643 22 10 01111111110 00000000 00000000000000 Q ss_pred CCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 504 TLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) +++ +... .+ +++ T Consensus 500 ~~~-~~~~--~~---------------------~~~ 511 (511) T protein:vir:10 500 DDD-TKDT--VD---------------------KKE 511 (511) T ss_pred CCc-ccCc--cc---------------------ccC Confidence 000 0000 00 000 No 167 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.72 E-value=8.6e-08 Score=59.31 Aligned_cols=441 Identities=10% Similarity=-0.027 Sum_probs=172.0 Q ss_pred Ccchhhhccc-------ccc-CCcc---hHHHHHHHHH----H--HHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYT-DDPN---AFFKHIDSKI----A--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~-~~~~---~~~~~~~~~~----~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =+|=.||... +.. .+.. +.+..+-..- . .+.+..=-.|++.....+.. ... . . T Consensus 15 ~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~-----~~~---~--~ 84 (511) T protein:vir:96 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RKE---E--Y 84 (511) T ss_pred hhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc-----Ccc---c--c Confidence 1111222211 000 0000 1111111000 0 01111111233221111000 000 0 0 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) ++ ..+ ........+|+..+.-+. |.+..+...+ ....+.+..++..- T Consensus 85 ~~--~~k----------i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~~--------~~~~~~l~~~~~~n-- 131 (511) T protein:vir:96 85 MA--DNR----------VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEAIEAFNDLN-- 131 (511) T ss_pred cC--cce----------eecchHHHHHHHHHhhhc-----------cCCceeecCc--------hHHHHHHHHHHhhc-- Confidence 00 000 012333344444443322 2222222111 11223344554331 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---- 216 (559) .|......+..+++++|.+|..+.+|.+|.+ .+.+++|..+.++.+.... .....++|+.... +. T Consensus 132 -------~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~ 203 (511) T protein:vir:96 132 -------DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred -------CHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccce Confidence 2345667778899999999999999988875 4788999999988765432 1222333433211 00 Q ss_pred --eeeeecccceEEEeccc----------------------CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRNP----------------------RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n~----------------------~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.++.... .-.......|+|-++-+...|+....+..-..+.+...+ T Consensus 204 ~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~ 283 (511) T protein:vir:96 204 VFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 01234444444332110 000011225777777766666655555555555556555 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|-.++.-... .+.+.....++.-.-.......+....+-.+++.++..++.+ .+..+....+...+.|...-+ T Consensus 284 ~~~lv~~g~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~ 358 (511) T protein:vir:96 284 DAMLLIKGNLN-----LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN 358 (511) T ss_pred CceeeeecCcc-----CCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 66555432111 111111111110000000000000001111223344444432 234567778888899999999 Q ss_pred CCHHHhccccccccccccccchhh--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--cccCccceeeecchhhh Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNE--S---NNQNKIDASKSKGLMPLLDMIAKNLTNGIIR--QILGDNYMLEFVGGDTR 424 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~--a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~~~~~~~~f~~l~~~ 424 (559) +|..-.+-... +.++..+-+ . +-.......+..+|.-.++.|...+..+--. ......+.+.|...... T Consensus 359 ~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~ 434 (511) T protein:vir:96 359 TPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPK 434 (511) T ss_pred Ccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCC Confidence 98754322111 111100000 0 0111222444555555555555444432111 11223578888888888 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPT 504 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (559) |..+.++.+..+ .|+|+.-.+.++++.-+-+ ...+..+................. .....+.++++.+ T Consensus 435 n~~e~~~~~~kl-~G~iS~et~l~~l~~v~D~----------~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 502 (511) T protein:vir:96 435 SLIEELKAYIDS-GGKISQTTLMSLFSFFQDP----------ELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDD 502 (511) T ss_pred CHHHHHHHHHHH-hccCChHHHHHhCCCCCCH----------HHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCCCCCc Confidence 888888877665 4668887787777542210 011111111100000000000000 0000000000000 Q ss_pred CCccccccchhc Q lcl|NC_012530. 505 LPPSSSNSFQQN 516 (559) Q Consensus 505 ~~~~~~~~~~~~ 516 (559) .++...+ ++ T Consensus 503 ~~~~~~~---~~ 511 (511) T protein:vir:96 503 TKDTVDK---KE 511 (511) T ss_pred ccccccc---cC Confidence 0000000 00 No 168 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.69 E-value=1e-07 Score=58.87 Aligned_cols=433 Identities=11% Similarity=0.048 Sum_probs=165.0 Q ss_pred hhhhccccccCCcchHH-HHHHHHHHHHHHh-----hhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFF-KHIDSKIANDTAS-----KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-----~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) |---+-+.-+.+++... ..|.+....+... .=-.|+++- ...+.... .++. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i-------------------~~~~~~~~----~~~~ 57 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRP-------------------EAIGVTVP----VQMQ 57 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCch-------------------hhcCcccc----hhhh Confidence 11111111122222211 1122222211111 111112110 00000000 0111 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHH Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLR 157 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~ 157 (559) +......+...+|+..++.+. +.||.+- +..+ ....+..++.. ..+..+.. T Consensus 58 ~~~~~~n~~~~ivd~~~~~l~-----------~~g~~~~--~~~~-------~~~~l~~i~~~---------N~~d~~~~ 108 (485) T protein:vir:24 58 SLLAHVGYPRLYVDSIAERQA-----------VEGFRLG--DADE-------ADEELWQWWQA---------NNLDIEAP 108 (485) T ss_pred hhhhccchHHHHHHHHhhhhc-----------cCceecC--CCch-------hHHHHHHHHHh---------cChhHHHH Confidence 111223455555555544331 2234321 1111 11223344332 12345667 Q ss_pred HHHHHHHHcCCcceEEEECCCCcE-------EEEEEecCceEEEEecCcccccccceEEEEEecCce---eeeecccc-- Q lcl|NC_012530. 158 KLVRDTYTYDQVNYENTYDSNGRL-------SHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKV---RGSFTADE-- 225 (559) Q Consensus 158 ~~v~d~ll~Gna~~~i~rd~~G~~-------~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~---~~~~~~~e-- 225 (559) .+..+++++|.+|..+.++..+.+ ..+.+++|..+.++.+..........+++...++.. ...|..+. T Consensus 109 ~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~ 188 (485) T protein:vir:24 109 LGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETF 188 (485) T ss_pred HHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEE Confidence 888899999999999988765432 257889999988877654322211111111111110 01122222 Q ss_pred -----------------------eEEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEE Q lcl|NC_012530. 226 -----------------------MGMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGIL 278 (559) Q Consensus 226 -----------------------vi~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 278 (559) |++|+.++. ..+.+|.|.++ .+.+++...+.-..-...+| +.|--+| T Consensus 189 ~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~---~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~---a~p~~~i 262 (485) T protein:vir:24 189 GWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTR---LSDLYGTSEITPELRSMTDAAARILMLMQATAELM---GVPQRLI 262 (485) T ss_pred EEEecCCceEeecccccCCCcccEEEeccCcc---cCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhh---cchhhhh Confidence 233332222 23457888664 23344433333222233333 3444443 Q ss_pred EecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 279 LVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 279 ~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) . +. .......+. +.-...|+. ..+++.+++.++.++..+.. .++ .+++..+..+..++..=++|+..+ T Consensus 263 ~--G~-~~~~~~~~~-~~~~~~~~~------~~~~i~~~~~~~~~~~q~~~-~~~e~~~~~l~~~i~~~s~~~~~p~~~f 331 (485) T protein:vir:24 263 F--GI-KPEEIGVDP-ETGQTLFDA------YLARILAFEDAEGKIQQFSA-AELANFTNALDQIAKQVAAYTGLPPQYL 331 (485) T ss_pred c--cC-Ccccccccc-ccccchhhh------cccceeccCCCCceEEeecc-cchHHHHHHHHHHHHHHhcccCCCHHHh Confidence 2 10 000000000 000111221 12344555556677765543 233 377788888888888889999999 Q ss_pred ccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ 434 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~ 434 (559) |....+..++ .....-+... .+..+..+..+|+-++..+....+. .-.......+.+.|......+..+.++.+. T Consensus 332 g~~~~n~~Sg-~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~-~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~ 409 (485) T protein:vir:24 332 STAADNPASA-EAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG-GDVPPDMLRMETVWRDPSTPTYAAKADAAT 409 (485) T ss_pred ccccCcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCccccceeeEEecCCCCCCHHHHHHHHH Confidence 8533221110 0000011111 1112223333444444433222111 101112346778887777788888888776 Q ss_pred HHHc-C--CCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 435 LELQ-T--ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 435 ~~~~-~--~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) +++. | .++..-+++++|+.+-+ ...+.......... .....+..... .+...+.+...+ T Consensus 410 kl~~~g~~~~s~et~~~~l~~~~d~----------~~e~~~~~ee~~~~---~~~~~~~~~~~-----~~~~~~~~~~~e 471 (485) T protein:vir:24 410 KLYGNGQGVIPRERARKDMGYSIAE----------REEMRRWDEEEAAM---GLGLLGTMVDA-----DPTVPGSPNPTP 471 (485) T ss_pred HHHhcccccCCHHHHHhhCCCCHhH----------HHHHHHHHHHHhhh---hhhHHHhhccc-----CCCCCCCCCCCC Confidence 6653 3 35666677777774321 00111100000000 00000000000 000000000000 Q ss_pred cchhcccccccccccccccccccccccccc Q lcl|NC_012530. 512 SFQQNQEGYTGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 512 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 541 (559) .++.+ ....+.|++ T Consensus 472 -~~~~~---------------~~~~~~~~a 485 (485) T protein:vir:24 472 -APKPQ---------------PAIEGGDSA 485 (485) T ss_pred -CCCCc---------------cCCCCCCCC Confidence 00000 000111111 No 169 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.69 E-value=1.1e-07 Score=58.79 Aligned_cols=440 Identities=10% Similarity=-0.015 Sum_probs=169.6 Q ss_pred Ccchhhhccc-------cccCCcc----hHHHHHHHHHH------HHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYTDDPN----AFFKHIDSKIA------NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~~~~~----~~~~~~~~~~~------~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =+|=.||... +..+... +.+..+-.... .+.+..=-.|+++-...+. .... . . T Consensus 15 ~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~-----~~~~---~--~ 84 (511) T protein:vir:96 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT-----RRKE---E--Y 84 (511) T ss_pred hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccC-----cccc---c--c Confidence 2222233211 0001100 11111111000 0111111223322111100 0000 0 0 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) + +.++ ....+..-+|+..+.-+. |.+..+...+ ....+.+..++..- T Consensus 85 ~--~~~k----------i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~n-- 131 (511) T protein:vir:96 85 M--ADNR----------VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEAIEAFNDLN-- 131 (511) T ss_pred c--Ccce----------eecchHHHHHHHHhhhhc-----------ccCceeecCc--------hHHHHHHHHHHhhc-- Confidence 0 0011 012333444444443322 1122222111 11223444554331 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---- 216 (559) .+..+...+..+++++|.+|..+.+|.+|++ .+..++|..+.++.+.... .....++|+.... +. T Consensus 132 -------~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~ 203 (511) T protein:vir:96 132 -------DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred -------ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccce Confidence 2334566778889999999999999988875 4788999999988875432 2223334433211 10 Q ss_pred --eeeeecccceEEEeccc----------------------CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRNP----------------------RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n~----------------------~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.++.... .-.......|.|-++-+...|+....+..-..+.+...+ T Consensus 204 ~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~ 283 (511) T protein:vir:96 204 VFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 11234444444432210 000111224777676666666655544444444444445 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCc--ccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHH Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGI--NGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICAL 349 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~--~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~ 349 (559) .|-.+++-... .+.+.........--..... -.........+++++| ++.+ .+..+....+...+.|+.. T Consensus 284 ~~~lv~~G~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:96 284 DAMLLIKGNLN-----LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGY--IYKQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred cchhheecCcc-----CCchhhcccccccceeccccceeccccccCCCCcceeE--EeecCCHHHHHHHHHHHHHHHHHH Confidence 55444432111 12222221111000000000 0000001111223444 3322 2345667778888999999 Q ss_pred hCCCHHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhh Q lcl|NC_012530. 350 VAMDPAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTR 424 (559) Q Consensus 350 fgVPp~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~ 424 (559) -++|..-.+-...+ .++ .....-+... ....+..+..+|.-.+..|...+...-- .......+.+.|...... T Consensus 357 s~~P~~~~~~~~~n-~Sg-~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~ 434 (511) T protein:vir:96 357 TNTPNMKDDNFSGT-QSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK 434 (511) T ss_pred hCCccccccccccc-cHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCc Confidence 99997544322111 000 0000000111 1222334455555555555444432211 112234578889888888 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccc-ccccCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQ-LESALQNPSGTPP 503 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 503 (559) +..+.++.+..+. |+|+..-+.++++. ++ |. ...+..+... .+........ ........++.++ T Consensus 435 n~~e~~d~~~kl~-G~iS~et~l~~l~~--v~--d~------~~El~ri~~E----~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:96 435 SLIEELKAYIDSG-GKISQTTLMSLFSF--FQ--DP------ELEVKKIEED----EKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred CHHHHHHHHHHHh-ccCChHHHHHhCCC--CC--CH------HHHHHHHHHH----HHHHHHHHhhccccCCCCCCCCCC Confidence 9888888877663 66787667766543 22 10 0111111111 0000000000 0000000001010 Q ss_pred CCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 504 TLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) ++ +.+...++ ++ T Consensus 500 ~~---~~~~~~~e---------------------~~ 511 (511) T protein:vir:96 500 DD---DTKDTVDK---------------------KE 511 (511) T ss_pred CC---CccCcccc---------------------cC Confidence 00 00000000 00 No 170 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.69 E-value=1.1e-07 Score=58.79 Aligned_cols=440 Identities=10% Similarity=-0.015 Sum_probs=169.6 Q ss_pred Ccchhhhccc-------cccCCcc----hHHHHHHHHHH------HHHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK-------FYTDDPN----AFFKHIDSKIA------NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~~~-------~~~~~~~----~~~~~~~~~~~------~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) =+|=.||... +..+... +.+..+-.... .+.+..=-.|+++-...+. .... . . T Consensus 15 ~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~-----~~~~---~--~ 84 (511) T protein:vir:78 15 GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT-----RRKE---E--Y 84 (511) T ss_pred hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccC-----cccc---c--c Confidence 2222233211 0001100 11111111000 0111111223322111100 0000 0 0 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) + +.++ ....+..-+|+..+.-+. |.+..+...+ ....+.+..++..- T Consensus 85 ~--~~~k----------i~~n~~k~Iv~~~~~yl~-----------g~p~~~~~~d--------~~~~~~l~~~~~~n-- 131 (511) T protein:vir:78 85 M--ADNR----------VAHDYASYISDFINGYFL-----------GNPIQYQDDD--------KDVLEAIEAFNDLN-- 131 (511) T ss_pred c--Ccce----------eecchHHHHHHHHhhhhc-----------ccCceeecCc--------hHHHHHHHHHHhhc-- Confidence 0 0011 012333444444443322 1122222111 11223444554331 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---- Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---- 216 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---- 216 (559) .+..+...+..+++++|.+|..+.+|.+|++ .+..++|..+.++.+.... .....++|+.... +. T Consensus 132 -------~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~ 203 (511) T protein:vir:78 132 -------DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred -------ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccce Confidence 2334566778889999999999999988875 4788999999988875432 2223334433211 10 Q ss_pred --eeeeecccceEEEeccc----------------------CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 217 --VRGSFTADEMGMFIRNP----------------------RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 217 --~~~~~~~~evi~~~~n~----------------------~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ....+.++.+.++.... .-.......|.|-++-+...|+....+..-..+.+...+ T Consensus 204 ~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~ 283 (511) T protein:vir:78 204 VFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLN 283 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 11234444444432210 000111224777676666666655544444444444445 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCc--ccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHH Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGI--NGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICAL 349 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~--~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~ 349 (559) .|-.+++-... .+.+.........--..... -.........+++++| ++.+ .+..+....+...+.|+.. T Consensus 284 ~~~lv~~G~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:78 284 DAMLLIKGNLN-----LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGY--IYKQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred cchhheecCcc-----CCchhhcccccccceeccccceeccccccCCCCcceeE--EeecCCHHHHHHHHHHHHHHHHHH Confidence 55444432111 12222221111000000000 0000001111223444 3322 2345667778888999999 Q ss_pred hCCCHHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcc--ccccCccceeeecchhhh Q lcl|NC_012530. 350 VAMDPAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGII--RQILGDNYMLEFVGGDTR 424 (559) Q Consensus 350 fgVPp~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~--~~~~~~~~~~~f~~l~~~ 424 (559) -++|..-.+-...+ .++ .....-+... ....+..+..+|.-.+..|...+...-- .......+.+.|...... T Consensus 357 s~~P~~~~~~~~~n-~Sg-~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~ 434 (511) T protein:vir:78 357 TNTPNMKDDNFSGT-QSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK 434 (511) T ss_pred hCCccccccccccc-cHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCc Confidence 99997544322111 000 0000000111 1222334455555555555444432211 112234578889888888 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccc-ccccCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQ-LESALQNPSGTPP 503 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 503 (559) +..+.++.+..+. |+|+..-+.++++. ++ |. ...+..+... .+........ ........++.++ T Consensus 435 n~~e~~d~~~kl~-G~iS~et~l~~l~~--v~--d~------~~El~ri~~E----~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:78 435 SLIEELKAYIDSG-GKISQTTLMSLFSF--FQ--DP------ELEVKKIEED----EKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred CHHHHHHHHHHHh-ccCChHHHHHhCCC--CC--CH------HHHHHHHHHH----HHHHHHHHhhccccCCCCCCCCCC Confidence 9888888877663 66787667766543 22 10 0111111111 0000000000 0000000001010 Q ss_pred CCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 504 TLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) ++ +.+...++ ++ T Consensus 500 ~~---~~~~~~~e---------------------~~ 511 (511) T protein:vir:78 500 DD---DTKDTVDK---------------------KE 511 (511) T ss_pred CC---CccCcccc---------------------cC Confidence 00 00000000 00 No 171 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.68 E-value=1.2e-07 Score=58.55 Aligned_cols=392 Identities=11% Similarity=0.103 Sum_probs=150.7 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHh---------------------------------hhhHhhhhcC- Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTE---------------------------------YAHRASTDDN- 109 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~---------------------------------~~~~~~~~~~- 109 (559) .++ ...+|..+++.+...... +|..++.... T Consensus 1 ~~t---------------~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTT---------------YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCC---------------HHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHh Confidence 000 012233333333333332 2322222111 Q ss_pred ---CcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE------CCCCc Q lcl|NC_012530. 110 ---GMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY------DSNGR 180 (559) Q Consensus 110 ---g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r------d~~G~ 180 (559) ..+|.+. + .....+.+.+++.. ..+......+..+.+++|.+|..+.+ |.+|. T Consensus 66 ~l~~~g~~~~--~-------d~~~~~~l~~i~~~---------N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~ 127 (480) T protein:vir:78 66 RLDIEGFRIS--E-------DSEGLEELWNWWQA---------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) T ss_pred hhccCceecC--C-------CchhHHHHHHHHHh---------cCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCe Confidence 1111110 0 00011122222211 12334566788899999999988765 34565 Q ss_pred EEEEEEecCceEEEEecCccc-ccccceEEEEEecCce----eeeecccce----------------------------- Q lcl|NC_012530. 181 LSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNKV----RGSFTADEM----------------------------- 226 (559) Q Consensus 181 ~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~~----~~~~~~~ev----------------------------- 226 (559) + .+.+++|..|.++.+.... ......+|+...+... ...+.++.+ T Consensus 128 ~-~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 206 (480) T protein:vir:78 128 P-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV 206 (480) T ss_pred e-EEEEEcccceEEEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcce Confidence 5 4788999999988875422 1111222221111110 112222222 Q ss_pred EEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHH Q lcl|NC_012530. 227 GMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWT 302 (559) Q Consensus 227 i~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~ 302 (559) +||..++. ..+.+|.|-|+ .+.+++...+.-..-...+|. .|--+|. + ....+...+.. ...|. T Consensus 207 v~f~n~~~---~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a---~p~~~i~--G-~~~~~~~~~~~---~~~~~ 274 (480) T protein:vir:78 207 VPLTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVIS--G-VTTDELTNDGE---NTTLD 274 (480) T ss_pred EEeecccc---cCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhc---chhhhhh--C-CCccccccccc---cchhh Confidence 23322221 23456877554 334444444443333344443 3433332 1 11111111100 11121 Q ss_pred HHhcCcccccccccccCCceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHH-- Q lcl|NC_012530. 303 ATSSGINGAYRIPMITAEDAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQ-- 379 (559) Q Consensus 303 ~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~-- 379 (559) .. .+++..+.+++.++..+.. .++ .|++..+..+..|+..=++|++.+|....+.+++ .....-+...+ T Consensus 275 ~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg-~Al~~~~~~l~~k 346 (480) T protein:vir:78 275 IY------YGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EAIIATDSRIVKM 346 (480) T ss_pred hh------hhhhccCCCCCceEEecCc-cCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH-HHHHHHHHHHHHH Confidence 11 1334455555677766543 234 3788889999999999999999998432111100 00000011100 Q ss_pred -HHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-C--CCCHHHHHHHhCCCCC Q lcl|NC_012530. 380 -NKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-T--ATTVNDYREKQGLPKI 455 (559) Q Consensus 380 -~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~--~~T~NE~R~~~gl~pi 455 (559) +..+..+...|.-.+..+....... . ......+.+.|......+..+.++...+++. + .++..-+++++|+.+- T Consensus 347 ~~~~~~~f~~~l~~~~rl~~~~~~~~-~-~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d 424 (480) T protein:vir:78 347 AERKGRIFGGAWERAMRIAMQIMGRE-V-TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT 424 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCC-c-cccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHh Confidence 0111111222222222222111100 0 1122356777877677777777776665553 3 2466566888887542 Q ss_pred CCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccc Q lcl|NC_012530. 456 AGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQG 535 (559) Q Consensus 456 ~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 535 (559) + ...+.... ..+.+........+ ..+.++. ...+...+.. .+..+ ..++ T Consensus 425 ~----------~~e~~~~~---~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~---------~~~~~----~~~~ 473 (480) T protein:vir:78 425 Q----------REQMRDWD---KQETEDMIDTLYST--TKAQADA---TPKPTVTETK---------TETQT----SPSG 473 (480) T ss_pred H----------HHHHHHHH---HHHHHHHHHHhhcc--ccCCCcc---ccCCCCCCCC---------CccCC----Cccc Confidence 1 01111000 00000000000000 0000000 0000000000 00000 0111 Q ss_pred ccccccc Q lcl|NC_012530. 536 VGKDGQL 542 (559) Q Consensus 536 ~~~~~~~ 542 (559) -+.++.+ T Consensus 474 ~~~~~~~ 480 (480) T protein:vir:78 474 FNRTKTR 480 (480) T ss_pred CCCcCCC Confidence 1111111 No 172 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.67 E-value=1.3e-07 Score=58.37 Aligned_cols=424 Identities=8% Similarity=0.043 Sum_probs=176.4 Q ss_pred Ccchhhhcccc--------------ccCC--cc---hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKF--------------YTDD--PN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSI 61 (559) Q Consensus 1 ~~~~~~~~~~~--------------~~~~--~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~ 61 (559) |++++|.+.-| |++. |. +++.++++-...-. |+..- ..+. T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~------g~~~~------------~~~~--- 59 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQ------SKWDD------------VQYK--- 59 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhc------CCccc------------cccc--- Confidence 99998874111 1111 10 11111111100000 00000 0000 Q ss_pred cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhc Q lcl|NC_012530. 62 VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERM 141 (559) Q Consensus 62 ~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~ 141 (559) .. ..+. ..+..........+++..|+-|..=+ -.|... ..+..+.+..+|.. T Consensus 60 -~~-----~~~~--~~~~~~slnl~~~i~~~~A~lv~~e~-----------~~i~v~--------d~~~~~~l~~~l~~- 111 (522) T protein:vir:47 60 -NT-----DGDI--KSRPMNHLPIARTASKKIASLVYNEQ-----------ATITTK--------NEILQKFLDDMLTN- 111 (522) T ss_pred -cc-----Ccch--hcccceecchHHHHHHHHhhhhcCCc-----------ceeecC--------ChHHHHHHHHHHhh- Confidence 00 0000 00111122344455555555543211 011111 11223344444432 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-------------ccceE Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-------------TRGKI 208 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-------------~~~~~ 208 (559) ..|...+...+...+..|.+++-+.+|. |. +.+-.+++..+.|+....+.+. ....+ T Consensus 112 --------n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~ 181 (522) T protein:vir:47 112 --------DRFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVY 181 (522) T ss_pred --------cchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeE Confidence 2355667777888888999888888874 43 4566777777776432211110 01111 Q ss_pred EEE-----------------------------Eec------Ccee--eee-----cccc----------eEEEecc-cCC Q lcl|NC_012530. 209 YRQ-----------------------------YID------NKVR--GSF-----TADE----------MGMFIRN-PRS 235 (559) Q Consensus 209 y~~-----------------------------~~~------~~~~--~~~-----~~~e----------vi~~~~n-~~~ 235 (559) |.. +.. |..+ ..+ .+.+ .+||+.+ +.. T Consensus 182 yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~ 261 (522) T protein:vir:47 182 YTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNN 261 (522) T ss_pred EEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccc Confidence 110 000 0000 000 0111 1133211 111 Q ss_pred CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CceEEEecCccCCccCC-HHHHHHHHHHHHHHhcCccc Q lcl|NC_012530. 236 DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGT----TKGILLVKPSPSVTNTS-MRALEDFKRHWTATSSGING 310 (559) Q Consensus 236 ~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~~~~~~~~~~~~-~e~~~~l~~~~~~~~~G~~n 310 (559) -..+.++|+|.+.-+...|......-.-..+-|+-|-. |..+|........+... ....+.- +..|.+.+ T Consensus 262 ~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~----~~~f~~~~- 336 (522) T protein:vir:47 262 KDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVE----QNVYMQIG- 336 (522) T ss_pred cccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcccccccccCcc----cceEeecC- Confidence 12245789999999998887776665555566676643 22222221111111000 0000000 01122111 Q ss_pred ccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh--HHHHHHHHHH Q lcl|NC_012530. 311 AYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN--NQNKIDASKS 387 (559) Q Consensus 311 ag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an--~~~~~~~~~~ 387 (559) .-..++-+++.++.. .+-++....+...+.|+...|++|..+|+...+..+..+..+..... .....+..++ T Consensus 337 -----~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~ 411 (522) T protein:vir:47 337 -----GSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVE 411 (522) T ss_pred -----CCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 000112234444422 56788999999999999999999999988665443322211111000 1122445666 Q ss_pred HHhhHHHHHHHHHHHh-hcccc--ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEee Q lcl|NC_012530. 388 KGLMPLLDMIAKNLTN-GIIRQ--ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIIL 462 (559) Q Consensus 388 ~~l~P~~~~ie~~ln~-~L~~~--~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~ 462 (559) .+|.-++..|....+. .++.. .....+.|.|+.....|..+.++.....+. |.|++-+++.++ |+.. + +. T Consensus 412 ~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~g~~e-e--ea-- 486 (522) T protein:vir:47 412 QSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLNISG-V--EA-- 486 (522) T ss_pred HHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCh-H--HH-- Confidence 6777666666544432 12221 123457778888788888887777766664 568999987653 4321 0 00 Q ss_pred ccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 463 SAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 463 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) ...+. ... ++... ..+. + .+--+ .+ .+....++ +.| T Consensus 487 -------~~el~---ri~--~E~~~-~~~~-------~--~~~~~--~~----~~~~~~~d---~~~ 522 (522) T protein:vir:47 487 -------EKELN---AIN--SELLP-MNDA-------E--LAIYG--MH----DQNEEKAD---DKG 522 (522) T ss_pred -------HHHHH---HHH--Hhhcc-CCCC-------C--CCCCC--CC----CcccccCC---CCC Confidence 00000 000 00000 0000 0 00000 00 00000000 111 No 173 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.65 E-value=1.4e-07 Score=58.15 Aligned_cols=432 Identities=9% Similarity=0.017 Sum_probs=168.3 Q ss_pred ccccccCCcc-hHHHH-HHHHHHHHHH---------hhhhccccccccccccccccccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 8 RTKFYTDDPN-AFFKH-IDSKIANDTA---------SKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 8 ~~~~~~~~~~-~~~~~-~~~~~~~~~~---------~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) ...|=++++. +.++. |....+.... ..=-.|++. +...+... ....... T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~-------------------i~~~~~~~-~~~~~~~ 60 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQE-------------------VPDLATRH-KNKEREV 60 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCc-------------------cccccccc-CChhHHH Confidence 3334444444 22222 1111111111 111112210 00011101 1111112 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) ++....+.+...||+..++.+- ..+|.+. +.. . ...+..++.. | .+.... T Consensus 61 ~~~~~~~n~~~~iVd~~~~~l~-----------~~gf~~~--d~~-----~---~~~~~~i~~~---N------~~d~~~ 110 (479) T protein:vir:99 61 LQQLSRKPWMGLMVNSFAQQLI-----------VDGYRKT--GTN-----E---NAKGWDTWRL---N------QMDKQQ 110 (479) T ss_pred HHHHhhcCcHHHHHHHHHhhcc-----------cccccCC--Cch-----h---hHHHHHHHHh---c------ChhHHH Confidence 2222334566667776655331 1223221 111 1 1123333332 1 223455 Q ss_pred HHHHHHHHHcCCcceEEEE-----CCCCcEEEEEEecCceEEEEecCcccccccceEEEEE-----------------ec Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTY-----DSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQY-----------------ID 214 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~r-----d~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~-----------------~~ 214 (559) ..+..+++++|.+|.++.+ |..|.+ .+..++|..+.++.++...... ..+++.. .. T Consensus 111 ~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (479) T protein:vir:99 111 FWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDEW-PKYLLERQPNGQYWWWTEEDYSIFEF 188 (479) T ss_pred HHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEecCCcccce-eeEEEeecCceeEEEEecceEEEEEe Confidence 6778889999999988764 344544 4777899988876543322110 0001000 00 Q ss_pred Cceeeee---c---ccc--eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC Q lcl|NC_012530. 215 NKVRGSF---T---ADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV 286 (559) Q Consensus 215 ~~~~~~~---~---~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 286 (559) +.....+ . ... |++|+.++.. ..+|.|.++.....++....+..-....+.-.+.|--+|.-- .. T Consensus 189 ~~~~~~~~~~~~h~~g~vPvv~f~n~~~~----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~---~~ 261 (479) T protein:vir:99 189 KQGKFIYRETVSHDYGHIPFVRYVNVMDL----RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL---ML 261 (479) T ss_pred cCCceeeccccccCCCCcceEEeecCCCc----CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC---Cc Confidence 0000000 0 011 3455544432 236888777766666665555444444444445565444311 01 Q ss_pred ccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHhcccccccc Q lcl|NC_012530. 287 TNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEIGMQNRGGA 365 (559) Q Consensus 287 ~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 365 (559) .+..... ...|.. ..+++..+.+++.++..+.. .++ .+++..+..+..|+.+=++|++.+|.....+. T Consensus 262 ~~~~~~~----~~~~~~------~~~~i~~~~~~~~~~~q~~~-~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg 330 (479) T protein:vir:99 262 PEGANAD----QEKMRF------AQESMLISQNEKASFGAIPA-APLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAA 330 (479) T ss_pred ccccccc----hhcccc------ccccceeecCCCceEEEecc-cchHHHHHHHHHHHHHHhccCCCCHHHcccccchHH Confidence 1100000 011111 12344455566677765542 333 37788888888999989999999986432110 Q ss_pred ccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-CCC Q lcl|NC_012530. 366 TGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TAT 441 (559) Q Consensus 366 ~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~ 441 (559) ......+... .+..+..+..+|.-++..+-...+.. .......+.+.|......+..+.++...+++. |++ T Consensus 331 ---~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~--~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~i 405 (479) T protein:vir:99 331 ---DALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRT--EEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKI 405 (479) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCC Confidence 0000111111 11111222223333333322111100 01112245666766666777888877776654 567 Q ss_pred CHHHHHHHh-CCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 442 TVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 442 T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) +...+.+++ |+.+-+ ...+... .... ...............+.+.....++...+..+ T Consensus 406 s~et~l~~l~gv~~~~----------~e~~~~~---~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 464 (479) T protein:vir:99 406 PAEGVWDMIPNLDQST----------VNGWKEI---YDRE--GDFGKYMRKLQNGPDPAEQRGGPNGATNMQQA------ 464 (479) T ss_pred CHHHHHHhcCCCCHHH----------HHHHHHH---HHHH--HHHHHHHHHHhcccCcccccCCCCCCCCCCCC------ Confidence 777676665 654310 0001000 0000 00000000000000000000000000000000 Q ss_pred ccccccccccccccccccccc Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~~~~~~ 541 (559) .+..+.....++.|. T Consensus 465 ------~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 465 ------NNKTGEPASLNKSGA 479 (479) T ss_pred ------CCCCcchhccCCCCC Confidence 000111112222222 No 174 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.64 E-value=1.5e-07 Score=57.97 Aligned_cols=421 Identities=8% Similarity=0.060 Sum_probs=171.0 Q ss_pred Ccchhhhcccc----ccCCcchHHHHHH----HHHHHHHH--hhhhccccccccccccccccccccccccccccCCCCCc Q lcl|NC_012530. 1 MGIFDRFRTKF----YTDDPNAFFKHID----SKIANDTA--SKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~~~~~~~~~~----~~~~~~~~~~~~~----~~~~~~~~--~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 70 (559) =+|..+||.-| ....+.+.+.+.. .+...+.. ..=-.|+......+ .. .....+....+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~-----~~------~~~~~~~~~~~ 71 (496) T protein:vir:38 3 NQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNL-----NY------EHNGNPVNRRQ 71 (496) T ss_pred hHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcc-----hh------ccCCCccccce Confidence 13333333111 1111111111100 00000000 00011111110000 00 00000000000 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) ........+++..|+-+..=| ..|... ..+..+.+.+++.. . T Consensus 72 ----------~~~n~~k~i~~~~a~~l~~~p-----------~~i~~~--------d~~~~e~l~~~~~~---------n 113 (496) T protein:vir:38 72 ----------LSMNLPKVTAKYMSKLLFNEK-----------VKINID--------DKAAEEFVLNVLKT---------N 113 (496) T ss_pred ----------eecchHHHHHHHHhhhhhCCc-----------ceEeeC--------ChHHHHHHHHHHhc---------c Confidence 112333445555554433211 112111 12233344455432 2 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------cceEEEEE-----ecCc Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------RGKIYRQY-----IDNK 216 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------~~~~y~~~-----~~~~ 216 (559) .|...+..++.+.+.+|.+|+.+..|.+|.+. +..++|..+.++....+.... .+..|+.. .++. T Consensus 114 ~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~-i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~ 192 (496) T protein:vir:38 114 GFTKNMERYIEYGEAMGGFVIKVYHDGNKNVK-VSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDV 192 (496) T ss_pred CHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEE-EEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCce Confidence 35666777888999999999999999888754 778889988876555443210 11111110 0000 Q ss_pred e------------------ee------------eecccceEEEe--cccC--CCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 217 V------------------RG------------SFTADEMGMFI--RNPR--SDILSGGYGLSELEMGLREFISHENTEL 262 (559) Q Consensus 217 ~------------------~~------------~~~~~evi~~~--~n~~--~~~~~~~~G~Spl~~~~~~i~~~~~~~~ 262 (559) . +. .+..-+...|. .+|. ......++|+|.++-+...++....+.. T Consensus 193 ~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s 272 (496) T protein:vir:38 193 YTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFD 272 (496) T ss_pred EEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHH Confidence 0 00 00000111111 1221 1123456899999988888877766655 Q ss_pred HHHHHHHhcCCCceEE-----EecCccCCccCCHHHHHHHHHHHHHHhcCcccccccc-ccc-CCceeeeecccc-chhH Q lcl|NC_012530. 263 FNDRFFTHGGTTKGIL-----LVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIP-MIT-AEDAKFVSMTQA-EDMQ 334 (559) Q Consensus 263 ~~~~~f~ng~~p~gil-----~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~-vl~-~g~~~~~~ls~~-~D~q 334 (559) -..+-|..| .+..++ ....... +. .. . . +.......+.. ... +++..++.++.. ..-+ T Consensus 273 ~~~~~~~~~-~~~i~v~~~~l~~~~~~~-g~-~~---~----~----~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~ 338 (496) T protein:vir:38 273 SYYQEFKLG-KKKVLVPSSFVKTAVNLD-GS-TT---Q----Y----FDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTE 338 (496) T ss_pred HHHHHHhhc-ccceecchHHhhccCCCC-Cc-cc---c----C----CCCccceEEEeecCCCcccccceeeccccCHHH Confidence 555556654 333222 1000000 00 00 0 0 00000000000 011 111223333211 2346 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH--HHHHHHHHHHHhhHHHHHHHHHHHhhcc-c--cc Q lcl|NC_012530. 335 FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN--QNKIDASKSKGLMPLLDMIAKNLTNGII-R--QI 409 (559) Q Consensus 335 f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~--~~~~~~~~~~~l~P~~~~ie~~ln~~L~-~--~~ 409 (559) +.+..+...+.|+..-|+||..+|+...+..++.+..+...... .......+..+|..++..+-+..+.... . .. T Consensus 339 ~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~ 418 (496) T protein:vir:38 339 FIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVV 418 (496) T ss_pred HHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 78888888899999999999999986544433222111110000 1123344555666666655443332111 1 12 Q ss_pred cCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceecccccccccccccccccccc Q lcl|NC_012530. 410 LGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTR 487 (559) Q Consensus 410 ~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 487 (559) ....+.|.|+.....|..+.++....++. |.|+.-.++..+ |....+ .....+..+ .... T Consensus 419 ~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~e-------------a~~el~ri~-----~E~~ 480 (496) T protein:vir:38 419 ELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAE-------------ADEWAEMLA-----KEKQ 480 (496) T ss_pred CccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHH-------------HHHHHHHHH-----Hhhh Confidence 23457888888888888888877777664 567877776543 332100 000000000 0000 Q ss_pred cccccccCCCCCCCCCC Q lcl|NC_012530. 488 LTQLESALQNPSGTPPT 504 (559) Q Consensus 488 ~~~~~~~~~~~~~~~~~ 504 (559) ...+....++... +++ T Consensus 481 ~~~~~~d~~~~~~-~~e 496 (496) T protein:vir:38 481 AEMPNNDMNGIFG-EEE 496 (496) T ss_pred ccCccccccCCCC-CCC Confidence 0001111110000 000 No 175 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.64 E-value=1.6e-07 Score=57.87 Aligned_cols=351 Identities=11% Similarity=0.058 Sum_probs=156.2 Q ss_pred ccccccccccccccccccccccCCCCCcccHHHHHHHHhh--ChH------HHHHHHHHHHHHHhhhhHhhhhcCC---- Q lcl|NC_012530. 43 YTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM--NVV------LNAIINTRANQVTEYAHRASTDDNG---- 110 (559) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~--~~~------v~acv~~ia~~ia~~~~~~~~~~~g---- 110 (559) .+...++..- -......+-...+..|+. .++ +..-..-....+..+|..++.+... T Consensus 1 ~~~~~i~~L~------------~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~ 68 (409) T protein:vir:94 1 MTEKGIGYLR------------FKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVF 68 (409) T ss_pred CCHHHHHHHH------------HHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhccc Confidence 1111111100 000011111122223332 222 1111222223444555555544322 Q ss_pred cceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCc Q lcl|NC_012530. 111 MGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPT 190 (559) Q Consensus 111 ~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~ 190 (559) .||.. .+. .+..+... ..+......+..+.+++|.+|+.+..+.+|+| .+.+++|. T Consensus 69 ~Gf~~--------~d~------~l~~i~~~---------N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~ 124 (409) T protein:vir:94 69 REFEN--------DDF------TVNEIFEE---------NNPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAV 124 (409) T ss_pred CcccC--------Cch------HHHHHHHh---------cChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccc Confidence 12210 000 12222221 12334556778899999999999999989986 57889999 Q ss_pred eEEEEecCcccccccceEEEEEecCc-e--eeeecccc----------------------eEEEecccCCCccCCccccc Q lcl|NC_012530. 191 TIYFANDEHGHRRTRGKIYRQYIDNK-V--RGSFTADE----------------------MGMFIRNPRSDILSGGYGLS 245 (559) Q Consensus 191 ~V~~~~~~~g~~~~~~~~y~~~~~~~-~--~~~~~~~e----------------------vi~~~~n~~~~~~~~~~G~S 245 (559) .+.++.|...........+..-.... . ...+.+++ +++|..++. ..+.+|.| T Consensus 125 ~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~---~~~~~G~s 201 (409) T protein:vir:94 125 NATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPD---AVRPFGRS 201 (409) T ss_pred eEEEEEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccc---cccccCcc Confidence 98887776433222111111100000 0 01122222 233333322 23567877 Q ss_pred H----HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE-ecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-- Q lcl|NC_012530. 246 E----LEMGLREFISHENTELFNDRFFTHGGTTKGILL-VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-- 318 (559) Q Consensus 246 p----l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-- 318 (559) . +..+.+++...+.-......||.+ |.-++. ++. + .+..+.++.... ++..++ T Consensus 202 ~I~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~d~-----d--~~~~~~~~~~~~----------~i~~~~~d 261 (409) T protein:vir:94 202 RITRSGMYWQSNAKRTLERADVTAEFYSF---PQKYVTGLSD-----D--AEPMETWKATVS----------SMLQFTKD 261 (409) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhcC---hhheeEecCC-----C--CcccchhhhhHH----------HhhcCCCC Confidence 5 445555555555555555566554 433332 211 1 111222322221 222221 Q ss_pred --CCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh----HHHHHHHHHHHHhh Q lcl|NC_012530. 319 --AEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN----NQNKIDASKSKGLM 391 (559) Q Consensus 319 --~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an----~~~~~~~~~~~~l~ 391 (559) +.+.++..+.. .+++ |++..+..+..+|..=++|++.+|....+.+++. .....+.. ++.. +......++ T Consensus 262 ~dg~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~-Al~a~~~~L~~~a~~k-~~~fg~~~~ 338 (409) T protein:vir:94 262 EDGDKPTLGQFTQ-PSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVE-AIKASHENLRLAGRKA-QRSLGAGLL 338 (409) T ss_pred CCCCCceEEecCC-CChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHH-HHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 12345554432 3454 8999999999999999999999996543211110 00000000 0011 111112222 Q ss_pred HHHHHHHHHHHhhc-cccccCccceeeecchh---hhhHHHHHHHHHHHHcCC--C-CHHHHHHHhCCCCCC Q lcl|NC_012530. 392 PLLDMIAKNLTNGI-IRQILGDNYMLEFVGGD---TRSQQDKLKSVQLELQTA--T-TVNDYREKQGLPKIA 456 (559) Q Consensus 392 P~~~~ie~~ln~~L-~~~~~~~~~~~~f~~l~---~~d~~~~~~~~~~~~~~~--~-T~NE~R~~~gl~pi~ 456 (559) -++..+- ++.... -.+.+....++.|.... ..+..+.++.+.+++..+ + .-+-+++++|+..-+ T Consensus 339 ~~~rla~-~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 339 NVAYLAA-CLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHH-HHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 2222111 111000 00112235677777443 334455666676666543 3 457889999997644 No 176 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.63 E-value=1.6e-07 Score=57.78 Aligned_cols=440 Identities=11% Similarity=0.061 Sum_probs=182.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhh----------hhccccccccccccccccccccccccccccCCCCCc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASK----------ALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 70 (559) +.-...|.....++.....+..| ++....+... =-.|++.....+ ........ T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~l-~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~----------------~~~~~~~~ 84 (501) T protein:vir:27 22 RESRIRYRADNLEELMVNNWELL-KNFINHHKLRQAPRIQELLDYARGENHDVLQF----------------GRRKDREM 84 (501) T ss_pred hhHHHhhccccccccccccHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----------------CccCcccc Confidence 56666666666555443332222 2222222110 011211000000 00000000 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) .+. + ...++...+|+..+.-+. |.+..+...+.. . .......+.+++.. . T Consensus 85 ~~~----k--i~~n~~k~Ivd~~~~yl~-----------g~p~~~~~~d~~--~--~~~~~~~l~~~~~~---------n 134 (501) T protein:vir:27 85 ADK----R--AVHNYGRMISKFKTGYLA-----------GNPIRVEYDDND--N--NSQNDDTIKRIGRI---------N 134 (501) T ss_pred ccc----e--eccchHHHHHHHHhhhhc-----------ccCeeEecCCcc--c--hHHHHHHHHHHHHh---------c Confidence 000 0 123455556655554432 222222222211 1 11112223333332 1 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCc----eeeeecccc Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNK----VRGSFTADE 225 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~----~~~~~~~~e 225 (559) .|..+...+..+++++|.+|..+.++.+|+|. +..++|..+.++.+... .....+++|+...... ....+..+. T Consensus 135 ~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~-i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~ 213 (501) T protein:vir:27 135 DIDSHNRTLIRDLSQTGRAYEVIYRNEYDETR-IKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEH 213 (501) T ss_pred ChhHHHHHHHHHHhhCCeEEEEEEeCCCCceE-EEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCe Confidence 34457778888999999999999999888764 67889999988876542 2222233333321111 111233333 Q ss_pred eEEEec-----------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccC Q lcl|NC_012530. 226 MGMFIR-----------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNT 289 (559) Q Consensus 226 vi~~~~-----------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 289 (559) +.++.. | |.-.......|.|.++.+...++....+..-..+.+...+.|-.++.-... .. T Consensus 214 v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~----~~ 289 (501) T protein:vir:27 214 IYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLA----LP 289 (501) T ss_pred EEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc----CC Confidence 322211 1 111111223577777777766666665555555555555555555432111 11 Q ss_pred CHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccc Q lcl|NC_012530. 290 SMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGN 368 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 368 (559) ..+....++... .-.....+ .+....++.++..++.. .+..+....+...+.|+.+-++|..-.+-.. ++ T Consensus 290 ~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n 360 (501) T protein:vir:27 290 KGMQASDMKRTR---LMQLKPPK-SADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFS-----GN 360 (501) T ss_pred cccchhhhhhcC---ceeecccc-cccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-----cC Confidence 122222222110 00000000 01112233444444432 2334566678888899999999864433211 11 Q ss_pred cccchhh---h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcc-ccccCccceeeecchhhhhHHHHHHHHHHHHcCCC Q lcl|NC_012530. 369 KSNSLNE---S---NNQNKIDASKSKGLMPLLDMIAKNLTNGII-RQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTAT 441 (559) Q Consensus 369 ~~~~~~~---a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~ 441 (559) .++.... . +-.......+...|+-++..+...++..-- .......+.+.|......+..+.++.+..+ .|.+ T Consensus 361 ~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl-~g~i 439 (501) T protein:vir:27 361 TSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL-GGQV 439 (501) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH-hccC Confidence 1111010 0 011122244455555555555444432211 112234578889888888988888877665 4667 Q ss_pred CHHHHHHHhCCCCCCCCCEeeccceecccccccccccc-cccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 442 TVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQN-EFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 442 T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) +..-+.++++. ++ |. ...+..+...... +.+......+. ..+...+..++..++ ++++.. T Consensus 440 S~et~l~~l~~--v~--D~------~~E~eri~~E~~e~~~~~~~~~~~~---~~~~~~d~~~~~~~d------~~e~~~ 500 (501) T protein:vir:27 440 SQETALSLSGL--VE--SP------NEELDKINKEVSEIDFKGYSNDFNE---HVGKYTDEVKETHTD------DFERAY 500 (501) T ss_pred cHHHHHHhCCC--CC--CH------HHHHHHHHHHHHhhhHhhhcCcccc---ccccccCCCCCCccc------cccccC Confidence 87767776543 21 10 0111111110000 00000000000 000000000000001 111111 Q ss_pred c Q lcl|NC_012530. 521 T 521 (559) Q Consensus 521 ~ 521 (559) + T Consensus 501 ~ 501 (501) T protein:vir:27 501 E 501 (501) T ss_pred C Confidence 1 No 177 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.61 E-value=1.8e-08 Score=63.10 Aligned_cols=482 Identities=12% Similarity=0.078 Sum_probs=209.7 Q ss_pred HHHHHHHHHHHhhhhcccccc---------cc---ccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRA---------YT---EPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAI 89 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a---------~~---~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~ac 89 (559) |...+ -.+.-|.|. .. .++ ..+...+--++++..+ ..|. .+-...+.-.+.++-. T Consensus 1 ma~~~-------lr~~rrpk~~p~~~r~~al~aas~~i-~~p~~~~~ks~~~~~~--~~WQ---~eAW~~~d~v~Elry~ 67 (629) T protein:vir:86 1 MAPTS-------LRIVRRPKSEPVSTRQRALVAASQPV-ENPGKAFRKAMGSSTR--TDWQ---EDAWKAYDAVGELRYY 67 (629) T ss_pred CCccc-------eeeeecCCCCChhhhhhhhhhhhhcc-ccccchhhhhcCCCch--hhhh---HHHHHHHHhhhhHHHH Confidence 11111 111112221 10 000 0000000000111111 0111 1112222336777778 Q ss_pred HHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCc Q lcl|NC_012530. 90 INTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQV 169 (559) Q Consensus 90 v~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna 169 (559) |.-|+++++..-+++..-....+.... .+ .+.......+......+..- +..-.++++.+..++-+-|.+ T Consensus 68 vgW~~~s~Sr~rL~as~idpDtg~ptg-----~i-~e~~~~~~~v~~~v~~i~gG----~lgqa~lLkr~~~~ltV~GE~ 137 (629) T protein:vir:86 68 VGWRSSSASRVRLIASAIDPDTGLPTG-----SI-DEDDRVGARVQQIVNQIAGG----ALGQAQLIKRVVEQLTVAGET 137 (629) T ss_pred hhhhhhhhceeeeEeeeecCCCCCCcc-----cc-CCCchhHHHHHHHHHhhcCC----hhhHHHHHHHHHhheecccce Confidence 888888888654433221111111000 01 11111112233334443322 222357999999999999999 Q ss_pred ceEEEEC------CCCcEE-EEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcc Q lcl|NC_012530. 170 NYENTYD------SNGRLS-HTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGY 242 (559) Q Consensus 170 ~~~i~rd------~~G~~~-~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~ 242 (559) |+.+.-- ..|.++ +++.|-++-|+-. .+ ..-+..-.+.........++++..++|.+.. ..+ T Consensus 138 wiv~~~~~~~~~d~~~~~~~eW~~vt~~ei~~~---~~------~~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr--~~e 206 (629) T protein:vir:86 138 WVAILFTDKSRLDSNGNPVPEWLALTPEEVRAS---EK------KTIIELPTGDKHEFRDGLDGMFRVWNPRARR--ARE 206 (629) T ss_pred EEEEeecCCCccCCCCcchhhheeechHHhhhc---cC------ceeeEcCCCCcceeeCCCceEEEeeCCCccc--ccC Confidence 9877632 233333 3344444443311 11 1123333455555566677777667776532 345 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc-----------------CCHHHHHHHHHHHH--- Q lcl|NC_012530. 243 GLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN-----------------TSMRALEDFKRHWT--- 302 (559) Q Consensus 243 G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~-----------------~~~e~~~~l~~~~~--- 302 (559) --||+.+++..+.-.........+..+.-.+-.|||.++...+-+. ...-+.+.|.+.|- T Consensus 207 ~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a 286 (629) T protein:vir:86 207 PDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVA 286 (629) T ss_pred CcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHH Confidence 6799999888887776666665555544444556665544322211 00113344444443 Q ss_pred -HHhcCcc-cccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccch Q lcl|NC_012530. 303 -ATSSGIN-GAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSL 373 (559) Q Consensus 303 -~~~~G~~-nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~ 373 (559) .++...+ .+--|||+..+ .++...+.+.-+.--+.+|+..+..||....|||.. ||+..++..|+. +. T Consensus 287 ~tAi~De~S~aA~vPiia~~P~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsA-Wq-- 363 (629) T protein:vir:86 287 QTAYDDEDSMAALIPMFAAAPGELIKNVTHLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSA-WQ-- 363 (629) T ss_pred hhhhcCCCCccceeeeeEeechHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEE-EE-- Confidence 3333222 23456776332 244444444455567889999999999999999875 566433333321 00 Q ss_pred hhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHH Q lcl|NC_012530. 374 NESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDY 446 (559) Q Consensus 374 ~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~ 446 (559) -+. .-++-.|.|.+..|+++|++.+|.+. + -.+|.+-|+. .+..+.....++....-+|.||-... T Consensus 364 --I~d-----edvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAl 436 (629) T protein:vir:86 364 --IGD-----EDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAM 436 (629) T ss_pred --ecc-----cceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHH Confidence 011 12345699999999999999887632 2 2468888874 45555544445555555788999999 Q ss_pred HHHhCCCCCCCCCEeeccceeccccc-ccccccc--c---ccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 447 REKQGLPKIAGGDIILSAVYIQRLGQ-QEQIKQN--E---FQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 447 R~~~gl~pi~gGD~~~~~~~~~~l~~-~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) |+.+|+.--.|=|..-.-...+-+.. +.+.... . ..........+.+...-+...++.. +. +..+.. T Consensus 437 rk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~---~d----E~sga~ 509 (629) T protein:vir:86 437 VKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDG---DE----EASGAS 509 (629) T ss_pred HHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCC---cc----cccCCC Confidence 99999854333221100000000000 0000000 0 0000000111100000000000000 00 000000 Q ss_pred cccccccccccccccccccccccccchh-----------hhhhccCCCCC Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGVGKDGQLKNKKNT-----------NSYKQGGSSKK 559 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~~~~~~~k~~~~~-----------~~~~~~~~~~~ 559 (559) + +++-......++++..+.-.+. .+-...||.-. T Consensus 510 ~-----~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~r 554 (629) T protein:vir:86 510 R-----REEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRSR 554 (629) T ss_pred c-----CCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcCC Confidence 0 0000001011111111111110 01112233212 No 178 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.58 E-value=2e-08 Score=62.75 Aligned_cols=482 Identities=12% Similarity=0.078 Sum_probs=209.2 Q ss_pred HHHHHHHHHHHhhhhcccccc---------cc---ccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRA---------YT---EPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAI 89 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a---------~~---~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~ac 89 (559) |...+ -.+.-|.|. .. .++ ..+...+--++++..+ ..|. .+-...+.-.+.++-. T Consensus 1 ma~~~-------lr~~rrpk~~p~~~r~~al~aas~~i-~~p~~~~~ks~~~~~~--~~WQ---~eAW~~~d~v~Elry~ 67 (629) T protein:vir:99 1 MAPTS-------LRIVRRPKSEPVSTRQRALVAASQPV-ENPGKAFRKAMGSSTR--TDWQ---DDAWKAYDAVGELRYY 67 (629) T ss_pred CCccc-------eeeeecCCCCChhhhhhhhhhhhhcc-cccchhhhhhcCCCch--hhhh---HHHHHHHHhhhhHHHH Confidence 11111 111112221 10 000 0000000000111111 0111 1112222336777778 Q ss_pred HHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCc Q lcl|NC_012530. 90 INTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQV 169 (559) Q Consensus 90 v~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna 169 (559) |.-|+++++..-+++..-....+.... .+ .+.......+......+..- +..-.++++.+..++-+-|.+ T Consensus 68 vgW~~~s~Sr~rL~as~idpDtg~ptg-----~i-~e~~~~~~~v~~~v~~i~gG----~lgqa~lLkr~~~~ltV~GE~ 137 (629) T protein:vir:99 68 VGWRSSSASRVRLIASAIDPDTGLPTG-----SI-DEDDRVGARVQQIVNQIAGG----ALGQAQLIKRVVEQLTVAGET 137 (629) T ss_pred hhhhhhhhceeeeEeeeecCCCCCCcc-----cc-CCCchhHHHHHHHHHhhcCC----hhhHHHHHHHHHhheecccce Confidence 888888888654433221111111000 01 11111112233334443322 222357999999999999999 Q ss_pred ceEEEEC------CCCcEE-EEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcc Q lcl|NC_012530. 170 NYENTYD------SNGRLS-HTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGY 242 (559) Q Consensus 170 ~~~i~rd------~~G~~~-~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~ 242 (559) |+.+.-- ..|.++ +++.|-++-|+-. .+ ..-+..-.+.........++++..++|.+.. ..+ T Consensus 138 wiv~~~~~~~~~d~~~~~~~eW~~vt~~ei~~~---~~------~~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr--~~e 206 (629) T protein:vir:99 138 WVAILFTDKSRLDSNGNPVPEWLALTPEEVRAS---EK------KTIIELPTGDKHEFRDGLDGMFRVWNPRARR--ARE 206 (629) T ss_pred EEEEeecCCCccCCCCcchhhheeechHHhhhc---cC------ceeEEcCCCCccceeCCCceEEEeeCCCccc--ccC Confidence 9877632 233333 3334444443311 11 1123333455555556677777667776532 345 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc-----------------CCHHHHHHHHHHHH--- Q lcl|NC_012530. 243 GLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN-----------------TSMRALEDFKRHWT--- 302 (559) Q Consensus 243 G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~-----------------~~~e~~~~l~~~~~--- 302 (559) --||+.+++..+.-.........+..+.-.+-.|||.++...+-+. ...-+.+.|.+.|- T Consensus 207 ~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a 286 (629) T protein:vir:99 207 PDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVA 286 (629) T ss_pred CcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHH Confidence 6799999888887776666665555554445556665544332211 00113344444443 Q ss_pred -HHhcCcc-cccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccch Q lcl|NC_012530. 303 -ATSSGIN-GAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSL 373 (559) Q Consensus 303 -~~~~G~~-nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~ 373 (559) .++...+ .+--|||+..+ .++...+.+.-+.--+.+|+..+..||....|||.. ||+..++..|+. +. T Consensus 287 ~tAi~De~S~aA~vPiia~~P~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsA-Wq-- 363 (629) T protein:vir:99 287 QTAYDDEDSMAALIPMFAAAPGELIKNVTHLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSA-WQ-- 363 (629) T ss_pred hhhhcCCCCccceeeeeEeechHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEE-EE-- Confidence 3333222 23456776332 244444444455567889999999999999999875 566433333321 00 Q ss_pred hhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHH Q lcl|NC_012530. 374 NESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDY 446 (559) Q Consensus 374 ~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~ 446 (559) -+. .-++-.|.|.+..|+++|++.+|.+. + -.+|.+-|+. .+..+.....++....-+|.||-... T Consensus 364 --I~d-----edvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAl 436 (629) T protein:vir:99 364 --IGD-----EDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAM 436 (629) T ss_pred --ecc-----cceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHH Confidence 011 12345699999999999999887632 2 2468888874 44555444445555555788999999 Q ss_pred HHHhCCCCCCCCCEeeccceeccccc-ccccccc--c---ccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 447 REKQGLPKIAGGDIILSAVYIQRLGQ-QEQIKQN--E---FQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 447 R~~~gl~pi~gGD~~~~~~~~~~l~~-~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) |+.+|+.--.|=|..-.-...+-+.. +.+.... . ..........+.+...-+...++.. +. +..+.. T Consensus 437 rk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~---~d----E~sga~ 509 (629) T protein:vir:99 437 VKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDG---DE----EASGAS 509 (629) T ss_pred HHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCC---cc----cccCCC Confidence 99999854332221100000000000 0000000 0 0000000111100000000000000 00 000000 Q ss_pred cccccccccccccccccccccccccchh-----------hhhhccCCCCC Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGVGKDGQLKNKKNT-----------NSYKQGGSSKK 559 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~~~~~~~k~~~~~-----------~~~~~~~~~~~ 559 (559) + +++-......++++..+.-.+. .+-...||.-. T Consensus 510 ~-----~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~r 554 (629) T protein:vir:99 510 R-----REEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRSR 554 (629) T ss_pred c-----CCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcCC Confidence 0 0000001011111111111110 01112233212 No 179 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.56 E-value=2.8e-07 Score=56.53 Aligned_cols=432 Identities=12% Similarity=0.057 Sum_probs=166.7 Q ss_pred ccccCCcc--hHHHHHHHHHHHHHHhhh-----hccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhh Q lcl|NC_012530. 10 KFYTDDPN--AFFKHIDSKIANDTASKA-----LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM 82 (559) Q Consensus 10 ~~~~~~~~--~~~~~~~~~~~~~~~~~~-----~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 82 (559) =-.++.++ +-+..|.+.+..+..... -.|++. ....+... .. .+.+.-.. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~-------------------i~~~~~~~-~~---~~~~~~~~ 57 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERR-------------------PDAIGLAV-PL---DMRKYLAH 57 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------------hhhcCccc-ch---hhhhhhhh Confidence 01122222 223444444433322111 111110 00001100 00 11111123 Q ss_pred ChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccccc-ChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHH Q lcl|NC_012530. 83 NVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKP-TKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVR 161 (559) Q Consensus 83 ~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~-~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~ 161 (559) ..+...||+.+++.+-. .||.+-....... ...+.+....+..++.. ..+......+.. T Consensus 58 ~n~~~~ivd~~a~~l~~-----------~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~---------N~~~~~~~~~~~ 117 (488) T protein:vir:23 58 VGYPRTYVDAIAERQEL-----------EGFRIPSANGEEPESGGENDPASELWDWWQA---------NNLDIEATLGHT 117 (488) T ss_pred cchHHHHHHHHHHhhhc-----------cceeccCCcccccccccchhHHHHHHHHHHh---------cChhHHHHHHHH Confidence 45666777777654421 1222211111100 01111222334444332 123456677888 Q ss_pred HHHHcCCcceEEEECC--------CCcEEEEEEecCceEEEEecCcccccccceEEEEEecCcee---eeecccce---- Q lcl|NC_012530. 162 DTYTYDQVNYENTYDS--------NGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVR---GSFTADEM---- 226 (559) Q Consensus 162 d~ll~Gna~~~i~rd~--------~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~---~~~~~~ev---- 226 (559) +.+++|.+|+.+.++. .|.+ .+.+++|..+.++.+........+.+|++..++... ..+..+.+ T Consensus 118 ~a~i~G~a~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 196 (488) T protein:vir:23 118 DALIYGTAYITISMPDPEVDFDVDPEVP-LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWL 196 (488) T ss_pred HHhhcCceEEEEecCCcccccCCCCCcc-eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEE Confidence 9999999999876642 2332 367889998887776432222222222222222111 12222222 Q ss_pred ---------------------EEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec Q lcl|NC_012530. 227 ---------------------GMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGILLVK 281 (559) Q Consensus 227 ---------------------i~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 281 (559) ++|+.++. ..+.+|.|-|+ .+.+++...+.-..-...||. .|.-+|. T Consensus 197 ~~~~~~~~~~~~~h~~g~vPvv~f~n~~~---~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a---~p~~~i~-- 268 (488) T protein:vir:23 197 RAEGEWEAPTSTPHGLEMVPVIPISNRTR---LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMA---IPQRLIF-- 268 (488) T ss_pred ecCCceEeccccccCCCCcceEEeccccc---cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhh---hHHHHHh-- Confidence 33332221 23457887654 333343333332222333333 2322221 Q ss_pred CccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC-ceeeeeccccchh-HHHHHHHHHHHHHHHHhCCCHHHhcc Q lcl|NC_012530. 282 PSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE-DAKFVSMTQAEDM-QFQSWLNYLINIICALVAMDPAEIGM 359 (559) Q Consensus 282 ~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g-~~~~~~ls~~~D~-qf~e~~~~~~~~Ia~~fgVPp~~lg~ 359 (559) +. ...+...+. +.-...|+.. .+++.+++.| +.++..+.. .++ .|++..+-.+..|+..=++|++.+|. T Consensus 269 G~-~~~~~~~~~-~~~~~~~~~~------~~~v~~~~~g~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~ 339 (488) T protein:vir:23 269 GA-KPEELGINA-ETGQRMFDAY------MARILAFEGGEGAHAEQFSA-AELRNFVDALDALDRKAASYSGLPPQYLSS 339 (488) T ss_pred CC-Ccccccccc-cccchhhhhh------hhhhccCCCCCCceeEecCC-CChHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 10 000000000 0001112221 1234444433 356655542 233 37888888899999999999999975 Q ss_pred ccccccccccccchhhhhHH---HHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHH Q lcl|NC_012530. 360 QNRGGATGNKSNSLNESNNQ---NKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLE 436 (559) Q Consensus 360 ~~~~~~~~~~~~~~~~an~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~ 436 (559) ...+..++ ......+.... +..+..+...|.-++..+...++..-. ......+.+.|......+..+.++.+.++ T Consensus 340 ~~~n~~Sg-~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~~~~~i~v~f~~~~~~s~~~~ada~~kl 417 (488) T protein:vir:23 340 SSDNPASA-EAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDI-PTEYYRMETVWRDPSTPTYAAKADAAAKL 417 (488) T ss_pred ccCcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-chhhccceEEecCCCCCCHHHHHHHHHHH Confidence 33221110 00011111111 111122222333333333221111000 11224577788777777888888877666 Q ss_pred HcC---CCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 437 LQT---ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 437 ~~~---~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) +++ .++..-+++++|+-+-+ ...+....+....................+.+. +.++.+..+.++. T Consensus 418 ~~~g~~~~s~et~~~~l~~~~d~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~ 486 (488) T protein:vir:23 418 FANGAGLIPRERGWVDMGYTIVE----------REQMRQWLEQDQKQGLGLIGSLYGASTPEGKPG-EAPVGEPPAPEPD 486 (488) T ss_pred HhcccccCCHHHHHHhCCCCchH----------HHHHHHHHHHHHHHHHHHHHHHhccCCCcccCC-CCCCCCCCCCCCC Confidence 543 36777788888874321 011111000000000000000000000000000 0011111111111 Q ss_pred hh Q lcl|NC_012530. 514 QQ 515 (559) Q Consensus 514 ~~ 515 (559) .. T Consensus 487 ~a 488 (488) T protein:vir:23 487 AA 488 (488) T ss_pred CC Confidence 11 No 180 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.55 E-value=3e-07 Score=56.38 Aligned_cols=410 Identities=11% Similarity=0.050 Sum_probs=163.8 Q ss_pred cccCCcchHHHHHHHHHHHHHH-----hhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChH Q lcl|NC_012530. 11 FYTDDPNAFFKHIDSKIANDTA-----SKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVV 85 (559) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 85 (559) .-...+++.+..|......+.. ..=-.|+++- ...++.. ...+.. +..-..+.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-------------------~~~~~~~-~~~~~~-~~~k~~~n~ 59 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-------------------PELTRNT-SAAWRS-FQREARTNW 59 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-------------------hhcCccc-Chhhhh-hhhhhhcch Confidence 2222333334444333222211 1111222210 0001100 001111 111123456 Q ss_pred HHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHH Q lcl|NC_012530. 86 LNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYT 165 (559) Q Consensus 86 v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll 165 (559) ...||+..++.+- +.++.+...+..+. . ..+.+++.. ..+..+...+..++++ T Consensus 60 ~~~ivd~~~~~l~-----------~~~~~~~~~~d~~~----~---~~~~~i~~~---------N~~d~~~~~~~~~a~i 112 (456) T protein:vir:10 60 GLMVRDSVADRII-----------PNGITVGGSADSDL----A---LRARRIWRD---------NRMDSVCKQWVKYGLD 112 (456) T ss_pred HHHHHHHHHhhhc-----------cCCeecCCCCCcch----H---HHHHHHHHh---------cChhhHHHHHHHHHhh Confidence 6667776665443 23444322211111 1 123333332 1223455667889999 Q ss_pred cCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-ccceEEEEEecCceee------------------------- Q lcl|NC_012530. 166 YDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-TRGKIYRQYIDNKVRG------------------------- 219 (559) Q Consensus 166 ~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-~~~~~y~~~~~~~~~~------------------------- 219 (559) +|.+|..+..+.+|.+. +..++|..+.++.+...... ....+|+...++.... T Consensus 113 ~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) T protein:vir:10 113 FGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) T ss_pred cCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccce Confidence 99999999998888764 67889999888776543211 1112222211111100 Q ss_pred --eecccc------eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCH Q lcl|NC_012530. 220 --SFTADE------MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSM 291 (559) Q Consensus 220 --~~~~~e------vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~ 291 (559) ...... .-|+..-|.--......|+|.++.....++....+..-........+.|--++.-.. ......+ T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~--~~~~~~d 269 (456) T protein:vir:10 192 LVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTE--HGLPNVD 269 (456) T ss_pred eeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccC--ccccccc Confidence 000000 000000000000112347777776666555544333322222222223322221100 0000000 Q ss_pred HHHHH--HHHHHHHHhcCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccc Q lcl|NC_012530. 292 RALED--FKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGN 368 (559) Q Consensus 292 e~~~~--l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 368 (559) +.-.. ....|+.. .+.+..+ .++.++..+.. .+++ |++..+..+..|+++=++|++.+|....+.+ + T Consensus 270 ~~g~~~~~~~~~~~~------~~~~~~~-~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~S-g- 339 (456) T protein:vir:10 270 ENGNAIDYASIFEAA------PGALWEL-PPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS-A- 339 (456) T ss_pred ccccccchhhhhhhh------ccccccC-CCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChH-H- Confidence 11011 11123221 1222223 34466655543 3444 8899999999999999999999985322111 0 Q ss_pred cccchhhhhHHH---HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCC-CCHH Q lcl|NC_012530. 369 KSNSLNESNNQN---KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTA-TTVN 444 (559) Q Consensus 369 ~~~~~~~an~~~---~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~-~T~N 444 (559) .+...-+..... ..+..+..+|+-+++.+. .+........+.+.|......+..+.++++.++...+ ++.. T Consensus 340 ~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~-----~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~ 414 (456) T protein:vir:10 340 EGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-----QIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWAS 414 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHH Confidence 000000111110 011111122222222111 1111222345778888888888888888887766544 5666 Q ss_pred HHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 445 DYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 445 E~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) -+++++|+.|-+ +. ..+.+....+.+.... +-...+.++ .+. T Consensus 415 ~~~~~lg~~~~~-------------i~------~~e~er~~~e~~~~~~--~~~~~~~~~----~~~ 456 (456) T protein:vir:10 415 IRRNILNYNADQ-------------IK------QDDLDRAREQITLFAG--NPVQRPQED----GSR 456 (456) T ss_pred HHHhhCCCCHHH-------------HH------HHHHHHHHHHHHHHhh--hhhhcCCCC----CCC Confidence 667777775421 00 0000000000000000 000000000 000 No 181 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.55 E-value=3e-07 Score=56.38 Aligned_cols=410 Identities=11% Similarity=0.050 Sum_probs=163.8 Q ss_pred cccCCcchHHHHHHHHHHHHHH-----hhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChH Q lcl|NC_012530. 11 FYTDDPNAFFKHIDSKIANDTA-----SKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVV 85 (559) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 85 (559) .-...+++.+..|......+.. ..=-.|+++- ...++.. ...+.. +..-..+.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-------------------~~~~~~~-~~~~~~-~~~k~~~n~ 59 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-------------------PELTRNT-SAAWRS-FQREARTNW 59 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-------------------hhcCccc-Chhhhh-hhhhhhcch Confidence 2222333334444333222211 1111222210 0001100 001111 111123456 Q ss_pred HHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHH Q lcl|NC_012530. 86 LNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYT 165 (559) Q Consensus 86 v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll 165 (559) ...||+..++.+- +.++.+...+..+. . ..+.+++.. ..+..+...+..++++ T Consensus 60 ~~~ivd~~~~~l~-----------~~~~~~~~~~d~~~----~---~~~~~i~~~---------N~~d~~~~~~~~~a~i 112 (456) T protein:vir:10 60 GLMVRDSVADRII-----------PNGITVGGSADSDL----A---LRARRIWRD---------NRMDSVCKQWVKYGLD 112 (456) T ss_pred HHHHHHHHHhhhc-----------cCCeecCCCCCcch----H---HHHHHHHHh---------cChhhHHHHHHHHHhh Confidence 6667776665443 23444322211111 1 123333332 1223455667889999 Q ss_pred cCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-ccceEEEEEecCceee------------------------- Q lcl|NC_012530. 166 YDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-TRGKIYRQYIDNKVRG------------------------- 219 (559) Q Consensus 166 ~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-~~~~~y~~~~~~~~~~------------------------- 219 (559) +|.+|..+..+.+|.+. +..++|..+.++.+...... ....+|+...++.... T Consensus 113 ~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) T protein:vir:10 113 FGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) T ss_pred cCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccce Confidence 99999999998888764 67889999888776543211 1112222211111100 Q ss_pred --eecccc------eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCH Q lcl|NC_012530. 220 --SFTADE------MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSM 291 (559) Q Consensus 220 --~~~~~e------vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~ 291 (559) ...... .-|+..-|.--......|+|.++.....++....+..-........+.|--++.-.. ......+ T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~--~~~~~~d 269 (456) T protein:vir:10 192 LVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTE--HGLPNVD 269 (456) T ss_pred eeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccC--ccccccc Confidence 000000 000000000000112347777776666555544333322222222223322221100 0000000 Q ss_pred HHHHH--HHHHHHHHhcCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccc Q lcl|NC_012530. 292 RALED--FKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGN 368 (559) Q Consensus 292 e~~~~--l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 368 (559) +.-.. ....|+.. .+.+..+ .++.++..+.. .+++ |++..+..+..|+++=++|++.+|....+.+ + T Consensus 270 ~~g~~~~~~~~~~~~------~~~~~~~-~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~S-g- 339 (456) T protein:vir:10 270 ENGNAIDYASIFEAA------PGALWEL-PPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS-A- 339 (456) T ss_pred ccccccchhhhhhhh------ccccccC-CCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChH-H- Confidence 11011 11123221 1222223 34466655543 3444 8899999999999999999999985322111 0 Q ss_pred cccchhhhhHHH---HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCC-CCHH Q lcl|NC_012530. 369 KSNSLNESNNQN---KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTA-TTVN 444 (559) Q Consensus 369 ~~~~~~~an~~~---~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~-~T~N 444 (559) .+...-+..... ..+..+..+|+-+++.+. .+........+.+.|......+..+.++++.++...+ ++.. T Consensus 340 ~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~-----~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~ 414 (456) T protein:vir:10 340 EGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-----QIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWAS 414 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHH Confidence 000000111110 011111122222222111 1111222345778888888888888888887766544 5666 Q ss_pred HHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 445 DYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 445 E~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) -+++++|+.|-+ +. ..+.+....+.+.... +-...+.++ .+. T Consensus 415 ~~~~~lg~~~~~-------------i~------~~e~er~~~e~~~~~~--~~~~~~~~~----~~~ 456 (456) T protein:vir:10 415 IRRNILNYNADQ-------------IK------QDDLDRAREQITLFAG--NPVQRPQED----GSR 456 (456) T ss_pred HHHhhCCCCHHH-------------HH------HHHHHHHHHHHHHHhh--hhhhcCCCC----CCC Confidence 667777775421 00 0000000000000000 000000000 000 No 182 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.54 E-value=3.3e-07 Score=56.13 Aligned_cols=424 Identities=10% Similarity=0.030 Sum_probs=179.0 Q ss_pred Ccchhhhc----------cccccCCcc-hHHHHHHHHHHH------HHHhhhhccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFR----------TKFYTDDPN-AFFKHIDSKIAN------DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVP 63 (559) Q Consensus 1 ~~~~~~~~----------~~~~~~~~~-~~~~~~~~~~~~------~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~ 63 (559) |.+|..=| .+ .++++. +.+..+-+.... +.+..=-.|++..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~------------------ 61 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFP-KGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTA------------------ 61 (470) T ss_pred CccccCCcccccCCceEEeC-CCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccC------------------ Confidence 88885433 22 223333 222222211111 1111112233211111 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGV 143 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p 143 (559) +......+. + ...+....+|+..+.-+- |.+..+...+. .+....+..++.. T Consensus 62 -~~~~~~~~~----k--i~~n~~~~Ivd~~~~~l~-----------g~p~~~~~~~d-------~~~~~~l~~~~~~--- 113 (470) T protein:vir:99 62 -PEKETGADN----R--IVVNSAKYVVDVYNGYFC-----------GIEPKLALLND-------SSKIDEIARWNRQ--- 113 (470) T ss_pred -cccccCCcc----e--eecchHHHHHHHHhhhhc-----------cCCeeEeeCCc-------hhHHHHHHHHHHh--- Confidence 000001010 0 123445555655544332 12222222211 1122344455443 Q ss_pred CCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccc-cccceEEEEEecCce----e Q lcl|NC_012530. 144 DYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHR-RTRGKIYRQYIDNKV----R 218 (559) Q Consensus 144 ~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~-~~~~~~y~~~~~~~~----~ 218 (559) ..+......+..+.+.+|.+|..+.++.+|++ .+..++|..+.++.+..+.. ....++|+....+.. . T Consensus 114 ------n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~ 186 (470) T protein:vir:99 114 ------ENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYG 186 (470) T ss_pred ------cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEE Confidence 13455677888899999999999999988886 47889999999888765432 111222222111111 1 Q ss_pred eeecccceEEEec--------------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE Q lcl|NC_012530. 219 GSFTADEMGMFIR--------------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILL 279 (559) Q Consensus 219 ~~~~~~evi~~~~--------------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 279 (559) ..+..+.+.++.. | |........+|.|-++.+...++....+..-..+.+...+.|-.++. T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~ 266 (470) T protein:vir:99 187 VIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMI 266 (470) T ss_pred EEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 1122222222110 1 11111123457777777666666666555555555666666765553 Q ss_pred ecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 280 VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 280 ~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) -- ..+.+ +.-+.+ ..+... ....++-. .+++.++..++.+ .+..+....+...+.|+..-++|++.. T Consensus 267 g~--~~~~~---~~g~~~-~~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 335 (470) T protein:vir:99 267 GF--KLPED---DEGNPK-FDFKNN-----RVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQD 335 (470) T ss_pred cC--Ccccc---cccchh-hhhhhc-----ceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccc Confidence 21 11111 111111 111111 01111111 1122333334422 233456678888999999999997543 Q ss_pred ccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ 434 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~ 434 (559) +-.. +..++ .....-+... .+..+..+..+|+-++..+...+...--.......+.+.|......+..+.++.+. T Consensus 336 ~~~~-~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~ 413 (470) T protein:vir:99 336 KNFA-GNSSG-VALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAK 413 (470) T ss_pred cccc-cCchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHH Confidence 2111 10000 0000000111 11122344455555555554444433222223446788898888888888888877 Q ss_pred HHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccch Q lcl|NC_012530. 435 LELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQ 514 (559) Q Consensus 435 ~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (559) .+ .|+++...++++++.- | + ...+..+.+.. ......... .....+.... ++ .+ T Consensus 414 kl-~giis~et~l~~l~~v-----d----~--~~E~eri~~E~----~~~~~~~~~---~~~~~d~~~~--d~-----~~ 467 (470) T protein:vir:99 414 NA-EGIVSKKTQLGMIPDI-----E----P--DAEMKQIAKEK----ADAIKQTQQ---LSMPIDILKR--DN-----NA 467 (470) T ss_pred HH-hccCCHHHHHHhCCCC-----C----H--HHHHHHHHHHH----HHHHHHHHh---hcCCCCcCCC--CC-----Cc Confidence 65 3678887777766431 1 0 00111111100 000000000 0000000000 00 00 Q ss_pred hcc Q lcl|NC_012530. 515 QNQ 517 (559) Q Consensus 515 ~~~ 517 (559) +++ T Consensus 468 ee~ 470 (470) T protein:vir:99 468 EEE 470 (470) T ss_pred cCC Confidence 000 No 183 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.47 E-value=5.2e-07 Score=55.04 Aligned_cols=440 Identities=11% Similarity=0.030 Sum_probs=175.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHH----------hhhhccccc-cccccccccccccccccccccccCCCCC Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTA----------SKALNGVDR-AYTEPVDGNLMFSTLEDTSIVPKPSPIA 69 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~gr~~-a~~~~~~~~~~~~~~~~~~~~~~p~~~~ 69 (559) -.-...|....+.+..... ..+-++...++. ..=-.|++. .+..+ ...+ .. T Consensus 22 ~~~~~~~~~~~~~~~~~~~-~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~---------------~~~~--~~ 83 (501) T protein:vir:96 22 RESRIRYRADNLEELMVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSG---------------RRKD--NE 83 (501) T ss_pred hhHHhhhcccccccccCCh-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCcc---------------ccCc--cc Confidence 1112233333333222222 111122222111 111122211 00000 0000 00 Q ss_pred cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCCh Q lcl|NC_012530. 70 FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIR 149 (559) Q Consensus 70 ~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~ 149 (559) ..+ .+ ...++...+|+..+.-+. |.+..+...+.. ........+.+++.. T Consensus 84 ~~~----~r--i~~n~~k~Ivd~~~~yl~-----------g~p~~~~~~~~~----~~~~~~~~l~~~~~~--------- 133 (501) T protein:vir:96 84 MAD----KR--AVHNYGRMISKFKTGYLA-----------GNPIRVEYDDND----DNSQNDDAIKRIGRI--------- 133 (501) T ss_pred ccc----ce--eecchHHHHHHHHhhhhc-----------ccCeeEeeCCcc----chhHHHHHHHHHHHh--------- Confidence 000 00 123455556655554332 222233322211 111122233444332 Q ss_pred hhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecC--c--eeeeeccc Q lcl|NC_012530. 150 DDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDN--K--VRGSFTAD 224 (559) Q Consensus 150 ~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~--~--~~~~~~~~ 224 (559) ..|......+..+++.+|.+|..+.++.+|.+. +..++|..+.++.+... .....+++|+..... . ....+.++ T Consensus 134 n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~ 212 (501) T protein:vir:96 134 NDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETR-IKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDE 212 (501) T ss_pred cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceE-EEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCC Confidence 134456777888999999999999999888764 77899999998877542 222223333322111 1 11123333 Q ss_pred ceEEEec-----------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc Q lcl|NC_012530. 225 EMGMFIR-----------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN 288 (559) Q Consensus 225 evi~~~~-----------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~ 288 (559) .+.++.. | |.-.......|.|.++.+...++....+..-..+.+...+.|-.++.-... . . T Consensus 213 ~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~--~-~ 289 (501) T protein:vir:96 213 HIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLA--L-P 289 (501) T ss_pred cEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccc--c-C Confidence 3322211 0 100111223577877777666666665555555566666666555532111 1 1 Q ss_pred CCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Q lcl|NC_012530. 289 TSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATG 367 (559) Q Consensus 289 ~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 367 (559) ..+....++.. ..-..... ..+-....+.+..-++.+ .+..+....+...+.|+..-++|..-.+-...+ T Consensus 290 -~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n---- 360 (501) T protein:vir:96 290 -KGMQASDMKRT---RLMQLKPP-KSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGN---- 360 (501) T ss_pred -cccchhhhhhc---Ceeeeccc-ccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccc---- Confidence 11111222110 00000000 001111222333334322 233466677888888988889986554422111 Q ss_pred ccccchhh---h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcc-ccccCccceeeecchhhhhHHHHHHHHHHHHcCC Q lcl|NC_012530. 368 NKSNSLNE---S---NNQNKIDASKSKGLMPLLDMIAKNLTNGII-RQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTA 440 (559) Q Consensus 368 ~~~~~~~~---a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~ 440 (559) .++.... . +-....+..+..+|+-++..+...++..-- .......+.+.|......+..+.++.+..+ .|+ T Consensus 361 -~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl-~g~ 438 (501) T protein:vir:96 361 -TSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL-GGQ 438 (501) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHH-hcc Confidence 1111000 0 111122244455555555555444432211 112234578889988889999988887766 366 Q ss_pred CCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 441 TTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 441 ~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) ++..-+.++++. ++ |. ...+..+..... + .......+......+.......+...++.| T Consensus 439 iS~et~~~~l~~--v~--D~------~~E~~ri~~E~~-~-~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e--------- 497 (501) T protein:vir:96 439 VSQETALSLSGL--VE--SP------NEELDKINKEMS-E-IDFKGYSNDFNEHVGKYTDEVKETHTDDFE--------- 497 (501) T ss_pred CchHHHHHhCCC--CC--CH------HHHHHHHHHHHH-H-hhccccccchhhcccccCCcCCCCCCCccc--------- Confidence 787667776643 21 10 011111110000 0 000000000000000000000000000000 Q ss_pred cccccccc Q lcl|NC_012530. 521 TGKDAKPS 528 (559) Q Consensus 521 ~~~~~~~~ 528 (559) +-++ T Consensus 498 ----~~~~ 501 (501) T protein:vir:96 498 ----REYE 501 (501) T ss_pred ----cccC Confidence 0000 No 184 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=98.46 E-value=3.1e-07 Score=56.30 Aligned_cols=449 Identities=12% Similarity=0.095 Sum_probs=183.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhh---hcccccccccccccc----ccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKA---LNGVDRAYTEPVDGN----LMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~gr~~a~~~~~~~~----~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) ||.|+=|---+..|+. ...+..... .....-..+...+.. ..++..+...+..-+...+...| T Consensus 1 ~~~~~lf~f~~~~d~~----------~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eL 70 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQN----------EYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDL 70 (516) T ss_pred CCchHhcccccchhhh----------HHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHH Confidence 9998877221111111 011111111 011111111111110 11111111111111111222233 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) ...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... ... T Consensus 71 I~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~~-~s~~ik~kI~eeF~~Il~ll~-F~~------ 136 (516) T protein:vir:10 71 INTYRQLINNPEVERAVANIVNEAI------VYERGHKVVSLDLDDTD-FGSNVKEKILEEFDEVCRLLD-ASR------ 136 (516) T ss_pred HHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhc-cch------ Confidence 3333455778999999998888753 34455555666664332 334444444333333333211 111 Q ss_pred HHHHHHHHHHHHcCCcceEEEEC-CCCcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEe-------cCce-- Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYD-SNGRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYI-------DNKV-- 217 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd-~~G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~-------~~~~-- 217 (559) --..+++.+++.|..|..++-| +..-+.+|..|||.+|+.++ +..|....++ ..|+.+. .++. T Consensus 137 -~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~ 215 (516) T protein:vir:10 137 -KLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIF 215 (516) T ss_pred -hhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCcccccccccee Confidence 1223455567889999986665 34459999999999987643 2222211111 0111111 1110 Q ss_pred ----eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 218 ----RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 218 ----~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) ...+ +.+.|++.+.-+-+...+.+ +|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..-.++- T Consensus 216 ~~~~~ikI-~~dAI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqY 293 (516) T protein:vir:10 216 EPNTRIKI-PRSAVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEY 293 (516) T ss_pred CCCcceee-chhheeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 1122 33444444322223333333 56678888777776666665544433334344455554443332211111 Q ss_pred HHHHHHHHHHHh-----cC-cccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTATS-----SG-INGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~~-----~G-~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++.+ .| ..+..+ ..+++ +| |.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 294 l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 294 VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccc Confidence 122222222111 01 001001 11221 11 2334333222334446677788889999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~l~~~ 424 (559) +...........++. -+-++. -....|.-+..++...|...| + ++.++ ..+.|+|.....- T Consensus 374 ~~e~~~~~~~Gr~~E---ItRDEi---KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f 447 (516) T protein:vir:10 374 PRDDGGMVIGGQDTA---ITRDEL---DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYY 447 (516) T ss_pred cCCCCceeeccccch---hhHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchH Confidence 765443322222222 222221 112234444455444443333 2 22222 3467777644332 Q ss_pred h-------HHHHHHHHHHHH---cCCCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 S-------QQDKLKSVQLEL---QTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d-------~~~~~~~~~~~~---~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) . ...|+.++..+- ...++.+=||+ .|.+.-.+ +.. +..+...+...... T Consensus 448 ~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~~--e~k~I~~E~~~~~~----- 507 (516) T protein:vir:10 448 TELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ-------------IAQ--EEKQIEQEAGIKRF----- 507 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh-------------HHH--HHHHHHHhhhCCCC----- Confidence 2 334444443332 22356665554 44443211 000 00000000000000 Q ss_pred cCCCCCCCCCCCCccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~ 513 (559) .+|.+.+.+ T Consensus 508 -----------~~p~~~~~f 516 (516) T protein:vir:10 508 -----------QNPENEDDF 516 (516) T ss_pred -----------CCCCccccC Confidence 001111111 No 185 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=98.46 E-value=3.1e-07 Score=56.30 Aligned_cols=449 Identities=12% Similarity=0.095 Sum_probs=183.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhh---hcccccccccccccc----ccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKA---LNGVDRAYTEPVDGN----LMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~gr~~a~~~~~~~~----~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) ||.|+=|---+..|+. ...+..... .....-..+...+.. ..++..+...+..-+...+...| T Consensus 1 ~~~~~lf~f~~~~d~~----------~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eL 70 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQN----------EYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDL 70 (516) T ss_pred CCchHhcccccchhhh----------HHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHH Confidence 9998877221111111 011111111 011111111111110 11111111111111111222233 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) ...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... ... T Consensus 71 I~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~~-~s~~ik~kI~eeF~~Il~ll~-F~~------ 136 (516) T protein:vir:10 71 INTYRQLINNPEVERAVANIVNEAI------VYERGHKVVSLDLDDTD-FGSNVKEKILEEFDEVCRLLD-ASR------ 136 (516) T ss_pred HHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhc-cch------ Confidence 3333455778999999998888753 34455555666664332 334444444333333333211 111 Q ss_pred HHHHHHHHHHHHcCCcceEEEEC-CCCcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEe-------cCce-- Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYD-SNGRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYI-------DNKV-- 217 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd-~~G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~-------~~~~-- 217 (559) --..+++.+++.|..|..++-| +..-+.+|..|||.+|+.++ +..|....++ ..|+.+. .++. T Consensus 137 -~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~ 215 (516) T protein:vir:10 137 -KLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIF 215 (516) T ss_pred -hhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCcccccccccee Confidence 1223455567889999986665 34459999999999987643 2222211111 0111111 1110 Q ss_pred ----eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 218 ----RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 218 ----~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) ...+ +.+.|++.+.-+-+...+.+ +|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..-.++- T Consensus 216 ~~~~~ikI-~~dAI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqY 293 (516) T protein:vir:10 216 EPNTRIKI-PRSAVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEY 293 (516) T ss_pred CCCcceee-chhheeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 1122 33444444322223333333 56678888777776666665544433334344455554443332211111 Q ss_pred HHHHHHHHHHHh-----cC-cccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTATS-----SG-INGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~~-----~G-~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++.+ .| ..+..+ ..+++ +| |.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 294 l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 294 VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccc Confidence 122222222111 01 001001 11221 11 2334333222334446677788889999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~l~~~ 424 (559) +...........++. -+-++. -....|.-+..++...|...| + ++.++ ..+.|+|.....- T Consensus 374 ~~e~~~~~~~Gr~~E---ItRDEi---KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f 447 (516) T protein:vir:10 374 PRDDGGMVIGGQDTA---ITRDEL---DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYY 447 (516) T ss_pred cCCCCceeeccccch---hhHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchH Confidence 765443322222222 222221 112234444455444443333 2 22222 3467777644332 Q ss_pred h-------HHHHHHHHHHHH---cCCCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 S-------QQDKLKSVQLEL---QTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d-------~~~~~~~~~~~~---~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) . ...|+.++..+- ...++.+=||+ .|.+.-.+ +.. +..+...+...... T Consensus 448 ~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~~--e~k~I~~E~~~~~~----- 507 (516) T protein:vir:10 448 TELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ-------------IAQ--EEKQIEQEAGIKRF----- 507 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh-------------HHH--HHHHHHHhhhCCCC----- Confidence 2 334444443332 22356665554 44443211 000 00000000000000 Q ss_pred cCCCCCCCCCCCCccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~ 513 (559) .+|.+.+.+ T Consensus 508 -----------~~p~~~~~f 516 (516) T protein:vir:10 508 -----------QNPENEDDF 516 (516) T ss_pred -----------CCCCccccC Confidence 001111111 No 186 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.45 E-value=5.9e-07 Score=54.72 Aligned_cols=412 Identities=12% Similarity=0.069 Sum_probs=157.3 Q ss_pred cccccccccccccCCCC--CcccHHHHHHHH-hhChHHHHHHHHHHHHHHhhhhHhhhhcCC--cceeee---------- Q lcl|NC_012530. 52 MFSTLEDTSIVPKPSPI--AFGRITDVLRQY-SMNVVLNAIINTRANQVTEYAHRASTDDNG--MGYQVR---------- 116 (559) Q Consensus 52 ~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~-~~~~~v~acv~~ia~~ia~~~~~~~~~~~g--~~~~v~---------- 116 (559) +++.. ..|..- +..-+..+.... .....+..+|+.....+..+-....+-.+. ...+.+ T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:94 1 MFNII------RMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred Ccccc------cccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccccc Confidence 12111 111110 100111111000 111233344443333333221111111100 000000 Q ss_pred ccccccc-ChhHHHHHHHHHHHHHhcCCCCCCC------------hhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEE Q lcl|NC_012530. 117 LKNGDKP-TKEQQKKIDYAERYIERMGVDYSPI------------RDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSH 183 (559) Q Consensus 117 ~~d~~~~-~~~~~~~~~~~~~~L~~~~p~~~~~------------~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~ 183 (559) .+...++ ....+.-.+....||..-.+..... ...|......+..+++.+|.+|..+.++.+|++ . T Consensus 75 ~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~ 153 (474) T protein:vir:94 75 DKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEM-K 153 (474) T ss_pred ccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCee-E Confidence 0000000 0111111111112221100000000 012334455677889999999999999988875 4 Q ss_pred EEEecCceEEEEecCcc-cccccceEEEEEecCceeeeecccceEEEec----------------------c-----cCC Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTADEMGMFIR----------------------N-----PRS 235 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~----------------------n-----~~~ 235 (559) +..++|..+.++.+... ......++|+..........+..+.+.+++. | |.- T Consensus 154 i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 233 (474) T protein:vir:94 154 LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFI 233 (474) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceE Confidence 77789999988876432 1222233333322222222333333332221 0 000 Q ss_pred CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccc Q lcl|NC_012530. 236 DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIP 315 (559) Q Consensus 236 ~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~ 315 (559) ......+|.|-++.+...++....+..-..+.+...+.|-.++. +.. ++ ..+.+...+ ...++. T Consensus 234 ~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~--~~----~~~~~~~~~--------~~~~~i 297 (474) T protein:vir:94 234 AFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYE--GE----DLEEFMRGL--------KYYKAI 297 (474) T ss_pred EecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCC--cc----cchhhhhhh--------hcccee Confidence 01112357777776666666655554444555555555655543 211 11 111122111 112332 Q ss_pred cc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch--hhhh---HHHHHHHHHHHH Q lcl|NC_012530. 316 MI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL--NESN---NQNKIDASKSKG 389 (559) Q Consensus 316 vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--~~an---~~~~~~~~~~~~ 389 (559) .+ .+++++|.....+ ...+....+...+.|...-++|..-.+ +..++.++... -+.. -.......+..+ T Consensus 298 ~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~----~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:94 298 NVDGDGGVETIQVEVP-VSSTKEYIDLMRVYIMEFGQGVDFQTD----KFGSAPSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred eccCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCccccCcc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 2334555433332 334556667777888888888742211 11111100000 0000 111222344555 Q ss_pred hhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecc Q lcl|NC_012530. 390 LMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQR 469 (559) Q Consensus 390 l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~ 469 (559) |+-++..|...++. ......+.+.|+.....+..+.++.+.. .|+|+..-++++++. ++ |. ... T Consensus 373 l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~~~~~--~g~iS~et~l~~l~~--v~--D~------~~E 436 (474) T protein:vir:94 373 IQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQIIAQ--SQYLSRETLVKSSPL--VD--DY------KAE 436 (474) T ss_pred HHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHHHHHH--cCCCCHHHHHHhCCC--CC--CH------HHH Confidence 55555555443321 1233457788887777777777766544 367888778877654 21 10 011 Q ss_pred cccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) +..+.. +........+.......+....+..+.+.+++ T Consensus 437 ~eri~~------E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 437 LERIEQ------EQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHH------HHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 111111 01000000100000001110111111111111 No 187 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.45 E-value=5.9e-07 Score=54.72 Aligned_cols=412 Identities=12% Similarity=0.069 Sum_probs=157.3 Q ss_pred cccccccccccccCCCC--CcccHHHHHHHH-hhChHHHHHHHHHHHHHHhhhhHhhhhcCC--cceeee---------- Q lcl|NC_012530. 52 MFSTLEDTSIVPKPSPI--AFGRITDVLRQY-SMNVVLNAIINTRANQVTEYAHRASTDDNG--MGYQVR---------- 116 (559) Q Consensus 52 ~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~-~~~~~v~acv~~ia~~ia~~~~~~~~~~~g--~~~~v~---------- 116 (559) +++.. ..|..- +..-+..+.... .....+..+|+.....+..+-....+-.+. ...+.+ T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:97 1 MFNII------RMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred Ccccc------cccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccccc Confidence 12111 111110 100111111000 111233344443333333221111111100 000000 Q ss_pred ccccccc-ChhHHHHHHHHHHHHHhcCCCCCCC------------hhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEE Q lcl|NC_012530. 117 LKNGDKP-TKEQQKKIDYAERYIERMGVDYSPI------------RDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSH 183 (559) Q Consensus 117 ~~d~~~~-~~~~~~~~~~~~~~L~~~~p~~~~~------------~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~ 183 (559) .+...++ ....+.-.+....||..-.+..... ...|......+..+++.+|.+|..+.++.+|++ . T Consensus 75 ~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~ 153 (474) T protein:vir:97 75 DKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEM-K 153 (474) T ss_pred ccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCee-E Confidence 0000000 0111111111112221100000000 012334455677889999999999999988875 4 Q ss_pred EEEecCceEEEEecCcc-cccccceEEEEEecCceeeeecccceEEEec----------------------c-----cCC Q lcl|NC_012530. 184 TRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTADEMGMFIR----------------------N-----PRS 235 (559) Q Consensus 184 L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~----------------------n-----~~~ 235 (559) +..++|..+.++.+... ......++|+..........+..+.+.+++. | |.- T Consensus 154 i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 233 (474) T protein:vir:97 154 LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFI 233 (474) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceE Confidence 77789999988876432 1222233333322222222333333332221 0 000 Q ss_pred CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccc Q lcl|NC_012530. 236 DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIP 315 (559) Q Consensus 236 ~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~ 315 (559) ......+|.|-++.+...++....+..-..+.+...+.|-.++. +.. ++ ..+.+...+ ...++. T Consensus 234 ~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~--~~----~~~~~~~~~--------~~~~~i 297 (474) T protein:vir:97 234 AFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYE--GE----DLEEFMRGL--------KYYKAI 297 (474) T ss_pred EecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCC--cc----cchhhhhhh--------hcccee Confidence 01112357777776666666655554444555555555655543 211 11 111122111 112332 Q ss_pred cc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch--hhhh---HHHHHHHHHHHH Q lcl|NC_012530. 316 MI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL--NESN---NQNKIDASKSKG 389 (559) Q Consensus 316 vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--~~an---~~~~~~~~~~~~ 389 (559) .+ .+++++|.....+ ...+....+...+.|...-++|..-.+ +..++.++... -+.. -.......+..+ T Consensus 298 ~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~----~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:97 298 NVDGDGGVETIQVEVP-VSSTKEYIDLMRVYIMEFGQGVDFQTD----KFGSAPSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred eccCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCccccCcc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 2334555433332 334556667777888888888742211 11111100000 0000 111222344555 Q ss_pred hhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecc Q lcl|NC_012530. 390 LMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQR 469 (559) Q Consensus 390 l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~ 469 (559) |+-++..|...++. ......+.+.|+.....+..+.++.+.. .|+|+..-++++++. ++ |. ... T Consensus 373 l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~~~~~--~g~iS~et~l~~l~~--v~--D~------~~E 436 (474) T protein:vir:97 373 IQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQIIAQ--SQYLSRETLVKSSPL--VD--DY------KAE 436 (474) T ss_pred HHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHHHHHH--cCCCCHHHHHHhCCC--CC--CH------HHH Confidence 55555555443321 1233457788887777777777766544 367888778877654 21 10 011 Q ss_pred cccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 470 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) +..+.. +........+.......+....+..+.+.+++ T Consensus 437 ~eri~~------E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 437 LERIEQ------EQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHH------HHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 111111 01000000100000001110111111111111 No 188 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.45 E-value=1.3e-07 Score=58.37 Aligned_cols=474 Identities=14% Similarity=0.081 Sum_probs=210.3 Q ss_pred HHHHHHHHHHHhhhhccccc----ccccc-ccccccc-cccccccccccCCC-CCccc-HHHHHHHHhhChHHHHHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDR----AYTEP-VDGNLMF-STLEDTSIVPKPSP-IAFGR-ITDVLRQYSMNVVLNAIINTR 93 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~----a~~~~-~~~~~~~-~~~~~~~~~~~p~~-~~~~~-~~~~~~~~~~~~~v~acv~~i 93 (559) +.+.+ +-.+.-|.| +-.+- +..+-++ .++...... .+ +.+.. ..+-...+.-.+.++-.|.-| T Consensus 1 ~~a~~------~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~---~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~ 71 (631) T protein:vir:10 1 MAATQ------SLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS---TGISRNSDWQTDAWEAVDLVGELRYYVGWR 71 (631) T ss_pred CCccc------ceeeeecCCCCCccchhhhhhhhccccchhhhhhhh---cCCcccchhhHHHHHHHHhhhhHHHHhhhh Confidence 11100 111112222 11110 0000011 111110000 01 00111 011222223346777778888 Q ss_pred HHHHHhhhhHhhhhcCCcceeeecccccccChhHHH---HHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcc Q lcl|NC_012530. 94 ANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQK---KIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVN 170 (559) Q Consensus 94 a~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~---~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~ 170 (559) +++++..-++...-....+ .++....+ ....+.+...+...- +..-.++++.+..++-+-|.+| T Consensus 72 ~~s~sr~rL~as~idpDtg---------~ptg~iee~~~~~~~v~~~~~~i~gG----~lgQ~~llkrl~~~ltV~GE~w 138 (631) T protein:vir:10 72 ASSCSRCRLVASELDENTG---------LPTGGISEDNTEGERVREIVSKIADG----TLGQAALTKRVVECLTVPGELW 138 (631) T ss_pred hhhhceeeeEeeeeccCCC---------CCccccccCCchhHHHHHHHHhcCCC----cchHHHHHHHHHhheecccceE Confidence 8888865443322111100 11111110 112233333332221 1222579999999999999999 Q ss_pred eEEE-ECCC-------C--c-EEEEEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccC Q lcl|NC_012530. 171 YENT-YDSN-------G--R-LSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILS 239 (559) Q Consensus 171 ~~i~-rd~~-------G--~-~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~ 239 (559) +.+. |..+ | + .-+++++....|+.....+| ..+. ...+..-......|+++..++|.+.. T Consensus 139 iv~l~~p~~~~~~~pd~~~r~~~~W~~vt~~ei~~~~~g~g------~~v~-lp~g~~h~~~~~~D~l~RiW~P~prr-- 209 (631) T protein:vir:10 139 IVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG------TNIV-LPTGEEHEFVKGTDIIFRVWIPKPRK-- 209 (631) T ss_pred EEEEeccCcCCCCCcccccccccceeeccHHHHhcccCccc------ceee-cCCCCccceecCCceEEEeeCCCccc-- Confidence 8764 2222 1 2 23555556555543322222 1222 22233333344557777777776543 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc----------------CCHHHHHHHHHHHH- Q lcl|NC_012530. 240 GGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN----------------TSMRALEDFKRHWT- 302 (559) Q Consensus 240 ~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~----------------~~~e~~~~l~~~~~- 302 (559) ..+--||+.+++..+.-.........+..+.-.+-.|||.++...+-+. ...-+.+.|.+.+- T Consensus 210 ~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q 289 (631) T protein:vir:10 210 ASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQ 289 (631) T ss_pred ccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHH Confidence 3456799999988888777766666665555555567777765543320 11113344443332 Q ss_pred ---HHhcCc-ccccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hcccccccccccccc Q lcl|NC_012530. 303 ---ATSSGI-NGAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSN 371 (559) Q Consensus 303 ---~~~~G~-~nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~ 371 (559) .++... ..+--+||+..+ .++...+.+.-+.--+.+|+..+..||....|||.. ||+..++..|+. +. T Consensus 290 ~a~tai~De~S~aA~vPii~~~p~E~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsA-Wq 368 (631) T protein:vir:10 290 VAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSA-WQ 368 (631) T ss_pred HHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEE-EE Confidence 222221 123456776332 244444444455567889999999999999999875 566433333321 00 Q ss_pred chhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHH Q lcl|NC_012530. 372 SLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVN 444 (559) Q Consensus 372 ~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~N 444 (559) -+. .-++-.|.|.+..|+++|++.+|.+. + -.+|.+-|+. .+..+.....++.+..-+|.||-. T Consensus 369 ----I~d-----edVrlHI~P~l~lic~AlT~q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPdr~deA~qa~drGAIt~e 439 (631) T protein:vir:10 369 ----ISD-----EDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDPSQLTIDPDKSDEAKFAYENGAINGE 439 (631) T ss_pred ----ecc-----cceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHH Confidence 011 12345699999999999999887642 2 2468888874 445554444455555557889999 Q ss_pred HHHHHhCCCCCCCCC------------------EeeccceecccccccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_012530. 445 DYREKQGLPKIAGGD------------------IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLP 506 (559) Q Consensus 445 E~R~~~gl~pi~gGD------------------~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (559) ..|+.+|+.--.+=| .-+.|. +.++.. .+-..... +.+.......++..++ T Consensus 440 Alrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLip~-lApl~~--------~~~~~v~~--P~~~a~~~~g~ed~~~ 508 (631) T protein:vir:10 440 ALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPM-LAPLIA--------GVLKQIEF--PQQQAIDSGGNEDTSD 508 (631) T ss_pred HHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcchh-hHHHHH--------HHhhhccC--CCCCCCCCCCCCcccc Confidence 999999995433222 111110 011100 00000011 1111111111111100 Q ss_pred ccccccchhccccccccccccccccccccccccccccccchhh-----hhhccCCCCC Q lcl|NC_012530. 507 PSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTN-----SYKQGGSSKK 559 (559) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~-----~~~~~~~~~~ 559 (559) ....++.....+... ......+.+. ....+. +-...||--+ T Consensus 509 ~~~~~~g~~epdt~d----------~~p~~~~a~~--~~~iv~llv~RALelAGkRl~ 554 (631) T protein:vir:10 509 ADDLDDGEQEPDTED----------DDDGTQKAGL--ETGIVDLMVDRALELVGKRRR 554 (631) T ss_pred ccccccCCCCCCCCC----------CCCccccccc--hHHHHHHHHHHHHHhhcchhc Confidence 000000000101100 0001111111 001111 0111222212 No 189 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.42 E-value=7.6e-07 Score=54.15 Aligned_cols=421 Identities=10% Similarity=0.028 Sum_probs=170.6 Q ss_pred Ccch--hhhccccccCCc-chHHHHHHHHHHH-----HHHhhhhccccccccccccccccccccccccccccCCCCCccc Q lcl|NC_012530. 1 MGIF--DRFRTKFYTDDP-NAFFKHIDSKIAN-----DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGR 72 (559) Q Consensus 1 ~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~-----~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 72 (559) |+.= ..|.-+ .++++ .+.|..+-...-. +.+.+=-.|++.....+. ....+.+ T Consensus 1 ~~~~~~~~~~~p-~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~------------------~~~~~~~ 61 (453) T protein:vir:39 1 MKYKPPKLMTFP-KDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPT------------------KDLWKPD 61 (453) T ss_pred CeecCCcceEcC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCC------------------ccccCcc Confidence 4332 122211 22222 2223333222111 111111223322111110 0000000 Q ss_pred HHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 73 ITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 73 ~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .+ ...++...+|+..+.-+.. .+..+...+ ....+.+.+++..- .+ T Consensus 62 ----~k--i~~n~~~~ivd~~~~~l~g-----------~~~~~~~~d--------~~~~~~l~~i~~~N---------~~ 107 (453) T protein:vir:39 62 ----NR--LTVNFTKYIVDTFTGYFNG-----------IPVKKSHSD--------KETLSKLQEFDNLN---------DM 107 (453) T ss_pred ----ce--eecchHHHHHHHHhhhhcc-----------cCceeccCC--------hHHHHHHHHHHHhc---------Ch Confidence 11 1134555566655544321 122221111 11223455554431 23 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCc-eeeeecccceEEEe Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNK-VRGSFTADEMGMFI 230 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~-~~~~~~~~evi~~~ 230 (559) ......+..+.+.+|.+|..+.++.+|.+. +..++|..+.++.++... .....++|+...+.. ....+.++.+.++. T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~ 186 (453) T protein:vir:39 108 EDEESELAKMACIYGRAFELLYQNEETQTN-VIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALN 186 (453) T ss_pred hHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEE Confidence 446677788999999999999999988764 667899999888765332 111112222111100 01122333322222 Q ss_pred cc------------cCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 231 RN------------PRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 231 ~n------------~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) .. +.. .......|.|.++.....++....+..-..+.+...+.|-.++. +. .++++. T Consensus 187 ~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~--g~----~~~~~~ 260 (453) T protein:vir:39 187 GTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL--GA----AVEEED 260 (453) T ss_pred ecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee--cC----CCCchh Confidence 11 000 01112357777766666555544444444444555556655553 21 233333 Q ss_pred HHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch Q lcl|NC_012530. 294 LEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL 373 (559) Q Consensus 294 ~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~ 373 (559) ...++.. ..-...+. ...-..++++|..... .+..+....+...+.|+..-++|..-.+ . .++.++..+ T Consensus 261 ~~~~~~~---~~~~~~~~--~~~~~~~~~~~lt~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~--~---~gn~Sg~Al 329 (453) T protein:vir:39 261 LKNIRSN---RVINYYGE--SSEAKNVDVKFLEKPD-SDSQTENLLDRLTKLIFQTTMVANISDE--S---FGSSSGVSL 329 (453) T ss_pred hhhhhhc---ceeeecCC--CCCCCCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccc--c---ccCChHHHH Confidence 3333221 10000000 0011223344443222 2345566778888888888888843211 1 111111111 Q ss_pred --hhhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_012530. 374 --NESN---NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYRE 448 (559) Q Consensus 374 --~~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~ 448 (559) .+.. -....+..+..+|+.++..+...++..- .......+.+.|......|..+.++++..+ .|.|+.--+.+ T Consensus 330 ~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl-~g~is~et~l~ 407 (453) T protein:vir:39 330 AYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS-NKEAWKDIEYTFTRNEPKDIKEQAETANIL-MGITSQETALS 407 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHH Confidence 0111 1112234455566666665554443221 112234578889888888888888887655 46678877777 Q ss_pred HhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 449 KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 449 ~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) +++.-+-+ ...+..+.+... +.............+..++.++ .+.| T Consensus 408 ~l~~v~D~----------~~E~~ri~~E~~---~~~~~~~~~~~~~~~~~~~~~~----~~~e 453 (453) T protein:vir:39 408 VISVIPDV----------QAEMEKIKKEEA---STAIFDKDKQPSEKGTDTVVPE----TNEE 453 (453) T ss_pred hCCCCCCH----------HHHHHHHHHHHH---HHHHHHHhccCCCCCCCCCCCC----cCCC Confidence 77542110 011111111000 0000000000000000000000 0000 No 190 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.41 E-value=7.8e-07 Score=54.08 Aligned_cols=392 Identities=12% Similarity=0.118 Sum_probs=146.8 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHh---------------------------------hhhHhhhhcC- Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTE---------------------------------YAHRASTDDN- 109 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~---------------------------------~~~~~~~~~~- 109 (559) .++ ...+|..+++.+.+.... +|..++.... T Consensus 1 ~~t---------------~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTT---------------YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCC---------------HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHh Confidence 000 011223333333333332 3322222111 Q ss_pred ---CcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEE------CCCCc Q lcl|NC_012530. 110 ---GMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTY------DSNGR 180 (559) Q Consensus 110 ---g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~r------d~~G~ 180 (559) ..+|.+. + +.+..+.+.+++.. ..+......+..+.+++|.+|..+.+ |.+|. T Consensus 66 ~l~~~g~~~~--~-------d~~~~~~l~~i~~~---------N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~ 127 (480) T protein:vir:78 66 RLDIEGFRIS--E-------DSEGLEELWNWWQA---------NDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGI 127 (480) T ss_pred hhccCceecC--C-------CchhHHHHHHHHHh---------cCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCe Confidence 1111110 0 00011122222211 12334566788899999999988775 34555 Q ss_pred EEEEEEecCceEEEEecCccc-ccccceEEEEEecCce----eeeecccc-----------------------------e Q lcl|NC_012530. 181 LSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNKV----RGSFTADE-----------------------------M 226 (559) Q Consensus 181 ~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~~----~~~~~~~e-----------------------------v 226 (559) +. +.+++|..|.++.+.... ......+|+...+... ...+.++. | T Consensus 128 ~~-i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 206 (480) T protein:vir:78 128 PL-IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV 206 (480) T ss_pred eE-EEEEcccceEEEEcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcce Confidence 54 778899999888775321 1111122211111100 01122222 2 Q ss_pred EEEecccCCCccCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHH Q lcl|NC_012530. 227 GMFIRNPRSDILSGGYGLSELE----MGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWT 302 (559) Q Consensus 227 i~~~~n~~~~~~~~~~G~Spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~ 302 (559) ++|+.++. ..+.+|.|-|+ .+.+++...+.-..-...+| +.|--+|. + ....+...+.. ...|. T Consensus 207 v~f~n~~~---~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~---a~p~~~i~--G-~~~~~~~~~~~---~~~~~ 274 (480) T protein:vir:78 207 VPLTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS--G-VTTDELTNDGE---NTTLD 274 (480) T ss_pred EEeecccc---cCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhh---cchhhhhh--c-CCccccccccc---cchhh Confidence 33332221 23357877654 33444444333333333333 34443432 1 11111111100 11122 Q ss_pred HHhcCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHH- Q lcl|NC_012530. 303 ATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQN- 380 (559) Q Consensus 303 ~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~- 380 (559) .. .+.+..+.+++.++..+.. .+++ |++..+..+..|+..=++|+..+|....+..++ .....-+..... T Consensus 275 ~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg-~Alk~~~~~l~~k 346 (480) T protein:vir:78 275 IY------YGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EAIIATDSRIVKM 346 (480) T ss_pred hh------hhhhccCCCCCceEEecCc-cCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHH-HHHHHHHHHHHHH Confidence 11 1233444455667766543 2343 788888899999999999999998533211100 000000111100 Q ss_pred --HHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHc-C--CCCHHHHHHHhCCCCC Q lcl|NC_012530. 381 --KIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-T--ATTVNDYREKQGLPKI 455 (559) Q Consensus 381 --~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~--~~T~NE~R~~~gl~pi 455 (559) ..+..+...|.-.+..+........ ......+.+.|......+..+.++...+.+. + .++..-+++.+|+.+- T Consensus 347 a~~~~~~f~~~l~~~~~l~~~~~g~~~--~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d 424 (480) T protein:vir:78 347 AERKGRIFGGAWERAMRIAMQIMGREV--TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT 424 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCc--cccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHh Confidence 0111111122222222211111000 1112346777876666677777766655543 3 2566667888887542 Q ss_pred CCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccc Q lcl|NC_012530. 456 AGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQG 535 (559) Q Consensus 456 ~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 535 (559) + ...+.... . ++........... ...++.....+..++..++.+ + ..++ T Consensus 425 ~----------~~~~~~~~---~---e~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~---------~----~~~~ 473 (480) T protein:vir:78 425 Q----------REQMRDWD---K---QETEDMIDTLYST--TKAQADATPKPTVTETKTETQ---------T----SPSG 473 (480) T ss_pred H----------HHHHHHHH---H---HHHHHHHHHhhcc--ccccCCCCCCCCCCCCCCccc---------c----ccCC Confidence 1 01111100 0 0000000000000 000000000000000000000 0 1111 Q ss_pred ccccccc Q lcl|NC_012530. 536 VGKDGQL 542 (559) Q Consensus 536 ~~~~~~~ 542 (559) .+.++-+ T Consensus 474 ~~~~~~~ 480 (480) T protein:vir:78 474 FNRTKTR 480 (480) T ss_pred CCcccCC Confidence 1111110 No 191 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.39 E-value=8.8e-07 Score=53.78 Aligned_cols=411 Identities=11% Similarity=0.073 Sum_probs=176.2 Q ss_pred Ccchhhhcccc---------------ccC--Ccc---hHHHHHHHHHHHHHHhhhhcccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKF---------------YTD--DPN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTS 60 (559) Q Consensus 1 ~~~~~~~~~~~---------------~~~--~~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~ 60 (559) |++++|...-| ++| .|. +++.+++.-+..- .|+..-+... . . T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y------~g~~~~l~~~-----~--~----- 62 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYY------MDDFKQVTHK-----N--S----- 62 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHh------cCCCcccccc-----c--c----- Confidence 99999864111 111 110 1222221111100 0110000000 0 0 Q ss_pred ccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHh Q lcl|NC_012530. 61 IVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIER 140 (559) Q Consensus 61 ~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~ 140 (559) ...+.... .........+++..|+-|..=| -.|... ..+..+.+.+++.. T Consensus 63 -~~~~~~~~----------~~slnl~~~i~~~~A~ll~~e~-----------~~i~~~--------d~~~~e~l~~i~~~ 112 (505) T protein:vir:79 63 -YGDTQKHE----------LQSVNVTKLASAKLASLIFNEQ-----------CQVTVS--------DETANDFLDDVFQQ 112 (505) T ss_pred -CCCccccc----------eeecchHHHHHHHHHhhhcCCC-----------ceeecC--------ChHHHHHHHHHHHh Confidence 00000000 0111233445555554443211 011111 12233344454433 Q ss_pred cCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc-------------cce Q lcl|NC_012530. 141 MGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT-------------RGK 207 (559) Q Consensus 141 ~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------~~~ 207 (559) ..|...+...+.+.+..|.+++.+..|. |. +.+..++|..+.++..+.+.... ... T Consensus 113 ---------n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~ 181 (505) T protein:vir:79 113 ---------NDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTI 181 (505) T ss_pred ---------ccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcce Confidence 2355667778888999999999888874 44 35777888887775433332210 001 Q ss_pred EEEE-----------------Eec------Ccee--------------eee---cccceEEEecccCC--CccCCccccc Q lcl|NC_012530. 208 IYRQ-----------------YID------NKVR--------------GSF---TADEMGMFIRNPRS--DILSGGYGLS 245 (559) Q Consensus 208 ~y~~-----------------~~~------~~~~--------------~~~---~~~evi~~~~n~~~--~~~~~~~G~S 245 (559) +|.. +.. |..+ ..+ ...-..|++ +|.. .....++|+| T Consensus 182 ~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~-~~~~N~~~~~splG~S 260 (505) T protein:vir:79 182 YYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYR-NKGANNKNFTSPMGMS 260 (505) T ss_pred EEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEec-CCcccccccCCccCCc Confidence 1100 000 0000 000 001122333 2222 2224568999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC----ceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCc Q lcl|NC_012530. 246 ELEMGLREFISHENTELFNDRFFTHGGTT----KGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAED 321 (559) Q Consensus 246 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p----~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~ 321 (559) .+.-+...|......-.-..+-|+.|... ..+|....... +. ..... . ..+.+...........+++ T Consensus 261 ~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~-~~-~~~~~---~----~~fd~~~~~y~~~~~~~~~ 331 (505) T protein:vir:79 261 LIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYG-GQ-ASETH---P----PMFDPDETVYQAMYGDASE 331 (505) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCC-cc-ccccc---c----cCCCccceeeeeccCCCCC Confidence 99999988887776666666667766432 22222211100 00 00000 0 0000000111101111222 Q ss_pred eeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch---hhhhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_012530. 322 AKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL---NESNNQNKIDASKSKGLMPLLDMI 397 (559) Q Consensus 322 ~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~---~~an~~~~~~~~~~~~l~P~~~~i 397 (559) ..++.++. -.+.++.+..+...+.|+...|+++..+|+...+..++.+..+. .++. ....+..++.+|..++..| T Consensus 332 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t-~~~~~~~~~~al~~li~~i 410 (505) T protein:vir:79 332 VGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQT-RSSYITQVEKTIKALTYAI 410 (505) T ss_pred CceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 33444442 14567888889999999999999999999876554333222111 1111 1223344566666666666 Q ss_pred HHHHHhhcccc---------ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccce Q lcl|NC_012530. 398 AKNLTNGIIRQ---------ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVY 466 (559) Q Consensus 398 e~~ln~~L~~~---------~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~ 466 (559) -.......+.. .....+.|.|+.....|..+.++.....+. |.|++-+++... |+.. + +. T Consensus 411 ~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~e-e--ea------ 481 (505) T protein:vir:79 411 LELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNYGLDE-E--EA------ 481 (505) T ss_pred HHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCh-H--HH------ Confidence 54333221111 012246678888888888887777766665 557888877653 3321 0 00 Q ss_pred ecccccccccccccccccccccccccccCCC Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQQTRLTQLESALQN 497 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (559) ...+ .....+.. ..........++ T Consensus 482 ---~~el---~ri~~E~~-~~~p~~~~~gg~ 505 (505) T protein:vir:79 482 ---DEWL---AQIDAENS-TAEPEFNQFGGD 505 (505) T ss_pred ---HHHH---HHHHHhcc-ccCCCchhccCC Confidence 0000 00000000 000000000100 No 192 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.37 E-value=9.8e-07 Score=53.54 Aligned_cols=425 Identities=10% Similarity=0.076 Sum_probs=173.1 Q ss_pred Ccchhhhcccc--------------ccC--Ccc---hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKF--------------YTD--DPN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSI 61 (559) Q Consensus 1 ~~~~~~~~~~~--------------~~~--~~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~ 61 (559) ||+++|.+.=| +.+ +|. ++..+|++-...- .|+..-.. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y------~g~~~~~~----------------- 57 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQY------EGDYPQVE----------------- 57 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHh------cCCCcccc----------------- Confidence 99999875111 011 111 2222222211110 11111000 Q ss_pred cccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--c-cChhHHHHHHHHHHHH Q lcl|NC_012530. 62 VPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--K-PTKEQQKKIDYAERYI 138 (559) Q Consensus 62 ~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~-~~~~~~~~~~~~~~~L 138 (559) .+ ... +....+ ......+...+++.+|+-|..=+ -.|...+.+ + .........+.+.+.+ T Consensus 58 -~~--~~~--~~~~~~-~~~sl~~~~~i~~~~A~Ll~~e~-----------~~i~v~d~~~~~~~~~~~~~~~e~l~~i~ 120 (517) T protein:vir:98 58 -YI--NSQ--GKIQER-DYMTLNLRKLSADVLSGLVFNEQ-----------CEVYVSDAKDEEKKDNSFKTAHEFIQHVF 120 (517) T ss_pred -cc--ccc--cccccc-ceeecCcHHHHHHHhhhhhcCCc-----------ceEEecccccccccccchhHHHHHHHHHH Confidence 00 000 000000 00111222334444444432111 112222211 1 0111122233344444 Q ss_pred HhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-------------cc Q lcl|NC_012530. 139 ERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-------------TR 205 (559) Q Consensus 139 ~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-------------~~ 205 (559) .. ..|...++..+.+.+..|.+++-+.+|. |. +.+.++++..+.+.....+.+. .. T Consensus 121 ~~---------n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~ 189 (517) T protein:vir:98 121 QH---------NKFIKNLSDYLEPTFALGGLTVRPYVDN-GE-IEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNK 189 (517) T ss_pred Hh---------ccHHHHHHHHHHHHhhhCCEEEEEEEeC-Ce-eEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCC Confidence 32 2355566677888888999999888874 33 3477788888776433222111 01 Q ss_pred ceEEEEE----------------------ecC------ceee--ee---cccce----------EEEecccCCC--ccCC Q lcl|NC_012530. 206 GKIYRQY----------------------IDN------KVRG--SF---TADEM----------GMFIRNPRSD--ILSG 240 (559) Q Consensus 206 ~~~y~~~----------------------~~~------~~~~--~~---~~~ev----------i~~~~n~~~~--~~~~ 240 (559) ..+|.+. ..+ ..+. .+ .+.++ .|+ .+|..+ ..+. T Consensus 190 ~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~-~~p~~N~~~~~s 268 (517) T protein:vir:98 190 TVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYL-KPSGFNNINPHS 268 (517) T ss_pred ceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEe-cCCcccccccCC Confidence 1112111 000 0000 00 01111 122 222221 2246 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----ceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccc Q lcl|NC_012530. 241 GYGLSELEMGLREFISHENTELFNDRFFTHGGTT----KGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPM 316 (559) Q Consensus 241 ~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p----~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~v 316 (559) ++|+|.+.-+...+......-.....-|+-|-.. ..+|....... +.... ..| -+.....+..- T Consensus 269 plG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~~~~-g~~~~-------~~~----d~~~~~y~~~~ 336 (517) T protein:vir:98 269 PLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVPDES-GMPPP-------QVF----DPDVNVYKSIR 336 (517) T ss_pred CCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccccCCC-CcccC-------CCC----Ccccceeeecc Confidence 7899999988888877766655555666765432 11121110000 00000 000 00000000000 Q ss_pred ccCCceeeeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh--HHHHHHHHHHHHhhHH Q lcl|NC_012530. 317 ITAEDAKFVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN--NQNKIDASKSKGLMPL 393 (559) Q Consensus 317 l~~g~~~~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an--~~~~~~~~~~~~l~P~ 393 (559) ...++-.++.++. -.+-++.+..+...+.|+...|++|..+|+...+..+..+..+.+... .....+..+..+|.-+ T Consensus 337 ~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~l 416 (517) T protein:vir:98 337 MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGL 416 (517) T ss_pred CCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111122333332 145688999999999999999999999998776554332221111110 0111223344444444 Q ss_pred HHHHHHHHHh-hccccc--cCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceec Q lcl|NC_012530. 394 LDMIAKNLTN-GIIRQI--LGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQ 468 (559) Q Consensus 394 ~~~ie~~ln~-~L~~~~--~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~ 468 (559) +..|-..... .|+... ....+.++|+.....|..+.++....++. |.|++-+++.++ |+..-+ .+. T Consensus 417 v~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~eee-A~~-------- 487 (517) T protein:vir:98 417 VISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRIFKVPKKT-AEQ-------- 487 (517) T ss_pred HHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCChHH-HHH-------- Confidence 4443322211 223221 12346788888888898888887776665 568999976554 653210 000 Q ss_pred ccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 469 RLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 469 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) -+.. . +++....+ +. ....+..++...+.+ T Consensus 488 e~~~------i--~~E~~~~~-~~------~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 488 WLEE------I--RKDQIELD-PV------TISQRAQKRMFGDEE 517 (517) T ss_pred HHHH------H--HHhccccC-CC------CccccccCCCCCCCC Confidence 0000 0 00000000 00 000000011110000 No 193 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.37 E-value=9.8e-07 Score=53.53 Aligned_cols=350 Identities=12% Similarity=0.069 Sum_probs=157.4 Q ss_pred ccccccccccccccccccccccCCCCCcccHHHHHHHHhh--ChH------HHHHHHHHHHHHHhhhhHhhhhcCC---- Q lcl|NC_012530. 43 YTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM--NVV------LNAIINTRANQVTEYAHRASTDDNG---- 110 (559) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~--~~~------v~acv~~ia~~ia~~~~~~~~~~~g---- 110 (559) .+...++.. ........+....+..|+. .++ +..-.+...+.+..+|..++.+... T Consensus 1 ~~~~~i~~L------------~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~ 68 (409) T protein:vir:16 1 MTEKGIGYL------------RFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVF 68 (409) T ss_pred CCHHHHHHH------------HHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhccc Confidence 111111100 0000011111122223332 221 2222222233455566655554322 Q ss_pred cceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCc Q lcl|NC_012530. 111 MGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPT 190 (559) Q Consensus 111 ~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~ 190 (559) .||.. .+. .+..++.. ..+......+..+.|++|.+|+.+..+.+|.| .+.+++|. T Consensus 69 ~Gf~~--------~d~------~l~~i~~~---------N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~ 124 (409) T protein:vir:16 69 REFEN--------DDF------TVNEIFEE---------NNPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEAT 124 (409) T ss_pred ccccC--------cch------HHHHHHHh---------cChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEccc Confidence 12210 000 12223221 12334566778899999999999999888875 58889999 Q ss_pred eEEEEecCcccccccceEEEEEe-cCcee--eeecccc----------------------eEEEecccCCCccCCccccc Q lcl|NC_012530. 191 TIYFANDEHGHRRTRGKIYRQYI-DNKVR--GSFTADE----------------------MGMFIRNPRSDILSGGYGLS 245 (559) Q Consensus 191 ~V~~~~~~~g~~~~~~~~y~~~~-~~~~~--~~~~~~e----------------------vi~~~~n~~~~~~~~~~G~S 245 (559) .+.++.|...........+.... .+... ..+.+++ +++|..++. ..+.+|.| T Consensus 125 ~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~---~~~~~G~s 201 (409) T protein:vir:16 125 NATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPD---AVRPFGRS 201 (409) T ss_pred ceEEEeecccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEeccccc---ccccCCcc Confidence 88887765433322211111110 01100 0112222 334433322 23457877 Q ss_pred H----HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE-ecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-- Q lcl|NC_012530. 246 E----LEMGLREFISHENTELFNDRFFTHGGTTKGILL-VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-- 318 (559) Q Consensus 246 p----l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-- 318 (559) . +..+.+++...+.-......||.+ |.-++. ++. +. +..+ .|+... +++..++ T Consensus 202 eI~~~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~d~-----d~--~~~~----~~~~~~------~~i~~~~~d 261 (409) T protein:vir:16 202 RITRSGMYWQSNAKRTLERADVTAEFYSF---PQKYVTGLSD-----DA--EPME----TWKATV------SSMLQFTKD 261 (409) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhcC---hhheeEecCC-----CC--Cccc----hhhhhh------hHhhccCCC Confidence 4 555566666665555556666654 444442 211 11 1111 233221 2233332 Q ss_pred --CCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHH------HHHHHHHHH Q lcl|NC_012530. 319 --AEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQN------KIDASKSKG 389 (559) Q Consensus 319 --~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~------~~~~~~~~~ 389 (559) +.+.++..+.. .+++ |++..+..+..+|..=++|++.+|....+-+++ ....+-.+. ..+...... T Consensus 262 ~~g~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa----~Ai~a~~~~L~~ka~~k~~~fg~~ 336 (409) T protein:vir:16 262 EDGDKPTLGQFTQ-PSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSV----EAIKASHENLRLAGRKAQRSLGAG 336 (409) T ss_pred CCCCCceEEecCC-CChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHH----HHHHHHHHHHHHHHHHHHHHHHHH Confidence 12345555542 3454 899999999999999999999999654321110 000000000 000111111 Q ss_pred hhHHHHHHHHHHHhhccccccCccceeeecchh---hhhHHHHHHHHHHHHcC--CC-CHHHHHHHhCCCCCC Q lcl|NC_012530. 390 LMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGD---TRSQQDKLKSVQLELQT--AT-TVNDYREKQGLPKIA 456 (559) Q Consensus 390 l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~---~~d~~~~~~~~~~~~~~--~~-T~NE~R~~~gl~pi~ 456 (559) ++-++..+-......=-.+.+....++.|.... ..+..+.++++.+++.. ++ .-+-+++++|+..-+ T Consensus 337 l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 337 LLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 111111111110100000111234566776443 44566777777776654 34 346679999996543 No 194 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.36 E-value=1.1e-06 Score=53.36 Aligned_cols=433 Identities=13% Similarity=0.108 Sum_probs=162.0 Q ss_pred Ccchhhhc---cccccCCcc-hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 1 MGIFDRFR---TKFYTDDPN-AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 1 ~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) .+.+.+=. ..|-+++++ +.+..+..+......... .|-+-+.+--.+. .. ....+.... ..+... T Consensus 5 ~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~--~rl~~l~~YY~G~------~~--~~~~~~~~~-~~~~~~ 73 (501) T protein:vir:25 5 VDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISER--QWLDRIYEYTKGL------RG--RPEVPEGAS-DEVKEL 73 (501) T ss_pred chhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhcC------CC--chhccccCC-hhhhhh Confidence 23333222 112233322 222333333333222111 0100000000000 00 000010000 011111 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) ..-..+.+...||++.++.+- ..||.+. +.. . ...+..++.. ..+.... T Consensus 74 -~~~~v~n~~~~ivd~~a~~l~-----------~~gf~~~--d~~--~------~~~l~~i~~~---------N~~d~~~ 122 (501) T protein:vir:25 74 -AKLSVKNVLSLVRDSFAQNLS-----------VVGYRNA--LAK--E------NDPAWEMWQR---------NRMDARQ 122 (501) T ss_pred -HhhhhcChHHHHHHHHHhhhc-----------ccceecC--Ccc--c------hHHHHHHHHh---------cChhHHH Confidence 111223466667766655331 1233321 111 0 1122333321 1233455 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEe-cCccc-ccccceEEEEEecCc----eeeeecccc----- Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFAN-DEHGH-RRTRGKIYRQYIDNK----VRGSFTADE----- 225 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~-~~~g~-~~~~~~~y~~~~~~~----~~~~~~~~e----- 225 (559) ..+..+++++|.+|+.+.++..|. .+..++|..|.++. +.... ......+|+...... ....+.+.. T Consensus 123 ~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~ 200 (501) T protein:vir:25 123 AEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELD 200 (501) T ss_pred HHHHHHHhhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEe Confidence 677889999999999999988874 35567898887654 32211 111122221111100 000011000 Q ss_pred -----------------------------------------eEEEecccCCCccCCcccccHHHHHH---HHHHHHHHHH Q lcl|NC_012530. 226 -----------------------------------------MGMFIRNPRSDILSGGYGLSELEMGL---REFISHENTE 261 (559) Q Consensus 226 -----------------------------------------vi~~~~n~~~~~~~~~~G~Spl~~~~---~~i~~~~~~~ 261 (559) |+++..++ . ...+|.|-++... +++.....-. T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~--~--~~~~g~sdie~v~~l~Da~~~~~s~~ 276 (501) T protein:vir:25 201 LGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR--D--ADDMIVGEVAPLILLQQAINSVNFDR 276 (501) T ss_pred cCceeeeeccccccccccccccccccccccccccCCccceeeEeccCcc--c--cCccccchhhhhHHHHHHHHHHHHHH Confidence 22222111 1 1235777555444 4444433333 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhH-HHHHHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLN 340 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~ 340 (559) .....||. .|.-+|. + ... +..+. |+. ..+++.++++++.++..+. ..+++ |++..+ T Consensus 277 ~~~~e~~a---~p~~~i~--G-~~~-----~~~~~----~~~------~~~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~ 334 (501) T protein:vir:25 277 LIVSRFGA---NPQRVIS--G-WTG-----SKAEV----LKA------SALRVWTFEDPEVKAQAFP-PASVEPYNLILE 334 (501) T ss_pred HHHHHhhc---cHHHHHh--C-CCC-----Cccch----hhh------cccceeccCCCCceEEEec-ccChHHHHHHHH Confidence 33334443 3433321 1 111 11111 221 1234556665667766554 23555 889999 Q ss_pred HHHHHHHHHhCCCHHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceee Q lcl|NC_012530. 341 YLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLE 417 (559) Q Consensus 341 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~ 417 (559) ..+..|+..=++|++.+|....+.+ + .......... .+..+..+...|.-+++.+....... .......+.+. T Consensus 335 ~~i~~i~~~s~~P~~~~~~~~~N~S-g-~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~--~~~~~~~i~v~ 410 (501) T protein:vir:25 335 EMLQHVAMVAQISPAQVTGKMINVS-A-EALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDP--DTAADSGAEVL 410 (501) T ss_pred HHHHHHHhhcCCChhhhccccCChH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccccceeeeEE Confidence 9999999999999999984332211 0 0000000000 00111111222222222211111100 01122356777 Q ss_pred ecchhhhhHHHHHHHHHHHHcCCCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCC Q lcl|NC_012530. 418 FVGGDTRSQQDKLKSVQLELQTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQ 496 (559) Q Consensus 418 f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (559) |......+..+.++++.++...+++.-.+.. +.|+.+-+ ...+..... .+.............. T Consensus 411 w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~----------ie~~~~~~~-----e~~~~~~~~~~~~~~~ 475 (501) T protein:vir:25 411 WRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQT----------IQAIKDSLR-----GGEVKSLVDKLLSNEP 475 (501) T ss_pred ecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHH----------HHHHHHHHH-----HHhHHHHHHHhhccCc Confidence 8888888999999988777665676544443 34664311 000100000 0000001111100000 Q ss_pred CCCCCCCCCCccccccchhccccccccccccccc Q lcl|NC_012530. 497 NPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGK 530 (559) Q Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 530 (559) .+..+.+.. .+.+....+......|+ T Consensus 476 ~~~~~~~~~--------~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 476 APVPPPPPQ--------AAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCCCCCC--------CCccccccccCCCCCCC Confidence 000000000 00000000000001111 No 195 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.36 E-value=1.1e-06 Score=53.26 Aligned_cols=414 Identities=9% Similarity=0.069 Sum_probs=169.3 Q ss_pred Ccchhhhccc------------cc-cCCcc---hHHHHHHHHHHHHHHhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTK------------FY-TDDPN---AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~------------~~-~~~~~---~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) =||..++|.- ++ ..+|. +++.++++-..- -.|+......+ .. . .. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~------Y~g~~~~~~~~-----~~----~----~~ 63 (499) T protein:vir:80 3 NQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRL------YQGNYAEWHNL-----NY----E----HN 63 (499) T ss_pred hHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHH------hcCCcchhhcc-----cc----c----cC Confidence 1222222210 11 11111 122222221110 01111110000 00 0 00 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) +.+... ..........+|+..|+-+..=| -.+... ..+..+.+..++.. T Consensus 64 ~~~~~~--------~~~s~n~~~~iv~~~a~~l~~ep-----------~~i~~~--------d~~~~e~l~~~~~~---- 112 (499) T protein:vir:80 64 GNPVNR--------RQLSMNLPKVTAKYMSKLLFNEK-----------VKINID--------DETAEEFVLNVLKT---- 112 (499) T ss_pred CCcccc--------ceeecchHHHHHHHHHHhhhCCc-----------ceEeeC--------CHHHHHHHHHHHhh---- Confidence 000000 00111233345555554433211 112111 12233334444432 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------cceEEEEE--- Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------RGKIYRQY--- 212 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------~~~~y~~~--- 212 (559) ..|...+..++...+.+|.+|+.+..|.+|++. +..++|..+.++..+.|.... .+..|... T Consensus 113 -----n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~-i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h 186 (499) T protein:vir:80 113 -----NGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVK-VSFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWN 186 (499) T ss_pred -----ccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEE-EEEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEE Confidence 235566777788889999999999999888764 788899998876554443210 01111100 Q ss_pred -----------ec------------Cceee------eecc-------cc--eEEEecccCC--CccCCcccccHHHHHHH Q lcl|NC_012530. 213 -----------ID------------NKVRG------SFTA-------DE--MGMFIRNPRS--DILSGGYGLSELEMGLR 252 (559) Q Consensus 213 -----------~~------------~~~~~------~~~~-------~e--vi~~~~n~~~--~~~~~~~G~Spl~~~~~ 252 (559) +. |..+. .+.+ .. ++|++ +|.. -..+.++|+|.++-+.. T Consensus 187 ~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~-~~~~N~~~~~splG~S~~~~~~~ 265 (499) T protein:vir:80 187 EWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIK-PNIANNKNLTSPLGISVYANALD 265 (499) T ss_pred EecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeec-CCccccccCCCccCCchHhhHHH Confidence 00 00000 0000 00 22332 2211 12345679999998888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEE-----EecCccCCccCCHHHHHHHHHHHHHHhcCcccccc-cccccC-Cceeee Q lcl|NC_012530. 253 EFISHENTELFNDRFFTHGGTTKGIL-----LVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYR-IPMITA-EDAKFV 325 (559) Q Consensus 253 ~i~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~-~~vl~~-g~~~~~ 325 (559) .|......-.-..+-|..|. ...++ ...... .+.... . +.......+ +..... ++-.++ T Consensus 266 lid~lD~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~-~g~~~~--------~----~~~~~~~~~~~~~~~~~~~~~i~ 331 (499) T protein:vir:80 266 TLKTLDLMFDSYYQEFKLGK-KKVLVPSSFVKTAVNL-DGSTTQ--------Y----FDSTDEAFFLYQGEQDDNGKAIK 331 (499) T ss_pred HHHHHHHHHHHHHHHHHhcc-cceecchhhhhccCCC-CCCccc--------C----CCcccceeeEeeccCCCCcCcee Confidence 88877766665556676653 22222 111000 000000 0 000000000 111111 111233 Q ss_pred eccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh--HHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_012530. 326 SMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN--NQNKIDASKSKGLMPLLDMIAKNLT 402 (559) Q Consensus 326 ~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an--~~~~~~~~~~~~l~P~~~~ie~~ln 402 (559) .++. -.+-++.+..+...++|....|++|..+|+...+..++.+..+..... ........++.+|..++..|-...+ T Consensus 332 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~ 411 (499) T protein:vir:80 332 DISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGK 411 (499) T ss_pred EecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3331 133457778888889999999999999997654433222211100000 0111223344455555544433322 Q ss_pred hhcccc---ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceecccccccccc Q lcl|NC_012530. 403 NGIIRQ---ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQEQIK 477 (559) Q Consensus 403 ~~L~~~---~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~~~~~ 477 (559) ...+.. .....+.|.|+.....|..+.++.....+. |.|+.-.++... |.+- +..+ ..+.. T Consensus 412 ~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-~ea~-----------~el~~-- 477 (499) T protein:vir:80 412 LIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITE-AEAD-----------EWAEM-- 477 (499) T ss_pred HhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCCh-HHHH-----------HHHHH-- Confidence 111111 123457888888888888888877776664 568988876543 4321 0000 00000 Q ss_pred cccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 478 QNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) .. +++ ....+ +++.....++.+ T Consensus 478 -i~--~E~-~~~~~----------~~d~~g~~ge~e 499 (499) T protein:vir:80 478 -LA--KEK-QAEIP----------NNDMTGIFGEEE 499 (499) T ss_pred -HH--HHh-hcCCC----------CCCccccCCCCC Confidence 00 000 00000 000000001100 No 196 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=98.35 E-value=1.1e-06 Score=53.23 Aligned_cols=449 Identities=12% Similarity=0.116 Sum_probs=181.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhh---hccccccccccccccc----cccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKA---LNGVDRAYTEPVDGNL----MFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~gr~~a~~~~~~~~~----~~~~~~~~~~~~~p~~~~~~~~ 73 (559) ||.|+=| +|... .++ ....+..... .....-..+...+... .++..+...+..-+...+-..+ T Consensus 1 ~~~~~lf--~f~~~-~d~-------~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~L 70 (516) T protein:vir:10 1 MKFLDLF--KFWDR-VDQ-------NEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDL 70 (516) T ss_pred CCchHhc--ccccc-hhh-------HHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHH Confidence 9998877 33211 011 0011111111 0111111111111111 1111111111111111222233 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) ...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... ... T Consensus 71 I~~YR~ma~~pEvd~Av~eIvneai------v~d~~~~pV~l~l~~~e-~s~sik~kI~eeF~~Il~ll~-F~~------ 136 (516) T protein:vir:10 71 INTYRQLTNNPEVERAVANIVNEAV------VYEKGHKVVSLDLDDTE-FSSSIKDKILEEFDEICRLLD-ASR------ 136 (516) T ss_pred HHHHHHhhhccchhHHHHHhhccee------EecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhc-cch------ Confidence 3333445778999999998888753 34455555556554432 344444444333333333211 111 Q ss_pred HHHHHHHHHHHHcCCcceEEEEC-CCCcEEEEEEecCceEEEEec-----Ccccccccce-EEEEEecCce--------- Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYD-SNGRLSHTRMVDPTTIYFAND-----EHGHRRTRGK-IYRQYIDNKV--------- 217 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd-~~G~~~~L~~l~p~~V~~~~~-----~~g~~~~~~~-~y~~~~~~~~--------- 217 (559) --..+++.+++.|..|..++-| +..-+.+|..|||.+|+.++- ..|..-.++. .|+.+..+.. T Consensus 137 -~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~ 215 (516) T protein:vir:10 137 -KLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLF 215 (516) T ss_pred -hhhHHHHhhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceecccccc Confidence 1223345567889999986665 344599999999999876442 2221111111 1111111111 Q ss_pred ----eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 218 ----RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 218 ----~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) ...+ +.+.|++.+.-.-+...+.+ +|-|..|...+......+...-=|==.-|.-+-|.-++-+..|..-.++- T Consensus 216 ~~~~~ikI-~~daI~y~hSGl~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqY 293 (516) T protein:vir:10 216 EPNTRIKI-PRSAIVYAHSGLQDCSDRGI-VGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEY 293 (516) T ss_pred CCCCceec-chhheeeeecCcccCCCCce-eceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 1112 34445544322222222222 46677777777766666665544433334344555555443332211111 Q ss_pred HHHHHHHHHHHh-----cC-cccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTATS-----SG-INGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~~-----~G-~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++.+ .| ..+..+ ..+++ +| |.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 294 l~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl 373 (516) T protein:vir:10 294 VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRM 373 (516) T ss_pred HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccc Confidence 122222222211 01 011111 11221 11 2334433322334446677788889999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHH----HHHHHhhc-----cccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMI----AKNLTNGI-----IRQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~i----e~~ln~~L-----~~~~~~----~~~~~~f~~l~~~ 424 (559) +...........++ +-+-++.. ....|.-+..++ -+.|.+.| +++.++ ..+.|+|.....- T Consensus 374 ~~e~~~~~~~Gr~~---EItRDEiK---F~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f 447 (516) T protein:vir:10 374 PRDDGGMVIGGQDM---AITRDELD---FRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYY 447 (516) T ss_pred cCCCCceeeccccc---hhhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchH Confidence 75544332222222 22222211 122233344433 33444433 233332 3466776544322 Q ss_pred h-------HHHHHHHHHHHH---cCCCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 S-------QQDKLKSVQLEL---QTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d-------~~~~~~~~~~~~---~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) . ...|+.++..+- ...++.+=||+ .|.+.-.+ +.. +..+...+..... T Consensus 448 ~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~~--~~k~I~~E~~~~~------ 506 (516) T protein:vir:10 448 TELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQ-------------IAQ--EEKQIEKEANVKR------ 506 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhH-------------HHH--HHHHHHHhhhCCC------ Confidence 2 334444443332 22356665654 44443211 000 0000000000000 Q ss_pred cCCCCCCCCCCCCccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~ 513 (559) . . +|.+.+.+ T Consensus 507 -~---~------~p~~e~~f 516 (516) T protein:vir:10 507 -F---Q------NPENEDDF 516 (516) T ss_pred -C---C------CCCccccC Confidence 0 0 01111111 No 197 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=98.35 E-value=1.1e-06 Score=53.17 Aligned_cols=448 Identities=10% Similarity=0.094 Sum_probs=182.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhh--hccccccccccccc--ccccc----ccccccccccCC---CCC Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKA--LNGVDRAYTEPVDG--NLMFS----TLEDTSIVPKPS---PIA 69 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gr~~a~~~~~~~--~~~~~----~~~~~~~~~~p~---~~~ 69 (559) ||.|.=|- .+.++.--....+.+.. ...+-.+...|... ..... .....++....- ... T Consensus 1 ~~~~~~~~----------~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~ 70 (524) T protein:vir:98 1 MNFLGFGN----------VLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPA 70 (524) T ss_pred CCCcchhh----------HHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccc Confidence 66655441 11221111111111111 11111111111110 00000 000111110000 001 Q ss_pred cccHHHHH---HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCC Q lcl|NC_012530. 70 FGRITDVL---RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYS 146 (559) Q Consensus 70 ~~~~~~~~---~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~ 146 (559) ..+..++. +..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... .. T Consensus 71 ~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaI------v~~~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~-F~ 142 (524) T protein:vir:98 71 IQNKEQLINTYRGIMSYPEVENAVSEIIDDAI------VNEQGKDIITMDLAKTN-FSKAIQDKIVEEFDNVLNIYD-FD 142 (524) T ss_pred cchHHHHHHHHHHHhhccchhhHHHhhhccee------EecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhc-cc Confidence 11233443 445678999999998888753 34455555666664433 344444444433333333311 11 Q ss_pred CChhhHHHHHHHHHHHHHHcCCcceEEEECCCCc--EEEEEEecCceEEEEec------Ccc-cccccceEEEEEecC-- Q lcl|NC_012530. 147 PIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGR--LSHTRMVDPTTIYFAND------EHG-HRRTRGKIYRQYIDN-- 215 (559) Q Consensus 147 ~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~--~~~L~~l~p~~V~~~~~------~~g-~~~~~~~~y~~~~~~-- 215 (559) . --..+++.+++.|..|+.++-|.+.. +.+|..|||.+|+.++. +.| .+.....-|+.+..+ T Consensus 143 ~-------~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~ 215 (524) T protein:vir:98 143 N-------MGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKA 215 (524) T ss_pred h-------hhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCC Confidence 1 12234556678999999999765443 99999999999976541 112 111111122222210 Q ss_pred -----------ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCcc Q lcl|NC_012530. 216 -----------KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSP 284 (559) Q Consensus 216 -----------~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 284 (559) +....++.+.|+|...... +...+. +|-|..|...+....-.+...-=|==.-|.-+-|.-++-+. T Consensus 216 ~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~-d~~~~i--isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGn 292 (524) T protein:vir:98 216 GYTYNGQIYQANQKIKIPRSAIVYAHSGLE-DCSNNI--IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQ 292 (524) T ss_pred ccccccceecCCCceeechhheeeeccCcc-cCCCCe--eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC Confidence 0112355666777644332 222222 35577777777766666665544433334444555555443 Q ss_pred CCccCCHHHHHHHHHHHHHHh-----cC-ccc-ccccccccC--------C-ceeeeeccccchhHHHHHHHHHHHHHHH Q lcl|NC_012530. 285 SVTNTSMRALEDFKRHWTATS-----SG-ING-AYRIPMITA--------E-DAKFVSMTQAEDMQFQSWLNYLINIICA 348 (559) Q Consensus 285 ~~~~~~~e~~~~l~~~~~~~~-----~G-~~n-ag~~~vl~~--------g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~ 348 (559) .|..-.++-+..+-..+++.. .| ..+ ..-..+++. | |.++..|.-...+--++-..|..+.+.+ T Consensus 293 lPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~ 372 (524) T protein:vir:98 293 MGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYE 372 (524) T ss_pred CCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHH Confidence 332211111122222222110 11 011 111122211 1 2344433322334446677788889999 Q ss_pred HhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcc-----ccccC----ccce Q lcl|NC_012530. 349 LVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTN----GII-----RQILG----DNYM 415 (559) Q Consensus 349 ~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~~~~----~~~~ 415 (559) +++||.+.|+..+ ++++-+.++..++ ++. -....|.-+..++...|.. .|+ ++.++ ..+. T Consensus 373 aLnVP~sRl~~~~-~~f~~Gr~~EItR---DEi---KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~ 445 (524) T protein:vir:98 373 ALRVPLSRMPRDD-GGMQIGGGGEITR---DEL---KFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKIS 445 (524) T ss_pred HhCCCceeccCCC-CccccccccchhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcce Confidence 9999999997643 2333222222222 111 1122344444444444433 322 22222 3467 Q ss_pred eeecchhhhhH-------HHHHHHHHHHH---cCCCCHHHHHH-HhCCCCCCCCCEeeccceeccccccccccccccccc Q lcl|NC_012530. 416 LEFVGGDTRSQ-------QDKLKSVQLEL---QTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQ 484 (559) Q Consensus 416 ~~f~~l~~~d~-------~~~~~~~~~~~---~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~ 484 (559) |+|.....-.+ ..|+.++..+- ...++.+=||+ .|.+.-.+ +. .+..+...+.. T Consensus 446 ~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee-------------i~--~~~k~I~~E~k 510 (524) T protein:vir:98 446 FVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDED-------------ID--EQAKLIEEESK 510 (524) T ss_pred EEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHH-------------HH--HHHHHHHHHHh Confidence 77765433333 33444333321 11345555543 33332110 00 00001000000 Q ss_pred ccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 485 QTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ... ..+++. +.+.+ T Consensus 511 ~~~-------~~~p~~--------e~~~f 524 (524) T protein:vir:98 511 EER-------FKNPEA--------EEENF 524 (524) T ss_pred CCC-------CcCCcc--------ccccC Confidence 000 000010 11111 No 198 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.34 E-value=1.2e-06 Score=52.97 Aligned_cols=414 Identities=10% Similarity=0.008 Sum_probs=176.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhh-----hccccccccccccccccccccccccccccCCCCCcccHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKA-----LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD 75 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 75 (559) |==.+-+|.+=.+++...-+.+|.+.+..+...-. -.|++.- ...+... . .+ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~-------------------~~~~~~~-p---~~ 57 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRTI-------------------QYVGTLI-P---PQ 57 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCCh-------------------hhccccc-c---HH Confidence 43344444444455545555556555444332211 1122110 0001000 1 11 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ++.......+...||+.+++.+. -.||.+- +... .+ ..+..+... ..+... T Consensus 58 ~r~~~~v~nw~~~~Vd~~a~rl~-----------~~Gf~~~--d~~~-~~------~~l~~iw~~---------N~ld~~ 108 (474) T protein:vir:81 58 YFNLGLVLGWTGKAVDALARRCN-----------LEGFVWP--DGDL-DS------LGGTEVVDD---------NHLLSE 108 (474) T ss_pred HHHHHhhcChHHHHHHHHHhhhc-----------ccceECC--CCCc-cc------hHHHHHHHh---------cChhHH Confidence 22222346777788888887553 1234431 1110 00 112233322 122345 Q ss_pred HHHHHHHHHHcCCcceEEEECCCCcE-EEEEEecCceEEEEecCcccccccceEEEEE-ecCcee--eeecccc------ Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDSNGRL-SHTRMVDPTTIYFANDEHGHRRTRGKIYRQY-IDNKVR--GSFTADE------ 225 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~~G~~-~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~-~~~~~~--~~~~~~e------ 225 (559) ...+..+.|+||.+|+.|.++.+|.+ ..+.+++|.++.++.|...........++.. .++... ..|.++. T Consensus 109 ~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~ 188 (474) T protein:vir:81 109 IDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQR 188 (474) T ss_pred HHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEE Confidence 66778899999999999998777764 4578899999887766543322111111110 011100 0111111 Q ss_pred -------------------eEEEecccCCCccCCcccccHH----HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecC Q lcl|NC_012530. 226 -------------------MGMFIRNPRSDILSGGYGLSEL----EMGLREFISHENTELFNDRFFTHGGTTKGILLVKP 282 (559) Q Consensus 226 -------------------vi~~~~n~~~~~~~~~~G~Spl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 282 (559) |+++..++.. .+++|.|.| ..+.+++...+.-......||.. |.-++.- T Consensus 189 ~~~~~~w~~~~~~~~~gvPvV~~~n~~~~---~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G-- 260 (474) T protein:vir:81 189 DKATLKWQVDRDEHVYGVPAQVLPYKPAP---KRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSY---PEFWLLG-- 260 (474) T ss_pred cCccceeeeccCCCCCCcceEEecccccc---cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcc---hhheeec-- Confidence 3444443322 244677744 44555555444444445555554 3333311 Q ss_pred ccCCccCC---HHHHHHHHHHHHHHh--cCcccccccccccCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_012530. 283 SPSVTNTS---MRALEDFKRHWTATS--SGINGAYRIPMITAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAE 356 (559) Q Consensus 283 ~~~~~~~~---~e~~~~l~~~~~~~~--~G~~nag~~~vl~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~ 356 (559) ..+.+.. .+....++..+.... .+.. .+.+|- ..+.++-.+.. .+++ |++..+..+..||..=++|++. T Consensus 261 -~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~-d~~~~~--~~~~~~~q~~~-a~l~~~~~~l~~~~~~~a~~t~iP~~~ 335 (474) T protein:vir:81 261 -ADESALKNADGTIKSVWEARLGRIKGLPDDA-DADIPQ--LARADVKQFPA-ASPDAHWSDINGLAKLFAREASLPDTA 335 (474) T ss_pred -CChhhcccccccccchhhhhHHHHhcCCCcc-cccccc--cccccccccCC-CChhHHHHHHHHHHHHHHhhhCCCHHH Confidence 1111111 111122332222211 1111 111111 11234444432 3454 8999999999999999999999 Q ss_pred hcccc-ccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc--ccc-------ccCccceeeecchhhhhH Q lcl|NC_012530. 357 IGMQN-RGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI--IRQ-------ILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 357 lg~~~-~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L--~~~-------~~~~~~~~~f~~l~~~d~ 426 (559) ||+.. .+..++ ... ...++....-....-+-+-..+++.+-..+ ... .+...+++.|......+. T Consensus 336 lG~~~~~np~Sa----eAi-~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~ 410 (474) T protein:vir:81 336 VAISGLSNPTSA----ESY-DASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSK 410 (474) T ss_pred hcccccccccHH----HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCH Confidence 99753 211110 000 011111111111111111112222221111 011 112356677777778888 Q ss_pred HHHHHHHHHHHcCC--CC-HHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc---ccCCCCC Q lcl|NC_012530. 427 QDKLKSVQLELQTA--TT-VNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE---SALQNPS 499 (559) Q Consensus 427 ~~~~~~~~~~~~~~--~T-~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 499 (559) .++++++.++...+ +- ..=+++++|+.|-+ +..+..... .+.......... ...++.+ T Consensus 411 a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~----------i~~~~~~~~-----~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 411 SAQADAGMKQLAAVPWLAETEVGLELIGLTPQQ----------ARRAMADKR-----RVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHH----------HHHHHHHHH-----HHhHHHHHHHHHhcCCCCCCCC Confidence 88888887776533 33 34467788886531 000000000 000000000000 0000000 No 199 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=98.33 E-value=1.3e-06 Score=52.85 Aligned_cols=442 Identities=8% Similarity=0.069 Sum_probs=188.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhh---cccccccccccccccccccccccc----c-cccCCCCCccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKAL---NGVDRAYTEPVDGNLMFSTLEDTS----I-VPKPSPIAFGR 72 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~gr~~a~~~~~~~~~~~~~~~~~~----~-~~~p~~~~~~~ 72 (559) |+...|. ++....+...... ....-..+...+......+...+. + ...+...+. . T Consensus 1 ~~~w~~~----------------de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~-e 63 (511) T protein:vir:56 1 MKFWTKE----------------EEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVK-E 63 (511) T ss_pred CCCccch----------------hhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchH-H Confidence 2222221 1111111111111 111111111111111111111111 1 111111111 2 Q ss_pred HHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 73 ITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 73 ~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) +...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... ... T Consensus 64 LI~~YR~ma~~pEvd~Av~eIvne~i------v~d~~~~pV~l~ld~~~-~s~~iK~kI~eeF~~Il~ll~-F~~----- 130 (511) T protein:vir:56 64 LIKSYRALAEYHEVDDAIQEIVDEAI------VYENDKEVVWLNLDNTD-FSENIKAKINEEFDRVVSLLQ-MRK----- 130 (511) T ss_pred HHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhc-cch----- Confidence 33333455678999999998888753 34455556666664433 444444444433333333311 111 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEec-----Cccc-ccccceEEEEEecC----------- Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFAND-----EHGH-RRTRGKIYRQYIDN----------- 215 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~-----~~g~-~~~~~~~y~~~~~~----------- 215 (559) --..+++.+++.|..|..++-|...-+.+|..|||.+|+.++. .+|. +.....-|+.+... T Consensus 131 --~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~ 208 (511) T protein:vir:56 131 --HGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSAT 208 (511) T ss_pred --hhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccc Confidence 1223455667889999999888766799999999999876442 1121 11111122222211 Q ss_pred ---ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHH Q lcl|NC_012530. 216 ---KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMR 292 (559) Q Consensus 216 ---~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e 292 (559) .....++.+.|.|..........+.++.+|-|..|...+......+...-=|==.-|.-+-|.-++-+..|..-.++ T Consensus 209 ~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeq 288 (511) T protein:vir:56 209 NRAQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQ 288 (511) T ss_pred cccccceeechhheeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHH Confidence 11234667777666544333345666788889999888887777776655443333444455555544333221111 Q ss_pred HHHHHHHHHHHHh------cCcccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_012530. 293 ALEDFKRHWTATS------SGINGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE 356 (559) Q Consensus 293 ~~~~l~~~~~~~~------~G~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~ 356 (559) -+..+-..+++.. +...+..+ ..+++ +| +.++..|.-...+--++-..|..+.+.++++||.+. T Consensus 289 Yl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SR 368 (511) T protein:vir:56 289 YVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSR 368 (511) T ss_pred HHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccc Confidence 1122222221111 01111111 11221 11 233443332233445667778888999999999999 Q ss_pred hccccc-cccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecchh Q lcl|NC_012530. 357 IGMQNR-GGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVGGD 422 (559) Q Consensus 357 lg~~~~-~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~l~ 422 (559) |+-.+. ++++.+.++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+|.... T Consensus 369 l~~e~q~~~f~~Gr~~EItR---DEi---KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn 442 (511) T protein:vir:56 369 AASEDQTGGINFGQGAEITR---DEL---KFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDS 442 (511) T ss_pred ccCCCCccccccccchhhhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecc Confidence 985543 2333222222221 111 112234445555544444333 2 22222 34677776544 Q ss_pred hhhHH-------HHHHHHHHHH--cC-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccc Q lcl|NC_012530. 423 TRSQQ-------DKLKSVQLEL--QT-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQL 491 (559) Q Consensus 423 ~~d~~-------~~~~~~~~~~--~~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 491 (559) .-.+. .|+.++..+- -| .++.+=+|+ .|.+.-.+ +. .+..+...+........+ T Consensus 443 ~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDee-------------i~--~~~k~I~~E~k~~~~~~~ 507 (511) T protein:vir:56 443 YFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQ-------------IT--AMQSEIDEEETNPRFQQD 507 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHH-------------HH--HHHHHHHHhhcCCCCCCc Confidence 33333 3333333221 01 135554443 33332110 00 000000000000000000 Q ss_pred cccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 492 ESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~ 513 (559) . +.+ T Consensus 508 e------------------~~f 511 (511) T protein:vir:56 508 D------------------QGF 511 (511) T ss_pred c------------------cCC Confidence 0 000 No 200 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.32 E-value=1.4e-06 Score=52.69 Aligned_cols=461 Identities=11% Similarity=0.067 Sum_probs=170.9 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccc-cccCCCCCcccHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSI-VPKPSPIAFGRITDVLRQ 79 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~-~~~p~~~~~~~~~~~~~~ 79 (559) |-| ..+ .+++.+++..++..-...+ .+.+..=-.|++....++... .+..... .....+-++ . T Consensus 8 ~~~-~~~-~~~~~~~i~~~~~~~~~~~-~~~~~~YY~g~h~Il~r~~~~-----~~~~~~~~~d~~~~nnk--------i 71 (537) T protein:vir:78 8 KPI-DQL-GGLLNTEITTYMASNHIKW-AHIGENYYNQENDIEKSRIFY-----MNDKGQLREDNYASNVK--------I 71 (537) T ss_pred ccH-HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHhcccchhhhccccc-----ccccccccccccccccc--------c Confidence 111 111 1222233332221111111 112222233443333222110 0000000 000000000 0 Q ss_pred HhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 80 YSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 80 ~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) ......-+|+..+.-+. |.+..+...+. ...+..+.+..++.+ .|......+ T Consensus 72 --~~nf~k~Ivd~~~~yl~-----------G~Pv~~~~~d~-----~~~e~~~~l~~~~~~----------~~~~~~~el 123 (537) T protein:vir:78 72 --SHGFFTELVDQLAQYLL-----------SNGVEVKVKDE-----DNTQLDEILQEYFDE----------DFQATIDTL 123 (537) T ss_pred --ccchHHHHHHHHhhhhc-----------ccCceeecCcc-----hhHHHHHHHHHHhhc----------cHHHHHHHH Confidence 01222223333332222 11112221111 111222333333321 223445566 Q ss_pred HHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEEE-ecC---------ceeeeecccceEEE Q lcl|NC_012530. 160 VRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQY-IDN---------KVRGSFTADEMGMF 229 (559) Q Consensus 160 v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~-~~~---------~~~~~~~~~evi~~ 229 (559) ..++..+|.+|.++.+|.+|.+. +..++|..+.++.++.+... ...+++.. ... .....++++.+.++ T Consensus 124 ~~~~s~~G~ay~~~y~de~~~~~-~~~i~p~~~~pv~d~~~~~~-~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y 201 (537) T protein:vir:78 124 VTNASKKGFEGIFARTTSEGKLK-FQTVDGLTLIPVFDDYGVLK-MIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYY 201 (537) T ss_pred HHHHhhcCeeEEEeeecCCCceE-EEEEccceeEEEEcCCCCce-eEEEEEeeeeccccccCcceEEEEEEEcCCcEEEE Confidence 77889999999999999998764 78889999988887665432 12222211 100 01223444444443 Q ss_pred eccc-------------------------------------------------CCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_012530. 230 IRNP-------------------------------------------------RSDILSGGYGLSELEMGLREFISHENT 260 (559) Q Consensus 230 ~~n~-------------------------------------------------~~~~~~~~~G~Spl~~~~~~i~~~~~~ 260 (559) +... .-......+|+|-++.....|+....+ T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~ 281 (537) T protein:vir:78 202 IQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVM 281 (537) T ss_pred EecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHH Confidence 2110 000001124666666666666665555 Q ss_pred HHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--CCceeeeeccccchhHHHHH Q lcl|NC_012530. 261 ELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--AEDAKFVSMTQAEDMQFQSW 338 (559) Q Consensus 261 ~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~g~~~~~~ls~~~D~qf~e~ 338 (559) ..-.++.+..-+.|- +.+.+.. .+-..+ ++..++. .++..+. +++++|.....+.++ .... T Consensus 282 ~S~~an~~~~~~~~i--lvi~g~~--~~~~~~----~~~~l~~--------~~~i~v~~d~~~v~~l~~~~~~~~-~e~~ 344 (537) T protein:vir:78 282 NCFLSNNLQDFSEAI--YVVKGFS--GDSTDK----LRQNIKA--------KKMIGVNGDNAGMEIQTVSIPYEA-RKAK 344 (537) T ss_pred HHhhhhHHHHhcCce--eeeecCC--Cccchh----HHHHHhh--------cCceeecCCCCceeEEEecCCHHH-HHHH Confidence 555555555544444 4343321 111122 2222211 1222332 234555544443332 2334 Q ss_pred HHHHHHHHHHHhCCCHHHhccccccccccccccch--hhh---hHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCcc Q lcl|NC_012530. 339 LNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL--NES---NNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDN 413 (559) Q Consensus 339 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--~~a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~ 413 (559) .+...+.|...-.+|. .... .. ++.++..+ -+. .-....+.++..+|+-.++.|...++.+-........ T Consensus 345 ld~L~~~I~~~s~~~~--~~~~--~~-gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~ 419 (537) T protein:vir:78 345 MDIDVENIYRSGMGFN--STAV--GD-GNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSND 419 (537) T ss_pred HHHHHHHHHHhcCCCC--Cccc--cc-cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccce Confidence 5555556655433332 1111 11 11111010 001 1122233455666666666666555433222233456 Q ss_pred ceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccccccccccccccccccccc Q lcl|NC_012530. 414 YMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLE 492 (559) Q Consensus 414 ~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 492 (559) +.+.|+.-...+..+.++.+..+.. |+++..-+.+.+++ ++.- + ........................ T Consensus 420 i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~--vdd~-e--------~ek~~~ee~~~~~~~~~~~~~~~~ 488 (537) T protein:vir:78 420 ICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPR--IGDD-E--------TLKLIAEELDLDYNELKDALAEQD 488 (537) T ss_pred eeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCC--CCCH-H--------HHHHHHHHHHhhhhhhhhhhhhhc Confidence 8889999899999999998877665 55788888877644 3310 0 000000000000000000000000 Q ss_pred ccCC-CCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhcc Q lcl|NC_012530. 493 SALQ-NPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQG 554 (559) Q Consensus 493 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~ 554 (559) .... .....++..++. ..+++..-++++++.|-.|. .+..|.. +..|- T Consensus 489 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~d~~~~~~~~~~-~~~~~~~--------~~~~~ 537 (537) T protein:vir:78 489 AQSLDVSPDVQAMLDGL-----PVNANQPPVDPNQPVADPNV-VPPTDPN--------AVPQT 537 (537) T ss_pred ccccCcCcchhhhcCCC-----CCCCCCCCCCccCCCCCCCC-CCCCCCc--------cCCCC Confidence 0000 000000000000 01111111111111111110 0011111 00000 No 201 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.28 E-value=1.7e-06 Score=52.19 Aligned_cols=373 Identities=10% Similarity=0.029 Sum_probs=152.3 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcce-----eeeccccccc-ChhHHH----------------------HHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGY-----QVRLKNGDKP-TKEQQK----------------------KID 132 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~-----~v~~~d~~~~-~~~~~~----------------------~~~ 132 (559) +....|..||+..-..+..+-....+-.+.-.. +...+...++ ...... ... T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~ 80 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQVSN 80 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcccCceeecCChHHHH Confidence 112223333333222222221111110000000 0000000000 000000 111 Q ss_pred HHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEE Q lcl|NC_012530. 133 YAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQ 211 (559) Q Consensus 133 ~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~ 211 (559) .+..++.. ..+......+..+.+.+|.+|..+.++.+|+|. +..++|..+.++.+.... .....++|+. T Consensus 81 ~l~~~~~~---------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-~~~~~p~~~~~v~dd~~~~~~~~~i~~~~ 150 (429) T protein:vir:98 81 YLELLDGY---------NDQDDNNAELSKICSIYGHGYELVFNDENAEAG-ITYLTPLEAFIVYDDSIRQKPLFAVRYFY 150 (429) T ss_pred HHHHHHhh---------cCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEE-EEEEcccceEEEEeCCCCCceEEEEEEEE Confidence 12222221 123345667788999999999999999999864 778899999887765432 1112222322 Q ss_pred EecCceeeeecccceEEEe-------------cc-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 212 YIDNKVRGSFTADEMGMFI-------------RN-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGT 273 (559) Q Consensus 212 ~~~~~~~~~~~~~evi~~~-------------~n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 273 (559) .........+...+.++.. -| |.-......+|.|-++.+...++....+..-..+.+...+. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~ 230 (429) T protein:vir:98 151 NKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDVEYFAD 230 (429) T ss_pred ecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 1111111111111111110 00 11011123357777777666666666555555555666666 Q ss_pred CceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC---ceeeeeccccc-hhHHHHHHHHHHHHHHHH Q lcl|NC_012530. 274 TKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE---DAKFVSMTQAE-DMQFQSWLNYLINIICAL 349 (559) Q Consensus 274 p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g---~~~~~~ls~~~-D~qf~e~~~~~~~~Ia~~ 349 (559) |-.+++ +. ..+++....++. +++..+..+ +.+...++.+. +..+....+...+.|+.. T Consensus 231 p~~~i~--g~----~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 292 (429) T protein:vir:98 231 AYLKIL--GA----ELDDETLKSLRD------------TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRT 292 (429) T ss_pred ceeeee--cC----CCCcchhhhHhh------------CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 765553 21 122332222211 122222221 12233333222 233555678889999999 Q ss_pred hCCCHHHhccccccccccccccch--hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhh Q lcl|NC_012530. 350 VAMDPAEIGMQNRGGATGNKSNSL--NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTR 424 (559) Q Consensus 350 fgVPp~~lg~~~~~~~~~~~~~~~--~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~ 424 (559) -++|.. .....+ +.++... .++.. ....+..+..+|.-++..+...++..- .......+.+.|...... T Consensus 293 s~~p~~--~~~~~g---n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~d~~~i~v~f~~~~p~ 366 (429) T protein:vir:98 293 AMVANI--SDESFG---TASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKI-GPKDWIGIKYKFTRNLPA 366 (429) T ss_pred hCcccc--Cccccc---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CccccccceEEeCCCCCc Confidence 999843 221111 1111000 01111 111123333344444444433332211 111223577889888888 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPT 504 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (559) |..+.++.+..+ .|+|+..-+.++++.-+-+ ...+..+.. ++...... ..+.....+.+ T Consensus 367 ~~~~~a~~~~kl-~g~is~et~~~~l~~v~d~----------~~E~~ri~~------E~~~~~~~----~~~~~~~~~~~ 425 (429) T protein:vir:98 367 NLLEESQIAGNL-AGIVSEETQVGVLSIVENP----------QKEIERKNS------DKSTLISR----QAGGLNGQNTT 425 (429) T ss_pred CHHHHHHHHHHH-hccCchHHHHHhCCCCCCH----------HHHHHHHHH------HHHHHHHH----HHhhhcCCCCC Confidence 988888877665 4667876677777542211 011111111 11100000 00011111111 Q ss_pred CCcc Q lcl|NC_012530. 505 LPPS 508 (559) Q Consensus 505 ~~~~ 508 (559) .+.+ T Consensus 426 ~~~~ 429 (429) T protein:vir:98 426 TILE 429 (429) T ss_pred CCCC Confidence 1111 No 202 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.26 E-value=2e-06 Score=51.88 Aligned_cols=422 Identities=9% Similarity=0.024 Sum_probs=171.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHH-HHH--HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSK-IAN--DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) -+++..+.-+.-..++.+.+..+-.+ +.. +.+.+=-.|++.....+.. ..+........-.+.++ T Consensus 9 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~-----~~~~~~~~~~~~~~~~k------- 76 (479) T protein:vir:79 9 TDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRY-----YLLDGAKVDDFTKVNNK------- 76 (479) T ss_pred cceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccc-----cccccccccccccCcce------- Confidence 12222221111111111111111111 111 1111112233322222110 00000000000000000 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHH Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLR 157 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~ 157 (559) ..++....+|+..+.-+..-| ..+...+ + +....+..|+. + .|..... T Consensus 77 ---i~~~~~~~Ivd~~~~~l~g~p-----------~~~~~~~-----~---~~~~~~~~~~~----n------~~~~~~~ 124 (479) T protein:vir:79 77 ---AINNYHKLLVDQKVGYSVGNP-----------IVFNADD-----D---NLTKLLNDLLG----E------EFDDTIT 124 (479) T ss_pred ---eecchHHHHHHHHHhhhhcCC-----------ceeccCC-----H---HHHHHHHHHHh----c------CHHHHHH Confidence 123444555555544432211 1121111 1 11122333322 1 2345566 Q ss_pred HHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cc---eeeeecccceEEEec Q lcl|NC_012530. 158 KLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NK---VRGSFTADEMGMFIR 231 (559) Q Consensus 158 ~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~---~~~~~~~~evi~~~~ 231 (559) .++.+.+.+|.+|..+..+.+|++. +..++|..+.++.+..+. .....++|+...+ +. ....+....+.|++. T Consensus 125 ~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~ 203 (479) T protein:vir:79 125 ELYLNASNKGVEWLHPYINRKGEFK-YVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIE 203 (479) T ss_pred HHHHHHHhcCeEEEEEEeCCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEe Confidence 7788899999999999999888865 788899999888765432 1222233332221 11 111233333333321 Q ss_pred c------------------------------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_012530. 232 N------------------------------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTK 275 (559) Q Consensus 232 n------------------------------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 275 (559) . |.-......+|.|-++.....++....+..-..+.+...+.|- T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~ 283 (479) T protein:vir:79 204 RGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVI 283 (479) T ss_pred cCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCce Confidence 0 0000011234677777666666655555555555566666666 Q ss_pred eEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_012530. 276 GILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDP 354 (559) Q Consensus 276 gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp 354 (559) .++.- . ++...++....+ ..+++..+ .+++++|..... .+..+....+...+.|...-++|. T Consensus 284 ~v~~g--~--~~~~~~~~~~~~------------~~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~ 346 (479) T protein:vir:79 284 YVLKE--Y--PGTSLQEFIDNI------------RYYKSIKVDGGGGVDKLEINI-PVEAKKELLDRLEKNIIIFGQGVN 346 (479) T ss_pred eeeec--C--Cccccccchhhh------------hhccceecCCCCcceEEeccC-CHHHHHHHHHHHHHHHHHHhCccc Confidence 55532 1 111222211111 11222223 234455544333 234456667777888888888886 Q ss_pred HHhccccccccccccccchhhhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHH Q lcl|NC_012530. 355 AEIGMQNRGGATGNKSNSLNESN---NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLK 431 (559) Q Consensus 355 ~~lg~~~~~~~~~~~~~~~~~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~ 431 (559) .-.+ ..+..++. .-..-++. .....+..+...|+-+++.+...++..-........+.+.|......+.++.++ T Consensus 347 ~~~~--~~gn~Sg~-Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~ 423 (479) T protein:vir:79 347 PESQ--NTGDKSGV-ALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKID 423 (479) T ss_pred cccc--cccchhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHH Confidence 4332 11111100 00000000 112233444555555555555444432112223345788888888888888888 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 432 SVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 432 ~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) .+..+ .|+|+...+.++++. ++ |. ...+..+.. ++....... ....+..++.. .| T Consensus 424 ~~~kl-~g~iS~et~l~~l~~--v~--d~------~~E~~ri~~------E~~~~~~~~--~~~~~~~~~~~------~e 478 (479) T protein:vir:79 424 MAAKS-TGIVSDETIVSNHPW--VE--DV------NDELERLKK------QEDTQKEYD--DLIPNNQDGVI------DE 478 (479) T ss_pred HHHHH-hccCcHHHHHHhCCC--CC--CH------HHHHHHHHH------HHHHHHHHH--hccCcccCCCc------Cc Confidence 77665 466787777777654 21 10 011111110 000000000 00000000000 00 Q ss_pred c Q lcl|NC_012530. 512 S 512 (559) Q Consensus 512 ~ 512 (559) + T Consensus 479 ~ 479 (479) T protein:vir:79 479 T 479 (479) T ss_pred C Confidence 0 No 203 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.25 E-value=3.9e-07 Score=55.74 Aligned_cols=490 Identities=13% Similarity=0.072 Sum_probs=201.4 Q ss_pred HHHHHHHHHHHhhhhcccccccccccc-c--cccccccccccccccCCCCCccc-HHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVD-G--NLMFSTLEDTSIVPKPSPIAFGR-ITDVLRQYSMNVVLNAIINTRANQV 97 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~-~--~~~~~~~~~~~~~~~p~~~~~~~-~~~~~~~~~~~~~v~acv~~ia~~i 97 (559) |...+ .+...+.+.+.. +-.+..+ . ++...++...+.-. -.+.+.. ..+-.+.+.-.+.++-.|.-|++++ T Consensus 1 ma~~~--lr~~rrpk~~p~-~~rr~~ltaAsq~~~~p~~~~kt~~--~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~ 75 (639) T protein:vir:97 1 MAATS--LRVVRRPKGSAP-AARRRSLTAASQLITDPQKQMKTSL--MGTARNEWQSEAWDFSESIGELSYYVSWRANSC 75 (639) T ss_pred CCccc--eeeeecCCCCCc-chhhHHHhhhhhccCCcccchhhhc--cccchhhhhhhhhhhhhhhhhHHHHhhhhhhhh Confidence 11111 011111111110 0000000 0 00000100000000 0000001 0111222223467777788888888 Q ss_pred HhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE-EC Q lcl|NC_012530. 98 TEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT-YD 176 (559) Q Consensus 98 a~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~-rd 176 (559) +..-++...-....+... .++..++.-..+.+.....+...-+ ..-.++++.+..++-+-|.+|+.++ |. T Consensus 76 sr~rL~as~idpDtg~Pt-----G~V~~E~d~~~~~v~~~v~~iagG~----lGqa~llkr~~~~ltV~GE~wi~~l~r~ 146 (639) T protein:vir:97 76 SRTTLIPSAIDPDTGLPT-----GEVDIEEDPDAQTVADYVKGIADGP----LGQAALIKRAVECMTVVGEVWIAVLIRQ 146 (639) T ss_pred ceeeeEeeeeccccCCCC-----CccccccccCcchHHHHHHhhcCcc----chHHHHHHHHHhheecccceEEEEEEec Confidence 865443321111111000 0000011001111222222222211 1224689999999999999997644 33 Q ss_pred CCC------cEEEEEEe-cCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHH Q lcl|NC_012530. 177 SNG------RLSHTRMV-DPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEM 249 (559) Q Consensus 177 ~~G------~~~~L~~l-~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~ 249 (559) .++ .+.+-|++ ...-|.. ...| ..-...-+|.........++++..++|.+.. ..+--||+.+ T Consensus 147 ~k~~~~~~~~~~~~W~vvs~~Ei~~--~~~~------~~~i~lPdG~~he~~~~~d~l~RvW~P~prr--~~e~dSpvra 216 (639) T protein:vir:97 147 EKDPVTGLAAPRARWYAVTREEIKS--KAGE------TAEISLPDGKTHEFNRDLDSLVRIWNPRPRK--ASQATSPVRA 216 (639) T ss_pred CccccCcccccccceeeeeHHHhcc--cCCC------eeEeecCCCCCccccCCCceEEEEeCCCccc--ccCCcchhHH Confidence 333 23433443 2222321 1111 1112222333332333446666667775532 3456799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCH--------------------HHHHHHHHHHH----HHh Q lcl|NC_012530. 250 GLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSM--------------------RALEDFKRHWT----ATS 305 (559) Q Consensus 250 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--------------------e~~~~l~~~~~----~~~ 305 (559) ++..+.-.........+..+.-.+-.|||.++...+.+.... -+.+.|...|- .++ T Consensus 217 ~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai 296 (639) T protein:vir:97 217 CLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAM 296 (639) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhh Confidence 888887777666666555555555556766655443332221 11333433332 333 Q ss_pred cCc-ccccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccchhhhh Q lcl|NC_012530. 306 SGI-NGAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSLNESN 377 (559) Q Consensus 306 ~G~-~nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~an 377 (559) ... ..+--+||+... +++...+.+.-+.--+.+|+..+..||....|||.. ||+.+.+-.+...-+ T Consensus 297 ~De~S~aA~vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~------ 370 (639) T protein:vir:97 297 EDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIG------ 370 (639) T ss_pred cCCCCccceeeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEec------ Confidence 222 124456776432 233333434445567889999999999999999875 566433222211111 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHHHHHh Q lcl|NC_012530. 378 NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDYREKQ 450 (559) Q Consensus 378 ~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~ 450 (559) . .-++-.|.|.+..|+++|++.+|.+. + -.+|.+-|+. .+..+.....++.+..-+|.||-.-.|+.+ T Consensus 371 d-----edvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~l 445 (639) T protein:vir:97 371 D-----EDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLL 445 (639) T ss_pred c-----cceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHh Confidence 1 12345699999999999999887642 2 2468888874 445555544555555557889999999999 Q ss_pred CCCCCCCCCEeeccc--------------eecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhc Q lcl|NC_012530. 451 GLPKIAGGDIILSAV--------------YIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQN 516 (559) Q Consensus 451 gl~pi~gGD~~~~~~--------------~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (559) |+---.|=|+.-.+. -+.-+.+.. .. ..+.....++.......++.+ .+.+...+. T Consensus 446 G~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~apl~----~P-~lq~~e~ptp~~a~~~a~~~~-----~~de~~ga~ 515 (639) T protein:vir:97 446 NVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLL----SS-QLAGIEFPQPANAIESTREDE-----EDDEDSGAR 515 (639) T ss_pred ccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhhhcc----Cc-cceecccCCCCCCCCCCCCCC-----CcccccCCC Confidence 985433222110000 000000000 00 000011111111111001000 000011111 Q ss_pred ccccccccccccccccccccccccccccc-----chhhhhhccCCCC-C Q lcl|NC_012530. 517 QEGYTGKDAKPSGKDNQQGVGKDGQLKNK-----KNTNSYKQGGSSK-K 559 (559) Q Consensus 517 ~~~~~~~~~~~~g~~~~~~~~~~~~~k~~-----~~~~~~~~~~~~~-~ 559 (559) +...-+.++... ......-+.-.+-. --..+-...||-- + T Consensus 516 ~~~ePdte~~~~---~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~ 561 (639) T protein:vir:97 516 QQREPQTEDERS---TEEAASLNDRAAYLVAERLLVNRALDLAGKRRFK 561 (639) T ss_pred CCcCCCcccccC---CccccCcCchhHHHHHHHHHHHHHHHhhcccccC Confidence 100000000000 00000000000000 0001112233321 1 No 204 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.25 E-value=3.9e-07 Score=55.74 Aligned_cols=490 Identities=13% Similarity=0.072 Sum_probs=201.4 Q ss_pred HHHHHHHHHHHhhhhcccccccccccc-c--cccccccccccccccCCCCCccc-HHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEPVD-G--NLMFSTLEDTSIVPKPSPIAFGR-ITDVLRQYSMNVVLNAIINTRANQV 97 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~~~-~--~~~~~~~~~~~~~~~p~~~~~~~-~~~~~~~~~~~~~v~acv~~ia~~i 97 (559) |...+ .+...+.+.+.. +-.+..+ . ++...++...+.-. -.+.+.. ..+-.+.+.-.+.++-.|.-|++++ T Consensus 1 ma~~~--lr~~rrpk~~p~-~~rr~~ltaAsq~~~~p~~~~kt~~--~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~ 75 (639) T protein:vir:10 1 MAATS--LRVVRRPKGSAP-AARRRSLTAASQLITDPQKQMKTSL--MGTARNEWQSEAWDFSESIGELSYYVSWRANSC 75 (639) T ss_pred CCccc--eeeeecCCCCCc-chhhHHHhhhhhccCCcccchhhhc--cccchhhhhhhhhhhhhhhhhHHHHhhhhhhhh Confidence 11111 011111111110 0000000 0 00000100000000 0000001 0111222223467777788888888 Q ss_pred HhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE-EC Q lcl|NC_012530. 98 TEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT-YD 176 (559) Q Consensus 98 a~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~-rd 176 (559) +..-++...-....+... .++..++.-..+.+.....+...-+ ..-.++++.+..++-+-|.+|+.++ |. T Consensus 76 sr~rL~as~idpDtg~Pt-----G~V~~E~d~~~~~v~~~v~~iagG~----lGqa~llkr~~~~ltV~GE~wi~~l~r~ 146 (639) T protein:vir:10 76 SRTTLIPSAIDPDTGLPT-----GEVDIEEDPDAQTVADYVKGIADGP----LGQAALIKRAVECMTVVGEVWIAVLIRQ 146 (639) T ss_pred ceeeeEeeeeccccCCCC-----CccccccccCcchHHHHHHhhcCcc----chHHHHHHHHHhheecccceEEEEEEec Confidence 865443321111111000 0000011001111222222222211 1224689999999999999997644 33 Q ss_pred CCC------cEEEEEEe-cCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHH Q lcl|NC_012530. 177 SNG------RLSHTRMV-DPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEM 249 (559) Q Consensus 177 ~~G------~~~~L~~l-~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~ 249 (559) .++ .+.+-|++ ...-|.. ...| ..-...-+|.........++++..++|.+.. ..+--||+.+ T Consensus 147 ~k~~~~~~~~~~~~W~vvs~~Ei~~--~~~~------~~~i~lPdG~~he~~~~~d~l~RvW~P~prr--~~e~dSpvra 216 (639) T protein:vir:10 147 EKDPVTGLAAPRARWYAVTREEIKS--KAGE------TAEISLPDGKTHEFNRDLDSLVRIWNPRPRK--ASQATSPVRA 216 (639) T ss_pred CccccCcccccccceeeeeHHHhcc--cCCC------eeEeecCCCCCccccCCCceEEEEeCCCccc--ccCCcchhHH Confidence 333 23433443 2222321 1111 1112222333332333446666667775532 3456799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCH--------------------HHHHHHHHHHH----HHh Q lcl|NC_012530. 250 GLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSM--------------------RALEDFKRHWT----ATS 305 (559) Q Consensus 250 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--------------------e~~~~l~~~~~----~~~ 305 (559) ++..+.-.........+..+.-.+-.|||.++...+.+.... -+.+.|...|- .++ T Consensus 217 ~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai 296 (639) T protein:vir:10 217 CLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAM 296 (639) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhh Confidence 888887777666666555555555556766655443332221 11333433332 333 Q ss_pred cCc-ccccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccchhhhh Q lcl|NC_012530. 306 SGI-NGAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSLNESN 377 (559) Q Consensus 306 ~G~-~nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~an 377 (559) ... ..+--+||+... +++...+.+.-+.--+.+|+..+..||....|||.. ||+.+.+-.+...-+ T Consensus 297 ~De~S~aA~vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~------ 370 (639) T protein:vir:10 297 EDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIG------ 370 (639) T ss_pred cCCCCccceeeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEec------ Confidence 222 124456776432 233333434445567889999999999999999875 566433222211111 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHHHHHh Q lcl|NC_012530. 378 NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDYREKQ 450 (559) Q Consensus 378 ~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~ 450 (559) . .-++-.|.|.+..|+++|++.+|.+. + -.+|.+-|+. .+..+.....++.+..-+|.||-.-.|+.+ T Consensus 371 d-----edvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~l 445 (639) T protein:vir:10 371 D-----EDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLL 445 (639) T ss_pred c-----cceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHh Confidence 1 12345699999999999999887642 2 2468888874 445555544555555557889999999999 Q ss_pred CCCCCCCCCEeeccc--------------eecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhc Q lcl|NC_012530. 451 GLPKIAGGDIILSAV--------------YIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQN 516 (559) Q Consensus 451 gl~pi~gGD~~~~~~--------------~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (559) |+---.|=|+.-.+. -+.-+.+.. .. ..+.....++.......++.+ .+.+...+. T Consensus 446 G~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~apl~----~P-~lq~~e~ptp~~a~~~a~~~~-----~~de~~ga~ 515 (639) T protein:vir:10 446 NVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLL----SS-QLAGIEFPQPANAIESTREDE-----EDDEDSGAR 515 (639) T ss_pred ccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhhhcc----Cc-cceecccCCCCCCCCCCCCCC-----CcccccCCC Confidence 985433222110000 000000000 00 000011111111111001000 000011111 Q ss_pred ccccccccccccccccccccccccccccc-----chhhhhhccCCCC-C Q lcl|NC_012530. 517 QEGYTGKDAKPSGKDNQQGVGKDGQLKNK-----KNTNSYKQGGSSK-K 559 (559) Q Consensus 517 ~~~~~~~~~~~~g~~~~~~~~~~~~~k~~-----~~~~~~~~~~~~~-~ 559 (559) +...-+.++... ......-+.-.+-. --..+-...||-- + T Consensus 516 ~~~ePdte~~~~---~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~ 561 (639) T protein:vir:10 516 QQREPQTEDERS---TEEAASLNDRAAYLVAERLLVNRALDLAGKRRFK 561 (639) T ss_pred CCcCCCcccccC---CccccCcCchhHHHHHHHHHHHHHHHhhcccccC Confidence 100000000000 00000000000000 0001112233321 1 No 205 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=98.25 E-value=2.1e-06 Score=51.76 Aligned_cols=453 Identities=11% Similarity=0.122 Sum_probs=179.2 Q ss_pred Ccc--hhhhccccccCCcchHHHHHHHHHH-HHHHhhhhcccccccccccccccccc-ccccccccc-----cCCCCCcc Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPNAFFKHIDSKIA-NDTASKALNGVDRAYTEPVDGNLMFS-TLEDTSIVP-----KPSPIAFG 71 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gr~~a~~~~~~~~~~~~-~~~~~~~~~-----~p~~~~~~ 71 (559) |++ |+=| ++....-...+.+..+... +...-+...|....+. ...+. .++.+.... -+...+.. T Consensus 1 m~~~~L~~~--~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~-----~~~~~a~~~~g~~~~~~g~~e~~~~~~~ 73 (524) T protein:vir:10 1 MKFNVLSLF--APWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEV-----SSNEAASPYNAAFQTIFGSYEPGMKTTR 73 (524) T ss_pred CCCchhhHh--hccccCcchhhhhhhccCCccccCccCCCCceeeee-----cccccccccceeeeehhcccccccchHH Confidence 766 5544 2222221122222222111 1111122222211110 00000 111111110 11112222 Q ss_pred cHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhh Q lcl|NC_012530. 72 RITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDD 151 (559) Q Consensus 72 ~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~ 151 (559) .+...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. ..+..+.++..--..+.+... ... T Consensus 74 eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~-F~~---- 141 (524) T protein:vir:10 74 ELIDTYRNLMNNYEVDNAVSEIVSDAI------VYEDDTEVVALNLDKSK-FSPKIKNMMLDEFNDVLNHLS-FQR---- 141 (524) T ss_pred HHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecCcC-cchHHHHHHHHHHHHHHHHhc-cch---- Confidence 233333455778999999998888753 34455556666664433 344444444333333333211 111 Q ss_pred HHHHHHHHHHHHHHcCCcceEEEECCC---CcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEecCce----- Q lcl|NC_012530. 152 FTSFLRKLVRDTYTYDQVNYENTYDSN---GRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYIDNKV----- 217 (559) Q Consensus 152 ~~~f~~~~v~d~ll~Gna~~~i~rd~~---G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~~~~~----- 217 (559) --..+++.+++.|..|+.++-|.. .-+.+|..|||.+|+.++ ...|.....+ .-|+.+..+.. T Consensus 142 ---~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~ 218 (524) T protein:vir:10 142 ---KGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACD 218 (524) T ss_pred ---hhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccC Confidence 122345566788999999887632 348999999999986532 2222211111 11222211111 Q ss_pred --------eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccC Q lcl|NC_012530. 218 --------RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNT 289 (559) Q Consensus 218 --------~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 289 (559) ...++ .+.|++.+.-.-+. .+..=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..- T Consensus 219 g~~~~~~~~ikI~-~dAI~y~hSGL~d~-~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~K 296 (524) T protein:vir:10 219 GRMYEAGTKIKIP-KAAIVYAHSGLVDC-CGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARK 296 (524) T ss_pred ccccCCCcceecc-hhheeeeeccceeC-CCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchh Confidence 11222 34444433111111 111224557777777766666665554433333333445555444333221 Q ss_pred CHHHHHHHHHHHHHHh-----cC-cccc-ccccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 290 SMRALEDFKRHWTATS-----SG-INGA-YRIPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMD 353 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~-----~G-~~na-g~~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVP 353 (559) .++-+..+-..+++.+ .| ..+. .-..+++ +| +.++..|.-...+--++-..|..+.+.++++|| T Consensus 297 AeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP 376 (524) T protein:vir:10 297 AAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVP 376 (524) T ss_pred HHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCc Confidence 1111122222222111 01 0010 1111221 11 233333322233444666778888999999999 Q ss_pred HHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecc Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVG 420 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~ 420 (559) .+.|.-...++.+.+.++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+|.. T Consensus 377 ~sRl~~d~~~~f~~gr~~EItR---DEi---kF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~ 450 (524) T protein:vir:10 377 LSRIPQDQQGGVMFDSGTSITR---DEL---TFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHR 450 (524) T ss_pred hhhcCCCCCccccccccchhhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeee Confidence 9999433333333333322222 111 112334445555544444333 2 22222 346777765 Q ss_pred hhhhhHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccc Q lcl|NC_012530. 421 GDTRSQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT 489 (559) Q Consensus 421 l~~~d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 489 (559) ...-.+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+.+.+...... T Consensus 451 Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~--~~~k~I~~E~k~~~~- 514 (524) T protein:vir:10 451 DSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEE-------------IE--QEAKQIEEESKEARF- 514 (524) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH-------------HH--HHHHHHHHHhhcCCC- Confidence 4433333 33333332211 1 134444443 33332110 00 000011100000000 Q ss_pred cccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 490 QLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) .++++. .+.+ T Consensus 515 ------~~~~~~--------~~~f 524 (524) T protein:vir:10 515 ------QDPDQE--------QEDF 524 (524) T ss_pred ------CCCchh--------hhcC Confidence 001110 0111 No 206 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=98.25 E-value=2.1e-06 Score=51.73 Aligned_cols=453 Identities=11% Similarity=0.123 Sum_probs=179.1 Q ss_pred Ccc--hhhhccccccCCcchHHHHHHHHHH-HHHHhhhhcccccccccccccccccc-ccccccccc-----cCCCCCcc Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPNAFFKHIDSKIA-NDTASKALNGVDRAYTEPVDGNLMFS-TLEDTSIVP-----KPSPIAFG 71 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gr~~a~~~~~~~~~~~~-~~~~~~~~~-----~p~~~~~~ 71 (559) |++ |+=| ++....-...+.+..+... +...-+...|-...+. ...+. .++.+.... -+...+.. T Consensus 1 m~~~~L~~~--~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~-----~~~~~a~~~~g~~~~~~g~~e~~~~~~~ 73 (524) T protein:vir:72 1 MKFNVLSLF--APWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEV-----SSNEAASPYNAAFQTIFGSYEPGMKTTR 73 (524) T ss_pred CCCchhhHh--hccccCcchhhhhhhccCCccccCccCCCCceeeee-----cccccccccceeeeehhcccccccchHH Confidence 766 5544 2222221122222222111 1111122222211110 00000 111111110 11112222 Q ss_pred cHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhh Q lcl|NC_012530. 72 RITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDD 151 (559) Q Consensus 72 ~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~ 151 (559) .+...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. ..+..+.++..--..+.+... ... T Consensus 74 eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~-F~~---- 141 (524) T protein:vir:72 74 ELIDTYRNLMNNYEVDNAVSEIVSDAI------VYEDDTEVVALNLDKSK-FSPKIKNMMLDEFSDVLNHLS-FQR---- 141 (524) T ss_pred HHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecCcC-cchHHHHHHHHHHHHHHHHhc-cch---- Confidence 233333455778999999998888753 34455556666664433 344444444333333333211 111 Q ss_pred HHHHHHHHHHHHHHcCCcceEEEECCC---CcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEecCce----- Q lcl|NC_012530. 152 FTSFLRKLVRDTYTYDQVNYENTYDSN---GRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYIDNKV----- 217 (559) Q Consensus 152 ~~~f~~~~v~d~ll~Gna~~~i~rd~~---G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~~~~~----- 217 (559) --..+++.+++.|..|+.++-|.. .-+.+|..|||.+|+.++ ...|.....+ .-|+.+..+.. T Consensus 142 ---~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~ 218 (524) T protein:vir:72 142 ---KGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACD 218 (524) T ss_pred ---hhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccC Confidence 122345566788999999887632 348999999999986532 2222211111 11222211111 Q ss_pred --------eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccC Q lcl|NC_012530. 218 --------RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNT 289 (559) Q Consensus 218 --------~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 289 (559) ...++ .+.|++.+.-.-+. .+..=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..- T Consensus 219 g~~~~~~~~ikI~-~dAI~y~hSGL~d~-~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~K 296 (524) T protein:vir:72 219 GRMYEAGTKIKIP-KAAVVYAHSGLVDC-CGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARK 296 (524) T ss_pred ccccCCCcceecc-hhheeeeeccceeC-CCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchh Confidence 11222 34444433111111 111224557777777766666665554433333333445555444333221 Q ss_pred CHHHHHHHHHHHHHHh-----cC-cccc-ccccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 290 SMRALEDFKRHWTATS-----SG-INGA-YRIPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMD 353 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~-----~G-~~na-g~~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVP 353 (559) .++-+..+-..+++.+ .| ..+. .-..+++ +| +.++..|.-...+--++-..|..+.+.++++|| T Consensus 297 AeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP 376 (524) T protein:vir:72 297 AAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVP 376 (524) T ss_pred HHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCc Confidence 1111122222222111 01 0010 1111221 11 233333322233444666778888999999999 Q ss_pred HHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecc Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVG 420 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~ 420 (559) .+.|.-...++.+.+.++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+|.. T Consensus 377 ~sRl~~d~~~~f~~gr~~EItR---DEi---kF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~ 450 (524) T protein:vir:72 377 LSRIPQDQQGGVMFDSGTSITR---DEL---TFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHR 450 (524) T ss_pred hhhcCCCCCccccccccchhhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeee Confidence 9999433333333333322222 111 112334445555544444333 2 22222 346777765 Q ss_pred hhhhhHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccc Q lcl|NC_012530. 421 GDTRSQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT 489 (559) Q Consensus 421 l~~~d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 489 (559) ...-.+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+.+.+...... T Consensus 451 Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~--~~~k~I~~E~k~~~~- 514 (524) T protein:vir:72 451 DSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEE-------------IE--QEAKQIEEESKEARF- 514 (524) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH-------------HH--HHHHHHHHHhhcCCC- Confidence 4433333 33333332211 1 134444443 33332110 00 000011100000000 Q ss_pred cccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 490 QLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) .++++ +.+.+ T Consensus 515 ------~~~~~--------~~~~f 524 (524) T protein:vir:72 515 ------QDPDQ--------EQEDF 524 (524) T ss_pred ------CCCch--------hhhcC Confidence 00011 00111 No 207 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.22 E-value=2.4e-06 Score=51.39 Aligned_cols=427 Identities=12% Similarity=0.122 Sum_probs=175.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccc-cccccc----ccccccccccccCCCCCcccHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEP-VDGNLM----FSTLEDTSIVPKPSPIAFGRITD 75 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~-~~~~~~----~~~~~~~~~~~~p~~~~~~~~~~ 75 (559) |++++|.. ..|+...........-+++..+.+.-..+ -..... +-.+....+...+....+ ... T Consensus 1 m~~~~~~k---------~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~--~~~ 69 (508) T protein:vir:15 1 MGLIQRIK---------DLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIK--KKR 69 (508) T ss_pred CChHHHHH---------HHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCc--ccc Confidence 99998873 11111111111100111111111100000 000000 000000000000000000 000 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) ..........+++..|+-|..=+ -.+...+.. .....+.++|.. ..|+.. T Consensus 70 ---~~~sln~~~~i~~~~A~lv~~e~-----------~~i~v~~~~-------~~~e~l~~il~~---------n~f~~~ 119 (508) T protein:vir:15 70 ---LKNTINMAKTAARRIASVVFNEK-----------AEIHVKDNN-------EADKFLNDVLED---------NDFKNK 119 (508) T ss_pred ---ceeecchHHHHHHHHHhhhhCCC-----------ceEEeCCch-------HHHHHHHHHHHh---------ccHHHH Confidence 00111233445555555443211 112211111 111223444432 235556 Q ss_pred HHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-------------ccceEEEEE-----ecC-c Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-------------TRGKIYRQY-----IDN-K 216 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-------------~~~~~y~~~-----~~~-~ 216 (559) +...+.+.+..|.+++-+..|.. . +.+.+++|..+.++....+.+. ....+|... .++ . T Consensus 120 ~~~~~e~a~a~G~~~~k~~~d~~-~-~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~ 197 (508) T protein:vir:15 120 FEEALEKGVALGGFAMRPYIDGN-H-IKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGS 197 (508) T ss_pred HHHHHHHHhhcCceEEEEEEeCC-e-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcc Confidence 67778888999999998888753 3 4577788888776543333221 011112100 000 0 Q ss_pred ee---eeec----------------------ccc----------eEEEecccCC--CccCCcccccHHHHHHHHHHHHHH Q lcl|NC_012530. 217 VR---GSFT----------------------ADE----------MGMFIRNPRS--DILSGGYGLSELEMGLREFISHEN 259 (559) Q Consensus 217 ~~---~~~~----------------------~~e----------vi~~~~n~~~--~~~~~~~G~Spl~~~~~~i~~~~~ 259 (559) .. ..|. +.+ .+|++ +|.. ...+.++|+|.+.-+...+..... T Consensus 198 ~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~-~~~~N~~~~~splG~S~~~~~~~lid~lD~ 276 (508) T protein:vir:15 198 YQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFK-TPGANNINIESPLGLGVVDNAKHVLDDIND 276 (508) T ss_pred eEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEec-CCccccccCCCCcCCchHhhhHHHHHHHHH Confidence 00 0000 001 11222 2211 122457899999999988888776 Q ss_pred HHHHHHHHHHhcCCCceEEE---ecCccCCccCCHHHHHHHHHHHHHHhcCcccccc-cccccCCceeeeecccc-chhH Q lcl|NC_012530. 260 TELFNDRFFTHGGTTKGILL---VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYR-IPMITAEDAKFVSMTQA-EDMQ 334 (559) Q Consensus 260 ~~~~~~~~f~ng~~p~gil~---~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~-~~vl~~g~~~~~~ls~~-~D~q 334 (559) .-....+-|+.| .+..++. ++...... ..| .......+ +..-..++..++.++.. .+-+ T Consensus 277 ~~s~~~~e~~~~-~~~i~v~~~~l~~d~~~~-------~~~--------~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~ 340 (508) T protein:vir:15 277 THDQFIWEIRLG-QKHIAVQPGMLRFDDEHK-------PTF--------DTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQ 340 (508) T ss_pred HHHHHHHHHHhc-ccceeechHHhcCCCCCc-------ccc--------CCCCeeEEeccCCCCCCCceeEeecccChHH Confidence 666666667654 4443331 11100000 000 00011111 00101112234444422 4567 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhH--HHHHHHHHHHHhhHHHHHHHHHHHh-hcccc--- Q lcl|NC_012530. 335 FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNN--QNKIDASKSKGLMPLLDMIAKNLTN-GIIRQ--- 408 (559) Q Consensus 335 f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~--~~~~~~~~~~~l~P~~~~ie~~ln~-~L~~~--- 408 (559) +.+..+...+.|....|++|..+|+...+..++.+..+...... .......++.+|..++..|-...+. .++.. T Consensus 341 ~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~ 420 (508) T protein:vir:15 341 YKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKP 420 (508) T ss_pred HHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 88999999999999999999999987655443322211111111 1123344555666665555443321 11111 Q ss_pred -------ccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHh-CCCCCCCCCEeeccceecccccccccccc Q lcl|NC_012530. 409 -------ILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQ-GLPKIAGGDIILSAVYIQRLGQQEQIKQN 479 (559) Q Consensus 409 -------~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~-gl~pi~gGD~~~~~~~~~~l~~~~~~~~~ 479 (559) .....+.|.|+.....|..+.++.....+. |.|++-+++... |+.. +.-+. -+.. T Consensus 421 ~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~d-eea~~--------el~r------- 484 (508) T protein:vir:15 421 LFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGMTD-EQAAE--------ELAK------- 484 (508) T ss_pred ccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCh-HHHHH--------HHHH------- Confidence 112346678888888888888777766665 558998887653 4321 00000 0000 Q ss_pred cccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 480 EFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) .+.+ .+..... +..-.+.++.++| T Consensus 485 ----i~~E--~~~~~~~--~~~~~~~~g~~ge 508 (508) T protein:vir:15 485 ----IQSE--APTDTFE--GGRSAILNGGDGE 508 (508) T ss_pred ----HHHh--ccccCcc--ccccccCCCCCCC Confidence 0000 0000000 0000001111111 No 208 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=98.21 E-value=1.5e-06 Score=52.49 Aligned_cols=452 Identities=10% Similarity=0.091 Sum_probs=181.5 Q ss_pred Cc---chhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccc-c------cccccccccccCCCCCc Q lcl|NC_012530. 1 MG---IFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLM-F------STLEDTSIVPKPSPIAF 70 (559) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~-~------~~~~~~~~~~~p~~~~~ 70 (559) |- -..+|.+.++.++=.+ +.+..+..+. +.....-..+...+.... . .+++.+++ -+...+. T Consensus 1 ~~~~~~~~~lf~f~~~~de~~-~~~~~~~~~~-----S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~--e~~~~~~ 72 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKE-YKQQINNNLE-----SVTAPKLDDGAREIETQEQNIPYNALMQQMFGSN--EPEVKNT 72 (524) T ss_pred CCchhhHHHHhhhhhcchhhh-hhhhhccCCC-----ccccCCCCCCceeeccCcccccchhhhhhhhhcc--cchhhhH Confidence 21 1222222233221111 1111111000 000001111111110000 0 00111111 1122222 Q ss_pred ccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 71 GRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 71 ~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) ..+...-+..+.+|.|..+|.-|.+.+. ..+.+....++.+.+.. .....+.++..--..+.+... ... T Consensus 73 ~eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~Ld~~~-~s~siK~kI~eeF~~Il~ll~-F~~--- 141 (524) T protein:vir:10 73 RELIDTYRNLMNNYEVDNAVQEIVSDAI------VYEDDKEVVALNLDGTD-FSQSIKDKILAEFSEVLNLLN-FQR--- 141 (524) T ss_pred HHHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhc-cch--- Confidence 2333333455778999999998888753 34455556666664433 444444444443333333311 111 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEecC------ Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYIDN------ 215 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~~~------ 215 (559) --..+++.+++.|..|..++-|. ..-+.+|..|||.+|+.++ ...|.....+ ..|+.+..+ T Consensus 142 ----~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~ 217 (524) T protein:vir:10 142 ----KGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCA 217 (524) T ss_pred ----hhhHHHhhheeeceEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCccccc Confidence 12234556678999999987763 2349999999999986532 2222211111 112222111 Q ss_pred -------ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc Q lcl|NC_012530. 216 -------KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN 288 (559) Q Consensus 216 -------~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~ 288 (559) +....++.+.|+|......+. .+..=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|.. T Consensus 218 ~~~~~~~~~~ikI~~dAIvy~~SGL~d~--~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~ 295 (524) T protein:vir:10 218 DGRIYSAGTKVKIPRAAVVYAHSGLLDC--CGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSR 295 (524) T ss_pred CcceecCCcceecchhheeeeccCcccC--CCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCch Confidence 111235566676654322211 11122455777777777666666655444333333344554544333322 Q ss_pred CCHHHHHHHHHHHHHHh-----cC-cccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCC Q lcl|NC_012530. 289 TSMRALEDFKRHWTATS-----SG-INGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAM 352 (559) Q Consensus 289 ~~~e~~~~l~~~~~~~~-----~G-~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgV 352 (559) -.++-+..+-..+++.. .| ..+..+ ..+++ +| +.++..|.-...+--++-..|..+.+.++++| T Consensus 296 KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnV 375 (524) T protein:vir:10 296 KAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRI 375 (524) T ss_pred hHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCC Confidence 11111111211111111 01 011111 11221 11 23344332223344466777888899999999 Q ss_pred CHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeec Q lcl|NC_012530. 353 DPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFV 419 (559) Q Consensus 353 Pp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~ 419 (559) |.+.|+-...++++.+.++..++. +. -....|.-+..++...|...| + ++.++ ..+.|+|. T Consensus 376 P~sRl~~e~~~~f~~gr~~EItRD---Ei---KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~ 449 (524) T protein:vir:10 376 PESRIPSESNSGVMFDAGTAITRD---EL---KFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFN 449 (524) T ss_pred CchhccCCCCccccccccchhhHH---HH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEee Confidence 999997655555544433333322 11 112234445555444443333 2 22222 34677776 Q ss_pred chhhhhHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceeccccccccccccccccccccc Q lcl|NC_012530. 420 GGDTRSQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRL 488 (559) Q Consensus 420 ~l~~~d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 488 (559) ....-.+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+.+.+.... T Consensus 450 ~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~--~~~k~I~~E~k~~-- 512 (524) T protein:vir:10 450 RDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEE-------------IN--QEAKQIEEESKEA-- 512 (524) T ss_pred ecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH-------------HH--HHHHHHHHHhhcC-- Confidence 54433333 33333332211 1 134444443 33332110 00 0000111000000 Q ss_pred ccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 489 TQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ...++++.+ +.+ T Consensus 513 -----~~~~~~~~~--------~~f 524 (524) T protein:vir:10 513 -----RFQNPDEEE--------EDF 524 (524) T ss_pred -----CCCCCChhh--------hcC Confidence 000011111 111 No 209 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.20 E-value=2.7e-06 Score=51.11 Aligned_cols=420 Identities=10% Similarity=0.010 Sum_probs=169.8 Q ss_pred CcchhhhccccccCCc-----------chHHHHHHHHHHH-----HHHhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDP-----------NAFFKHIDSKIAN-----DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) |--+++-.++...+.+ .+.|+.+-..... +.+..=-.|++....++.. .... .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~-----~~~~---~~~~ 72 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPK-----LDNK---GEID 72 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccch-----hccc---cccc Confidence 4444333322222222 1222222111111 1111111133222111100 0000 0000 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) + .+ ...+ ...+....+|+..+.-+..-| ..+... ..+..+.+..|+.+ T Consensus 73 ~---~~----~~~k--i~~n~~~~Ivd~~~~~l~g~p-----------~~~~~~--------d~~~~~~l~~~~~n---- 120 (474) T protein:vir:96 73 P---LK----PDWR--MFTNYHQNLVDQKVAYAVANP-----------VTFSSD--------DDKSLKTIQEVLNH---- 120 (474) T ss_pred c---cc----cchh--cccchHHHHHHhhhhhhcccC-----------ceeecC--------chHHHHHHHHHHhc---- Confidence 0 00 0001 113445555655554432212 122111 11223445555532 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCceeeeecc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTA 223 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~ 223 (559) .+......+..+++.+|.+|..+.++.+|++. +..++|..+.++.+... ......++|+..........+.. T Consensus 121 ------~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~ 193 (474) T protein:vir:96 121 ------KWDDKLVDILTAASNKGIEWLQPYIDENGEFK-TFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTD 193 (474) T ss_pred ------CHHHHHHHHHHHHHhcCeeEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeC Confidence 12234455667889999999999899888865 88899999998876432 12222333333322222222333 Q ss_pred cceEEEec--------------------------cc-----CCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 224 DEMGMFIR--------------------------NP-----RSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 224 ~evi~~~~--------------------------n~-----~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ..+.++.. |+ .-......+|.|-++.....++....+..-..+.+...+ T Consensus 194 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 273 (474) T protein:vir:96 194 SDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDEST 273 (474) T ss_pred CeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33332211 00 000001234677776666666655555555556666666 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) .|-.++. +.. + +..+.+...+ ...++..+.+.+.+++.++.+ ....+....+...+.|+..-+ T Consensus 274 ~~~lv~~--g~~--~----~~~~~~~~~~--------~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 337 (474) T protein:vir:96 274 ELIYILK--GYE--G----QDLDEFMRNL--------KYYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQ 337 (474) T ss_pred cceeeee--cCC--c----ccccchhhhh--------hcCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhC Confidence 6654443 211 1 1111111111 123444454333334444322 233566777888899999999 Q ss_pred CCHHHhccccccccccccccc--hhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhH Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNS--LNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQ 426 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~--~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~ 426 (559) +|..-.+ +..++.++.. .-++.. ....+.++..+|+-+++.|...+. .......+.+.|+.....+. T Consensus 338 ~p~~~~~----~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~----~~~~~~~i~i~f~~~~p~~~ 409 (474) T protein:vir:96 338 GVDFQQD----KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYK----LNIKVQDVEITFNFNVMVNE 409 (474) T ss_pred Ccccccc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEeccCCCcCH Confidence 9864321 1000100000 000111 111223344444444444433221 12223456778888888888 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_012530. 427 QDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLP 506 (559) Q Consensus 427 ~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (559) .+.++.+.. .|.|+...++++++. ++.-+ ..+..+.+ +......... ...++..+..++++ T Consensus 410 ~e~~~~~~~--ag~iS~et~~~~~~~--v~d~~--------~E~~ri~~------E~~e~~~~~~-~~~~~~~~~~~d~~ 470 (474) T protein:vir:96 410 LEQSQIGVQ--SQYLSKETVVTNHPW--VDDPV--------AELERIEQ------DNIDFNKQLP-PLEGDANGRAQDNE 470 (474) T ss_pred HHHHHHHHh--cCCCchHHHHHhCCC--CCCHH--------HHHHHHHH------HHHHHHhccc-ccccccccccCCCc Confidence 877776543 467888888887653 22100 11111110 0000000000 00000000000000 Q ss_pred ccccccchhcccccccccccccccccc Q lcl|NC_012530. 507 PSSSNSFQQNQEGYTGKDAKPSGKDNQ 533 (559) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~g~~~~ 533 (559) + ++ . T Consensus 471 ------~-------------e~----~ 474 (474) T protein:vir:96 471 ------S-------------ET----N 474 (474) T ss_pred ------c-------------cC----C Confidence 0 00 0 No 210 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.18 E-value=3e-06 Score=50.84 Aligned_cols=440 Identities=10% Similarity=-0.014 Sum_probs=171.9 Q ss_pred Ccchhh--hccccccCC---cchHHHHHHHHHH-HHHHhhhhccccccccccccccccccccccccccccCCCCCcccHH Q lcl|NC_012530. 1 MGIFDR--FRTKFYTDD---PNAFFKHIDSKIA-NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRIT 74 (559) Q Consensus 1 ~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 74 (559) |.|+-+ |+..+-..+ ++..+.+..+..- .+.+..=-.|++.....+. ......+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~------------------~~~~~~~~- 61 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEF------------------DNATVEAA- 61 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCc------------------CcCCCCcc- Confidence 776532 211111111 1222222221110 0111111222221111110 00000000 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) + ...+....+|+..+.-+.+ .+..+...+ .+..+.+.+++.. ..+.. T Consensus 62 ---k--i~~n~~~~Iv~~~~~~l~g-----------~p~~~~~~~--------~~~~~~l~~~~~~---------n~~~~ 108 (499) T protein:vir:10 62 ---N--VMVNHAKYITDMNVGFMTG-----------NPVKYVAEK--------GKNIDDILEVFNQ---------IDIHK 108 (499) T ss_pred ---e--eecchHHHHHHHHhhhhcc-----------cCceeecCC--------hhHHHHHHHHHhh---------cCHhH Confidence 0 0123444455555443321 111221111 1122334444433 12345 Q ss_pred HHHHHHHHHHHcCCcceEEEECCCCcE----------------EEEEEecCceEEEEecCccc-ccccceEEEEEecC-- Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDSNGRL----------------SHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDN-- 215 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~~G~~----------------~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~-- 215 (559) +...+..+.+.+|.+|.++..+.+|.+ ..+..++|..+.++.+..+. .....++|+...+. T Consensus 109 ~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~ 188 (499) T protein:vir:10 109 HDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEG 188 (499) T ss_pred HHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCC Confidence 667778889999999999988887753 34777889888777665432 12223333332211 Q ss_pred c----eeeeecccceEEEec-----------------ccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 216 K----VRGSFTADEMGMFIR-----------------NPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFT 269 (559) Q Consensus 216 ~----~~~~~~~~evi~~~~-----------------n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ 269 (559) . ....+.++.+.++.. |+.. .......|.|-++.+...++....+..-..+.+. T Consensus 189 ~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~ 268 (499) T protein:vir:10 189 NTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKE 268 (499) T ss_pred CceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHH Confidence 1 112334444443321 0000 0111234666666655555555545444555555 Q ss_pred hcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc-cCCceeeeecccc-chhHHHHHHHHHHHHHH Q lcl|NC_012530. 270 HGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI-TAEDAKFVSMTQA-EDMQFQSWLNYLINIIC 347 (559) Q Consensus 270 ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl-~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia 347 (559) ..+.|-.+++ +... .+ ..+....+ +.+.+..+ .+++.+++.++.+ ....+....+...+.|. T Consensus 269 ~~~~~~lv~~--G~~~-~~-~~~~~~~~------------~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~ 332 (499) T protein:vir:10 269 AFVDALLVTF--GFGL-GD-DKDDIQRL------------KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIH 332 (499) T ss_pred HhcCceeeee--cCcc-cc-ccchhhhh------------hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHH Confidence 5566655553 2111 11 11111111 11222222 2233344444432 23345666777778888 Q ss_pred HHhCCCHHHhccccccccccccccch---hhhh---HHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecch Q lcl|NC_012530. 348 ALVAMDPAEIGMQNRGGATGNKSNSL---NESN---NQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGG 421 (559) Q Consensus 348 ~~fgVPp~~lg~~~~~~~~~~~~~~~---~~an---~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l 421 (559) ..-++|..--+ . ..++.++.. -++. -....+..+..+|.-++..+...++..- .......+.+.|... T Consensus 333 ~~s~~p~~~~~----~-~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~-~~~d~~~i~i~f~~~ 406 (499) T protein:vir:10 333 KISYVPNMNDE----K-FMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKG-ANDDASGCKISLVAN 406 (499) T ss_pred HHhCcccCCch----h-hcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCC Confidence 87777742111 0 000001000 0011 1122233444455555555544443221 111234578888888 Q ss_pred hhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 422 DTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 422 ~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) ...+..+.++.+..+ .|+++.--++++++. +++ . ...+..+.+.. ++........ ....+++.. T Consensus 407 ~p~n~~e~~~~~~kl-~g~iS~et~~~~l~~--v~d--~------~~E~~ri~~E~----~~~~~~~~~~-~~~~~~~~~ 470 (499) T protein:vir:10 407 IPSNLSDVVNNVKNA-DGIIPRKYTYSWLPD--VDN--P------QDVIDEMNQQD----AETIKKNQEA-LRGQDPDRL 470 (499) T ss_pred CCCCHHHHHHHHHHH-hccCChHHHHHhCCC--CCC--H------HHHHHHHHHHH----HHHHHHHHhh-hccCCCCCC Confidence 888888888888765 566888778877654 221 0 01111111100 0000000000 000111111 Q ss_pred CCCCCccccccchhccccccccccccccccccccccccccc Q lcl|NC_012530. 502 PPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQL 542 (559) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 542 (559) ...+.+... .+...++......+||.-.. T Consensus 471 ~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 471 ELEDKQDDS------------SENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCCCCccc------------CCCCCCCccccccCCCCCCC Confidence 110000000 00000111112223333222 No 211 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=98.17 E-value=3.2e-06 Score=50.72 Aligned_cols=450 Identities=9% Similarity=0.094 Sum_probs=180.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccccccc-ccc-----cccCCCCCcccHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLED-TSI-----VPKPSPIAFGRIT 74 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~-~~~-----~~~p~~~~~~~~~ 74 (559) .++|+.+ ...++++. ++...++. .+.....-..+...+......+... +++ ..-+...+...+. T Consensus 5 l~~~~~~----~~~~~~~~----~~~~~~~~--~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI 74 (521) T protein:vir:81 5 LKMLARW----ADFDNDKY----EEQIKDKA--ESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLV 74 (521) T ss_pred hhhhHhh----cCchhhhH----HhhhccCc--cccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHH Confidence 2333322 11111111 11100000 0000011111111111111111110 000 1111112222233 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) ..-+..+.+|.|..||.-|.+.+. ..+.+....++.+.+. +.....+.++..--..+.+... ... T Consensus 75 ~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~-~~s~~iK~kI~eeF~~Il~ll~-F~~------- 139 (521) T protein:vir:81 75 NTYRGLMNNHEVENAVQNIVNDAI------VFEEGHEVVSLNLEAT-GFSESVKERIHEEFKDLLNTIQ-FDR------- 139 (521) T ss_pred HHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEeccc-ccchHHHHHHHHHHHHHHHHhc-cch------- Confidence 333455778999999998888753 3445555666666433 3444444444433333333311 111 Q ss_pred HHHHHHHHHHHcCCcceEEEECCC--CcEEEEEEecCceEEEEec-----Ccc-cccccceEEEEEecCc---------- Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDSN--GRLSHTRMVDPTTIYFAND-----EHG-HRRTRGKIYRQYIDNK---------- 216 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~~--G~~~~L~~l~p~~V~~~~~-----~~g-~~~~~~~~y~~~~~~~---------- 216 (559) --..+++.+++.|..|+.++-|.+ .-+.+|..|||.+|+.++- ..| .+.....-|+.+..+. T Consensus 140 ~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~ 219 (521) T protein:vir:81 140 RGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVF 219 (521) T ss_pred hhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceee Confidence 122345566789999999996644 4499999999999876432 111 1111111122221111 Q ss_pred ---eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 217 ---VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 217 ---~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) ....+ +.+.|++.+.-.-+. +++.=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..-.++- T Consensus 220 ~~~~~vkI-~~dAI~y~hSGl~d~-~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqY 297 (521) T protein:vir:81 220 SPNSRVKI-PRSAITYAHSGLMDC-DDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQH 297 (521) T ss_pred cCCcceee-chhheeeeeccceeC-CCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 11122 334444433111121 1222246688888777776666665554433334445555555443332221222 Q ss_pred HHHHHHHHHHH-----hcC-ccc-ccccccccC--------C-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTAT-----SSG-ING-AYRIPMITA--------E-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~-----~~G-~~n-ag~~~vl~~--------g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++. ..| ..+ ..-+.+++. | +.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 298 l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl 377 (521) T protein:vir:81 298 MNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRS 377 (521) T ss_pred HHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccc Confidence 22222222221 011 111 111122211 1 2334433222334446667788889999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcc-----ccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTN----GII-----RQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~~~~----~~~~~~f~~l~~~ 424 (559) +....+.++.+.++..++. +. -....|.-+..++...|.. .|+ ++.++ ..+.|+|.....- T Consensus 378 ~~e~~~~~~~Gr~~EItRD---Ei---KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f 451 (521) T protein:vir:81 378 NLSDANMVIGGDGSEITRD---EL---EFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYY 451 (521) T ss_pred cCCCCcceeccccchhhHH---HH---HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchH Confidence 7655555443333333322 11 1122344444444444433 322 22222 3467777654433 Q ss_pred hHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 SQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) .+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+.+.+...... T Consensus 452 ~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee-------------i~--~~~k~I~~E~~~~~~----- 511 (521) T protein:vir:81 452 TEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQ-------------MD--TEKKQIEEEANDPRF----- 511 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH-------------HH--HHHHHHHHHhhCCCC----- Confidence 333 33333332211 1 134444443 33332110 00 000011000000000 Q ss_pred cCCCCCCCCCCCCccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~ 513 (559) .++++ +-+.+ T Consensus 512 --~~p~~--------~~~~f 521 (521) T protein:vir:81 512 --KQTPD--------EIEDF 521 (521) T ss_pred --CCCcc--------cccCC Confidence 00110 11111 No 212 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.17 E-value=3.3e-06 Score=50.65 Aligned_cols=364 Identities=11% Similarity=0.098 Sum_probs=152.6 Q ss_pred ccccccccccccccccccccccCCCCCcccHHHHHHHHhh--ChH------HHHHHHHHHHHHHhhhhHhhhhcCC---- Q lcl|NC_012530. 43 YTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSM--NVV------LNAIINTRANQVTEYAHRASTDDNG---- 110 (559) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~--~~~------v~acv~~ia~~ia~~~~~~~~~~~g---- 110 (559) .....+. .. ........+-...+..|+. .++ +....+.+.+.+..+|..++..... T Consensus 1 m~~~~i~-----~L-------~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~ 68 (422) T protein:vir:97 1 MNYMGMG-----YL-------RRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIF 68 (422) T ss_pred CChHHHH-----HH-------HHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcccc Confidence 0000000 00 0000011111122233322 111 1122333334455566555544322 Q ss_pred cceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC-CCcEEEEEEecC Q lcl|NC_012530. 111 MGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS-NGRLSHTRMVDP 189 (559) Q Consensus 111 ~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~-~G~~~~L~~l~p 189 (559) .||.+ .| . .+..++.. ..+......+..+.|++|.+|+.|.++. .|.| .+.+++| T Consensus 69 ~Gf~~--~d------~------~l~~~w~~---------N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp 124 (422) T protein:vir:97 69 REFTN--DD------F------NAWEIFKA---------NNPDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEA 124 (422) T ss_pred ceeeC--Cc------h------hHHHHHHh---------cChHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEech Confidence 12211 00 0 11222221 1223455577889999999999999875 5665 5888999 Q ss_pred ceEEEEecCcccccccceEEEEEec-Ccee-eeeccc----------------------ceEEEecccCCCccCCccccc Q lcl|NC_012530. 190 TTIYFANDEHGHRRTRGKIYRQYID-NKVR-GSFTAD----------------------EMGMFIRNPRSDILSGGYGLS 245 (559) Q Consensus 190 ~~V~~~~~~~g~~~~~~~~y~~~~~-~~~~-~~~~~~----------------------evi~~~~n~~~~~~~~~~G~S 245 (559) .++.++.|...........++.... +... ..+..+ =|++|..++. ....+|.| T Consensus 125 ~~~~~i~D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~---~~~~~G~s 201 (422) T protein:vir:97 125 SKATGILDPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPD---AVRPFGRS 201 (422) T ss_pred hhEEEEEeCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCC---CccccCcc Confidence 9998887755433222211111111 1110 001111 1334443332 23457877 Q ss_pred HH----HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE-ecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC- Q lcl|NC_012530. 246 EL----EMGLREFISHENTELFNDRFFTHGGTTKGILL-VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA- 319 (559) Q Consensus 246 pl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~- 319 (559) .| ..+.+++...+.-......||.. |.-+|. ++. +....+ .|+... +++..++. T Consensus 202 ~I~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~d~-------d~~~~~----~~~~~~------~~i~~~~~d 261 (422) T protein:vir:97 202 RITKAGMYHQKAAKRTLERAEVTAEFYSF---PQKYVLGMDP-------DAKPME----KWRATV------STLLEISKD 261 (422) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhcccCc-------ccccCc----hhhhhh------hhhhccCCC Confidence 54 44445555444444444555544 433331 111 111111 232221 23333321 Q ss_pred ---CceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh----HHHHHHHHHHHHhh Q lcl|NC_012530. 320 ---EDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN----NQNKIDASKSKGLM 391 (559) Q Consensus 320 ---g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an----~~~~~~~~~~~~l~ 391 (559) ++.++..+.. .+++ |++..+..+..|+.+=++|++.+|....+.+++. +....... ++.. +......++ T Consensus 262 e~~~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~-Ai~a~~~~L~~ka~~k-~~~fg~~l~ 338 (422) T protein:vir:97 262 EDGDKPTVGQFTT-ASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVE-SIKAAHENLRAAGRKA-QRSFSSGFL 338 (422) T ss_pred CCCCcceeeecCC-CChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHH-HHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 2345555542 3454 8999999999999999999999997553211110 00000000 0111 111111222 Q ss_pred HHHHHHHHHHHhhcc-ccccCccceeeecchh---hhhHHHHHHHHHHHHcC--C-CCHHHHHHHhCCCCCCCCCEeecc Q lcl|NC_012530. 392 PLLDMIAKNLTNGII-RQILGDNYMLEFVGGD---TRSQQDKLKSVQLELQT--A-TTVNDYREKQGLPKIAGGDIILSA 464 (559) Q Consensus 392 P~~~~ie~~ln~~L~-~~~~~~~~~~~f~~l~---~~d~~~~~~~~~~~~~~--~-~T~NE~R~~~gl~pi~gGD~~~~~ 464 (559) -++..+. ++....- .+.......+.|.... ..+..+.++.+.+++.. + +...-+++++|+.+.+ .- T Consensus 339 ~~~rla~-~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~---~~--- 411 (422) T protein:vir:97 339 NVAYIAV-CLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGAD---KP--- 411 (422) T ss_pred HHHHHHH-HHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchh---HH--- Confidence 2222111 1111100 0112234567776443 33355556666555443 3 4677889999985431 10 Q ss_pred ceeccccccccccccc Q lcl|NC_012530. 465 VYIQRLGQQEQIKQNE 480 (559) Q Consensus 465 ~~~~~l~~~~~~~~~~ 480 (559) ...+.. ...+. T Consensus 412 --~~~~~~---~~~d~ 422 (422) T protein:vir:97 412 --IPAITE---VTTDG 422 (422) T ss_pred --HHHHHh---hhccC Confidence 000100 00000 No 213 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.16 E-value=2.4e-06 Score=51.40 Aligned_cols=482 Identities=11% Similarity=0.064 Sum_probs=197.5 Q ss_pred HHHHHHHHHHHhhhhcccccccc-ccccccccccccccccccccCCCCC---cccH-HHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYT-EPVDGNLMFSTLEDTSIVPKPSPIA---FGRI-TDVLRQYSMNVVLNAIINTRANQ 96 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~-~~~~~~~~~~~~~~~~~~~~p~~~~---~~~~-~~~~~~~~~~~~v~acv~~ia~~ 96 (559) |...+ -.+.-|.|... +..+-..+ ++ .-.+....-+..+ +... .+-...+.-.+.++-.|.-|+++ T Consensus 1 ma~~~-------lrv~rrpk~~p~~r~l~aas-qp-~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss 71 (629) T protein:vir:10 1 MAAST-------LRVSRRPKGSPARRSLTAAS-QP-MEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASS 71 (629) T ss_pred CCccc-------eeEEecCCCccceeeecccc-CC-CCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhh Confidence 11111 11111222110 00000000 00 0000000000000 0000 01111111224555556666777 Q ss_pred HHhhhhHhhhhcCCcceeeecccccccChhHHHH---HHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEE Q lcl|NC_012530. 97 VTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKK---IDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYEN 173 (559) Q Consensus 97 ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~---~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i 173 (559) ++..-++...-... ...++....+. ...+...+.+...-+ ..-.++++.+..++-+-|..|+.+ T Consensus 72 ~Sr~rL~as~idpD---------tg~ptg~i~ed~p~~~~v~~~v~~iagG~----lGqaqLlkr~~~~ltV~GE~~i~i 138 (629) T protein:vir:10 72 CSRVELIASELDPD---------TGKPTGGIRDDDPDGLRFLEIVKTMAGGP----LGQAQLQKRAAECLTVPGEHRICL 138 (629) T ss_pred heeeeEEEeeecCC---------CCCCccccccCchhHHHHHHHHHHhcCcc----chHHHHHHHHHhheeccCceEEEE Confidence 76543332211111 11111111111 111223333332221 122468999999999999999987 Q ss_pred EECCC----CcEEE-EEEecCceEEEEecCcccccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHH Q lcl|NC_012530. 174 TYDSN----GRLSH-TRMVDPTTIYFANDEHGHRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELE 248 (559) Q Consensus 174 ~rd~~----G~~~~-L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~ 248 (559) +--.. |-+.. .+.|...-|. .+|. +..-....++..-......|+++..++|.+.. ...--||+. T Consensus 139 l~~~~~~pd~~~r~~W~vVt~~Ei~----~kg~----g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr--~~e~DSpvr 208 (629) T protein:vir:10 139 LDQGDKNPDGSVRHNWYVVTNDEVK----NKGA----GKTDIELPDGTIHEYSKGRDVMFRVWNPRPRR--AKEPDSPVR 208 (629) T ss_pred eecCCCCCCcccccceeeecHHHhc----cccC----ceeEEEcCCCceeeeeCCCeeEEEeeCCCccc--ccCCcchhH Confidence 64333 33442 3333333332 1111 11122233343333345567777778776543 334669999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCc-------cCC----------HHHHHHHHHHH----HHHhcC Q lcl|NC_012530. 249 MGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVT-------NTS----------MRALEDFKRHW----TATSSG 307 (559) Q Consensus 249 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~-------~~~----------~e~~~~l~~~~----~~~~~G 307 (559) +++..+.-.....+...+..+.-.+-.|||.++...+-+ +.+ .-+.+.|...| ..++.. T Consensus 209 a~l~~lrEi~r~tk~i~~aakSRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~D 288 (629) T protein:vir:10 209 ACLDSLREIIRTTKKIRNASKSRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDD 288 (629) T ss_pred HHHHHHHHHHHhhhHhHHHHHhHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcC Confidence 988888777666666555555444555666554433222 011 01223333333 233322 Q ss_pred c-ccccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccchhhhhHH Q lcl|NC_012530. 308 I-NGAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSLNESNNQ 379 (559) Q Consensus 308 ~-~nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~an~~ 379 (559) . ..+--+||+... +++...+.+.-+.--+.+|+..+..+|....|||.. ||+..++..|+. + .-+. T Consensus 289 e~S~aA~vPiia~vP~E~l~~ikhLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsA-W----qI~d- 362 (629) T protein:vir:10 289 EDSQAALIPLLATVPGEHLQKIFHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSA-W----QIGD- 362 (629) T ss_pred CCCccceeeeEEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceee-E----Eecc- Confidence 2 224456766332 233333333344556889999999999999999875 566433333321 0 0111 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHhhccccc---c---Cccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCC Q lcl|NC_012530. 380 NKIDASKSKGLMPLLDMIAKNLTNGIIRQI---L---GDNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDYREKQGL 452 (559) Q Consensus 380 ~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~---~~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl 452 (559) .-++-.|.|.+..|+++|++.+|.+- + -.+|.+-|+. .+..|.....++....-+|.||-...|+.+|+ T Consensus 363 ----edvrlHI~P~l~~ic~Ait~~~Lrp~L~~eGiDp~~Yvvw~DaS~Lt~dPd~~deA~~a~drGaIt~eAlRr~lG~ 438 (629) T protein:vir:10 363 ----EDVQLHIKPVMEVLCAAIYREVLVATLRAEGIDPDRYVLWYDASGLTVDPDKTDEATAAKEQGAITHEAYRRYLGL 438 (629) T ss_pred ----cceeeecchHHHHHHHHHHhHHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcc Confidence 12344689999999999998887531 2 2468888873 45555544445555555688999999999999 Q ss_pred CCCCCCC--Ee-----------eccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccc Q lcl|NC_012530. 453 PKIAGGD--II-----------LSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG 519 (559) Q Consensus 453 ~pi~gGD--~~-----------~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (559) .--.+=| ++ ..+...++. ......+ .-.. ...+. +.......+..++++ ++++ T Consensus 439 ~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~---~apll~~-~l~~--i~~P~-p~~a~~~~~~~~~~~-------E~~~ 504 (629) T protein:vir:10 439 ADEDGYDLETLEGAQAWARDAIVADPSLIKV---LAPLLTD-ELAE--IDWPE-PPAALPPGEDDQADE-------EQDT 504 (629) T ss_pred ccccCCCcCCcHHHHHHHHHHhcCCCchhhh---hhhhcCC-cccc--ccccC-CCCcCCCCCcccCcc-------ccCC Confidence 5433211 11 000000000 0000000 0000 00000 000000000100111 1111 Q ss_pred ccccccccccc-cccccccccccccc--cchhhhhhccCCCC-C Q lcl|NC_012530. 520 YTGKDAKPSGK-DNQQGVGKDGQLKN--KKNTNSYKQGGSSK-K 559 (559) Q Consensus 520 ~~~~~~~~~g~-~~~~~~~~~~~~k~--~~~~~~~~~~~~~~-~ 559 (559) .......+.+. .....+....-.+. .--..+-...||-- + T Consensus 505 ~~~e~~~e~dA~~a~~~~~~aa~~~A~rllv~RALelAGkRl~~ 548 (629) T protein:vir:10 505 TGSEPSTEDDAEAAARISSVADMVLAERLLTVRALGLAGKRRVN 548 (629) T ss_pred CCCCcCCCcchhhcccCCchhhHHHHHHHHHHHHHHHccccccC Confidence 11000000000 00000000000000 00011112223221 1 No 214 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=98.11 E-value=4.3e-06 Score=50.02 Aligned_cols=450 Identities=9% Similarity=0.093 Sum_probs=182.1 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccc-----ccccc-cccccCCCCCcccHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFS-----TLEDT-SIVPKPSPIAFGRIT 74 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~-----~~~~~-~~~~~p~~~~~~~~~ 74 (559) .++|..+ ...+.++. .+..... ..+.....-..+...+...... .+... .+..-+...+...+. T Consensus 5 l~~~~~~----~~~d~~~~----~e~~~~~--~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI 74 (521) T protein:vir:65 5 LKMLARW----ADFDNDKY----EEQIKDK--AESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLV 74 (521) T ss_pred hhhhhhc----cCchhhHH----HhhhccC--CCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHH Confidence 2222222 22222211 1110000 0011111111121111100000 01110 111111112222233 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) ..-+..+.+|.|..||.-|.+.+. ..+.+....++.+.+. +.....+.++..--..+.+... ... T Consensus 75 ~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~~~-~~s~~iK~kI~eeF~~Il~ll~-F~~------- 139 (521) T protein:vir:65 75 NTYRGLMNNHEVENAVQNIVNDAI------VFEEGHEVVSLNLEAT-GFSESVKERIHEEFKDLLNTIQ-FDR------- 139 (521) T ss_pred HHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEeccc-ccchHHHHHHHHHHHHHHHHhc-cch------- Confidence 333445778999999998888753 3445555666666433 3444444444443333333311 111 Q ss_pred HHHHHHHHHHHcCCcceEEEECCC--CcEEEEEEecCceEEEEec-----Ccc-cccccceEEEEEecCc---------- Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDSN--GRLSHTRMVDPTTIYFAND-----EHG-HRRTRGKIYRQYIDNK---------- 216 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~~--G~~~~L~~l~p~~V~~~~~-----~~g-~~~~~~~~y~~~~~~~---------- 216 (559) --..+++.+++.|..|+.++-|.+ .-+.+|..|||.+|+.++- ..| .......-|+.+..+. T Consensus 140 ~~~~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~ 219 (521) T protein:vir:65 140 RGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVF 219 (521) T ss_pred hhhHHHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceee Confidence 122345566789999999996644 4499999999999876542 111 1111111222221111 Q ss_pred ---eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 217 ---VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 217 ---~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) ....+ +.+.|++.+.-.-+ .+++.=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..-.++- T Consensus 220 ~~~~~vkI-~~dAI~y~hSGl~d-~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqY 297 (521) T protein:vir:65 220 SPNSRVKI-PRSAITYAHSGLMD-CDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQH 297 (521) T ss_pred cCCcceee-chhheeeeecccee-CCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 11122 33444443311112 12222246688888777777666665554433334445555555443332221222 Q ss_pred HHHHHHHHHHH-----hcC-ccc-ccccccccC--------C-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTAT-----SSG-ING-AYRIPMITA--------E-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~-----~~G-~~n-ag~~~vl~~--------g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++. ..| ..+ ..-+.+++. | |.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 298 l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl 377 (521) T protein:vir:65 298 MNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRS 377 (521) T ss_pred HHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceec Confidence 22222222221 011 111 111122211 1 2344433222334446667788889999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcc-----ccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTN----GII-----RQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~~~~----~~~~~~f~~l~~~ 424 (559) +....+.++.+.++..++.-. -....|.-+..++...|.. .|+ ++.++ ..+.|+|.....- T Consensus 378 ~~e~~~~~~~gr~~EItRDEi------KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f 451 (521) T protein:vir:65 378 NLSDANMVIGGDGSEITRDEL------EFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYY 451 (521) T ss_pred cCCCCcceeccccchhhHHHH------HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchH Confidence 876665554433333332211 1122344444444444433 322 22222 3467777654433 Q ss_pred hHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 SQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) .+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+...+...... T Consensus 452 ~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDee-------------i~--~~~k~I~~E~~~~~~----- 511 (521) T protein:vir:65 452 TEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQ-------------MD--TEKKQIEEEANDPRF----- 511 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH-------------HH--HHHHHHHHhhhCCCC----- Confidence 333 33333333211 1 235554543 33332110 00 000000000000000 Q ss_pred cCCCCCCCCCCCCccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~ 513 (559) .+++ ++-+.+ T Consensus 512 --~~p~--------~~~~~f 521 (521) T protein:vir:65 512 --KQTP--------DEIEDF 521 (521) T ss_pred --CCCc--------ccccCC Confidence 0000 111111 No 215 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.11 E-value=4.4e-06 Score=49.97 Aligned_cols=397 Identities=10% Similarity=-0.002 Sum_probs=157.4 Q ss_pred ccccccccccccccccccccccccCCCCCcccHHHHHHH----Hh-hChHHHHHHHHHHHHHHhhhhHhhhhcCCcc--e Q lcl|NC_012530. 41 RAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQ----YS-MNVVLNAIINTRANQVTEYAHRASTDDNGMG--Y 113 (559) Q Consensus 41 ~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~----~~-~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~--~ 113 (559) ++. .. .+.......++.+. +. ...++..||+.....+..+-....+-.+.-. . T Consensus 1 ~~~-----------~~---------~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~ 60 (478) T protein:vir:10 1 MIS-----------IN---------WPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILD 60 (478) T ss_pred Ccc-----------cc---------CCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhc Confidence 100 00 00000001111111 10 1234445555444444332111111100000 0 Q ss_pred ee----------eccccccc-ChhHH----------------------HHHHHHHHHHHhcCCCCCCChhhHHHHHHHHH Q lcl|NC_012530. 114 QV----------RLKNGDKP-TKEQQ----------------------KKIDYAERYIERMGVDYSPIRDDFTSFLRKLV 160 (559) Q Consensus 114 ~v----------~~~d~~~~-~~~~~----------------------~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v 160 (559) +. ..+...++ ..... +..+.+..++. ..+......+. T Consensus 61 ~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~----------n~~~~~~~~~~ 130 (478) T protein:vir:10 61 APPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN----------HKWDDKLVDIL 130 (478) T ss_pred cccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh----------cCHHHHHHHHH Confidence 00 00000000 00001 11112222221 12344556678 Q ss_pred HHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCceeeeecccceEEEec-------- Q lcl|NC_012530. 161 RDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNKVRGSFTADEMGMFIR-------- 231 (559) Q Consensus 161 ~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~~~~~~~~~evi~~~~-------- 231 (559) .+++.+|.+|..+..+.+|++ .+..++|..+.++.+.... ......+|+..........+..+.+.++.. T Consensus 131 ~~~~~~G~~~~~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~ 209 (478) T protein:vir:10 131 TAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPD 209 (478) T ss_pred HHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeecc Confidence 889999999999999988876 4778899999888764321 122223333322222222333333332221 Q ss_pred ------------------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCcc Q lcl|NC_012530. 232 ------------------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTN 288 (559) Q Consensus 232 ------------------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~ 288 (559) | |.-......+|.|-++.....++....+..-..+.+...+.|-.++. +.. ..+ T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~--g~~-~~~ 286 (478) T protein:vir:10 210 FYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--GYE-GED 286 (478) T ss_pred ccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee--cCC-ccc Confidence 0 00001123357777666555555555444444445555555644442 211 111 Q ss_pred CCHHHHHHHHHHHHHHhcCcccccccccc---cCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccc Q lcl|NC_012530. 289 TSMRALEDFKRHWTATSSGINGAYRIPMI---TAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGA 365 (559) Q Consensus 289 ~~~e~~~~l~~~~~~~~~G~~nag~~~vl---~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 365 (559) . .+ ....+. .+++..+ .+++++|..... ....+....+...+.|...-++|..-.+- . T Consensus 287 ~-~~----~~~~~~--------~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-----~ 347 (478) T protein:vir:10 287 M-KD----FMHNLK--------YYKAISVAGESGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDK-----F 347 (478) T ss_pred c-ch----hhhhhh--------hcceEEecCCCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCccccCccc-----c Confidence 1 11 111111 1122222 223455543322 34456677888888888888888532211 1 Q ss_pred ccccccchh---hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcC Q lcl|NC_012530. 366 TGNKSNSLN---ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQT 439 (559) Q Consensus 366 ~~~~~~~~~---~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~ 439 (559) +++.++... ++.. ....+.++..+|+-++..|...+. .......+.+.|+.....|..+.++.+... .| T Consensus 348 ~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g----~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl-~g 422 (478) T protein:vir:10 348 GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR----LDVKVQDIEITFNFNVMVNELENSQIAMNS-TG 422 (478) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccccceEEecCCCCCCHHHHHHHHHHH-hC Confidence 111111100 0001 111223344444444444433221 122334578888888888888888877654 56 Q ss_pred CCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 440 ATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 440 ~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) +|+...+++++++- +.-+ ..+..+.+ ++........ ....+... +.+.+..+.+++ T Consensus 423 ~iS~et~~~~l~~v--~D~~--------~E~~ri~~------E~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~ 478 (478) T protein:vir:10 423 LLSKETILSNHAWV--EDPV--------AEMERIEQ------ENIELNQQLP-DIEEGLNG-EQQRQSENNQPE 478 (478) T ss_pred CCChHHHHHhCCCC--CCHH--------HHHHHHHH------HHHHHHhhcc-ccccccCC-CCCCCCCCCCCC Confidence 78888888887652 1100 11111111 0000000000 00000000 000000000000 No 216 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.11 E-value=4.5e-06 Score=49.92 Aligned_cols=361 Identities=10% Similarity=0.036 Sum_probs=158.9 Q ss_pred hccccccccccccccccccccccccccccCC-CCC-cccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcce Q lcl|NC_012530. 36 LNGVDRAYTEPVDGNLMFSTLEDTSIVPKPS-PIA-FGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGY 113 (559) Q Consensus 36 ~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~-~~~-~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~ 113 (559) ++.-+..+... ..+..+-...+. +.. ...+....+ +...+...+|+.+++.+. -.|| T Consensus 1 l~~~~~r~~~~--------~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vds~a~rl~-----------~~Gf 59 (410) T protein:vir:95 1 MNLYQSRVNLR--------YKHYAMQHYEAPTGITIPAHIRAKYQ--AVLGWAAKGVDSLADRLI-----------FRAF 59 (410) T ss_pred CCcchhhHHHH--------HHHhcCCCCccccchhccHHHHhHHH--hhcchhHHHHHHhHhhhc-----------cccc Confidence 22222111100 000111111100 000 001111111 234566667776666442 0123 Q ss_pred eeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEE Q lcl|NC_012530. 114 QVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIY 193 (559) Q Consensus 114 ~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~ 193 (559) .. ++. .+..+... .++......+..+.|++|.+|+.|..+.+|.| .+.+++|.++. T Consensus 60 ~~----~d~----------~l~~i~~~---------N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~ 115 (410) T protein:vir:95 60 AN----DDF----------NVTEIFDR---------NNPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNAT 115 (410) T ss_pred cC----CCc----------hHHHHHhh---------cChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceE Confidence 21 110 12233221 12344566778899999999999999888886 47899999998 Q ss_pred EEecCcccccccceEEEEEecCce---eeeecccce---------------------EEEecccCCCccCCccccc---- Q lcl|NC_012530. 194 FANDEHGHRRTRGKIYRQYIDNKV---RGSFTADEM---------------------GMFIRNPRSDILSGGYGLS---- 245 (559) Q Consensus 194 ~~~~~~g~~~~~~~~y~~~~~~~~---~~~~~~~ev---------------------i~~~~n~~~~~~~~~~G~S---- 245 (559) ++.|...........++....++. ...+.++.+ ++|..++. ....+|.| T Consensus 116 ~i~Dp~~~~~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~---l~~~~G~s~I~~ 192 (410) T protein:vir:95 116 GVIDPITGLLVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPD---AVRPFGRSRITR 192 (410) T ss_pred EEEeCCCCceEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEeccccc---CCccCCccccch Confidence 887764333222222111111111 112222322 33332222 13457777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEE-ecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC---- Q lcl|NC_012530. 246 ELEMGLREFISHENTELFNDRFFTHGGTTKGILL-VKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE---- 320 (559) Q Consensus 246 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g---- 320 (559) |+..+.+++...+.-......||.+ |.-++. ++. +.+..+ .|+... +++..++.. T Consensus 193 ~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~d~-------d~~~~~----~~~~~~------~~i~~~~~~~~~~ 252 (410) T protein:vir:95 193 AGMYYQKYAKRTLERADITAEFYSW---PQKYILGLDP-------DAEPME----KWKATV------SSLLTISSSDKGV 252 (410) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcc---hhheeeccCC-------CCCcCc----hhhhhh------hhheeccCCCCCC Confidence 4555666665555555556666654 433332 111 111111 233221 233333221 Q ss_pred ceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_012530. 321 DAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK 399 (559) Q Consensus 321 ~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~ 399 (559) ..++..+. ..+++ |++..+..+..||..=++|++.+|....+.+++ .... ..++....-....-+-+-..+++ T Consensus 253 ~~~v~q~~-~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa----~Al~-a~~~~L~~ka~~k~~~fg~~l~~ 326 (410) T protein:vir:95 253 KPSVGQFT-TASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSV----EAIK-ASHENLRLAGRKAQRSLGAGLLN 326 (410) T ss_pred cceEEecC-CCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHH----HHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 24555443 23554 899999999999999999999999644321110 0000 01111111111111111111222 Q ss_pred HHHhh--ccc-----cccCccceeeec---chhhhhHHHHHHHHHHHHcC--CC-CHHHHHHHhCCCCCCCCCEeeccce Q lcl|NC_012530. 400 NLTNG--IIR-----QILGDNYMLEFV---GGDTRSQQDKLKSVQLELQT--AT-TVNDYREKQGLPKIAGGDIILSAVY 466 (559) Q Consensus 400 ~ln~~--L~~-----~~~~~~~~~~f~---~l~~~d~~~~~~~~~~~~~~--~~-T~NE~R~~~gl~pi~gGD~~~~~~~ 466 (559) .+-.. +.. +.......+.|. .....+..+.++++.++... ++ ...-+++++|+.+-+ + T Consensus 327 ~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~---~------ 397 (410) T protein:vir:95 327 VAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDM---S------ 397 (410) T ss_pred HHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHH---H------ Confidence 11111 111 112233455565 34445667777777666542 44 556689999996431 0 Q ss_pred ecccccccccccccccccccc Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQQTR 487 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~~~~ 487 (559) .... .+.+..... T Consensus 398 -~~~~-------~~e~~~~g~ 410 (410) T protein:vir:95 398 -AKPV-------VSEGGSNGE 410 (410) T ss_pred -HHHH-------HHHHHhCCC Confidence 0000 000111110 No 217 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.11 E-value=4.5e-06 Score=49.91 Aligned_cols=423 Identities=10% Similarity=0.036 Sum_probs=171.5 Q ss_pred Ccchh-hhccccccCCcc-----hHHHHHHHHHHH-HHHhhhhccccccccccccccccccccccccccccCCCCCcccH Q lcl|NC_012530. 1 MGIFD-RFRTKFYTDDPN-----AFFKHIDSKIAN-DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI 73 (559) Q Consensus 1 ~~~~~-~~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 73 (559) |+|+- |+..-..++++. ..+++....... +.+..=-.|++.....+. .+. .+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~----------------~~~--~~~~~ 62 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA----------------KDS--WKPDN 62 (453) T ss_pred CccccceeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC----------------CCc--cCccc Confidence 77653 111112344443 222222221100 111111223332211110 000 01010 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 74 TDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 74 ~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) + ...++...+|+..+.-+- |.+..+...+ ....+.+..|+.. ..|. T Consensus 63 ----k--i~~n~~~~ivd~~~~~l~-----------g~~~~~~~~d--------~~~~~~l~~~~~~---------n~~~ 108 (453) T protein:vir:73 63 ----R--LTNNFAKYIVDTFVGYFN-----------GIPIKKTHDD--------KSVLEAMQLFDNL---------NDME 108 (453) T ss_pred ----e--eecchHHHHHHHhhhhhc-----------ccCceeecCC--------hHHHHHHHHHHHh---------cChh Confidence 1 112444445554443322 2222222111 1122334455433 1234 Q ss_pred HHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccc-cccceEEEEEecCce-eeeecccceEEEec Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHR-RTRGKIYRQYIDNKV-RGSFTADEMGMFIR 231 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~-~~~~~~y~~~~~~~~-~~~~~~~evi~~~~ 231 (559) .....+..+.+.+|.+|..+.++.+|.+. +..++|..+.++.++.... .....+|+...++.. ...+..+.++++.. T Consensus 109 ~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~ 187 (453) T protein:vir:73 109 DEESELAKIACVYGRAYELMYQNESTESE-VIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITG 187 (453) T ss_pred HHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEe Confidence 46667888999999999999999998874 6778999988877654322 222222222222221 12334444433321 Q ss_pred c-----------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHH Q lcl|NC_012530. 232 N-----------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRAL 294 (559) Q Consensus 232 n-----------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~ 294 (559) . |.-.......|.|-++.+...++....+..-..+.....+.|-.++. +. .+.++.. T Consensus 188 ~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~--g~----~~~~~~~ 261 (453) T protein:vir:73 188 KAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFL--GA----EVDEEDA 261 (453) T ss_pred cCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee--cC----CCCchhh Confidence 1 10011112357676766666665544444444444444455655553 21 1223333 Q ss_pred HHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccch Q lcl|NC_012530. 295 EDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSL 373 (559) Q Consensus 295 ~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~ 373 (559) ..++..-- ........+..... .++.++.-++.+ .+..+....+...+.|+..-++|. ++...-+..++ ..... T Consensus 262 ~~~~~~~~-~~~~~~~~~~~~~~-~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg-~Al~~ 336 (453) T protein:vir:73 262 KNIKDNRL-INFFDKNSNGQGTN-AAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAAN--ISDENFGNSSG-VALAY 336 (453) T ss_pred hccccccc-cccccccccccccc-ccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcc--cCcccccCccH-HHHHH Confidence 33322100 00000001111111 122233333322 344566677888888988888884 22221111110 00000 Q ss_pred hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHh Q lcl|NC_012530. 374 NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQ 450 (559) Q Consensus 374 ~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~ 450 (559) -+... .+..+..+..+|.-++..+...++..- .......+.+.|+.....+..+.++.+..+. |+++..-+.+++ T Consensus 337 ~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~-giis~et~~~~~ 414 (453) T protein:vir:73 337 KLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNAS-NKDAWKDIEYTFTRNEPKDIKEQAETANILK-GITSEETALSVI 414 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCCCHHHHHHHHHHHh-ccCcHHHHHHhC Confidence 11111 111223444555555555543333221 1122346788898888888999888876664 667776666666 Q ss_pred CCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccc Q lcl|NC_012530. 451 GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEG 519 (559) Q Consensus 451 gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (559) +.-+-+ ...+..+.+ ++.... .. +..++...++. ++.+- T Consensus 415 ~~~~d~----------~~E~~ri~~------E~~~~~--~~-~~~~~~~~~~~-----------~~~~~ 453 (453) T protein:vir:73 415 SVIPDV----------QAEMEKIKK------KKLLQL--SL-TRTSNLVRMKQ-----------MRGNL 453 (453) T ss_pred CCCCCH----------HHHHHHHHH------HHHHHH--HH-HHhccCCcchh-----------hhcCC Confidence 542110 011111111 000000 00 00000000000 00000 No 218 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=98.10 E-value=4.6e-06 Score=49.86 Aligned_cols=480 Identities=12% Similarity=0.065 Sum_probs=198.0 Q ss_pred CcchhhhccccccCCcchHH-HHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFF-KHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQ 79 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 79 (559) |-|| |-+ .++.+.+ +...+.+..-.++..+.-..++ ......+....|. .+-... T Consensus 1 ~~~~---rPk---~~p~~p~~~~~arrr~LtaAsa~l~~~~~~---------------~~kt~~~~~~~WQ---~eAW~~ 56 (646) T protein:vir:10 1 MALL---KPK---SAPPEPFGAEVARRIALAGATAQVDLGASS---------------SWKTWKFGNKDWQ---TEGWRL 56 (646) T ss_pred Cccc---CCC---CCCCCcccccccchhhhhhccccccCCCcc---------------eeecCCCcchhhh---HHHHHH Confidence 2222 111 1111100 0000000000000000000000 0000000000110 112222 Q ss_pred HhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 80 YSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 80 ~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) +...+.++-.|.-|+++++..-++.. +|. ..+ .++... ..+.+......+.. ....-.++++.+ T Consensus 57 ~d~vpELry~vgW~~~a~SR~rL~as--------eid-dtG-~~tg~v--~~~~v~~iv~~~~G----g~~gQ~qlLkr~ 120 (646) T protein:vir:10 57 YDIIPEHHFLAGRIGDSVAQARLYVT--------EVD-DTG-EETGEV--QDERIKRLAAVPLG----TGSQRDDNLRLA 120 (646) T ss_pred HhhhhhHhhHhhhhhhhhceeeeeee--------eec-CCC-CCcCcc--chHHHHHHhhhhcc----chhhHHHHHHHH Confidence 23347777788888888887544332 222 111 111111 00112222222211 112224789999 Q ss_pred HHHHHHcCCcceEEE---E-CCCCcEEEEEEecCceEEEEecCcccccccceEEEEEec---CceeeeecccceEEEecc Q lcl|NC_012530. 160 VRDTYTYDQVNYENT---Y-DSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID---NKVRGSFTADEMGMFIRN 232 (559) Q Consensus 160 v~d~ll~Gna~~~i~---r-d~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~n 232 (559) ..++-+-|.+|+... . ..+++ ..++++-...|.. .|.. .-..... +.........++++..++ T Consensus 121 ~~~ltV~GE~wiv~~~~~~~~~~~~-~~W~vvt~~Ev~~----tg~~-----~~i~~p~~~~g~~~v~~~~~d~lvRiW~ 190 (646) T protein:vir:10 121 GLDLAVGGECWIVGEGAATSPEAAE-GSWFVVTGSAISR----TGDE-----IAVRRPQQRGGSKLVLVDGQDILIRCWR 190 (646) T ss_pred HhheecccceEEeeccccCCCCCCc-cceeeecHHHhcc----CCCe-----eeeecCccCCCCCcceecCCceEEEEec Confidence 999999999887531 1 11221 1244444444421 1111 1111111 333344556677766677 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCc--cCCHHHHHHHHHHH----HHHhc Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVT--NTSMRALEDFKRHW----TATSS 306 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--~~~~e~~~~l~~~~----~~~~~ 306 (559) |.+. ...+--||+.+++..+.-.........+..+.-.+-.|||.++.+.+-+ ...+-....|...| ..++. T Consensus 191 P~Pr--r~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~ 268 (646) T protein:vir:10 191 PHPN--DTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMA 268 (646) T ss_pred CCcc--cccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeeccccccCCCCCCCcchhHHHHHHHHHHHhhhc Confidence 7553 2345679999998888877777666666555555566778776553322 11111223333333 23332 Q ss_pred Cc-ccccccccccCC---ce----eeeecc--ccchhHHHHHHHHHHHHHHHHhCCCHHH-hccccccccccccccchhh Q lcl|NC_012530. 307 GI-NGAYRIPMITAE---DA----KFVSMT--QAEDMQFQSWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSLNE 375 (559) Q Consensus 307 G~-~nag~~~vl~~g---~~----~~~~ls--~~~D~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~ 375 (559) .. ..+--|||+..+ -+ +.+.++ ..-+.--+.+|+..+..||....|||.. ||+.+.+ .|+. +.- T Consensus 269 De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~RlA~glDIppE~LLGlgd~N-HWtA-WqI--- 343 (646) T protein:vir:10 269 DQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIARLASSAEIPGEVLTGIGDAN-HWTA-WLI--- 343 (646) T ss_pred CCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHHHHHHHhccCCchhheeeccccc-eeee-eee--- Confidence 22 124456776432 11 233333 3334456889999999999999999875 5665433 3321 111 Q ss_pred hhHHHHHHHHHHHHhhHHHHHHHHHHHhhcccc---ccC----ccceeeecc-hhhhhHHHHHHHHHHHHcCCCCHHHHH Q lcl|NC_012530. 376 SNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQ---ILG----DNYMLEFVG-GDTRSQQDKLKSVQLELQTATTVNDYR 447 (559) Q Consensus 376 an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~---~~~----~~~~~~f~~-l~~~d~~~~~~~~~~~~~~~~T~NE~R 447 (559) +.+ -++ -|.|.+..|+++|++.+|.+ .++ .+|.+-|+. .+..+.....++.+..-+|.||-...| T Consensus 344 -~de-----~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~~pd~~deA~qa~drGAIt~eAlr 416 (646) T protein:vir:10 344 -SDE-----GIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLASKPNRLDEAIQLHERNLIKDEEVV 416 (646) T ss_pred -ccc-----cch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCcccccCCCCcHHHHHHHHcCCccHHHHH Confidence 111 233 49999999999999988753 122 368888874 445554444455555557889999999 Q ss_pred HHhCCCCCCCCCE--eeccceecccccccccccccccccccccccccccC-----CCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 448 EKQGLPKIAGGDI--ILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESAL-----QNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 448 ~~~gl~pi~gGD~--~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) +.+|+.--.+=+. ..+..-.........+ .-....+.....+..+. +..++++...+ ..+++.. T Consensus 417 k~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~L--il~P~~qa~~~~P~~~~~~lpp~~~~~~dg~~~------~~e~~g~- 487 (646) T protein:vir:10 417 KAGAFSVDQMPTVQERAVQILLGLVKTQPDL--ILDPAIQAALGLPAVQSVGLPPTAAQRTDGDLD------DDESEGA- 487 (646) T ss_pred HHhcccccccCChHHHHHHHHHHHhcCCccc--cccchhhccccCCCcCccccCCcccccccCCCC------ChhhcCC- Confidence 9999853211010 0000000000000000 00000000011111000 00011110000 0011011 Q ss_pred ccccccccccccccccccc--ccccccch----------------------------hhhhhccCCCCC Q lcl|NC_012530. 521 TGKDAKPSGKDNQQGVGKD--GQLKNKKN----------------------------TNSYKQGGSSKK 559 (559) Q Consensus 521 ~~~~~~~~g~~~~~~~~~~--~~~k~~~~----------------------------~~~~~~~~~~~~ 559 (559) +++.+.++....+ +..+.-.+ ..+-...||-.. T Consensus 488 ------~~~~E~~~~pda~~~~a~~~~~~~r~~~~~~~~~~~~~p~a~~~aav~l~v~RAL~lAG~Rlr 550 (646) T protein:vir:10 488 ------PNGGEAPDQPDADEARAITAALDRRIALAARPVLALPSPEAVFNASAKLMILRALELAGGRLT 550 (646) T ss_pred ------CCCCccCCCCCCCccccccccccccchhhhhhhhccccchhHHHHHHHHHHHHHHHhcccccc Confidence 1111111111111 01000000 000001111111 No 219 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.09 E-value=4.8e-06 Score=49.75 Aligned_cols=416 Identities=12% Similarity=0.067 Sum_probs=170.1 Q ss_pred Ccc--hhhhccccccCCcc-hHHHHHHHHHHH-----HHHhhhhccccccccccccccccccccccccccccCCCCCccc Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPN-AFFKHIDSKIAN-----DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGR 72 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~-----~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 72 (559) |+. -..| .=+.++++. +.|..+-..... +.+.+=-.|++.....+. ....+.+ T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~------------------~~~~~~~ 61 (452) T protein:vir:36 1 MKYKPPKLM-TFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPA------------------KDSWKPD 61 (452) T ss_pred CcccCceeE-EcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcc------------------ccccCcc Confidence 321 1111 114455555 222222221111 111111223322111110 0000101 Q ss_pred HHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 73 ITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 73 ~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) . + ...++...+|+..+.-+- |.+..+...+ ....+.+.+++.. ..| T Consensus 62 ~----k--i~~n~~~~ivd~~~~~l~-----------g~~~~~~~~d--------~~~~~~l~~~~~~---------n~~ 107 (452) T protein:vir:36 62 N----R--LAVNFTKYIVDTFTGYFN-----------GIPVKKSHSD--------KEILTKLQEFDNL---------NDM 107 (452) T ss_pred c----e--eecchHHHHHHHHhhhhc-----------ccCceeecCC--------hhHHHHHHHHHhh---------cCh Confidence 0 1 113444455555543332 2222222211 1112234444432 123 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCc-eeeeecccceEEEe Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNK-VRGSFTADEMGMFI 230 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~-~~~~~~~~evi~~~ 230 (559) ......+..+.+.+|.+|..+.+|.+|++. +..++|..+.++.+.... .....++|+...++. ....+..+.+.++. T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~ 186 (452) T protein:vir:36 108 EDEESELAKMACIYGRAFEFLYQDEDTQTN-VVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKIS 186 (452) T ss_pred hHHHHHHHHHHHhcCeEEEEEEecCCCeeE-EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEE Confidence 456667788899999999999999888764 777899999888765432 111122222211111 11223333322221 Q ss_pred c------------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 231 R------------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 231 ~------------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) . | |.-.......|.|-++.....++....+..-..+.+...+.|-.++. +. .++++. T Consensus 187 ~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~--g~----~~~~~~ 260 (452) T protein:vir:36 187 GENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL--GA----AVEEED 260 (452) T ss_pred EcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cC----CcCchh Confidence 1 0 10011112246666665555555544444444444555555654443 21 223333 Q ss_pred HHHHHHHHHHHhcCcccccccccccCC------ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Q lcl|NC_012530. 294 LEDFKRHWTATSSGINGAYRIPMITAE------DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATG 367 (559) Q Consensus 294 ~~~l~~~~~~~~~G~~nag~~~vl~~g------~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 367 (559) ...++. +++..+..+ +++|..... .+..+....+...+.|+..-++|.. +...-+..++ T Consensus 261 ~~~~~~------------~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg 325 (452) T protein:vir:36 261 LKNIRS------------NRVINYYADGEGKNVDVKFLEKPD-SDSQTENLLDRLTKLIFQTTMVANI--SDESFGSSSG 325 (452) T ss_pred hhhhhh------------cceEEecCCCCccCCcceeEeecC-CHHHHHHHHHHHHHHHHHHhCcccc--CcccccCCcH Confidence 222211 111122121 233332222 2445667778888899988899853 2221111111 Q ss_pred ccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHH Q lcl|NC_012530. 368 NKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVN 444 (559) Q Consensus 368 ~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~N 444 (559) . .....++.. ....+..+..+|+.+++.|...++.. -.......+.+.|......|..+.++.+..+ .|+|+.- T Consensus 326 ~-Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~-~g~iS~e 402 (452) T protein:vir:36 326 V-SLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-SNKDSWKDIEYTFTRNEPKDIKEQAETANIL-MGITSQE 402 (452) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCccccccceEEeCCCCCcCHHHHHHHHHHH-hccCChH Confidence 0 000111111 11123444555665555554443322 1112234577888888888888888877664 4668877 Q ss_pred HHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccc Q lcl|NC_012530. 445 DYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSN 511 (559) Q Consensus 445 E~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (559) -+.++++.-+ |. ...+..+.+ ++..... .......+.+..+.+.+.++.| T Consensus 403 t~~~~~~~~~----d~------~~E~~ri~~------E~~~~~~-~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 403 TALSVISVIP----DV------QAEMEKIKK------EEASTAI-FDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHHHhCCCCC----CH------HHHHHHHHH------HHHHHHH-HHhhccCCCCcccccCccccCC Confidence 7777765421 10 011111111 1100000 0000000001101111111100 No 220 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.05 E-value=5.8e-06 Score=49.28 Aligned_cols=413 Identities=10% Similarity=0.013 Sum_probs=162.4 Q ss_pred cccCCcchHHHHHHHH---HHH--HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChH Q lcl|NC_012530. 11 FYTDDPNAFFKHIDSK---IAN--DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVV 85 (559) Q Consensus 11 ~~~~~~~~~~~~~~~~---~~~--~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 85 (559) +.-+.+.+-+...-.+ ... +.+.+=-.|++....++... ......... .....+.++ ..++. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~-~~~~~~~~~--~~~~~~~~k----------i~~n~ 67 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGK-AKLNKEGKK--DPLRSADNR----------IPSNF 67 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccch-hcccccccc--cccccCCcc----------cccch Confidence 1111111111111111 000 11111122332211111100 000000000 000000000 01233 Q ss_pred HHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHH Q lcl|NC_012530. 86 LNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYT 165 (559) Q Consensus 86 v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll 165 (559) ...+|+..+.-+. |.+..+...+ .+..+.+..++.+ +|...+..+..+++. T Consensus 68 ~k~Iv~~~~~yl~-----------G~p~~~~~~d--------~~~~~~l~~~~~~----------~~~~~~~~l~~~~~~ 118 (470) T protein:vir:10 68 YQLLVDQEAGYVA-----------SVFPDIDVGK--------DADNKKIIDVLGD----------DRALTLNGLLVDSSN 118 (470) T ss_pred HHHHHHhhhhhee-----------ccceeeecCc--------hHHHHHHHHHHhh----------hHHHHHHHHHHHHhh Confidence 3334444333222 1111221111 1122334444432 123334456778899 Q ss_pred cCCcceEEEECCCCcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCc------eeeeecccceEEEecccC---- Q lcl|NC_012530. 166 YDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNK------VRGSFTADEMGMFIRNPR---- 234 (559) Q Consensus 166 ~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~------~~~~~~~~evi~~~~n~~---- 234 (559) +|.+|.++.+|.+|++. +..++|..+.++.++.- ......++|+...+.. ....+....+.|+...-. T Consensus 119 ~G~a~~~~y~d~~~~~~-~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~ 197 (470) T protein:vir:10 119 AGRAWLHYWIDEDGNFR-YGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTV 197 (470) T ss_pred cCeeEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCccee Confidence 99999999999998764 77899999998876542 1222234444332211 112333344433321100 Q ss_pred ---------------------------------CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec Q lcl|NC_012530. 235 ---------------------------------SDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVK 281 (559) Q Consensus 235 ---------------------------------~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 281 (559) -......+|.|-++.....|+....+..-..+.+...+.|-.+|. T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~-- 275 (470) T protein:vir:10 198 IEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT-- 275 (470) T ss_pred ccccccccccccccccccccccccccCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeee-- Confidence 000011246666666555555554444444555555555655553 Q ss_pred CccCCccCCHHHHHHHHHHHHHHhcCccccccccccc---CCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_012530. 282 PSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT---AEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIG 358 (559) Q Consensus 282 ~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~---~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg 358 (559) +... .+. .+ +...+... .+-.++... +++++|.....+ +..+....+...+.|...-++|.. . T Consensus 276 g~~~-~~~-~~----~~~~~~~~-----~~i~~~~~~~~~~~~~~~lt~~~~-~~~~~~~~~~L~~~I~~~s~~p~~--~ 341 (470) T protein:vir:10 276 NYGG-ADL-HQ----FMNDLRKY-----KSIKINNTGNGDNSGVDKLQIDIP-VEARDDALKITRKNIFLFGQGIDP--A 341 (470) T ss_pred cCCc-ccc-ch----hhhhhhhc-----CeEeccCCCCCcCceeEEEeecCC-hHHHHHHHHHHHHHHHHHhCCCCC--C Confidence 2111 111 12 22222211 011111111 123444433333 234566677788888888888842 2 Q ss_pred cccccccccccccch--hhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHH Q lcl|NC_012530. 359 MQNRGGATGNKSNSL--NESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSV 433 (559) Q Consensus 359 ~~~~~~~~~~~~~~~--~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~ 433 (559) .. +. ++.++... -++.. ....+..+..+|+-.++.|...++. .......+.+.|+.....|..+.++.+ T Consensus 342 ~~--~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~---~~~d~~~i~i~f~~~~p~d~~e~~~~~ 415 (470) T protein:vir:10 342 NF--ES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF---SDADKRHISQHWTRTKVEDSLTKAQIV 415 (470) T ss_pred cc--cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---cCcccceeeEEeccCCCCCHHHHHHHH Confidence 21 11 11111000 11111 1122344445555555555444432 122345678899999999999998877 Q ss_pred HHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCcccccc Q lcl|NC_012530. 434 QLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNS 512 (559) Q Consensus 434 ~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (559) ... .|.|+.--++++++. ++ |. ...+..+. .+.++.....+. ....++ .+.+.+. T Consensus 416 ~~~-~g~iS~et~l~~~p~--v~--D~------~~E~eri~----~E~~e~~~~~~~-------~~~~~~--~~~dde~ 470 (470) T protein:vir:10 416 STV-ANYSSKEAVAKANPI--VD--DW------QQELKDLA----KDKEENDPYSNQ-------ADELNG--KGVNDEQ 470 (470) T ss_pred HHH-hccCcHHHHHHhCCC--CC--CH------HHHHHHHH----HHHHHHHHhhcc-------ccccCC--CCCCCCC Confidence 664 466788777776643 22 10 00111111 010000000000 000000 0000000 No 221 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.03 E-value=6.5e-06 Score=49.01 Aligned_cols=441 Identities=9% Similarity=0.026 Sum_probs=156.8 Q ss_pred Ccchhhhc--cccccCCcc----hHHHHHHHHHHHH------HHhhhhccccccccccccccccccccccccccccCCCC Q lcl|NC_012530. 1 MGIFDRFR--TKFYTDDPN----AFFKHIDSKIAND------TASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPI 68 (559) Q Consensus 1 ~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~~------~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~ 68 (559) .+ |.+++ .-++-+++. +.+..+-.....+ .+..=-.|++.... . .. ...+ .. T Consensus 3 ~~-~~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~----~--------~~--~~~~-~~ 66 (506) T protein:vir:94 3 YD-LTEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKIL----D--------KQ--SRRH-ED 66 (506) T ss_pred cc-hhhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc----c--------cc--cccc-cc Confidence 22 22222 112222211 1111111110000 00100112111000 0 00 0000 00 Q ss_pred CcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_012530. 69 AFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPI 148 (559) Q Consensus 69 ~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~ 148 (559) .+.+. + ...+....+|+..+.-+. |.+..+...+. ...+.+..|+..- T Consensus 67 ~~~~~----k--i~~n~~~~Iv~~~~~~l~-----------G~p~~~~~~d~--------~~~~~l~~~~~~N------- 114 (506) T protein:vir:94 67 GKADH----R--ATHSFAKYIADFQTSYSV-----------GNPINVKLPDD--------GSNSGFDTFNKAN------- 114 (506) T ss_pred cCCcc----e--eecchHHHHHHHhhhhhc-----------ccCceeecCcc--------hHHHHHHHHHhcc------- Confidence 00000 0 123444455555544332 22222222111 1123345554431 Q ss_pred hhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEec--Cce-------e Q lcl|NC_012530. 149 RDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYID--NKV-------R 218 (559) Q Consensus 149 ~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~~~-------~ 218 (559) .+......+..+++.+|.+|..+.++.+|++. +..++|..+.++.++... .....++|+.... +.. . T Consensus 115 --~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~-i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 191 (506) T protein:vir:94 115 --DVDAENYDLFLDMSRYGRAYEYVYRGEDNEEH-LAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVP 191 (506) T ss_pred --CHhHHHHHHHHHHHhcCeEEEEEEecCCCeeE-EEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEE Confidence 23345566788889999999999999888764 777899999888765331 1111222222110 000 0 Q ss_pred eeecccceEEEe------------cccCC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec Q lcl|NC_012530. 219 GSFTADEMGMFI------------RNPRS-----DILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVK 281 (559) Q Consensus 219 ~~~~~~evi~~~------------~n~~~-----~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 281 (559) ..+....+.++. .|+.. .......|.|.++.....++....+..-..+.....+.|-.+|+-. T Consensus 192 ~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~ 271 (506) T protein:vir:94 192 ETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGD 271 (506) T ss_pred EEEeCceEEEeccccCccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcC Confidence 011111111110 01000 0001112445444444433333222222222222112222222100 Q ss_pred -----------------CccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeecccc-chhHHHHHHHHHH Q lcl|NC_012530. 282 -----------------PSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQA-EDMQFQSWLNYLI 343 (559) Q Consensus 282 -----------------~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~-~D~qf~e~~~~~~ 343 (559) .............+.++.......-.....+. +-..+++.+.+-++.+ .+..+....+... T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~ 350 (506) T protein:vir:94 272 IDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMT-VNGTQTSVDAKYINKTYDVVGSEAYKKRVA 350 (506) T ss_pred ccccccchhccccccccccccccccccchhHHHhhhhhcCeeeeccccc-ccCccccccceeeeecCCHHHHHHHHHHHH Confidence 00000001111111111111111100110010 0111112233333322 2344667778888 Q ss_pred HHHHHHhCCCHHHhccccccccccccccchhh--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhc-cccccCccceee Q lcl|NC_012530. 344 NIICALVAMDPAEIGMQNRGGATGNKSNSLNE--S---NNQNKIDASKSKGLMPLLDMIAKNLTNGI-IRQILGDNYMLE 417 (559) Q Consensus 344 ~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~--a---n~~~~~~~~~~~~l~P~~~~ie~~ln~~L-~~~~~~~~~~~~ 417 (559) +.|...-++|..-.+ +..++.++...-. . +-....+.++...|+..+..|...++..= ........+.+. T Consensus 351 ~~I~~~s~~p~~~~~----~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~ 426 (506) T protein:vir:94 351 GDIHKFSHTPDLTDE----NFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFT 426 (506) T ss_pred HHHHHHhCccccccc----cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEE Confidence 999999999963221 1111111110000 0 11122334556666666666655544210 011223457788 Q ss_pred ecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCC Q lcl|NC_012530. 418 FVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQN 497 (559) Q Consensus 418 f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (559) |+.....|..+.++.+..+ .|.|+...++++++. ++ |. ...+..+ ..+........... ... T Consensus 427 f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~lp~--v~--d~------~~E~~ri------~~E~~~~~~~~~~~-~~~ 488 (506) T protein:vir:94 427 FRDNLPADNISQIKALVQA-GATLPQKYLYQQLPG--VT--NP------QDIVDMM------KEQSANGDYSFDQN-GVI 488 (506) T ss_pred eCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCC--CC--CH------HHHHHHH------HHHHHHHhhcchhh-cCC Confidence 9888889999999887765 467899888887644 21 10 0111111 11111111100000 000 Q ss_pred CCCCCCCCCccccccchhcccccccccccccccccccccc Q lcl|NC_012530. 498 PSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVG 537 (559) Q Consensus 498 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 537 (559) .++.+++ + ..+...++ ++ T Consensus 489 ~~~~~~~-~--~~~~~~~e-------------------~~ 506 (506) T protein:vir:94 489 SNDGQTN-T--TATQTDEE-------------------VR 506 (506) T ss_pred CcccCcc-c--cccccccC-------------------CC Confidence 0000000 0 00000000 11 No 222 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.97 E-value=8.8e-06 Score=48.29 Aligned_cols=430 Identities=11% Similarity=0.052 Sum_probs=154.7 Q ss_pred Ccchhhhc----cccccCCcchHHHHHHHHHH--HHHHhhhhccccccccccccccccccccccccccccCCCCCcccHH Q lcl|NC_012530. 1 MGIFDRFR----TKFYTDDPNAFFKHIDSKIA--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRIT 74 (559) Q Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 74 (559) |--=|=|. .++..+.+.+.+.+...... .+.+..=-.|++.....+... .....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~-----------------~~~~~~~- 62 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKT-----------------DKYAADN- 62 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc-----------------cccCCcc- Confidence 10000010 00111111122221111111 111122223333222221100 0000000 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) + ...+....+|+..+.-+. |.+..+...+. .....+..++.. ..+.. T Consensus 63 ---k--i~~n~~~~iv~~~~~~l~-----------g~~~~~~~~d~--------~~~~~l~~~~~~---------n~~~~ 109 (489) T protein:vir:99 63 ---R--IASDFAKYITVFEQGYML-----------GVPVEYKNENK--------DLQAAIDLMSVR---------NNEDY 109 (489) T ss_pred ---e--eecchHHHHHHHHhhhhc-----------cCCceeecCCh--------hHHHHHHHHHhh---------cChhH Confidence 0 113444455555543332 12222222111 111223333332 12334 Q ss_pred HHHHHHHHHHHcCCcceEEEE----CCCCcEEEEEEecCceEEEEecCccc-ccccceEEEEEecCc-----eeeeeccc Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTY----DSNGRLSHTRMVDPTTIYFANDEHGH-RRTRGKIYRQYIDNK-----VRGSFTAD 224 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~r----d~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~~~-----~~~~~~~~ 224 (559) +...+..+++++|.+|..+.. |..|+ +.+..++|..+.++.++... .....++|+....+. ....+.++ T Consensus 110 ~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~ 188 (489) T protein:vir:99 110 HNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSD 188 (489) T ss_pred HHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCC Confidence 556778889999999987764 34444 45888999999888765432 122223333221111 11223333 Q ss_pred ceEEEec---------------ccCCC-----ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCcc Q lcl|NC_012530. 225 EMGMFIR---------------NPRSD-----ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSP 284 (559) Q Consensus 225 evi~~~~---------------n~~~~-----~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 284 (559) .+.++.. |+... ......|.|.++.....++....+..-..+.....+.|--++ .+.. T Consensus 189 ~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i--~g~~ 266 (489) T protein:vir:99 189 TIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVI--AGNA 266 (489) T ss_pred cEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhh--ccCC Confidence 3333221 00000 001123555555444444333333222222222233333222 1111 Q ss_pred CCccCCHHHHHHHHHHHHHHhcC------cccccccccccCC------ceeeeecccc-chhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 285 SVTNTSMRALEDFKRHWTATSSG------INGAYRIPMITAE------DAKFVSMTQA-EDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 285 ~~~~~~~e~~~~l~~~~~~~~~G------~~nag~~~vl~~g------~~~~~~ls~~-~D~qf~e~~~~~~~~Ia~~fg 351 (559) ... ... ..+...+.....+ ....+++..+..+ +.+...++.. .+..+....+...+.|...-+ T Consensus 267 ~~~-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 342 (489) T protein:vir:99 267 YTG-ADE---NDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTF 342 (489) T ss_pred ccc-ccc---hhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhC Confidence 111 011 1111111111000 0111222222111 1122223322 222344566778888988888 Q ss_pred CCHHHh-ccccccccccccccchhh---hhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcc---ccccCccceeeecch Q lcl|NC_012530. 352 MDPAEI-GMQNRGGATGNKSNSLNE---SNN---QNKIDASKSKGLMPLLDMIAKNLTNGII---RQILGDNYMLEFVGG 421 (559) Q Consensus 352 VPp~~l-g~~~~~~~~~~~~~~~~~---an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~---~~~~~~~~~~~f~~l 421 (559) +|..-. ++. ++.++.... +.. ....+..+..+|+-++..|...++..=. .......+.+.|+.. T Consensus 343 ~p~~~~~~~~------~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~ 416 (489) T protein:vir:99 343 TPDTQDMKFS------GVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPN 416 (489) T ss_pred Cccccccccc------ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCC Confidence 885321 111 111111110 111 1222344555666666655554432111 111234578888888 Q ss_pred hhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCC Q lcl|NC_012530. 422 DTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGT 501 (559) Q Consensus 422 ~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (559) ...|..+.++.+..+ .|+|+.-.+.++++. +..-|. ...+..+ +.+........+. ...++.++ T Consensus 417 ~p~d~~~~~~~~~kl-~giis~et~~~~l~~--v~~~d~------~~E~~ri----~~E~~~~~~~~~~--~~~~~~~~- 480 (489) T protein:vir:99 417 LPQNDNEIVTAAQNL-YGIVSDQTIFEILNT--VTGVDA------EAELKRL----KEEADKKQSLPEP--RLVGDASG- 480 (489) T ss_pred CCcCHHHHHHHHHHH-hccCCHHHHHHhcCC--CCchhH------HHHHHHH----HHHHHHHhccccc--cccCCCCC- Confidence 888888888877665 366888777776532 211110 0001111 0010000000111 11111100 Q ss_pred CCCCCcccccc Q lcl|NC_012530. 502 PPTLPPSSSNS 512 (559) Q Consensus 502 ~~~~~~~~~~~ 512 (559) +.++.+.++ T Consensus 481 --~~~~~~~~p 489 (489) T protein:vir:99 481 --QEEPTAEKP 489 (489) T ss_pred --CcCCCCCCC Confidence 001111111 No 223 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=97.93 E-value=2.1e-06 Score=51.76 Aligned_cols=192 Identities=14% Similarity=0.175 Sum_probs=91.9 Q ss_pred EEEecCccCCccCCH-HHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_012530. 277 ILLVKPSPSVTNTSM-RALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPA 355 (559) Q Consensus 277 il~~~~~~~~~~~~~-e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~ 355 (559) |+++++-...-.... +.++++. +-..++|.. +. .+|..++-+|..++.+- .-+-+........||++-|||.. T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~--~~~~~~~~~--~~-~~ld~~~e~~e~~~~~l-sGl~d~l~~~~~~iaa~s~iP~t 74 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLA--QVDNNSGVG--QA-IGIDADSEEYNVLNSDI-GGIDTFLSQKFDRIVALSGIHEI 74 (201) T ss_pred CccchHHHHHhcCChHHHHHHHH--HHHHhhhhh--hh-heeecCCcceeeeecCc-CChHHHHHHHHHHHHhHhcCchh Confidence 444332100000111 1222222 123344432 22 33434434566554320 12345667777899999999988 Q ss_pred HhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHH- Q lcl|NC_012530. 356 EIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQ- 434 (559) Q Consensus 356 ~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~- 434 (559) .|-=...++.++++- ....|.-......-..-|+|.+.++-..+ .. ...+.|+|+.+...+.++++++.+ T Consensus 75 ~LfG~sp~Glnatge--~d~~nyyd~i~~~Qe~~l~p~le~l~~~~----~~---~~~~~~~f~pL~~~s~kekAei~~~ 145 (201) T protein:vir:10 75 ILKGKNVGGVSASQN--TALETFYGYVDRKRKAELLPLLEFLLPFI----VT---EQEWSVEFNPLSQVSDKDKSEILEK 145 (201) T ss_pred hhcCCCCccccccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cC---CCCceEeeCCCCCCCHHHHHHHHHH Confidence 776555555543222 12233444444444566778777654322 11 246999999999888888877653 Q ss_pred ------HHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCc Q lcl|NC_012530. 435 ------LELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPP 507 (559) Q Consensus 435 ------~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (559) .++. |.++++|+|+.+--.+..++ .+. +.+.. .... ..+..|+..| T Consensus 146 ~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~----~~~-----~~~~~-----------~~~~-------~e~~dp~~~~ 198 (201) T protein:vir:10 146 NVNSVAALIAAGIIDADEARDTLRAISTEVK----IGE-----GSIQT-----------EVVI-------NESEDPLDVS 198 (201) T ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhcCCcCC----CCC-----CCCCc-----------cccc-------cccCCCCCCC Confidence 3444 56899999998865443321 000 00000 0000 0000111111 Q ss_pred ccc Q lcl|NC_012530. 508 SSS 510 (559) Q Consensus 508 ~~~ 510 (559) .+. T Consensus 199 ~~~ 201 (201) T protein:vir:10 199 ANN 201 (201) T ss_pred CCC Confidence 111 No 224 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.83 E-value=1.6e-05 Score=46.90 Aligned_cols=421 Identities=10% Similarity=0.011 Sum_probs=163.6 Q ss_pred CcchhhhccccccCC-----------cchHHHHHHHHHHH-----HHHhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MGIFDRFRTKFYTDD-----------PNAFFKHIDSKIAN-----DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~-----~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) |.=++|-+.+-..++ .++.|..+-..... +.+..=-.|++....++... ...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~--------~~~~~~- 71 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKR--------DVNGDY- 71 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhh--------hccccc- Confidence 444444332211111 11111111111100 00111112322211111100 000000 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) . ....+. + ...+....+|+..+.-+-. .+..+...+ .+..+.+..++.+ T Consensus 72 ~--~~~~~~----k--i~~n~~k~ivd~~~~yl~g-----------~p~~~~~~~--------~~~~~~l~~~~~n---- 120 (478) T protein:vir:10 72 D--ETKPDW----R--MYTNYHQNLVDQKVAYAVA-----------NPVTFGVDN--------DKALKQIQHTLNH---- 120 (478) T ss_pred c--cccccc----e--eccchHHHHHHHHhhhhcc-----------cCceeecCC--------hHHHHHHHHHHhc---- Confidence 0 000000 0 1134444555555443321 122221111 1122334444421 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCc-ccccccceEEEEEecCceeeeecc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEH-GHRRTRGKIYRQYIDNKVRGSFTA 223 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~-g~~~~~~~~y~~~~~~~~~~~~~~ 223 (559) .|......+..+.+.+|.+|..+..|.+|++ .+..++|..+.++.+.. ........+|+...+......+.. T Consensus 121 ------~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~ 193 (478) T protein:vir:10 121 ------KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTK 193 (478) T ss_pred ------cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeC Confidence 2344556677889999999999888988876 47788999988876532 222222333333322222333444 Q ss_pred cceEEEecc-------------------------------cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 224 DEMGMFIRN-------------------------------PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 224 ~evi~~~~n-------------------------------~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) +.+.++... |.-.......|.|.++.....++....+..-..+.+...+ T Consensus 194 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~ 273 (478) T protein:vir:10 194 DDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESV 273 (478) T ss_pred CcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 444333210 0000001224667666655555554444444444444444 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc---cCCceeeeeccccchhHHHHHHHHHHHHHHHH Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI---TAEDAKFVSMTQAEDMQFQSWLNYLINIICAL 349 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl---~~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~ 349 (559) .|-.+++ +... .+. .+ +...+.. .++..+ .+++++|..... .+..+.+..+...+.|... T Consensus 274 ~~~~~~~--g~~~-~~~-~~----~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~ 336 (478) T protein:vir:10 274 ELIYILK--GYEG-EDM-KD----FMHNLKY--------YKAISVAGESGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEF 336 (478) T ss_pred Ccceeee--cCCc-ccc-cc----hhhhhhh--------CceeEecCCCCCcceEEeecC-CHHHHHHHHHHHHHHHHHH Confidence 5544432 2110 111 11 1111111 112222 223455554332 3445677788888899998 Q ss_pred hCCCHHHhccccccccccccccchh---hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhh Q lcl|NC_012530. 350 VAMDPAEIGMQNRGGATGNKSNSLN---ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDT 423 (559) Q Consensus 350 fgVPp~~lg~~~~~~~~~~~~~~~~---~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~ 423 (559) -++|..-.+ +. +++.++... +... .......+..+|+-++..|...+. .......+.+.|+.... T Consensus 337 s~~p~~~~~----~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~----~~~d~~~i~i~f~~~~p 407 (478) T protein:vir:10 337 GQGVDFQQD----KF-GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR----LDVRVQDIEITFNFNVM 407 (478) T ss_pred hCCcCcCcc----cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccccceEEeCCCCC Confidence 888853211 11 111111000 0001 111123333344444333332221 12223467888888888 Q ss_pred hhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 424 RSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 424 ~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) .+..+.++.+..+ .|+++.--+.++++. ++ |. ...+..+.. +........+....+ ...+. T Consensus 408 ~~~~e~~~~~~~~-~g~iS~et~i~~~~~--v~--d~------~~E~~ri~~------E~~~~~~~~~~~~~~--~~d~~ 468 (478) T protein:vir:10 408 VNELENSQIAMNS-TGLLSKETILGNHSW--VQ--DP------VAEMERIEQ------ENIELNQQLPDIEEG--LNDEQ 468 (478) T ss_pred CCHHHHHHHHHHH-hCCCChHHHHHhCCC--CC--CH------HHHHHHHHH------HHHHHHHhccccCCC--Ccccc Confidence 8888888776554 466777667666643 11 10 011111111 111111100000000 00000 Q ss_pred CCCccccccc Q lcl|NC_012530. 504 TLPPSSSNSF 513 (559) Q Consensus 504 ~~~~~~~~~~ 513 (559) ...+.+.+++ T Consensus 469 ~~~~~d~~~e 478 (478) T protein:vir:10 469 QRQSEDNQSE 478 (478) T ss_pred cccCcCCCCC Confidence 0000010000 No 225 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.75 E-value=2.2e-05 Score=46.10 Aligned_cols=446 Identities=11% Similarity=0.124 Sum_probs=178.2 Q ss_pred Ccch-hhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccc-------cccccccccc-ccCCC-CCc Q lcl|NC_012530. 1 MGIF-DRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLM-------FSTLEDTSIV-PKPSP-IAF 70 (559) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~-------~~~~~~~~~~-~~p~~-~~~ 70 (559) |++. .+|...++..+ + ....+....+-.+-..|....+. ..+..++++. ..-.. ... T Consensus 1 m~~~~l~lf~f~~k~~---------e----~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~ 67 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDD---------E----KRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKI 67 (521) T ss_pred CCcchhHHhhhhhhhh---------h----hHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhcccccc Confidence 6542 11211111111 0 01111110011111111111000 0000011110 00000 011 Q ss_pred ccHHHHH---HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_012530. 71 GRITDVL---RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSP 147 (559) Q Consensus 71 ~~~~~~~---~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~ 147 (559) .+..++. +..+.+|.|..+|.-|.+.+. ..+.+.....|.+.+... ....+.++..--..+.+.. +... T Consensus 68 ~n~~eLI~~YR~ma~~pEvd~Av~eIvneai------v~d~~~~pV~i~Ld~~~~-s~~iK~kI~eeF~~Il~ll-~F~~ 139 (521) T protein:vir:10 68 QNTKDLINQYRSLSKYHEVDNAIDEIINDAI------VQEDNRDTVYLDLDKTDW-NESVKEMVREEFRTILKLL-KFER 139 (521) T ss_pred chHHHHHHHHHHHhhccchhhHHHhhhcceE------EecCCCceEEEEecCccc-chHHHHHHHHHHHHHHHHh-ccch Confidence 1333443 445678999999998888753 344455555666644432 3333333333222333321 1111 Q ss_pred ChhhHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCceEEEEec-----Cccc-ccccceEEEEEe----- Q lcl|NC_012530. 148 IRDDFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPTTIYFAND-----EHGH-RRTRGKIYRQYI----- 213 (559) Q Consensus 148 ~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~~V~~~~~-----~~g~-~~~~~~~y~~~~----- 213 (559) --..+++.+++.|..|..++-|. ..-+.+|..|||.+|+.+.- ..|. ......-|+.+. T Consensus 140 -------~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~ 212 (521) T protein:vir:10 140 -------EGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDN 212 (521) T ss_pred -------hhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCc Confidence 12234556678899999987763 23499999999999865432 1111 111111122221 Q ss_pred ----cCce--eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCc Q lcl|NC_012530. 214 ----DNKV--RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVT 287 (559) Q Consensus 214 ----~~~~--~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~ 287 (559) .+.. ...++. +.|++.+.-+-+ .++++.+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|. T Consensus 213 ~~~~~g~~~~~vkI~~-daI~y~hSGL~d-~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk 290 (521) T protein:vir:10 213 RYNISGNSNNLVQIPI-DAIVYSHSGKVD-IDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPN 290 (521) T ss_pred eecCCCCCCcceeech-hheeeeccccee-CCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCc Confidence 1111 112444 444443321222 23567788899998888777777666554433334444555555443332 Q ss_pred cCCHHHHHHHHHHHHHHh------cCcccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhC Q lcl|NC_012530. 288 NTSMRALEDFKRHWTATS------SGINGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVA 351 (559) Q Consensus 288 ~~~~e~~~~l~~~~~~~~------~G~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fg 351 (559) .-.++-+..+-..+++.. +...+..+ ..+++ +| |.++..|.-...+--++-..|..+.+.++++ T Consensus 291 ~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLn 370 (521) T protein:vir:10 291 KKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMK 370 (521) T ss_pred hhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhC Confidence 211111122222221111 01111111 11221 11 2334433322334456677788889999999 Q ss_pred CCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeee Q lcl|NC_012530. 352 MDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEF 418 (559) Q Consensus 352 VPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f 418 (559) ||.+.|+... ++++-+.++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+| T Consensus 371 VP~sRl~~e~-~~f~~Gr~~EItR---DEi---kF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f 443 (521) T protein:vir:10 371 IPLSRLPQEG-AGVTFGAGNDITR---DEL---QFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVF 443 (521) T ss_pred CCccccCCCC-CceecccccchhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEe Confidence 9999987542 2233222222222 111 112334445555544444333 2 22222 3467777 Q ss_pred cchhhhhH-------HHHHHHHHHH-----HcCCCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccc Q lcl|NC_012530. 419 VGGDTRSQ-------QDKLKSVQLE-----LQTATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQ 485 (559) Q Consensus 419 ~~l~~~d~-------~~~~~~~~~~-----~~~~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~ 485 (559) .....-.+ ..|+.++..+ +..+++.+=||+ .|.+.-.+ +. .+..+...+... T Consensus 444 ~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDee-------------ik--~~~k~I~~E~~~ 508 (521) T protein:vir:10 444 SKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDED-------------IK--TEREKIDGELKD 508 (521) T ss_pred eecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhH-------------HH--HHHHHHHHhhhC Confidence 65433333 3344443332 111345554543 33432110 00 000011000000 Q ss_pred cccccccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 486 TRLTQLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) . ...++++ +.+.+ T Consensus 509 ~-------~~~~p~~--------e~~df 521 (521) T protein:vir:10 509 S-------VYKNPED--------PMEEF 521 (521) T ss_pred C-------CCCCCcc--------hhhcC Confidence 0 0000111 00111 No 226 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.74 E-value=2.3e-05 Score=46.00 Aligned_cols=432 Identities=12% Similarity=0.036 Sum_probs=175.0 Q ss_pred Ccchhhhc---cccccCCcc-hHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHH Q lcl|NC_012530. 1 MGIFDRFR---TKFYTDDPN-AFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDV 76 (559) Q Consensus 1 ~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 76 (559) |||+++.- .-...=+++ ..+..+.+..-. ..+-...+.... +-.-+|... . +++. . . T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~-----~~~~~w~~~-~-~~~~-----~-~ 61 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPL------VPDNQKEWSKDS-----YLTSLWAQG-Y-VPTV-----H-D 61 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhh------cccchhhhhhhh-----hhhhhcccC-C-CCcc-----c-c Confidence 99988762 111111111 122222221000 001111110000 000011111 0 1111 1 1 Q ss_pred HHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 77 LRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 77 ~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) ++ ...+.-.++++.+|+-|..=+ ..+.| ...+..+++ ...+.+.++|.. ..|+.-+ T Consensus 62 ~~--~~~~l~~~i~~~~A~ll~~e~---------~~i~v--~~~~~~d~e--~~~~~l~~il~~---------n~f~~~~ 117 (518) T protein:vir:78 62 KL--MNSGTGNEIVVVAAEYISGKP---------LSIDV--TGVNGSKDE--NLTKQLKEALRI---------DNFDSKS 117 (518) T ss_pred cc--ccCChHHHHHHHHHHhhcCCC---------ceEEe--cCccccCcH--HHHHHHHHHHHh---------ccHHHHH Confidence 11 233445556666666654211 11222 121111111 122234444432 2345556 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccc-----------ccceEEE--------------- Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRR-----------TRGKIYR--------------- 210 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~-----------~~~~~y~--------------- 210 (559) ...+.+.+..|.+++-+..+ +|++ .+..++|..+.+... +|... .+...|. T Consensus 118 ~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~~P~~~-~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~ 194 (518) T protein:vir:78 118 VKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQFWIDFK-NNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGK 194 (518) T ss_pred HHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCeeEEEee-cCcEEEEEEEEEeecCCcceeEEEEEeeccccccceee Confidence 66788888999998877776 4664 577788888877542 23110 0111111 Q ss_pred ----------EEecC-c-eee------------eecc--------------cceEEEecccCC--CccCCcccccHHHHH Q lcl|NC_012530. 211 ----------QYIDN-K-VRG------------SFTA--------------DEMGMFIRNPRS--DILSGGYGLSELEMG 250 (559) Q Consensus 211 ----------~~~~~-~-~~~------------~~~~--------------~evi~~~~n~~~--~~~~~~~G~Spl~~~ 250 (559) .+... . .+. .... .-.+.+..|+.. ...+.++|+|.+.-+ T Consensus 195 ~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~ 274 (518) T protein:vir:78 195 KLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQC 274 (518) T ss_pred cccceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhh Confidence 11000 0 000 0000 001222233321 223457899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEe---cCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCce----e Q lcl|NC_012530. 251 LREFISHENTELFNDRFFTHGGTTKGILLV---KPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDA----K 323 (559) Q Consensus 251 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~---~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~----~ 323 (559) ...|......-....+-|+.| .+..++.- +.....+...+ .-.|... .+.|... .....++. . T Consensus 275 ~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~~~~--~~~fd~~-~~~y~~i------~~~~~~~~~~~~~ 344 (518) T protein:vir:78 275 TNYLFAVDYFFTVYMREGEKT-KTKIAASERMFRKKVNKSTDKE--EWSMNVD-EDYFMQF------KGTLDAGAKLNDM 344 (518) T ss_pred hHHHHHHHHHHHHHHHHHHhC-CceeeechhHhccCCCCCCCcc--ccccCCC-CceEEEe------cCcCCCCCccccc Confidence 988888777766666667764 44433310 00000000000 0000000 0001100 00011111 1 Q ss_pred eeeccc-cchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhh--HHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_012530. 324 FVSMTQ-AEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESN--NQNKIDASKSKGLMPLLDMIAKN 400 (559) Q Consensus 324 ~~~ls~-~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an--~~~~~~~~~~~~l~P~~~~ie~~ 400 (559) ++.++. =.+.++.+..+...+.|....|++|..+|..+.. .++.+..+.+... ........+..+|.-++..|... T Consensus 345 i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~-~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l 423 (518) T protein:vir:78 345 IQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNRE-VKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYL 423 (518) T ss_pred eeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333331 1456889999999999999999999999864322 2221111111000 11122334444555444444433 Q ss_pred HHhhccc-----cccCccceeeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 401 LTNGIIR-----QILGDNYMLEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 401 ln~~L~~-----~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) +...... ......+.|+|+.....|..+.++.+..++. |.|++.++-+++... ...-+. ..-+..+ T Consensus 424 ~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~~~-~~deea------~~e~~ri- 495 (518) T protein:vir:78 424 LTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVEEKVKLIHPK-WEDEEI------QAEVKRI- 495 (518) T ss_pred HHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCC-CCHHHH------HHHHHHH- Confidence 3221111 0112347788988899999999998887775 568998855544211 110000 0000000 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) ..+... . .+.++++. .-.++++ | T Consensus 496 -----~~E~~~--~--------~~~~p~~~---~g~~~~~--------------g 518 (518) T protein:vir:78 496 -----YLENAI--G--------EVPDPEAI---GGMETKG--------------G 518 (518) T ss_pred -----HHHhcc--c--------CCCCCccc---cCCCCCC--------------C Confidence 000000 0 00000000 0000000 0 No 227 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.62 E-value=3.6e-05 Score=44.94 Aligned_cols=452 Identities=12% Similarity=0.121 Sum_probs=174.1 Q ss_pred Ccc--hhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccccc-ccccc------cccCCCCCcc Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTL-EDTSI------VPKPSPIAFG 71 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~-~~~~~------~~~p~~~~~~ 71 (559) ||. |+=| +.++.++= -.+.+..+..+.. .....-..+..-.......+. ...++ ..-+...+.. T Consensus 1 m~f~~~~lf-~f~~~~de-~~~~~~~~~~~~S-----~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~ 73 (523) T protein:vir:68 1 MKFNILSLF-APWAKMDE-RDYKDQEKENLES-----ITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTR 73 (523) T ss_pred CCCchhhhh-hhhhhhhh-hhhhhhhhccCCC-----ccccCCCCcceeeeccccccccccchhhhhhhhccccccchHH Confidence 665 4433 22221111 0111111111100 000000111110000000000 00000 0111112222 Q ss_pred cHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhh Q lcl|NC_012530. 72 RITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDD 151 (559) Q Consensus 72 ~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~ 151 (559) .+...-+..+.+|.|..+|.-|.+.+. ..+.+.....|.+.+. +.....+.++..--..+.+... ... T Consensus 74 eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~i~Ld~~-~~s~~iK~kI~eeF~~Il~ll~-F~~---- 141 (523) T protein:vir:68 74 ELIDTYRNLMTNYEVDNAVSEIVSDAI------VYEDDTEVVSINLDNT-KFSPNIKSMMLDEFNEVLNHLS-FQR---- 141 (523) T ss_pred HHHHHHHHHhhccchhhHHHHhhccee------eecCCCceEEEEeccc-ccchHHHHHHHHHHHHHHHHhc-cch---- Confidence 233333455778999999998888753 3444455555655443 2344444444333333333211 111 Q ss_pred HHHHHHHHHHHHHHcCCcceEEEECCC---CcEEEEEEecCceEEEEe-----cCcccccccc-eEEEEEecCc------ Q lcl|NC_012530. 152 FTSFLRKLVRDTYTYDQVNYENTYDSN---GRLSHTRMVDPTTIYFAN-----DEHGHRRTRG-KIYRQYIDNK------ 216 (559) Q Consensus 152 ~~~f~~~~v~d~ll~Gna~~~i~rd~~---G~~~~L~~l~p~~V~~~~-----~~~g~~~~~~-~~y~~~~~~~------ 216 (559) --..+++.+++.|..|..++-|.. .-+.+|..|||.+|+.++ .+.|.....+ .-|+.+..+. T Consensus 142 ---~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~ 218 (523) T protein:vir:68 142 ---KGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACD 218 (523) T ss_pred ---hhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeecccccccccc Confidence 122335566788999999887632 348999999999986532 2222211111 1111111110 Q ss_pred -------eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccC Q lcl|NC_012530. 217 -------VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNT 289 (559) Q Consensus 217 -------~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 289 (559) ....++ .+.|++.+.-.-+. .+..=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..|..- T Consensus 219 g~~~~~~~~ikI~-~dAI~y~hSGL~d~-~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~K 296 (523) T protein:vir:68 219 GRIYEAGTKIKIP-KAAIVYAHSGLVDC-CGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRK 296 (523) T ss_pred ccccCCCcceecc-hhheeeeeccceeC-CCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchh Confidence 111222 34444433111111 111224557777777766666666554443333334455555544333221 Q ss_pred CHHHHHHHHHHHHHHh-----cC-cccccc-ccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 290 SMRALEDFKRHWTATS-----SG-INGAYR-IPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMD 353 (559) Q Consensus 290 ~~e~~~~l~~~~~~~~-----~G-~~nag~-~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVP 353 (559) .++-+..+-..+++.. .| ..+..+ ..+++ +| +.++..|.-...+--++-..|..+.+.++++|| T Consensus 297 AeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP 376 (523) T protein:vir:68 297 AAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIP 376 (523) T ss_pred HHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCc Confidence 1111122222221111 01 011111 11221 11 233433322233444666778888999999999 Q ss_pred HHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecc Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVG 420 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~ 420 (559) .+.|.-. .++++-+.++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+|.. T Consensus 377 ~sRl~~~-~~~f~~Gr~~EItR---DEi---kF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~ 449 (523) T protein:vir:68 377 ITRIPSD-QGGIQFDAGTSITR---DEL---SFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHR 449 (523) T ss_pred ceeecCC-CcceecccccchhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeee Confidence 9999532 23333222222222 111 112334445555544444333 2 22222 346777765 Q ss_pred hhhhhHH-------HHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccc Q lcl|NC_012530. 421 GDTRSQQ-------DKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLT 489 (559) Q Consensus 421 l~~~d~~-------~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 489 (559) ...-.+. .|+.++..+-- | .++.+=||+ .|.+.-.+ +. .+..+.+.+..... T Consensus 450 Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-------------i~--~~~kqI~~E~k~~~-- 512 (523) T protein:vir:68 450 DSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEE-------------IE--QEAKQIEEESKEAR-- 512 (523) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH-------------HH--HHHHHHHHHhhcCC-- Confidence 4433333 33333332211 1 134444443 33332110 00 00001110000000 Q ss_pred cccccCCCCCCCCCCCCccccccc Q lcl|NC_012530. 490 QLESALQNPSGTPPTLPPSSSNSF 513 (559) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~ 513 (559) ..++++. .+.+ T Consensus 513 -----~~~p~~e--------~~~f 523 (523) T protein:vir:68 513 -----FQDPDQE--------QEDF 523 (523) T ss_pred -----CCCCchh--------hhcC Confidence 0001110 0111 No 228 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=97.42 E-value=7.1e-05 Score=43.32 Aligned_cols=479 Identities=13% Similarity=0.091 Sum_probs=177.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccccc-ccc-cccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTL-EDT-SIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~-~~~-~~~~~p~~~~~~~~~~~~~ 78 (559) |-=|=.| .+ +++ .+...+ +-..+-..-..|...+..+ +.. -+...+...+...+...-+ T Consensus 1 m~~lfgf---~~-----------~~~--~~~~~~---~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR 61 (558) T protein:vir:10 1 MAKLFGF---SI-----------EET--QKKSTS---IISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYR 61 (558) T ss_pred Ccchhcc---hh-----------hhh--hhhccC---CccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHH Confidence 3333223 11 000 000000 0000000000011011000 000 0111111222223333334 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) ..+.+|.|..+|.-|.+.+. ..+.+....+|.+.+... ....++++..--..+.+.. + |..--.. T Consensus 62 ~ma~~pEvd~Av~eIVneai------v~d~~~~pV~i~Ld~~~~-s~~iK~kI~eEF~~Il~ll-~-------F~~~~~e 126 (558) T protein:vir:10 62 EMALHPEADGAIEDVVNEAI------VSDLYDSPVEVELSNLNA-SNTLKKKIREEFRYIKEMM-D-------FDKKSHE 126 (558) T ss_pred HHhhccchhhHHHHhhccee------EecCCCceEEEEecccCc-chHHHHHHHHHHHHHHHHh-c-------cchhhhH Confidence 55778999999998888753 344455555566544432 3333444433333333321 1 1111223 Q ss_pred HHHHHHHcCCcceEEEECCC---CcEEEEEEecCceEEEEecC----------------cccccccc-eEEEEEecCce- Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDSN---GRLSHTRMVDPTTIYFANDE----------------HGHRRTRG-KIYRQYIDNKV- 217 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~~---G~~~~L~~l~p~~V~~~~~~----------------~g~~~~~~-~~y~~~~~~~~- 217 (559) +++.+++.|..|+.++-|.. .-+.+|..|||.+|+.++.- .+...... ..|+.+..... T Consensus 127 ~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~ 206 (558) T protein:vir:10 127 IFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQH 206 (558) T ss_pred HHhhheeeeEEEEEEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCccc Confidence 45566789999999887632 34899999999998765432 11111111 11221211110 Q ss_pred -----------eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCC Q lcl|NC_012530. 218 -----------RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSV 286 (559) Q Consensus 218 -----------~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 286 (559) ...--+.+.|++.+--+-+. ++.+=+|-|..|...+....-.+...-=|==.-|.-+-|.-++-+..+ T Consensus 207 ~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~-~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLP 285 (558) T protein:vir:10 207 PTGMVGQMGGKNSIKIAKDSITMCTSGLVDR-NKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLP 285 (558) T ss_pred ccccceeecCCCceeechhheeeecccceec-CCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCC Confidence 01111334444433211111 122224557777777766666666554433233333445544443333 Q ss_pred ccCCHHHHHHHHHHHHHHh-----cC-ccc-cccccccc--------C-CceeeeeccccchhHHHHHHHHHHHHHHHHh Q lcl|NC_012530. 287 TNTSMRALEDFKRHWTATS-----SG-ING-AYRIPMIT--------A-EDAKFVSMTQAEDMQFQSWLNYLINIICALV 350 (559) Q Consensus 287 ~~~~~e~~~~l~~~~~~~~-----~G-~~n-ag~~~vl~--------~-g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~f 350 (559) ..-.++-+..+-..+++.+ .| ..+ ..-..+++ + -+.++..|.-...+-=++-..|..+.+.+++ T Consensus 286 k~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aL 365 (558) T protein:vir:10 286 KVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRAL 365 (558) T ss_pred chhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHh Confidence 2211111122222222111 01 000 00111221 1 1233333322233434556778888999999 Q ss_pred CCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceee Q lcl|NC_012530. 351 AMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLE 417 (559) Q Consensus 351 gVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~ 417 (559) +||.+.|+-. ++++...++..++. +. -....|.-+..++...|...| + ++.++ ..+.|+ T Consensus 366 nVP~SRl~~e--~~f~~Gr~~EItRD---Ei---KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~ 437 (558) T protein:vir:10 366 GVPESRIAAE--GGFNLGRSSEILRD---EL---KFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYD 437 (558) T ss_pred CCCccccCCC--CcccccccchhhHH---HH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEE Confidence 9999999743 33333222222221 11 112234445555544444333 2 22222 346777 Q ss_pred ecchhhhh-------HHHHHHHHHHHHc--C-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccc-----cccc Q lcl|NC_012530. 418 FVGGDTRS-------QQDKLKSVQLELQ--T-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIK-----QNEF 481 (559) Q Consensus 418 f~~l~~~d-------~~~~~~~~~~~~~--~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~-----~~~~ 481 (559) |.....-. ...|+.++..+-- | .++.+=||+ .|.+.-.+=- ....++.+.. +.+. T Consensus 438 f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~---------~~~kqI~~E~k~~~~~~p~ 508 (558) T protein:vir:10 438 FLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIE---------EIDTQIEDEIQKGIIPDPS 508 (558) T ss_pred eeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHH---------HHHHHHHHHHhCCCCCCcc Confidence 76443322 2334444333211 1 235555543 3344211100 0000000000 0000 Q ss_pred cccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccccc Q lcl|NC_012530. 482 QRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 541 (559) +..+.. ..+.++++++..+....++.+.+.....+.+.. +....++|... T Consensus 509 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~ 558 (558) T protein:vir:10 509 QIDPIT-GEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDA---------QYSKDTKKAEL 558 (558) T ss_pred ccChhh-ccccCccCCchhccCCCCCcccccccchhhhhh---------hhhhhhhhhcC Confidence 000000 001111112222222222222221111111111 00111111100 No 229 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.37 E-value=8.2e-05 Score=42.98 Aligned_cols=456 Identities=11% Similarity=0.112 Sum_probs=175.7 Q ss_pred HHHHHHHHHHhhh-h--cccccccccccccccccccc-ccccc-cccCCCCCcccHHHHH---HHHhhChHHHHHHHHHH Q lcl|NC_012530. 23 IDSKIANDTASKA-L--NGVDRAYTEPVDGNLMFSTL-EDTSI-VPKPSPIAFGRITDVL---RQYSMNVVLNAIINTRA 94 (559) Q Consensus 23 ~~~~~~~~~~~~~-~--~gr~~a~~~~~~~~~~~~~~-~~~~~-~~~p~~~~~~~~~~~~---~~~~~~~~v~acv~~ia 94 (559) ++++.+-=++++. + .+-..+...-..+...+..+ +.+.+ ...+ ...+..++. +..+.+|.|..+|.-|. T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~---~~~~~~eLI~~YR~ma~~pEvd~Av~eIV 77 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDG---TIRNDHELITRYREMVLNPECDSAVDDVV 77 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeeccccccccccccc---ccchHHHHHHHHHHHhhccchhhHHHHhh Confidence 2222221111111 0 11111111111111111111 11111 1111 122334444 44567899999999888 Q ss_pred HHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEE Q lcl|NC_012530. 95 NQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENT 174 (559) Q Consensus 95 ~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~ 174 (559) +.+. ..+.+.....|.+.+-. .....+.++..--..+.+.. +... --..+++.+++.|..|+.++ T Consensus 78 neai------v~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll-~F~~-------~~~e~fR~WYVDgRi~fhKi 142 (537) T protein:vir:10 78 NETI------CGNFDDVPISIDLHNLK-QSEKIKKLIRSEFDEILRLL-DFDN-------RAYEIFRRWYVDGRLFFHKV 142 (537) T ss_pred ccee------EecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHh-ccch-------hhhHHHhhheeeeEEEEEEE Confidence 8753 33444444555554432 23333333333222333321 1111 12234556678899999988 Q ss_pred ECCC---CcEEEEEEecCceEEEEec-----Cccccc--------ccceEEEEEec------CceeeeecccceEEEecc Q lcl|NC_012530. 175 YDSN---GRLSHTRMVDPTTIYFAND-----EHGHRR--------TRGKIYRQYID------NKVRGSFTADEMGMFIRN 232 (559) Q Consensus 175 rd~~---G~~~~L~~l~p~~V~~~~~-----~~g~~~--------~~~~~y~~~~~------~~~~~~~~~~evi~~~~n 232 (559) -|.. .-+.+|..|||.+|+.++- .++... ....-|+.+.. ......++ .+.|++.+. T Consensus 143 id~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~-~dAI~y~hS 221 (537) T protein:vir:10 143 IDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIA-PDSIAYCHS 221 (537) T ss_pred EeCCCccccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceecc-Hhheeeecc Confidence 7632 3489999999999865443 111110 00011111111 11111233 344554432 Q ss_pred cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHh-----cC Q lcl|NC_012530. 233 PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATS-----SG 307 (559) Q Consensus 233 ~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~-----~G 307 (559) -+-+. .+++.+|-|..|...+......+...-=|==.-|.-+-|.-++-+..+..-.++-+..+-..+++.+ .| T Consensus 222 Gl~d~-n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TG 300 (537) T protein:vir:10 222 GIQDL-NKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTG 300 (537) T ss_pred cceeC-CCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCc Confidence 22222 2456778888888888777777666554433334444555555443332211111122222222111 01 Q ss_pred -ccc-cccccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhh Q lcl|NC_012530. 308 -ING-AYRIPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNES 376 (559) Q Consensus 308 -~~n-ag~~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~a 376 (559) ..+ ..-..+++ ++ +.++..|.-...+--++-..|..+.+.++++||.+.|+-. ++++-..++..++ T Consensus 301 ev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e--~~f~~Gr~~EItR- 377 (537) T protein:vir:10 301 EIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITR- 377 (537) T ss_pred eecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCC--CcccccccchhhH- Confidence 000 01111221 11 2333333222334456677788899999999999999643 3333322222221 Q ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecchhhhh-------HHHHHHHHHHH Q lcl|NC_012530. 377 NNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVGGDTRS-------QQDKLKSVQLE 436 (559) Q Consensus 377 n~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~l~~~d-------~~~~~~~~~~~ 436 (559) ++. -....|.-+..++...|...| + ++.++ ..+.|+|.....-. ...|+.++..+ T Consensus 378 --DEi---KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~ 452 (537) T protein:vir:10 378 --DEV---KFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQM 452 (537) T ss_pred --HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 111 112234445555544444333 2 22222 34667775443222 23344433332 Q ss_pred H--cC-CCCHHHHHH-HhCCCCCCCCCEeeccceeccccccccccccccccccccccccccc--------CC---CCCCC Q lcl|NC_012530. 437 L--QT-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESA--------LQ---NPSGT 501 (559) Q Consensus 437 ~--~~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--------~~---~~~~~ 501 (559) - -| .++.+=||+ .|.+.-.+ +.. ...+...+........+... .+ .+... T Consensus 453 dpyvGky~s~dyi~k~ILr~tDee-------------I~~--~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 517 (537) T protein:vir:10 453 DPYVGKYFSANYIRTKVLKQTESE-------------IKE--IDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGE 517 (537) T ss_pred hhhhhcccchHHHHHHHhccCHHH-------------HHH--HHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCC Confidence 1 01 134444433 33332110 000 00000000000000000000 00 00000 Q ss_pred CCCCCccccccchhccccccccccccccccccccccccccccccc Q lcl|NC_012530. 502 PPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKK 546 (559) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~ 546 (559) +|+.++...++... ++. |+- T Consensus 518 ~~~~~~~~~~~~~~---------------------~~~----~~~ 537 (537) T protein:vir:10 518 EPQTDPNSAVSPAD---------------------QKR----GEL 537 (537) T ss_pred CcccCCccCCCCCC---------------------ccC----CCC Confidence 11111111110000 000 111 No 230 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=97.29 E-value=0.0001 Score=42.40 Aligned_cols=415 Identities=10% Similarity=0.013 Sum_probs=159.8 Q ss_pred Ccc--hhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |.| +.+. +.+-++.+-.++.+ -..+..=-.|++.....+............ ...+......+. + T Consensus 1 ~~~e~~~~~----i~~~~~~~~~~~~~---~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~---~~~~~~~~~~~~----k 66 (471) T protein:vir:10 1 MEIEVIKKI----ISSQMVKHGKFVSQ---AAEAEKYYRNENDIKRKRKPADKKGAENEA---KAEDNAFRNADN----R 66 (471) T ss_pred CCHHHHHHH----HHHHHHHHHHHHHH---HHHHHHHhccccccccccchhhhhcccccc---cccccccccccc----e Confidence 211 1111 11111111111100 111122222333222111110000000000 000000000000 0 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRK 158 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~ 158 (559) ...+....+|+..+.-+.+-| ..+... ..+..+.+..|+.+ .|...... T Consensus 67 --i~~n~~~~Ivd~~~~yl~G~p-----------~~~~~~--------~~~~~~~l~~~~~n----------~~~~~~~~ 115 (471) T protein:vir:10 67 --ISHNWHQLLLDQKKAYALTYP-----------PTFDVD--------DKKVNDMIVDVLGD----------DYERISKQ 115 (471) T ss_pred --eccchhHHHHHhhhhhhcccC-----------ceeccC--------ChHHHHHHHHHHhc----------CHHHHHHH Confidence 113344445554443332111 111111 11122334444321 12344556 Q ss_pred HHHHHHHcCCcceEEEECC-CCcEEEEEEecCceEEEEecCcccc-cccceEEEEEecCc------eeeeecccceEEEe Q lcl|NC_012530. 159 LVRDTYTYDQVNYENTYDS-NGRLSHTRMVDPTTIYFANDEHGHR-RTRGKIYRQYIDNK------VRGSFTADEMGMFI 230 (559) Q Consensus 159 ~v~d~ll~Gna~~~i~rd~-~G~~~~L~~l~p~~V~~~~~~~g~~-~~~~~~y~~~~~~~------~~~~~~~~evi~~~ 230 (559) +..+++.+|.+|..+.++. +|++ .+..++|..+.++.++.... ....++|+...... ....+....+.|+. T Consensus 116 ~~~~~~~~G~~~~~v~~d~~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~ 194 (471) T protein:vir:10 116 LCVNAGNAGIAWLHVWKDASDNSF-RYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYR 194 (471) T ss_pred HHHHHhhCCeEEEEEEeeCCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEE Confidence 6788999999999999985 5664 57888999998887754321 22233444332111 11223344444432 Q ss_pred ccc--------------------------------CCC-----ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012530. 231 RNP--------------------------------RSD-----ILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGT 273 (559) Q Consensus 231 ~n~--------------------------------~~~-----~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 273 (559) ..- ... ......|.|-++.....++....+..-..+.+...+. T Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~ 274 (471) T protein:vir:10 195 HEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQE 274 (471) T ss_pred ecCCcccccccccccccccccccccccccccccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhC Confidence 100 000 0011235566655555554444444444444454455 Q ss_pred CceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc------CCceeeeeccccchhHHHHHHHHHHHHHH Q lcl|NC_012530. 274 TKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT------AEDAKFVSMTQAEDMQFQSWLNYLINIIC 347 (559) Q Consensus 274 p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~------~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia 347 (559) |-.++ .+.. ....++. ...+.. +++..+. +++++|.....+ +..+....+...+.|. T Consensus 275 ~~lv~--~g~~--~~~~~~~----~~~~~~--------~~~i~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~ 337 (471) T protein:vir:10 275 VIFVL--TNYG--GQDKQEF----LEDLKR--------YKMIKMDNDGMGDQSGVTTIAIDIP-TEARNLILERTKKQIF 337 (471) T ss_pred ceeee--ecCC--ccccchh----HHHhhc--------CCeEEecCCCCccCccceEEeecCC-hHHHHHHHHHHHHHHH Confidence 54443 3211 1111221 111111 1112221 123444443333 3345667777888888 Q ss_pred HHhCCCHHHhccccccccccccccchhhhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhh Q lcl|NC_012530. 348 ALVAMDPAEIGMQNRGGATGNKSNSLNESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTR 424 (559) Q Consensus 348 ~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~ 424 (559) ..-++|..-. ..-+..++ .....-++.. ....+..+..+|+-+++.|...+ .......+.+.|...... T Consensus 338 ~~s~tp~~~~--~~~gn~Sg-~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~-----~~~d~~~i~i~f~~~~p~ 409 (471) T protein:vir:10 338 ISGQGVNPET--DKLGNSSG-VALKFLYSLLELKAGNMETQFRSGYATLVKMILKHL-----GLSDKLKIKQTWTRNSIN 409 (471) T ss_pred HHhCCcCCCc--ccccCccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccCCCceeEEEeCCCCCC Confidence 8888885311 11111100 0000111111 11122333444444444443332 222345678889988899 Q ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCC Q lcl|NC_012530. 425 SQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPT 504 (559) Q Consensus 425 d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (559) |..+.++.+... .|+|+.--+.++++. ++ |. ...+..+. .+.... ........+...+++.+ T Consensus 410 n~~e~~~~~~kl-~g~iS~et~~~~~p~--v~--D~------~~E~eri~------~E~~~~-~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 410 NDTEMAQVVSTL-ATITSRENVAKSNPI--VE--DW------QDELRLQK------AEQEGR-SEKLYDMEEVEHESEVE 471 (471) T ss_pred CHHHHHHHHHHH-hccCchHHHHHhCCC--CC--CH------HHHHHHHH------HHHHHH-HhcccccCCCCCccccC Confidence 999988877665 466788777776643 22 10 01111111 011000 00000001100111110 No 231 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.06 E-value=0.00019 Score=41.01 Aligned_cols=411 Identities=10% Similarity=0.027 Sum_probs=157.3 Q ss_pred Cc-chhhhc-------cccccCCcc---hHHHHHHH---HHHH--HHHhhhhcccccccccccccccccccccccccccc Q lcl|NC_012530. 1 MG-IFDRFR-------TKFYTDDPN---AFFKHIDS---KIAN--DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPK 64 (559) Q Consensus 1 ~~-~~~~~~-------~~~~~~~~~---~~~~~~~~---~~~~--~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~ 64 (559) |. |+.--. +.++.++.. +.|..+-. .... +.+..=-.|++.....+... .. .. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~-----~~---~~-~~ 71 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKR-----NV---KG-EI 71 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc-----cc---cc-cc Confidence 21 100000 111111111 11111100 0111 11111122332221111000 00 00 00 Q ss_pred CCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCC Q lcl|NC_012530. 65 PSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVD 144 (559) Q Consensus 65 p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~ 144 (559) +..... .+ ...+....+|+..+.-+. |.+..+... ..+..+.+..++.+ T Consensus 72 ~~~~~~------~k--i~~n~~~~Iv~~~~~~l~-----------g~p~~~~~~--------d~~~~~~l~~~~~n---- 120 (468) T protein:vir:96 72 DPFKPD------WR--MYTNYHQNLVDQKVAYAV-----------ANPVTYGTE--------DEKSLKTIQEVLNH---- 120 (468) T ss_pred cccccc------cc--cccchHHHHHHHHHhhhc-----------cCCceeccC--------ChHHHHHHHHHHhc---- Confidence 000000 00 112344445544443332 111112111 11223344555432 Q ss_pred CCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCc-ccccccceEEEEEecCceeeeecc Q lcl|NC_012530. 145 YSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEH-GHRRTRGKIYRQYIDNKVRGSFTA 223 (559) Q Consensus 145 ~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~-g~~~~~~~~y~~~~~~~~~~~~~~ 223 (559) ++......+..+++.+|.+|..+.+|.+|++ .+..++|..+.++.+.. .......++|+..........+.. T Consensus 121 ------~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~ 193 (468) T protein:vir:96 121 ------KWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTA 193 (468) T ss_pred ------CHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeC Confidence 2234455677889999999999888888875 47788999988776532 111112233333222222222333 Q ss_pred cceEEEec--------------------------c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012530. 224 DEMGMFIR--------------------------N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGG 272 (559) Q Consensus 224 ~evi~~~~--------------------------n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~ 272 (559) ..+.++.. | |.-.......|.|-++.....++....+..-..+.+...+ T Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 273 (468) T protein:vir:96 194 NDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEAT 273 (468) T ss_pred CeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33332221 0 0000011234666666555555554444444445555556 Q ss_pred CCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc-C--CceeeeeccccchhHHHHHHHHHHHHHHHH Q lcl|NC_012530. 273 TTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT-A--EDAKFVSMTQAEDMQFQSWLNYLINIICAL 349 (559) Q Consensus 273 ~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~-~--g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~ 349 (559) .|-.+++ +.. +. ..+.+...++ .+++..+. . ++++|..... ....+....+...+.|... T Consensus 274 ~p~lv~~--g~~----~~--~~~~~~~~~~--------~~~~i~~~~d~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~ 336 (468) T protein:vir:96 274 ELIYVLK--GYE----GE--DLEEFMYNLK--------YYKAINVDGDGSGGVDTIQIDV-PVQSAKEYLDMLRDYVIEF 336 (468) T ss_pred Cceeeee--cCC----cc--ccchhhhhhh--------cCceEEecCCCCCcceEEeecC-ChHHHHHHHHHHHHHHHHH Confidence 6654443 211 11 1111111111 12223332 2 2345544333 2344566677788888888 Q ss_pred hCCCHHHhccccccccccccccchh---hhhH---HHHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhh Q lcl|NC_012530. 350 VAMDPAEIGMQNRGGATGNKSNSLN---ESNN---QNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDT 423 (559) Q Consensus 350 fgVPp~~lg~~~~~~~~~~~~~~~~---~an~---~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~ 423 (559) -++|..- .. +. +++.++... ++.. ....+..+..+|+-+++.|...+. .......+.+.|+.... T Consensus 337 s~~p~~~--~~--~~-~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g----~~~d~~~i~i~f~~~~p 407 (468) T protein:vir:96 337 GQGVDFQ--QD--KF-GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYK----LSIKVQDVEITFNFNVM 407 (468) T ss_pred hCccccc--cc--cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCC Confidence 8888532 11 11 111111000 0001 112223334444444444333221 11223456778887777 Q ss_pred hhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCC Q lcl|NC_012530. 424 RSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPP 503 (559) Q Consensus 424 ~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (559) .|..+.++.+.. .|+|+.-.+.++++. ++ |. ...+..+.+ ++.... ......++....+| T Consensus 408 ~d~~e~a~~~~~--~g~iS~et~i~~l~~--v~--D~------~~E~~ri~~------E~~~~~--~~~~~~~~~~~~~~ 467 (468) T protein:vir:96 408 VNELEQSQIGVN--SQYLSKETVVTNHPW--VD--DP------VAEMERIDQ------EELALP--SIEEGLNGKENNEP 467 (468) T ss_pred cCHHHHHHHHHh--cCCCchHHHHHhCCC--CC--CH------HHHHHHHHH------HHHHHH--HHhhccCCCCCCCC Confidence 887777765543 366888777776643 11 10 011111111 111000 00011111112122 Q ss_pred C Q lcl|NC_012530. 504 T 504 (559) Q Consensus 504 ~ 504 (559) + T Consensus 468 ~ 468 (468) T protein:vir:96 468 T 468 (468) T ss_pred C Confidence 1 No 232 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=96.90 E-value=0.00027 Score=40.16 Aligned_cols=476 Identities=12% Similarity=0.093 Sum_probs=172.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccc---ccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFS---TLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~---~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) |--|=-| .+.+... ..+.. -. |.....+.. .++.+.+...-......+..++. T Consensus 1 m~~lfgf---~i~~~~~------------------~~~~S--~v-pp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI 56 (564) T protein:vir:10 1 MSQLFGF---LINEKEG------------------QKGQS--PV-PPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELI 56 (564) T ss_pred Ccchhcc---eeeeecc------------------CCCCC--cc-cCCcCCChhhhhccccceeeecccccchhhHHHHH Confidence 3222222 2211100 00000 00 100011111 11111110000001112233443 Q ss_pred ---HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHH Q lcl|NC_012530. 78 ---RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTS 154 (559) Q Consensus 78 ---~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~ 154 (559) +..+.+|.|..+|.-|.+.+ +..+.+....+|.+.+. +.....++++..--..+.+.. +... T Consensus 57 ~~YR~ma~~pEVd~Av~eIVnea------Iv~d~~~~pV~vdL~~~-~~s~siK~kI~eEF~~Il~ll-~F~~------- 121 (564) T protein:vir:10 57 RRYRDMSLHPEVDSAIDEIVNEF------VVNDGDDKPVEVDLQNL-EIGSGVKKKIRDEFNRILRMM-NFNV------- 121 (564) T ss_pred HHHHHHhhccchhhHHHHhhcce------eEecCCCceEEEEeccc-CcchHHHHHHHHHHHHHHHHh-ccch------- Confidence 44567899999999888874 23445555566666333 344454444444333333331 1111 Q ss_pred HHHHHHHHHHHcCCcceEEEECC-C--CcEEEEEEecCceEEEEec------Cccccccc----------ceEEEEEec- Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDS-N--GRLSHTRMVDPTTIYFAND------EHGHRRTR----------GKIYRQYID- 214 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~-~--G~~~~L~~l~p~~V~~~~~------~~g~~~~~----------~~~y~~~~~- 214 (559) --..+++.+++.|..|..++-|. + .-+.+|.+|||..|+.++. ..+....+ ...|+.+.. T Consensus 122 ~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~ 201 (564) T protein:vir:10 122 NAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPK 201 (564) T ss_pred hhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccc Confidence 12234556678899999887662 2 2399999999998775541 11111001 112322221 Q ss_pred ---Cc-------------eeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEE Q lcl|NC_012530. 215 ---NK-------------VRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGIL 278 (559) Q Consensus 215 ---~~-------------~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 278 (559) +. ....++.+.|.|...... +. +++.=+|-|..|...+....-.+....=|==.-|.-+-|. T Consensus 202 ~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~-d~-~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvF 279 (564) T protein:vir:10 202 GFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGLM-DL-NKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIF 279 (564) T ss_pred cccCcccccccccccccccceeechhhcceecccce-eC-CCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEE Confidence 00 012344445554432111 11 1222245577777777666666655544332333334455 Q ss_pred EecCccCCccCCHHHHHHHHHHHHHHh-----cC-ccc-cccccccc--------C-CceeeeeccccchhHHHHHHHHH Q lcl|NC_012530. 279 LVKPSPSVTNTSMRALEDFKRHWTATS-----SG-ING-AYRIPMIT--------A-EDAKFVSMTQAEDMQFQSWLNYL 342 (559) Q Consensus 279 ~~~~~~~~~~~~~e~~~~l~~~~~~~~-----~G-~~n-ag~~~vl~--------~-g~~~~~~ls~~~D~qf~e~~~~~ 342 (559) -++-+..+..-.++-+..+-..+++.+ .| ..+ ..-..+++ + -+.++..|.-...+-=++-..|. T Consensus 280 YIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF 359 (564) T protein:vir:10 280 YIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYF 359 (564) T ss_pred EEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHH Confidence 444433332211111122222221111 01 000 00111221 1 12334333222334345567788 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC-- Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG-- 411 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~-- 411 (559) .+.+.++++||.+.|...+. +++-+.++.+++ ++. -....|.-+..++...|...| + ++.++ T Consensus 360 ~kKLY~aLnVP~SRl~~e~~-~f~~Gr~~EItR---DEi---KF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~ 432 (564) T protein:vir:10 360 KKKLYNSLNLPPSRLTDDNK-AFNLGKSTEILR---DEL---KFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDD 432 (564) T ss_pred HHHHHHHhCCCcccccCCCc-eeecccccchhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH Confidence 88999999999999976422 233222222222 111 112234445555544444333 2 22222 Q ss_pred --ccceeeecchhhh-------hHHHHHHHHHHHH--cC-CCCHHHHHH-HhCCCCCC--CCCEeeccceeccccccccc Q lcl|NC_012530. 412 --DNYMLEFVGGDTR-------SQQDKLKSVQLEL--QT-ATTVNDYRE-KQGLPKIA--GGDIILSAVYIQRLGQQEQI 476 (559) Q Consensus 412 --~~~~~~f~~l~~~-------d~~~~~~~~~~~~--~~-~~T~NE~R~-~~gl~pi~--gGD~~~~~~~~~~l~~~~~~ 476 (559) ..+.|+|.....- -...|+.++..+- -| .++.+=||+ .|.+.-.+ .-| .++.+. T Consensus 433 i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~-----------kqI~~E 501 (564) T protein:vir:10 433 MEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEID-----------KQMKSD 501 (564) T ss_pred HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHH-----------HHHHHH Confidence 3466676543322 2333444443321 11 234444433 33332110 000 000000 Q ss_pred cccccccccccccc--ccccCCCCCCCC--CCCCccccccchhccccccccccccccccccccccc Q lcl|NC_012530. 477 KQNEFQRQQTRLTQ--LESALQNPSGTP--PTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGK 538 (559) Q Consensus 477 ~~~~~~~~~~~~~~--~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 538 (559) ....--..+...+. ..+..+.+-.|+ ...++...+.+.+ +..+..+.+.+..+++.+.| T Consensus 502 ~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~---~~~~a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 502 IESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIK---KLNSAPKPPPSQQSKSQSNK 564 (564) T ss_pred hhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChh---hhccCCCCCCCCCCcCcCCC Confidence 00000000000000 000000000000 0000000000000 00000000111111111111 No 233 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=96.84 E-value=0.0003 Score=39.87 Aligned_cols=460 Identities=11% Similarity=0.059 Sum_probs=179.8 Q ss_pred HHHHHHHHHHHhhhhcccccccccc---ccccccc--------------------c---------------ccccccccc Q lcl|NC_012530. 22 HIDSKIANDTASKALNGVDRAYTEP---VDGNLMF--------------------S---------------TLEDTSIVP 63 (559) Q Consensus 22 ~~~~~~~~~~~~~~~~gr~~a~~~~---~~~~~~~--------------------~---------------~~~~~~~~~ 63 (559) +..++.......++..|-.+.-.++ .+....+ . .+++.-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~~~~~ 80 (569) T protein:vir:10 1 MADNKITLSSVRKALAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDE 80 (569) T ss_pred CCcchhHHHHHHHHHhhhhhcCCccchhhhhhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHHHHHHHHhhh Confidence 3333333333344433322111110 0000000 0 000000011 Q ss_pred cCCCCCcccHHHHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cC-hhHHHHHHHHHHHHHhc Q lcl|NC_012530. 64 KPSPIAFGRITDVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PT-KEQQKKIDYAERYIERM 141 (559) Q Consensus 64 ~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~-~~~~~~~~~~~~~L~~~ 141 (559) .-.+++|..+.-.+...+..|++.++.++....--.. ....|.-+-|.+...-. .. +..++..+++.+=|... T Consensus 81 ~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALgg-----de~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~ 155 (569) T protein:vir:10 81 VQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSF-----DKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT 155 (569) T ss_pred ccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecc-----cccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH Confidence 1112444444444445566788888887776543211 12235555665542211 11 11112223333211111 Q ss_pred CCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEE---EEecCceEEEEecCcccc---------------- Q lcl|NC_012530. 142 GVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHT---RMVDPTTIYFANDEHGHR---------------- 202 (559) Q Consensus 142 ~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L---~~l~p~~V~~~~~~~g~~---------------- 202 (559) +-..+..+..+...+|.+|+.|.-+..--++.| ++..|+-|++..-..-.. T Consensus 156 ----------iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqpFE~g~~tvGF~~~~~~~~~~ti~~ 225 (569) T protein:vir:10 156 ----------INKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASGKMVF 225 (569) T ss_pred ----------HHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccchhhhcCceEEeecccCCccccceee Confidence 111344566677899999998876543334444 445566665532111000 Q ss_pred -------cccceEEEEEe-----cCce-eeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 203 -------RTRGKIYRQYI-----DNKV-RGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFT 269 (559) Q Consensus 203 -------~~~~~~y~~~~-----~~~~-~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ 269 (559) ..+-+++.... +.+. ...+..++.=| +|. ....||-|-|+.+.+.......+.....+.=- T Consensus 226 l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~---~Pi---~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri 299 (569) T protein:vir:10 226 ADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEER---TPI---ETQNYGTSLLEYAYEPYMNLRSAIRSLKATRF 299 (569) T ss_pred echhhhhhhcccceeeccccchhhhhhhheeeccccccc---ccc---cchhhhhHHHHHHHhHHHHHHHHHHhccchhh Confidence 01111221100 0000 01111111111 121 12347889888887665544433222211111 Q ss_pred hcCCCceEEEecCccCCccCCHHH-----------HHHHHHHHHHHhcCcccc-----cccccccCCc--eeeeec-ccc Q lcl|NC_012530. 270 HGGTTKGILLVKPSPSVTNTSMRA-----------LEDFKRHWTATSSGINGA-----YRIPMITAED--AKFVSM-TQA 330 (559) Q Consensus 270 ng~~p~gil~~~~~~~~~~~~~e~-----------~~~l~~~~~~~~~G~~na-----g~~~vl~~g~--~~~~~l-s~~ 330 (559) +..+-.-+|.+.. ..+++.+ +++-++.++.+..|.+.- |-+|+..++. +.+ .+ +.+ T Consensus 300 ~dSv~~~~Itlnm----~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~tv-Dt~~~~ 374 (569) T protein:vir:10 300 NASKIDRIIGLAM----NSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMTI-DTQTIQ 374 (569) T ss_pred HHHHHhHHhhccc----cCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecCccccccc-cccccc Confidence 1112223343322 1234443 344566667777665442 3345554432 111 12 235 Q ss_pred chhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHH-HHHhhccc-- Q lcl|NC_012530. 331 EDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAK-NLTNGIIR-- 407 (559) Q Consensus 331 ~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~-~ln~~L~~-- 407 (559) .+.-=+|..-+..+..|.++|+.+.|||+.+--+..-..++-.--+-....+..++++.+.-++.++-+ .+..|.-. T Consensus 375 A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf 454 (569) T protein:vir:10 375 ADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVY 454 (569) T ss_pred cCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccc Confidence 666678889999999999999999999987765443333433333333444557778777777766433 33333211 Q ss_pred cccCccceeeecchhhhhH-------HHHHHHHHHHHc--------CCCCHHHHHHHhCCCCCCCCCEeeccceeccccc Q lcl|NC_012530. 408 QILGDNYMLEFVGGDTRSQ-------QDKLKSVQLELQ--------TATTVNDYREKQGLPKIAGGDIILSAVYIQRLGQ 472 (559) Q Consensus 408 ~~~~~~~~~~f~~l~~~d~-------~~~~~~~~~~~~--------~~~T~NE~R~~~gl~pi~gGD~~~~~~~~~~l~~ 472 (559) +.....|.++|.+....-+ ..|+.....+++ +.+-.||--..+=+..+=|.|+=+...-...+ T Consensus 455 ~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De~~~e~l~ae~-- 532 (569) T protein:vir:10 455 PEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDEKISEALVNEL-- 532 (569) T ss_pred CCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhcchhHHHHHHhhc-- Confidence 2335679999986543222 223333222221 11222322111101111111110000000000 Q ss_pred ccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 473 QEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) ....+ +++......-.. |+ ++-.. ..+.+ +|+-+.+. T Consensus 533 -~akp~-DEe~~~~~~~~~-----------~~-~~~~~--~~~~~----~~~~~~~~ 569 (569) T protein:vir:10 533 -KAKSE-DDDHLMDSIIKT-----------PP-QELAQ--ILESV----FKEGNDND 569 (569) T ss_pred -CCCcc-hhHHHHHHHhcC-----------Ch-HHHHH--HHHHH----hhccCCCC Confidence 00000 000000000000 00 00000 00000 00000000 No 234 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=96.73 E-value=0.00038 Score=39.33 Aligned_cols=466 Identities=12% Similarity=0.116 Sum_probs=174.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhcccccccccccccccccccc-ccccc-cccCCCCCcccHHHHH- Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTL-EDTSI-VPKPSPIAFGRITDVL- 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~-~~~~~-~~~p~~~~~~~~~~~~- 77 (559) |--|=-| .|.+- .++ .++. .+..+.+ ..+......+ +.+.+ ..-+. ..+..++. T Consensus 1 m~~lfg~---~i~~~--------~~~---~~~~--s~~~~~~----~dg~~~i~~~~~~~~~~~~e~~---~~~~~eLI~ 57 (533) T protein:vir:10 1 MSQLFGF---SLERA--------KKA---PKGP--SFVQKDN----LDGSQPVSGGGYYGYTVDFDGQ---VRNEYQLIS 57 (533) T ss_pred Ccccccc---ccccc--------ccc---ccCC--CCCCCCc----ccccceeecccccceeeecccc---cchHHHHHH Confidence 2222111 11110 000 0000 0000111 1111111111 11111 11111 12233444 Q ss_pred --HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHH Q lcl|NC_012530. 78 --RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSF 155 (559) Q Consensus 78 --~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f 155 (559) +..+.+|.|..+|.-|.+.+. ..+.+.....|.+.+.. .....++++..--..+.+... ... - T Consensus 58 ~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll~-F~~-------~ 122 (533) T protein:vir:10 58 RYREMVLQPECDSAVDDIVNETI------CGNFDDVPVSVELSNLK-VSDKIKKLIREEFGEILRLLD-FEN-------R 122 (533) T ss_pred HHHHHhhccchhhHHHHhhccee------eecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhc-cch-------h Confidence 445678999999998888753 34444445555554432 333444444333333333211 111 1 Q ss_pred HHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCceEEEEec-----Ccccc--------cccceEEEEEecC---- Q lcl|NC_012530. 156 LRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPTTIYFAND-----EHGHR--------RTRGKIYRQYIDN---- 215 (559) Q Consensus 156 ~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~~V~~~~~-----~~g~~--------~~~~~~y~~~~~~---- 215 (559) -..+++.+++.|..|..++-|. ..-+.+|..|||.+|+.++- .++.. .....-|+.+... T Consensus 123 ~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~ 202 (533) T protein:vir:10 123 SYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKN 202 (533) T ss_pred hhHHHhhhhhcceEEEEEEecCCCccccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccc Confidence 2233555678899999887763 33599999999999987432 12211 1111112222111 Q ss_pred --ceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHH Q lcl|NC_012530. 216 --KVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRA 293 (559) Q Consensus 216 --~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~ 293 (559) .....++. +.|++.+.-.-+. ++++=+|-|..|...+......+...-=|==.-|.-+-|.-++-+..+..-.++- T Consensus 203 ~~~~~vkI~~-dAI~y~hSGl~d~-~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqY 280 (533) T protein:vir:10 203 STTQGLKIAP-DSICYVHSGIMDL-NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQY 280 (533) T ss_pred cCCCceecch-hheeeeeccceeC-CCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHH Confidence 11112333 4444443211121 2222246678887777776666665544433333344455454433332211111 Q ss_pred HHHHHHHHHHHh-----cC-ccc-cccccccc--------CC-ceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_012530. 294 LEDFKRHWTATS-----SG-ING-AYRIPMIT--------AE-DAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEI 357 (559) Q Consensus 294 ~~~l~~~~~~~~-----~G-~~n-ag~~~vl~--------~g-~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~l 357 (559) +..+-..+++.+ .| ..+ ..-..+++ +| +.++..|.-...+--++-..|..+.+.++++||.+.| T Consensus 281 lr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl 360 (533) T protein:vir:10 281 LREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRL 360 (533) T ss_pred HHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccc Confidence 122222222111 01 000 00111221 11 2333333222334456677788899999999999999 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----c-----ccccC----ccceeeecchhhh Q lcl|NC_012530. 358 GMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI----I-----RQILG----DNYMLEFVGGDTR 424 (559) Q Consensus 358 g~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~~----~~~~~~f~~l~~~ 424 (559) +-. ++++-..++..++ ++. -....|.-+..++...|...| + ++.++ ..+.|+|.....- T Consensus 361 ~~e--~~f~~Gr~~EItR---DEi---KF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f 432 (533) T protein:vir:10 361 ETE--TTFNVGRAAEITR---DEV---KFQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYF 432 (533) T ss_pred CCC--CcccccccchhhH---HHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchH Confidence 643 3333322222221 111 112234445555544444333 2 22222 3467777644332 Q ss_pred h-------HHHHHHHHHHHH--cC-CCCHHHHHH-HhCCCCCCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 425 S-------QQDKLKSVQLEL--QT-ATTVNDYRE-KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 425 d-------~~~~~~~~~~~~--~~-~~T~NE~R~-~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) . ...|+.++..+- -| .++.+=+|+ .|.+.-.+ +. ....+.+.+........+.. T Consensus 433 ~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee-------------i~--~~~kqI~~E~k~~~~~~p~~ 497 (533) T protein:vir:10 433 AELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVE-------------MK--EIDKQIESEMESGIIADPAA 497 (533) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH-------------HH--HHHHHHHHHHhCCCCCCCcc Confidence 2 333444443331 12 235555543 44442211 00 00000000000000000000 Q ss_pred cCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKK 546 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~ 546 (559) +.++.....+++ ..+....+.-|.+ +.-+-..|.|- T Consensus 498 ------~~~~~~~~~~~~-----~~~~~~~~~~~~~------~~~~~~~~~~~ 533 (533) T protein:vir:10 498 ------EMDPAMAAGDPD-----AGGAPAEEVAPEG------PDPSDERKAEF 533 (533) T ss_pred ------hhhHHhcCCCCC-----cCCcccccCCCCC------CCcchhhccCC Confidence 000000000000 0000000000000 00000000000 No 235 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=96.13 E-value=0.00097 Score=37.11 Aligned_cols=433 Identities=13% Similarity=0.087 Sum_probs=177.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHH--HHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIA--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |-. |++|.-+....++.+|...+- +...+.- -.|.-|.....+. .. ..... . T Consensus 1 ~~~----~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~-----~~y~lP~~~~~~~----~~---------~~~~~----~ 54 (522) T protein:vir:94 1 MAE----REGFAAEGAKAVYDRLKNGRQPYETRAQNC-----AAVTIPSLFPKES----DN---------SSTEY----T 54 (522) T ss_pred Ccc----cchhhHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHhcccccCCCC----Cc---------ccccc----c Confidence 655 566666666666666654421 1111100 1133332211100 00 00000 0 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--c------cChhHHHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--K------PTKEQQKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~------~~~~~~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) ...+ ++--.|++.+|..+..... ..+-.|++...+.- . .....++....+++.+.... .+. T Consensus 55 ~~~d-st~~~a~~~Las~l~~~lt-----P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-----~~s 123 (522) T protein:vir:94 55 TPWQ-AVGARCLNNLAAKLMLALF-----PQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYM-----ETN 123 (522) T ss_pred cccc-ccHHHHHHHHHHHHHhhcC-----CCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHH-----Hhc Confidence 1122 3333577777777653211 12334444433211 0 01112222233333333211 124 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc------------------------- Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR------------------------- 205 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------- 205 (559) +|+.-+..++.|+.++||++.++..+..|.+..+..++-.++.+..|..|.+-.- T Consensus 124 nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~ 203 (522) T protein:vir:94 124 SFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYE 203 (522) T ss_pred CcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCC Confidence 5666677778899999999999888877776555555556666777776644100 Q ss_pred ----ceEE------------EEEecCceeee------ecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 206 ----GKIY------------RQYIDNKVRGS------FTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 206 ----~~~y------------~~~~~~~~~~~------~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) ...| ++..++..... |..-=.+..+++.. ....||.||.+-+...+.......+. T Consensus 204 p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~~~ 280 (522) T protein:vir:94 204 PDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRL---DGEDYGRSYCEEYLGDLNSLETITEA 280 (522) T ss_pred ccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccccCCceeeeeeec---CCCccccchHHHHHHHHHHHHHHHHH Confidence 0000 00001111000 00001222333222 23469999999999999998888888 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--CCceeeeeccccchhHH-HHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--AEDAKFVSMTQAEDMQF-QSWLN 340 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~g~~~~~~ls~~~D~qf-~e~~~ 340 (559) ....-.-...|..++.-++. .... ....|..+ .++. .+++...++..+.|.+. .+..+ T Consensus 281 ~l~~~~~~~~p~~~v~~~g~-----~~~~----------~~~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~~~~i~ 341 (522) T protein:vir:94 281 ITKMAKVASKVVGLVNPNGI-----TQPR----------RLNKAATG----EFVAGRVEDINFLQLTKGQDFTIAKSVAD 341 (522) T ss_pred HHHHHHHHhCCceeeccccc-----ccch----------heeccCCc----eeecCCcccceeeecccccchhHHHHHHH Confidence 88887777788755532211 1111 11122221 2332 23456666666667664 45677 Q ss_pred HHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHH-------------hhccc Q lcl|NC_012530. 341 YLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLT-------------NGIIR 407 (559) Q Consensus 341 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln-------------~~L~~ 407 (559) .....|.++|-+. .++..+...-+. .+ +. .+..-....|.|++.+++++|- ..+|+ T Consensus 342 ~~~~rI~~af~~~--~~~~~~~~r~TA------tE--V~-~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP 410 (522) T protein:vir:94 342 AIEQRLGWAFLLN--SAVQRNAERVTA------EE--IR-YVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIP 410 (522) T ss_pred HHHHHHHHHHhhh--hhccCCCccccH------HH--HH-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 7778899999665 233222221111 11 11 1112223345555555555443 23344 Q ss_pred cccCccceeeecchhhhh-----HHHHHHHHHHHHc---CC----CCH----HHHHHHhCCCCCCCCCEeeccceecccc Q lcl|NC_012530. 408 QILGDNYMLEFVGGDTRS-----QQDKLKSVQLELQ---TA----TTV----NDYREKQGLPKIAGGDIILSAVYIQRLG 471 (559) Q Consensus 408 ~~~~~~~~~~f~~l~~~d-----~~~~~~~~~~~~~---~~----~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l~ 471 (559) +.....+.+++...+..- ......++..... .. +.. +++.+.+|.+|.. .+..+- ... T Consensus 411 ~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~---ivr~~e---e~~ 484 (522) T protein:vir:94 411 DLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAG---LLLTQD---EKI 484 (522) T ss_pred CCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhh---ccCCHH---HHH Confidence 333344677776443221 1111112221110 00 111 2233455554321 110000 000 Q ss_pred cccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccc Q lcl|NC_012530. 472 QQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQG 535 (559) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 535 (559) ++.+..+...+.... .... ..+....-....+++.-+. T Consensus 485 ~~~~q~~~~~~~~~~-~~~~-------------------------~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 485 QRMAEQSSQQAVVQG-ASAA-------------------------GANMGAAVGQGAGEDMAQA 522 (522) T ss_pred HHHHHHHHHHHHHHH-HHHH-------------------------HHHhhhhhhcccchhhhcC Confidence 000000000000000 0000 0000000000000000000 No 236 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=96.03 E-value=0.0011 Score=36.80 Aligned_cols=449 Identities=13% Similarity=0.059 Sum_probs=174.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |-= --+.++..+++..++.+|...+-.-...-... -.|.-|.+...+. .. . .... ... T Consensus 1 m~~--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~---~~~~lP~~~~~~~----~~---~------~~~~----~~~ 58 (535) T protein:vir:33 1 MAD--SKRTGLGEDGAKATYDRLTNDRRAYETRAENC---AQYTIPSLFPKES----DN---E------STDY----TTP 58 (535) T ss_pred CCh--hhhhccChhHHHHHHHHHHHHhhHHHHHHHHH---HHHhcccccCCCC----Cc---c------cccc----ccc Confidence 321 12455666666666666665432211100000 0133332211100 00 0 0000 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccC------hhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPT------KEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~------~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .+ ++--.|++.+|..+.....+ ..-.|++...+.. +.. .+..+-...+++.+.... .+.+| T Consensus 59 ~d-st~~~a~~~Laa~l~~~ltP-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~-----~~snf 127 (535) T protein:vir:33 59 WQ-AVGARGLNNLASKLMLALFP-----MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-----ESNSY 127 (535) T ss_pred cc-ccHHHHHHHHHHHHHHhhcC-----CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH-----HhcCc Confidence 22 33345777777776543221 1224555443321 111 111112222333332211 12456 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------------------------- Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------------------------- 204 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------------------------- 204 (559) +.-+..++.|++++||+..++..+. |..+.+..++-.++.+..|..|.+-. T Consensus 128 ~~~~~~~~~~L~~~G~a~l~~~~~~-~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~ 206 (535) T protein:vir:33 128 RVTLFECLKQLIVAGNALLYLPEPE-GSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKK 206 (535) T ss_pred HHHHHHHHHHHHhhCceeEEeecCC-CCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccc Confidence 6666777889999999999988764 33334444444566666776664310 Q ss_pred --cce-EEE--------------EEecCceeee------ecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 --RGK-IYR--------------QYIDNKVRGS------FTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTE 261 (559) Q Consensus 205 --~~~-~y~--------------~~~~~~~~~~------~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~ 261 (559) ... .|. +..++..... |..-=.+..+++.. ....||.||.+-+...+....... T Consensus 207 ~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~ 283 (535) T protein:vir:33 207 MDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRI---DGESYGRSYCEEYLGDLRSLENLQ 283 (535) T ss_pred cccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeec---CCCccccchHHHHHHHHHHHHHHH Confidence 000 010 0011111000 00000122222222 234699999999999998888888 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc--cCCceeeeeccccchhHH-HHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI--TAEDAKFVSMTQAEDMQF-QSW 338 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl--~~g~~~~~~ls~~~D~qf-~e~ 338 (559) +.....-.-...|..++.-++. .... +...|..+ .++ ..+++...++....|.+. .+. T Consensus 284 ~~~l~~~~~~~~p~~lv~~~g~-----~~~~----------~~~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~~~~ 344 (535) T protein:vir:33 284 EAIVKMSMISAKVIGLVNPAGI-----TQPR----------RLTKAQTG----DFVPGRREDIDFLQLEKQADFTVAKAV 344 (535) T ss_pred HHHHHHHHHHhcCceeeccccc-----cchh----------hcccCCce----eeecCCcccceeeecccccchhHHHHH Confidence 8887777777777766532211 1111 11222222 222 234566666666567664 455 Q ss_pred HHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHH-------HHHHHHHHhhHHHHHHHHHH-Hhhcccccc Q lcl|NC_012530. 339 LNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNK-------IDASKSKGLMPLLDMIAKNL-TNGIIRQIL 410 (559) Q Consensus 339 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~-------~~~~~~~~l~P~~~~ie~~l-n~~L~~~~~ 410 (559) .+.....|.++|-+. .+...+...-+..+ .....++. ...+-...|.|++.+.-..+ ...+|++.. T Consensus 345 i~~~~~~I~~af~~~--~~~~~~~~r~TAtE----V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p 418 (535) T protein:vir:33 345 SDQIEARLSYAFMLN--SAVQRTGERVTAEE----IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELP 418 (535) T ss_pred HHHHHHHHHHHHhhh--hcccCCCccccHHH----HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC Confidence 677788898998544 22222222111111 11111111 11233344445444433333 223444433 Q ss_pred CccceeeecchhhhhHH-----HHHHHHHHHHc------C-CCCH----HHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 411 GDNYMLEFVGGDTRSQQ-----DKLKSVQLELQ------T-ATTV----NDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 411 ~~~~~~~f~~l~~~d~~-----~~~~~~~~~~~------~-~~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) ...+.++|...+..-.+ ....++..... . .+.. +++.+.+|.|+.. .+..+-..+.+. T Consensus 419 ~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~---i~~~~ee~~~~~--- 492 (535) T protein:vir:33 419 KEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSG---ILLTDEQKQALM--- 492 (535) T ss_pred ccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhH---hcCCHHHHHHHH--- Confidence 44577777654322111 11112211110 0 1122 2233445554420 000000000000 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccccc Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 541 (559) +.. +......... ...+..........+...+ +..+. +|-|.. T Consensus 493 ~q~----~~~~~~~~~~-~~~g~~~~~~~~~~~~~~~-------~~~~~------------~g~~~~ 535 (535) T protein:vir:33 493 MQD----AAQTGVENAA-AAGGAGVGALATSSPEAMQ-------GAAAK------------AGLNAT 535 (535) T ss_pred HHH----HHHHHHHHHH-HhhhhhhcchhhcCChhHH-------HHHHh------------ccCCCC Confidence 000 0000000000 0000000000100111111 00000 000000 No 237 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=95.27 E-value=0.00036 Score=39.49 Aligned_cols=274 Identities=17% Similarity=0.135 Sum_probs=107.1 Q ss_pred hhhhHhhhhcCCcceeee-cccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC Q lcl|NC_012530. 99 EYAHRASTDDNGMGYQVR-LKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS 177 (559) Q Consensus 99 ~~~~~~~~~~~g~~~~v~-~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~ 177 (559) |-...+.+....+.|..- ..||. ++-...+.--+-.++.+ .++.. ..- +.|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~--~~~~~-----~~~----~~~~~------------- 54 (279) T protein:vir:40 1 MSLFNLSRRAEDVSFSTFTVQDPT--TDLLLGKLLGLVSYFDN--VDYSE-----ASK----LEDLF------------- 54 (279) T ss_pred CcccccchhhcccceeeeeecCcc--hhHHHHHHHHHHHHhhc--ccchh-----hhh----hhhhh------------- Confidence 111111111111112111 11111 11111111111111111 11000 000 11111 Q ss_pred CCcEEEEEEecCceEEEEecCcccccccceEEEEEec----CceeeeecccceEEEecccCCCccCCcccccHHHHHHHH Q lcl|NC_012530. 178 NGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQYID----NKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLRE 253 (559) Q Consensus 178 ~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~----~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~ 253 (559) .|.|....|. +-+.|.. .+|.|.+. +..+.....+|| -++++-..-.....||.-+= ....- T Consensus 55 ------~~~~~~~~~~--~~~~~~~----~~~~~~~~~d~fn~~vr~~~~~~v-tVP~~Dv~IieNPlv~v~~e-e~~kM 120 (279) T protein:vir:40 55 ------YWALQGKEVY--RVWYGGF----KYYAQRVNADQFNIVVREPNRREV-TIRTNDYEMLLNPFYGANPQ-RFGVM 120 (279) T ss_pred ------hhhhccceee--hhhhhhH----HHHHhhcCcchhhhheecCCccee-Eeecchhhhhhcchheeccc-hhhHH Confidence 1222222111 1111111 01111111 111111111121 11211111111122444331 22222 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchh Q lcl|NC_012530. 254 FISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDM 333 (559) Q Consensus 254 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~ 333 (559) ++++ .+.-..++ .+.+..+++|+++... ..++..++.+..++++..++++-+++..+ +++.+++.|...-.- T Consensus 121 ~~la--~nai~~KL-D~~~qIk~fIKTd~d~----glee~kekaR~rIk~mlalAk~~nGityi-d~~ddItQL~kDYSt 192 (279) T protein:vir:40 121 FGMA--SNGIGRRL-DSQAQIKIYWKTKVSS----GLKEVWDRIRERLTQQQQLAREFNGVSVI-GSDDDIKQIQPDYSG 192 (279) T ss_pred HHHH--Hhhhhhhh-cccceeeeEEecCcch----hHHHHHHHHHHHHHHHHHHHHhcCCeeee-cCCceeEeecccccc Confidence 2222 22223343 6667778888876432 23556666666666666666654555666 445788888633223 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccccccCcc Q lcl|NC_012530. 334 QFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDN 413 (559) Q Consensus 334 qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~ 413 (559) ...+-.++.+...+..||||-.+|- .+-.+++..+|+..++.|++++.|-.|.. .+. T Consensus 193 slk~die~lkS~l~Sq~GinekIL~----------------GsAtE~q~iAyy~rtVePILkQyek~liY---~~E---- 249 (279) T protein:vir:40 193 SLQNDANLAIEIALSEYGMPRELLY----------------GQSNEVTIIAFAIQKVLPLLKQHDKNIIF---NQE---- 249 (279) T ss_pred ccHHHHHHHHHHHHhhcCCchhhcc----------------ccCchhhhhhHHHhhHHHHHHHhcccccc---hhh---- Confidence 3455667788889999999998872 23457788899999999999997764432 111 Q ss_pred ceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_012530. 414 YMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPKIAGGD 459 (559) Q Consensus 414 ~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~pi~gGD 459 (559) +...| ..+..+||+ +|---...+-+|+. .| T Consensus 250 ~fv~y--------------~ttta~gg~-~~s~~~~~~~~~~~-~~ 279 (279) T protein:vir:40 250 NFVAY--------------ISTTAKGGA-IESKSSKRDSEPVG-ND 279 (279) T ss_pred hhhhh--------------heecccCcc-cccccccccCCCCC-CC Confidence 11110 001111221 00000111234442 23 No 238 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=95.24 E-value=0.0025 Score=34.88 Aligned_cols=449 Identities=12% Similarity=0.054 Sum_probs=172.7 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |-=-. +.++..+++..++.+|...+-.-...-... -.|.-|.+...+. ... .... ... T Consensus 1 m~~~~--~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~---~~~~lP~~~~~~~-------~~~------~~~~----~~~ 58 (535) T protein:vir:15 1 MADSK--RTGLGEDGAKATYDRLTNDRRAYETRAENC---AQYTIPSLFPKES-------DNE------STDY----TTP 58 (535) T ss_pred CCccc--hhccchHHHHHHHHHHHHHhhHHHHHHHHH---HHHhcccccCCCC-------Ccc------cccc----ccc Confidence 32111 355566666666666666432211100000 0133332211100 000 0000 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccC------hhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPT------KEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~------~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .+ ++--.|++.+|..+.....+ ..-.|++...+.. +.. .+..+-...+++.+.... .+.+| T Consensus 59 ~d-st~~~a~~~Laa~l~~~ltP-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf 127 (535) T protein:vir:15 59 WQ-AVGARGLNNLASKLMLALFP-----MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-----ESNSY 127 (535) T ss_pred cc-ccHHHHHHHHHHHHHHhhcC-----CCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHH-----HhcCc Confidence 22 33345777777776532211 1224555443321 111 111122222333332211 12456 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------------------------- Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------------------------- 204 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------------------------- 204 (559) +.-+..++.|++++||+..++..+.. ..+.+..++-.++.+..|..|.+-. T Consensus 128 ~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~ 206 (535) T protein:vir:15 128 RVTLFECLKQLIVAGNALLYLPEPEG-SYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKK 206 (535) T ss_pred HHHHHHHHHHHHhhCceeEEeecCCC-CceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccC Confidence 66677778899999999998877643 3333333344566666666663310 Q ss_pred --cc-eEEE--------------EEecCceee----e--ecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 --RG-KIYR--------------QYIDNKVRG----S--FTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTE 261 (559) Q Consensus 205 --~~-~~y~--------------~~~~~~~~~----~--~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~ 261 (559) .. ..|. +..++.... . |..-=.+..+++.. ....||.||.+-+...+....... T Consensus 207 ~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~ 283 (535) T protein:vir:15 207 MDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRI---DGESYGRSYCEEYLGDLRSLENLQ 283 (535) T ss_pred CCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeec---CCCccccchHHHHHHHHHHHHHHH Confidence 00 0111 001111100 0 00000122222222 234699999999999998888888 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc--cCCceeeeeccccchhHH-HHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI--TAEDAKFVSMTQAEDMQF-QSW 338 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl--~~g~~~~~~ls~~~D~qf-~e~ 338 (559) +.....-.-...|..++.-++. .... +...|..+ .++ ..+++...++....|.+. .+. T Consensus 284 ~~~l~~~~~~~~p~~lv~~~g~-----~~~~----------~l~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~~~~ 344 (535) T protein:vir:15 284 EAIVKMSMISAKVIGLVNPAGI-----TQPR----------RLTKAQTG----DFVPGRREDIDFLQLEKQADFTVAKAV 344 (535) T ss_pred HHHHHHHHHHhcCceeeccccc-----ccch----------hcccCCce----eeecCCcccceeeecccccchhHHHHH Confidence 8887777777777766532211 1111 11122222 122 234566666666567664 455 Q ss_pred HHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHH-------HHHHHHHHhhHHHHHHHHHH-Hhhcccccc Q lcl|NC_012530. 339 LNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNK-------IDASKSKGLMPLLDMIAKNL-TNGIIRQIL 410 (559) Q Consensus 339 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~-------~~~~~~~~l~P~~~~ie~~l-n~~L~~~~~ 410 (559) .+.....|.++|-+. .+...+...-+..+ .....++. ...+-...|.|++.+.-..+ ...+|++.. T Consensus 345 i~~~~~~I~~af~~~--~~~~~~~~r~TAtE----V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p 418 (535) T protein:vir:15 345 SDQIEARLSYAFMLN--SAVQRTGERVTAEE----IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELP 418 (535) T ss_pred HHHHHHHHHHHHhhh--hcccCCCccccHHH----HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC Confidence 677788898998544 22222222111111 11111111 11233344445444433333 223444433 Q ss_pred CccceeeecchhhhhHH-----HHHHHHHHHHc------C-CCCH----HHHHHHhCCCCCCCCCEeeccceeccccccc Q lcl|NC_012530. 411 GDNYMLEFVGGDTRSQQ-----DKLKSVQLELQ------T-ATTV----NDYREKQGLPKIAGGDIILSAVYIQRLGQQE 474 (559) Q Consensus 411 ~~~~~~~f~~l~~~d~~-----~~~~~~~~~~~------~-~~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~ 474 (559) ...+.++|...+..-.+ ....++..... . .+.. +++.+.+|.||.. .+..+--.+.+ . T Consensus 419 ~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~---i~~~~eev~~~---~ 492 (535) T protein:vir:15 419 KEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSG---ILLTDEQKQAL---M 492 (535) T ss_pred ccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhh---hcCCHHHHHHH---H Confidence 44577777654322111 11112211110 0 1122 2333455655421 00000000000 0 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccccc Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQ 541 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 541 (559) +..+ ... .........+..-...+...|+... + .-+.+|.+.. T Consensus 493 ~q~~----~~~-~~~~~a~~~g~~~~~~~~~~p~~~~-------~------------~~~~~g~~~~ 535 (535) T protein:vir:15 493 MQDA----AQT-GIENAAATGGAGVGALATSSPEAMQ-------G------------AAAQAGLDAT 535 (535) T ss_pred HHHH----HHH-HHHHHHHHHHhhccchhccChHHHH-------H------------HHhccCCCCC Confidence 0000 000 0000000000000000111111100 0 0111111111 No 239 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=94.12 E-value=0.0053 Score=33.05 Aligned_cols=383 Identities=9% Similarity=-0.033 Sum_probs=142.2 Q ss_pred ccccccccccccccccccccccCCCCCcc-cHHHHHHHHh-hChHHHHHHHHH---------H--HHHHhhhhHhhhhcC Q lcl|NC_012530. 43 YTEPVDGNLMFSTLEDTSIVPKPSPIAFG-RITDVLRQYS-MNVVLNAIINTR---------A--NQVTEYAHRASTDDN 109 (559) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~-~~~~~~~~~~-~~~~v~acv~~i---------a--~~ia~~~~~~~~~~~ 109 (559) .+...+.. + ......+. .+..+.+-+. .+.+.++-.... + .-+..++..+..... T Consensus 1 l~~~~i~~----------~--i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 68 (451) T protein:vir:10 1 MELEKIRA----------I--ISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKA 68 (451) T ss_pred CCHHHHHH----------H--HHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhh Confidence 00000000 0 00000000 0111111111 111111100000 0 000111111111111 Q ss_pred C----cceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCC------- Q lcl|NC_012530. 110 G----MGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSN------- 178 (559) Q Consensus 110 g----~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~------- 178 (559) + .+..+...+ .....+.+..|+.+ .+......+..+.+.+|.||..+.++.+ T Consensus 69 ~yl~G~p~~~~~~~-------~~~~~~~~~~~~~n----------~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~ 131 (451) T protein:vir:10 69 SYMFTYPVLFDIDN-------NKELNEKVTDVLGN----------EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVT 131 (451) T ss_pred hheecccceeecCC-------cHHHHHHHHHHhcc----------CHHHHHHHHHHHHhhcCeEEEEEeecCCccccccc Confidence 1 111111000 01111122222211 2334556677888999999999888764 Q ss_pred -CcEEEEEEecCceEEEEecCcc-cccccceEEEEEecCc----------eeeeecccceEEEec--------------- Q lcl|NC_012530. 179 -GRLSHTRMVDPTTIYFANDEHG-HRRTRGKIYRQYIDNK----------VRGSFTADEMGMFIR--------------- 231 (559) Q Consensus 179 -G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~----------~~~~~~~~evi~~~~--------------- 231 (559) | -..+..++|..+.++.++.. ......++|+...... ....++.+.+.++.. T Consensus 132 ~~-~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~ 210 (451) T protein:vir:10 132 NQ-TFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITV 210 (451) T ss_pred cc-ceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccc Confidence 3 23477789988887765432 1222233333222110 111233333333221 Q ss_pred -c-----cCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHh Q lcl|NC_012530. 232 -N-----PRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATS 305 (559) Q Consensus 232 -n-----~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~ 305 (559) | |.-.......|.|-++.....++....+..-..+.+...+.|-.++ .+. .+....+....++ . T Consensus 211 ~~~~g~vPvv~~~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~--~g~--~~~~~~~~~~~~~----~-- 280 (451) T protein:vir:10 211 QHRFNSVPFVEFSNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYIL--ENF--GGEDTSEFLKELK----R-- 280 (451) T ss_pred cCCCCeeeEEEeccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeee--ecC--CcccchhhHHHHh----h-- Confidence 0 0000001123555555544444444433333344444444454333 321 1122223222221 1 Q ss_pred cCccccccccccc------CCceeeeeccccchhHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccc--hhhhh Q lcl|NC_012530. 306 SGINGAYRIPMIT------AEDAKFVSMTQAEDMQFQSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNS--LNESN 377 (559) Q Consensus 306 ~G~~nag~~~vl~------~g~~~~~~ls~~~D~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~--~~~an 377 (559) .++.++. +++++|..-..+ +..+....+...+.|...-++|. +.. .+. ++.++.. .-+.. T Consensus 281 ------~~~i~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~--~~~--~~~-gn~Sg~Alk~~~~~ 348 (451) T protein:vir:10 281 ------YKTIKTETDSEGDSGGLKTMQIEIP-TEARKIILEILKKQIYESGQGLQ--QDT--ENF-GNASGVALKFFYRK 348 (451) T ss_pred ------CCeEEecCcCCccCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCccc--ccc--ccc-ccccHHHHHHHHHH Confidence 1111221 234555433322 33456678888889999889984 221 111 1111100 01111 Q ss_pred HH---HHHHHHHHHHhhHHHHHHHHHHHhhccccccCccceeeecchhhhhHHHHHHHHHHHHcCCCCHHHHHHHhCCCC Q lcl|NC_012530. 378 NQ---NKIDASKSKGLMPLLDMIAKNLTNGIIRQILGDNYMLEFVGGDTRSQQDKLKSVQLELQTATTVNDYREKQGLPK 454 (559) Q Consensus 378 ~~---~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~~f~~l~~~d~~~~~~~~~~~~~~~~T~NE~R~~~gl~p 454 (559) .. ...+..+..+|+-+++.|...++ ......+.+.|+.....+..+.++.+..+. |+++..-+.++++.-. T Consensus 349 l~~k~~~k~~~f~~~l~~~~~li~~~~~-----~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~-g~iS~et~~~~~p~v~ 422 (451) T protein:vir:10 349 LELKSGLLETEFRTSFDKLIKAILYFLG-----VTDYKKIQQTYTRNMMSNDLEDADIATKSV-GIIPTKIILRHHPWVD 422 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCccceeEEecCCCCCCHHHHHHHHHHHh-ccCchHHHHHhCCCCC Confidence 11 11122333344444433333222 223456788899888899999988887663 6678777777664411 Q ss_pred CCCCCEeeccceecccccccccccccccccccccccccccCCCCCC Q lcl|NC_012530. 455 IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSG 500 (559) Q Consensus 455 i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (559) -+ + ..+..+....+ .. ........++..+ T Consensus 423 d~--~--------~e~~~~~ee~~----~~---~~~~~~~~~~~~~ 451 (451) T protein:vir:10 423 DV--E--------EAEKLYLEEKK----IQ---ASKVSDDYNNFTE 451 (451) T ss_pred CH--H--------HHHHHHHHHHH----HH---HHHHHhhcCCCCC Confidence 00 0 00000000000 00 0000000000000 No 240 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=91.21 E-value=0.017 Score=30.29 Aligned_cols=420 Identities=13% Similarity=0.061 Sum_probs=164.3 Q ss_pred ccccccccccccccccccccccccCCCCCcccH--HHHHHHHhhChH-HHHHHHHHHHHH---HhhhhHhhhhcCCccee Q lcl|NC_012530. 41 RAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI--TDVLRQYSMNVV-LNAIINTRANQV---TEYAHRASTDDNGMGYQ 114 (559) Q Consensus 41 ~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~--~~~~~~~~~~~~-v~acv~~ia~~i---a~~~~~~~~~~~g~~~~ 114 (559) ++|..+..++.....+...++....+......+ ..++..++.|.+ =++.| ++.+.- ..+........=+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~-lrg~~~~~~r~~~~ps~~~~~~~~~~ 79 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVI-LRGGDEGDQRPIYVPNGEKLIEAKMR 79 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeee-cCCccccccceeeehhhHHhhCCcce Confidence 666555554432212222222112221111111 122223333321 11111 111110 00000000000011112 Q ss_pred eeccccc-ccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCc Q lcl|NC_012530. 115 VRLKNGD-KPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPT 190 (559) Q Consensus 115 v~~~d~~-~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~ 190 (559) +...... ..+...++-...+..|..+- ++.......-++.++.|.+.+.+++|. .|.=+.+..+||. T Consensus 80 ~~~~g~~~~~~~~~e~v~~~lr~~~~~e---------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~ 150 (527) T protein:vir:10 80 FLGQGLKWEFSKKDAKVDDAIRVLFDRE---------NWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPS 150 (527) T ss_pred eeccCccccccchhHHHHHHHHHHHHHh---------hhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcc Confidence 2111111 11222222222233344332 223344556677888999999999984 3345678999998 Q ss_pred eEEEEecCcccccccceEEE----------------------EEe--cCcee------ee-------------------- Q lcl|NC_012530. 191 TIYFANDEHGHRRTRGKIYR----------------------QYI--DNKVR------GS-------------------- 220 (559) Q Consensus 191 ~V~~~~~~~g~~~~~~~~y~----------------------~~~--~~~~~------~~-------------------- 220 (559) ++.++.+.++.....+...+ +.. .+..+ .+ T Consensus 151 ~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~ 230 (527) T protein:vir:10 151 TYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPD 230 (527) T ss_pred eeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchh Confidence 88887776553322211101 000 00000 00 Q ss_pred ----ecccc-------------eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCc Q lcl|NC_012530. 221 ----FTADE-------------MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPS 283 (559) Q Consensus 221 ----~~~~e-------------vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 283 (559) ....+ |+|++-.| .....+|.|-|+-+...+.....+..-......-++.|-.+++- . T Consensus 231 ~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p---~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg--~ 305 (527) T protein:vir:10 231 DIKKLSTLTEEEPLPEQITTLPVFHFRGHP---IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDS--A 305 (527) T ss_pred hhhhhcCceeeecccCCCCccceEeecCCC---ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecc--c Confidence 00011 23442222 22345798888766555544433333333333336666555421 1 Q ss_pred cCCccCCHHHHHHHHHHHHHHhcCcccc---cccccc-cCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_012530. 284 PSVTNTSMRALEDFKRHWTATSSGINGA---YRIPMI-TAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIG 358 (559) Q Consensus 284 ~~~~~~~~e~~~~l~~~~~~~~~G~~na---g~~~vl-~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg 358 (559) .+-+. .|..+- +--.|+ .+++-++..++...+.+ |........+.|+..=++|.+-+| T Consensus 306 -~~vd~----------------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G 368 (527) T protein:vir:10 306 -PPRDS----------------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVG 368 (527) T ss_pred -ccccc----------------cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeec Confidence 11110 111110 000122 23345677666544543 666788888899999999999999 Q ss_pred cccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHH-------HHh--------------hc-ccccc-Cccce Q lcl|NC_012530. 359 MQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKN-------LTN--------------GI-IRQIL-GDNYM 415 (559) Q Consensus 359 ~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~-------ln~--------------~L-~~~~~-~~~~~ 415 (559) -.+.+..-+ . .-+.-.|.|++.+.+.. +.+ .+ +.... ...+. T Consensus 369 ~vD~s~~~S-------G--------~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ 433 (527) T protein:vir:10 369 VVDAAVAES-------G--------IALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVT 433 (527) T ss_pred cccCCcCcH-------H--------HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceE Confidence 655432110 0 11222344554432211 100 00 01111 12457 Q ss_pred eeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCC-CCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 416 LEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPK-IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 416 ~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~p-i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) +.|-..++.|.++..+-...++. |+++.-=+-++|+--+ ++...+ ....+........-....+...-.. T Consensus 434 ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~--------E~~~I~~era~~a~a~a~a~~~~~a 505 (527) T protein:vir:10 434 ITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEE--------DFRQATEDKKTQGIAQAEAADPFGA 505 (527) T ss_pred EEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHH--------HHHHHHHHHHHHhHHhhhhcCchhh Confidence 78888899999999988877775 4568777766662100 221111 0000000000000000000000000 Q ss_pred cCCCCCCCCCCCCccccccchhccccccccccccccccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGV 536 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 536 (559) ..+. ...-+++++++ . +|...- T Consensus 506 ~~~~-~~g~~~~~~d~------------------~--~~~~~~ 527 (527) T protein:vir:10 506 QMAA-EQGIPDEEDDQ------------------A--LNGQPL 527 (527) T ss_pred hhcc-ccCCCCCCccc------------------c--cCCCCC Confidence 0000 00000000000 0 000000 No 241 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=91.20 E-value=0.017 Score=30.29 Aligned_cols=420 Identities=12% Similarity=0.061 Sum_probs=164.3 Q ss_pred ccccccccccccccccccccccccCCCCCcccH--HHHHHHHhhChH-HHHHHHHHHHHH---HhhhhHhhhhcCCccee Q lcl|NC_012530. 41 RAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRI--TDVLRQYSMNVV-LNAIINTRANQV---TEYAHRASTDDNGMGYQ 114 (559) Q Consensus 41 ~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~--~~~~~~~~~~~~-v~acv~~ia~~i---a~~~~~~~~~~~g~~~~ 114 (559) ++|..+..++.....+...++....+......+ ..++..++.|.+ =++.| ++.+.- ..+........=+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~-lrg~~~~~~r~~~~ps~~~~~~~~~~ 79 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVI-LRGGDEGDQRPIYVPNGEKLIEAKMR 79 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeee-cCCccccccceeeehhhHHhhCCcce Confidence 666555554432212222222122221111111 122223333321 11111 111110 00000000000011112 Q ss_pred eeccccc-ccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECC---CCcEEEEEEecCc Q lcl|NC_012530. 115 VRLKNGD-KPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDS---NGRLSHTRMVDPT 190 (559) Q Consensus 115 v~~~d~~-~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~---~G~~~~L~~l~p~ 190 (559) +...... ..+...++-...+..|..+- ++.......-++.++.|.+.+.+++|. .|.=+.+..+||. T Consensus 80 ~~~~g~~~~~~~~~e~v~~~lr~~~~~e---------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~ 150 (527) T protein:vir:10 80 FLGQGLKWEFSKKDAKVDDAIKVLFDRE---------NWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPS 150 (527) T ss_pred eeccCccccccchhHHHHHHHHHHHHHh---------hhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcc Confidence 2111111 11222222222233344432 223344556677888999999999984 3345678999998 Q ss_pred eEEEEecCcccccccceEEE----------------------EEe--cCcee------ee-------------------- Q lcl|NC_012530. 191 TIYFANDEHGHRRTRGKIYR----------------------QYI--DNKVR------GS-------------------- 220 (559) Q Consensus 191 ~V~~~~~~~g~~~~~~~~y~----------------------~~~--~~~~~------~~-------------------- 220 (559) ++.++.+.++.....+...+ +.. .+..+ .+ T Consensus 151 ~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~ 230 (527) T protein:vir:10 151 TYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPD 230 (527) T ss_pred eeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchh Confidence 88887776553322211101 000 00000 00 Q ss_pred ----ecccc-------------eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCc Q lcl|NC_012530. 221 ----FTADE-------------MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPS 283 (559) Q Consensus 221 ----~~~~e-------------vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 283 (559) ....+ |+|++-.| .....+|.|-|+-+...+.....+..-......-++.|-.+++- . T Consensus 231 ~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p---~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg--~ 305 (527) T protein:vir:10 231 DIKKLSTLTEEEPLPEQITTLPVFHFRGHP---IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDS--A 305 (527) T ss_pred hhhhhcCceeeecccCCCCccceEeecCCC---ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecc--c Confidence 00011 23442222 22345798888766555544433333333333336666555421 1 Q ss_pred cCCccCCHHHHHHHHHHHHHHhcCcccc---cccccc-cCCceeeeeccccchhH-HHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_012530. 284 PSVTNTSMRALEDFKRHWTATSSGINGA---YRIPMI-TAEDAKFVSMTQAEDMQ-FQSWLNYLINIICALVAMDPAEIG 358 (559) Q Consensus 284 ~~~~~~~~e~~~~l~~~~~~~~~G~~na---g~~~vl-~~g~~~~~~ls~~~D~q-f~e~~~~~~~~Ia~~fgVPp~~lg 358 (559) .+-+. .|..+- +--.|+ .+++-++..++...+.+ |........+.|+..=++|.+-+| T Consensus 306 -~~vd~----------------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G 368 (527) T protein:vir:10 306 -PPRDS----------------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVG 368 (527) T ss_pred -ccccc----------------cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeec Confidence 11110 111110 000122 23345677666544543 666788888899999999999999 Q ss_pred cccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHH-------HHh--------------hc-ccccc-Cccce Q lcl|NC_012530. 359 MQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKN-------LTN--------------GI-IRQIL-GDNYM 415 (559) Q Consensus 359 ~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~-------ln~--------------~L-~~~~~-~~~~~ 415 (559) -.+.+..-+ . .-+.-.|.|++.+.+.. +.+ .+ +.... ...+. T Consensus 369 ~vD~s~~~S-------G--------~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ 433 (527) T protein:vir:10 369 VVDAAVAES-------G--------IALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVT 433 (527) T ss_pred cccCCcCcH-------H--------HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceE Confidence 655432110 0 11222344554432211 100 00 01111 12457 Q ss_pred eeecchhhhhHHHHHHHHHHHHc-CCCCHHHHHHHhCCCC-CCCCCEeeccceecccccccccccccccccccccccccc Q lcl|NC_012530. 416 LEFVGGDTRSQQDKLKSVQLELQ-TATTVNDYREKQGLPK-IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLES 493 (559) Q Consensus 416 ~~f~~l~~~d~~~~~~~~~~~~~-~~~T~NE~R~~~gl~p-i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 493 (559) +.|-..++.|.++..+-...++. |+++.-=+-++|+--+ ++..++ ....+........-....+...-.. T Consensus 434 ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~--------E~~~I~~era~~a~a~a~A~~~~~a 505 (527) T protein:vir:10 434 ITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEE--------DFKQATEDKKTQGIAQAEAADPFGA 505 (527) T ss_pred EEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHH--------HHHHHHHHHHHHhHHhhhhcCchhh Confidence 78888899999999988877775 4568777766662100 221111 0000000000000000000000000 Q ss_pred cCCCCCCCCCCCCccccccchhccccccccccccccccccccc Q lcl|NC_012530. 494 ALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGV 536 (559) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 536 (559) ..+. ...-+++++++ . +|...- T Consensus 506 ~~~~-~~g~~~~~~d~------------------~--~~~~~~ 527 (527) T protein:vir:10 506 QMAA-EQGIPDEEDDQ------------------A--LNGQPL 527 (527) T ss_pred hhcc-ccCCCCCCccc------------------c--cCCCCC Confidence 0000 00000000000 0 000000 No 242 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=90.89 E-value=0.019 Score=30.08 Aligned_cols=430 Identities=12% Similarity=0.052 Sum_probs=156.5 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHH---HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIAN---DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) |++-.|| +.++.-+.++.. ++.. |.-|....... .... ..... T Consensus 1 m~~~~r~----------~~L~~~R~~~e~~w~e~~~---------~tlP~~~~~~~--~~~~---------~~~~~---- 46 (522) T protein:vir:10 1 MKARERY----------NQLTTARQMFLDKAVECSE---------LTLPYLIDDDI--SSRP---------NHKSL---- 46 (522) T ss_pred CchHHHH----------HHHHHHhhHHHHHHHHHHH---------HhhhcccCCCC--CCCc---------ccccc---- Confidence 9988888 333333333322 3333 33332221100 0000 00000 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc---ccChhH----HHHHHHHHHHHHhcCCCCCCChh Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD---KPTKEQ----QKKIDYAERYIERMGVDYSPIRD 150 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~---~~~~~~----~~~~~~~~~~L~~~~p~~~~~~~ 150 (559) ....+ ++--.|++.+|..+..... ..+.-.|++...+.. ..+++. .+....+++.+... ..+. T Consensus 47 ~~~~d-stg~~a~~~LAa~l~~~lt----pp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~-----l~~s 116 (522) T protein:vir:10 47 TVPWQ-SVGAKCCVTLAAKLMLAVL----PPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDY-----IAAS 116 (522) T ss_pred ccccc-chHHHHHHHHHHHHHHhhc----CCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHH-----HHhc Confidence 01122 3334577777776643211 112234455443321 112211 11122233333221 1124 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc------------------------- Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR------------------------- 205 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------- 205 (559) +|+.-+..++.|+.++||+..++..+. ...||| .++.+..|..|.+-.- T Consensus 117 nf~~~~~~~~~~L~~~G~a~ly~~~~~----~~~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~ 190 (522) T protein:vir:10 117 NDRVAVHQALKHLIVGGNALIFMGKDG----LKTFPL--TRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGID 190 (522) T ss_pred CcHHHHHHHHHHHHhHCceeEEEcCCC----ceEEEc--ceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhh Confidence 566667778889999999998765432 234444 3444445555432100 Q ss_pred -------c-eEE--------------EEEecCceee----eecccc--eEEEecccCCCccCCcccccHHHHHHHHHHHH Q lcl|NC_012530. 206 -------G-KIY--------------RQYIDNKVRG----SFTADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISH 257 (559) Q Consensus 206 -------~-~~y--------------~~~~~~~~~~----~~~~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~ 257 (559) . ..| ++..++.... ....++ .+..+++.. ....||.||.+-+.-.+... T Consensus 191 ~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L 267 (522) T protein:vir:10 191 ESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRSTAPKNASPWLPLRFNTV---DGEDYGRGRVEEFLGDLKSL 267 (522) T ss_pred cccCCCCceEEEEEEEeeccCCceEEEEccCCccccccccccccccCCceeeeeeec---CCCccccchHHHHHHHHHHH Confidence 0 000 0000110000 000000 111122211 23469999999999999888 Q ss_pred HHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--CCceeeeeccccchhHH Q lcl|NC_012530. 258 ENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--AEDAKFVSMTQAEDMQF 335 (559) Q Consensus 258 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~g~~~~~~ls~~~D~qf 335 (559) ....+.....-.-...|..++.-++... . .....|..+ .++. .+++...++....|.+. T Consensus 268 ~~l~~~~~~~~~~a~~p~~lv~~~~~~~-----~----------~~l~~~~~~----~~v~g~~~~v~~~~~~~~~d~~~ 328 (522) T protein:vir:10 268 DGLSQSLIEGAAAASKVVFLVSPSSTTK-----P----------ATIAKAGNG----AIVQGRPEDVAVIQVGKTADFST 328 (522) T ss_pred HHHHHHHHHHHHHhcCCceeeccccccc-----c----------ccccCCCCc----ceecCCCccceeecccccccchH Confidence 8888877777777777775553221111 1 011122222 2332 23455555555667764 Q ss_pred -HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHH-------HHHHHHHhhHHHHHHHHHHHh-hcc Q lcl|NC_012530. 336 -QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKI-------DASKSKGLMPLLDMIAKNLTN-GII 406 (559) Q Consensus 336 -~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~-------~~~~~~~l~P~~~~ie~~ln~-~L~ 406 (559) .+..+..+..|..+|- ++...+...-+. +-.....++.. ..+....|.|++.+.-..+.+ .+| T Consensus 329 ~~~~i~~~~~ri~~aFl----~~~~~d~~rvTA----tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~l 400 (522) T protein:vir:10 329 AANMATAIEKRLLEAFL----VMNVRNAERVTA----EEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQI 400 (522) T ss_pred HHHHHHHHHHHHHHHHh----hccCCCCCCCCH----HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 4556777778888873 222212111111 11111111111 123344445555444333332 234 Q ss_pred ccccC---ccceeeecc-hhhhhHHHHH-HHHHHH--HcC------CCCH----HHHHHHhCCCCCCCCCEeeccceecc Q lcl|NC_012530. 407 RQILG---DNYMLEFVG-GDTRSQQDKL-KSVQLE--LQT------ATTV----NDYREKQGLPKIAGGDIILSAVYIQR 469 (559) Q Consensus 407 ~~~~~---~~~~~~f~~-l~~~d~~~~~-~~~~~~--~~~------~~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~ 469 (559) ++... ....+++.. +.+....++. .++... +.+ -+.. +++-+.+|.|+.. .+.. T Consensus 401 P~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~---ivrt------ 471 (522) T protein:vir:10 401 PKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLN---LVKT------ 471 (522) T ss_pred CCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhh---hcCC------ Confidence 33211 122333332 2232211211 222211 000 1122 2223344544210 0000 Q ss_pred ccccccccccccccc-ccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 470 LGQQEQIKQNEFQRQ-QTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 470 l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) ...+.+..+...+.. +...-....+. ...+..++.. ++++-+-.+.++++ T Consensus 472 ~eev~~~~q~~q~~~~~~~~~~~a~~~----~~~~~~~~~~----------------~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 472 EQQLAEEQQAAQQQAAQQSLVDQAGQM----TGSPLMDPTK----------------NPQLMDEEQPPMEE 522 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----hcccccCccc----------------cHHHHHHhCCCCCC Confidence 000000000000000 00000000000 0000000000 00000011111111 No 243 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=88.22 E-value=0.034 Score=28.66 Aligned_cols=448 Identities=13% Similarity=0.085 Sum_probs=160.4 Q ss_pred hhhhccccccCCcchHHHHHHHHHH--HHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHh Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKIA--NDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYS 81 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 81 (559) .+.=|+.+.-+++..++.+|...+- +...+. =-.|.-|.....+-.. ... . +.... T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e-----~~~~~lP~~~~~~~~~----------~~~---~----~~~~~ 58 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQN-----CAQYTIPSLFPKDSDN----------AST---D----YQTPW 58 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhcccccCCCCCc----------ccc---c----ccccc Confidence 2333445555555555555544321 111100 0012333221111000 000 0 01112 Q ss_pred hChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccC--h----hHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 82 MNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPT--K----EQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 82 ~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~--~----~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) + ++--.|++.+|..+.....+ .+-.|++...+.. +.. + +.++-...+++.+.... .+.+|+ T Consensus 59 d-st~~~a~~~Laa~l~~~ltP-----~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~ 127 (536) T protein:vir:21 59 Q-AVGARGLNNLASKLMLALFP-----MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-----ESNSYR 127 (536) T ss_pred c-ccHHHHHHHHHHHHHHhhcC-----CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-----HhcCcH Confidence 2 33345777777666432111 1234555444332 111 1 11111222233332211 123556 Q ss_pred HHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc----------------------------- Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT----------------------------- 204 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~----------------------------- 204 (559) .-+...+.|+.++||+..++..+..+.+..+..++..++.+..|..|.+-. T Consensus 128 ~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~ 207 (536) T protein:vir:21 128 VTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKA 207 (536) T ss_pred HHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhccccccccc Confidence 666677889999999999987765544444433334566666666664310 Q ss_pred -cce-EE--EEEe-cCceeeee---------c--------ccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 -RGK-IY--RQYI-DNKVRGSF---------T--------ADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTEL 262 (559) Q Consensus 205 -~~~-~y--~~~~-~~~~~~~~---------~--------~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~ 262 (559) ... .| ++.. ++.....+ . .-=.+..+++.. ....||.||.+-+...+.......+ T Consensus 208 ~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~~ 284 (536) T protein:vir:21 208 DETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRL---DGESYGRSYIEEYLGDLRSLENLQE 284 (536) T ss_pred ccceeEEEEEEEecCCCcEEEEeccCCeeeccccCccccccCCeeeeeeeec---CCCccccchHHHHHHHHHHHHHHHH Confidence 000 01 0000 01010000 0 001123333322 2346999999998888888777766 Q ss_pred HHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--CCceeeeeccccchhHH-HHHH Q lcl|NC_012530. 263 FNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--AEDAKFVSMTQAEDMQF-QSWL 339 (559) Q Consensus 263 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~g~~~~~~ls~~~D~qf-~e~~ 339 (559) .....-.-...|..++.- .+ -..+. ....+..+ .++. .+++...++....|.+. .+.. T Consensus 285 ~~l~~~~~a~~~~~lv~p-~g----~~~~~----------~~~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~~~~i 345 (536) T protein:vir:21 285 AIVKMSMISSKVIGLVNP-AG----ITQPR----------RLTKAQTG----DFVTGRPEDISFLQLEKQADFTVAKAVS 345 (536) T ss_pred HHHHHHHHHhcCCcccCc-cc----ccchh----------hhccCCCc----ceecCCcccceeeeccccccchHHHHHH Confidence 666654444555444421 11 11111 11111111 1222 23455666666667663 4567 Q ss_pred HHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHH-------HHHHHHHHhhHHHHHHHHHH-HhhccccccC Q lcl|NC_012530. 340 NYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNK-------IDASKSKGLMPLLDMIAKNL-TNGIIRQILG 411 (559) Q Consensus 340 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~-------~~~~~~~~l~P~~~~ie~~l-n~~L~~~~~~ 411 (559) +.....|.++|-+.. +...+...-+..+ .....++. ...+-...|.|++.+.-..+ ...+|++... T Consensus 346 ~~~~~rI~~af~~~~--l~~~~~~r~TAtE----V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~ 419 (536) T protein:vir:21 346 DAIEARLSFAFMLNS--AVQRTGERVTAEE----IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK 419 (536) T ss_pred HHHHHHHHHHHhhhh--cccCCCCCccHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh Confidence 778888989995542 2222221111111 11111111 11233344445554433333 2334443322 Q ss_pred ccceeeecchh----hhhHHHH-HHHHHHHHc------C-CCCHHH----HHHHhCCCCCCCCCEeeccceecccccccc Q lcl|NC_012530. 412 DNYMLEFVGGD----TRSQQDK-LKSVQLELQ------T-ATTVND----YREKQGLPKIAGGDIILSAVYIQRLGQQEQ 475 (559) Q Consensus 412 ~~~~~~f~~l~----~~d~~~~-~~~~~~~~~------~-~~T~NE----~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~ 475 (559) ..+..++...+ +....++ ..++..... . -+..++ +-+.+|.+|.. .+..+--++.+ .+ T Consensus 420 ~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~---r~ 493 (536) T protein:vir:21 420 EAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQK---MA 493 (536) T ss_pred hhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhh---hcCCHHHHHHH---HH Confidence 33455554322 2111111 112111110 0 012222 22334543321 00000000000 00 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccccchhhhhhccC Q lcl|NC_012530. 476 IKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKNKKNTNSYKQGG 555 (559) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~~~~~~~~~~~~ 555 (559) ..+...+..... ...+ .++.+.-. -++........+-|. |-| T Consensus 494 q~~~~~~~~~~a-----~~~~---------------------~~~~~~~~-~~~~~~~~~~~~~g~-----------~~~ 535 (536) T protein:vir:21 494 QQSMQMGMDNGA-----AALA---------------------QGMAAQAT-ASPEAMAAAADSVGL-----------QPG 535 (536) T ss_pred HHHHHHHHHHHH-----HHHH---------------------HHHHHHHh-cChhhHHhhhhcccc-----------CCC Confidence 000000000000 0000 00000000 000000000000000 000 Q ss_pred C Q lcl|NC_012530. 556 S 556 (559) Q Consensus 556 ~ 556 (559) - T Consensus 536 ~ 536 (536) T protein:vir:21 536 I 536 (536) T ss_pred C Confidence 0 No 244 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=87.72 E-value=0.037 Score=28.44 Aligned_cols=428 Identities=14% Similarity=0.076 Sum_probs=155.2 Q ss_pred Ccc--hhhhccccccCCcchHHHHHHHHHHH---HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHH Q lcl|NC_012530. 1 MGI--FDRFRTKFYTDDPNAFFKHIDSKIAN---DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD 75 (559) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 75 (559) |+= -.|| +.++.-+.++.. ++. .|.-|.....+.... ... T Consensus 1 mk~~a~~r~----------~~l~~~R~~~e~~w~e~~---------~y~lP~~~~~~~~~~-------------~~~--- 45 (542) T protein:vir:78 1 MKGLAQARY----------SAMRADREDFLDMARRCA---------ALTLPYLLTEDGHAS-------------GGR--- 45 (542) T ss_pred ChhHHHHHH----------HHHHHHhhHHHHHHHHHH---------HHhccccCCCCCCcc-------------ccc--- Confidence 331 1222 111111111111 111 133332211110000 000 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--c---cChh----HHHHHHHHHHHHHhcCCCCC Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--K---PTKE----QQKKIDYAERYIERMGVDYS 146 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~---~~~~----~~~~~~~~~~~L~~~~p~~~ 146 (559) +....++. --.|++.+|..+..... ..+.-.|++...+.. + .+++ .+.....+++.+.... T Consensus 46 -~~~~~dst-g~~a~~~Laa~l~~~lt----pp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l---- 115 (542) T protein:vir:78 46 -LQQPYQSL-GSKGVNALSSKLMLSLF----PIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQI---- 115 (542) T ss_pred -ccccccch-HHHHHHHHHHHHHHhhc----CCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHH---- Confidence 01112223 33577777776643211 112234444443321 1 1111 1111222333332211 Q ss_pred CChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------------------- Q lcl|NC_012530. 147 PIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------------------- 204 (559) Q Consensus 147 ~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------------------- 204 (559) .+.+|+.-+..++.|+.++||+.+++..+ +...|||. .+.+..|..|.+-. T Consensus 116 -~~snf~~~~~~~~~~L~~~G~a~l~~~~~----~~~~~pl~--~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~ 188 (542) T protein:vir:78 116 -AESSDRVQLTAAMKHLIVTGNVLVFAGKK----TLKVYPLD--RYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLL 188 (542) T ss_pred -HhcCcHHHHHHHHHHHHhhCeEEEEecCC----CceEEecc--eeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCc Confidence 12355666677788999999998876443 23445553 34444444443210 Q ss_pred -------------------------------------cceEEEEEecCceee------eecccceEEEecccCCCccCCc Q lcl|NC_012530. 205 -------------------------------------RGKIYRQYIDNKVRG------SFTADEMGMFIRNPRSDILSGG 241 (559) Q Consensus 205 -------------------------------------~~~~y~~~~~~~~~~------~~~~~evi~~~~n~~~~~~~~~ 241 (559) ....|++-.++..+. .|..-=.+..+++.. .... T Consensus 189 ~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~---~ge~ 265 (542) T protein:vir:78 189 EGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVV---DGES 265 (542) T ss_pred hHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEeccccccccccccccccCCceeeeeeec---CCCc Confidence 001112222222110 011111222333322 2346 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--C Q lcl|NC_012530. 242 YGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--A 319 (559) Q Consensus 242 ~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~ 319 (559) ||.||.+-+.-.+.......+.....-.-...|..++.-++. .... ....|..+ .|+. . T Consensus 266 YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~-----~~~~----------~~~~~~~g----~iv~g~~ 326 (542) T protein:vir:78 266 YGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSAT-----TKPQ----------SLARAGTG----AIIQGRA 326 (542) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc-----cchh----------hcccCCCc----eeecCCc Confidence 999999999999988888888887777777777755422211 1111 11122222 1222 2 Q ss_pred CceeeeeccccchhHH-HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_012530. 320 EDAKFVSMTQAEDMQF-QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIA 398 (559) Q Consensus 320 g~~~~~~ls~~~D~qf-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie 398 (559) +++...++..+.|.+. .+..+.....|.++|-+ ....+... -+..+ +. .+..-....|.|.+.+++ T Consensus 327 ~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~----~~~~d~~r------vTAtE--V~-~r~~E~~~~LG~v~~rl~ 393 (542) T protein:vir:78 327 EDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI----LNVRQSER------TTATE--VR-EVQMELDRQLSGIYGSLT 393 (542) T ss_pred cceeeeecccccchhHHHHHHHHHHHHHHHHhcc----cccCCccc------ccHHH--HH-HHHHHHHHHhhHHHHHHH Confidence 3455555555667664 45667777888888842 12111111 11111 11 112233345666666666 Q ss_pred HHHHhh-------------ccccccCccceeeecchhhhhH-----HHHHHHHHHHHc--C------CCCH----HHHHH Q lcl|NC_012530. 399 KNLTNG-------------IIRQILGDNYMLEFVGGDTRSQ-----QDKLKSVQLELQ--T------ATTV----NDYRE 448 (559) Q Consensus 399 ~~ln~~-------------L~~~~~~~~~~~~f~~l~~~d~-----~~~~~~~~~~~~--~------~~T~----NE~R~ 448 (559) ++|-.- +|++....-+++++...+..-. .....++..... + -+.. +.+.+ T Consensus 394 ~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~ 473 (542) T protein:vir:78 394 VELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAA 473 (542) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHH Confidence 555432 3333323346677664432111 111112111101 0 0122 22334 Q ss_pred HhCCCCCCCCCEeeccceeccccccccccccccccccccc-ccccccCCCCCCCCCCCCccccccchhcc-cccccccc Q lcl|NC_012530. 449 KQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRL-TQLESALQNPSGTPPTLPPSSSNSFQQNQ-EGYTGKDA 525 (559) Q Consensus 449 ~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 525 (559) .+|.|+.. .+..+--+.... +..+ .+..+.+. .+.......+.... ...-...+. +++. ...+|.+- T Consensus 474 ~~Gvp~~~---i~~s~e~~~~~~---~q~q--~~~~~~al~~~a~~~a~~~~~~~-~~~~~~a~~-~~~~~~~~~~~~~ 542 (542) T protein:vir:78 474 ASGIDTLN---LVKSPETMANEA---QQAQ--QQQMTASLMGQAGQLAKSPIGEK-MMQQINAPG-QEAPAGPQTGEDL 542 (542) T ss_pred HcCCCHhh---ccCCHHHHHHHH---HHHH--HHHHHHHHHHhhhhccccccccc-hhhhcCCCC-cCCCCCCcccccC Confidence 45655310 000000000000 0000 00000000 00000000000000 000000000 0000 00011100 No 245 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=82.82 E-value=0.074 Score=26.80 Aligned_cols=447 Identities=12% Similarity=0.062 Sum_probs=161.3 Q ss_pred hhhhccccccCCcchHHHHHHHHH--HHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHHh Q lcl|NC_012530. 4 FDRFRTKFYTDDPNAFFKHIDSKI--ANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYS 81 (559) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 81 (559) .+.=|+.+.-+++..++.+|...+ .+...+.- -.|.-|.....+-.. ... . +.... T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~-----~~~~lP~~~~~~~~~----------~~~---~----~~~~~ 58 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNC-----AQYTIPSLFPKDSDN----------AST---D----YQTPW 58 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHhcccccCCCCCc----------ccc---c----ccccc Confidence 233344555555555555554432 11111000 012333221111000 000 0 01112 Q ss_pred hChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccC--h----hHHHHHHHHHHHHHhcCCCCCCChhhHH Q lcl|NC_012530. 82 MNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPT--K----EQQKKIDYAERYIERMGVDYSPIRDDFT 153 (559) Q Consensus 82 ~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~--~----~~~~~~~~~~~~L~~~~p~~~~~~~~~~ 153 (559) + ++--.|++.+|..+.....+ .+-.|++...+.. +.. + +.++-...+++.+.... .+.+|+ T Consensus 59 d-st~~~a~~~Laa~l~~~ltP-----~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~ 127 (536) T protein:vir:10 59 Q-AVGARGLNNLASKLMLALFP-----MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-----ESNSYR 127 (536) T ss_pred c-ccHHHHHHHHHHHHHhhhcC-----CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-----HhcCcH Confidence 2 33345777777666432111 1224555444332 111 1 11111222233332211 123556 Q ss_pred HHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc----------------------------- Q lcl|NC_012530. 154 SFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT----------------------------- 204 (559) Q Consensus 154 ~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~----------------------------- 204 (559) .-+...+.|+.++||+..++..+..+.+..+..++..++.+..|..|.+-. T Consensus 128 ~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~ 207 (536) T protein:vir:10 128 VTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKA 207 (536) T ss_pred HHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCc Confidence 666677889999999999987765544444433334566666666664310 Q ss_pred -cce-EE--------------EEEecCceeee------ecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 -RGK-IY--------------RQYIDNKVRGS------FTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTEL 262 (559) Q Consensus 205 -~~~-~y--------------~~~~~~~~~~~------~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~ 262 (559) ... .| ++..++..+.. |..-=.+..+++.. ....||.||.+-+...+.......+ T Consensus 208 ~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~~ 284 (536) T protein:vir:10 208 DETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRL---DGESYGRSYIEEYLGDLRSLENLQE 284 (536) T ss_pred ccceEEEEEEEEecCCCcEEEEEeecCccccccccccccccCCceeeeeeec---CCCccccchHHHHHHHHHHHHHHHH Confidence 000 00 00111111100 00001123333322 2346999999998888888777766 Q ss_pred HHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCccccccccccc--CCceeeeeccccchhHH-HHHH Q lcl|NC_012530. 263 FNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMIT--AEDAKFVSMTQAEDMQF-QSWL 339 (559) Q Consensus 263 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~--~g~~~~~~ls~~~D~qf-~e~~ 339 (559) .....-.-...|..++.-. + -..+. ....+..+ .++. .+++...++....|.+. .+.. T Consensus 285 ~~l~~~~~a~~~~~lv~p~-g----~~~~~----------~~~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~~~~i 345 (536) T protein:vir:10 285 AIVKMSMISSKVIGLVNPA-G----ITQPR----------RLTKAQTG----DFVTGRPEDISFLQLEKQADFTVAKAVS 345 (536) T ss_pred HHHHHHHHHhcCCcccCcc-c----ccchh----------hhccCCCc----ceecCCcccceeeeccccccchHHHHHH Confidence 6666544445554444211 1 11111 11111111 1222 23455666666667663 4567 Q ss_pred HHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHH-------HHHHHHHHhhHHHHHHHHHH-HhhccccccC Q lcl|NC_012530. 340 NYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNK-------IDASKSKGLMPLLDMIAKNL-TNGIIRQILG 411 (559) Q Consensus 340 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~-------~~~~~~~~l~P~~~~ie~~l-n~~L~~~~~~ 411 (559) +.....|.++|-+.. +...+...-+..+ .....++. ...+-...|.|++.+.-..+ ...+|++... T Consensus 346 ~~~~~rI~~af~~~~--l~~~~~~r~TAtE----V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~ 419 (536) T protein:vir:10 346 DAIEARLSFAFMLNS--AVQRTGERVTAEE----IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK 419 (536) T ss_pred HHHHHHHHHHHhhhh--cccCCCCCccHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh Confidence 778888989995542 2222221111111 11111111 11233344445554433333 2234443322 Q ss_pred ccceeeecchh----hhhHHHH-HHHHHHHHc------C-CCCHHH----HHHHhCCCCCCCCCEeeccceecccccc-c Q lcl|NC_012530. 412 DNYMLEFVGGD----TRSQQDK-LKSVQLELQ------T-ATTVND----YREKQGLPKIAGGDIILSAVYIQRLGQQ-E 474 (559) Q Consensus 412 ~~~~~~f~~l~----~~d~~~~-~~~~~~~~~------~-~~T~NE----~R~~~gl~pi~gGD~~~~~~~~~~l~~~-~ 474 (559) ..+..++...+ +....++ ..++..+.. . -+..++ +-+.+|.+|.. .+..+--++.+.+. . T Consensus 420 ~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~r~q~~ 496 (536) T protein:vir:10 420 EAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQKMAQQS 496 (536) T ss_pred hhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchh---hcCCHHHHHHHHHHHH Confidence 33455554322 2111111 112211110 0 122222 22344553321 11000000000000 0 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 475 QIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) +..+...+.......+......+ + ......-.....+.++ T Consensus 497 ~~~~~~~~a~~~~~~~~~~~~~~-----~-~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 497 MQMGMDNGAAALAQGMAAQATAS-----P-EAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcC-----c-hhHHhhhhccccCCCC Confidence 00000000000000000000000 0 0000000000011111 No 246 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=433 Identities=11% Similarity=0.089 Sum_probs=164.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHH---HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIAN---DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) =.|..|| +.++.-+.++.. ++.. |.-|..+.. +......+... +. .. T Consensus 4 ~~l~~r~----------~~l~~~R~~~e~~w~e~~~---------~~lP~~~~~-~~~~~~~~~~~-~~-----~~---- 53 (547) T protein:vir:10 4 SKIVKRL----------DFLKTDRKNVEQIWDCIRK---------YIMPMRSDF-FSDLRSEGSIN-WN-----QN---- 53 (547) T ss_pred HHHHHHH----------HHHHHHhhHHHHHHHHHHH---------Hhccccccc-ccCCCCCcccc-cc-----cc---- Confidence 1223333 112222222211 2222 222222111 00000000000 00 00 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .... .++-..|++.+|..+..... ..+...|++...|..- .....++-...+++.+.... .+.+|+.-+ T Consensus 54 ~~i~-dst~~~a~~~Las~L~~~lt----Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l-----~~snf~~~~ 123 (547) T protein:vir:10 54 REVF-DSTAGDGLETLSSSLHGSLT----SPATKWFELAFRDKELNSDDECRKWLENATHDVYSAL-----QDSNFNLEA 123 (547) T ss_pred cccc-cchHHHHHHHHHHHHHHhhc----CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHH-----HhcCcHHHH Confidence 0112 23444577777766643211 1122344444433321 11222233333444433221 123455556 Q ss_pred HHHHHHHHHcCCcceEEEECC-CCcEEEEEEecCceEEEEecCccccccc------------------------------ Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDS-NGRLSHTRMVDPTTIYFANDEHGHRRTR------------------------------ 205 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~-~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------------ 205 (559) ..++.|+.++||+..++..|. .+..+.+..++..++.+..+..|.+-.- T Consensus 124 ~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~ 203 (547) T protein:vir:10 124 NETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKE 203 (547) T ss_pred HHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhc Confidence 667889999999999998764 2334556667777777777777753100 Q ss_pred c--e---EE--EEEe---c-Cce--------------e--eeeccc---c-----------eEEEecccCCCccCCcccc Q lcl|NC_012530. 206 G--K---IY--RQYI---D-NKV--------------R--GSFTAD---E-----------MGMFIRNPRSDILSGGYGL 244 (559) Q Consensus 206 ~--~---~y--~~~~---~-~~~--------------~--~~~~~~---e-----------vi~~~~n~~~~~~~~~~G~ 244 (559) . . .+ +..+ . ... . ..+..+ . .+..+++.. ....||. T Consensus 204 ~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~---~ge~YGr 280 (547) T protein:vir:10 204 ASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKS---AGSQWGF 280 (547) T ss_pred CCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeec---CCccccc Confidence 0 0 00 0000 0 000 0 000000 0 122222222 2346999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceee Q lcl|NC_012530. 245 SELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKF 324 (559) Q Consensus 245 Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~ 324 (559) ||.+.+...+.......+.......-...|..++.-++ ...+ + +..- |. .++.++.-.. T Consensus 281 gp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g-----~~~~-----~-----~~~p-----gg-~~~~~~~~~v 339 (547) T protein:vir:10 281 GPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERG-----LISD-----I-----DLGA-----SG-LTVVRDMESM 339 (547) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccc-----cccc-----c-----eecC-----Ce-eeecCCcccc Confidence 99999999888888887777766666667765542111 1110 1 1111 12 2222333355 Q ss_pred eeccccchhHH-HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHH-------HHHHHHHhhHHHHH Q lcl|NC_012530. 325 VSMTQAEDMQF-QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKI-------DASKSKGLMPLLDM 396 (559) Q Consensus 325 ~~ls~~~D~qf-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~-------~~~~~~~l~P~~~~ 396 (559) +|+....|.+. .+..+.....|-++|-+....+- +...-|..+ .....++.. ..+....|.|++.+ T Consensus 340 ~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~--~~~~~TAtE----V~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 413 (547) T protein:vir:10 340 KPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMK--DSPAMTATE----VQVRYELMQRLLGPTLGRLENDFLSPMIQR 413 (547) T ss_pred eeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcC--CCccccHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 66665566654 45677778889999987665432 222111111 111112211 12333445555544 Q ss_pred HHHHHH-hhccccc-------cCccceeeecchh-hhhHHHHHH----HHHHH--HcC----C---CCHHH----HHHHh Q lcl|NC_012530. 397 IAKNLT-NGIIRQI-------LGDNYMLEFVGGD-TRSQQDKLK----SVQLE--LQT----A---TTVND----YREKQ 450 (559) Q Consensus 397 ie~~ln-~~L~~~~-------~~~~~~~~f~~l~-~~d~~~~~~----~~~~~--~~~----~---~T~NE----~R~~~ 450 (559) .=..+. ..+|++. .+..+.+++...+ +........ ++... +.. + +..++ +-+.+ T Consensus 414 ~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~ 493 (547) T protein:vir:10 414 TFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLL 493 (547) T ss_pred HHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHh Confidence 322232 2334332 1223455554333 332222221 11111 001 1 22222 33455 Q ss_pred CCCCCCCCCEeeccceecccccccccccccccc-cccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 451 GLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQR-QQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 451 gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) |.|+ +.+..+--++.+-+ ..+...+. .+.+.. ...+ ...+ T Consensus 494 Gvp~----~~irs~eev~~~r~---qr~~~~q~~~qaa~~---~~~g----------------~~m~------------- 534 (547) T protein:vir:10 494 GAPQ----TLMRPKAKVTSIRK---NRSQTQQKAEQAAIA---EAEG----------------NAME------------- 534 (547) T ss_pred CCCh----hccCCHHHHHHHHH---HHHHHHHHHHHHHHH---HHHH----------------HHHH------------- Confidence 5543 11111110000000 00000000 000000 0000 0000 Q ss_pred cccccccccccccccc Q lcl|NC_012530. 530 KDNQQGVGKDGQLKNK 545 (559) Q Consensus 530 ~~~~~~~~~~~~~k~~ 545 (559) ..+.+..+-+.+| T Consensus 535 ---~~~~~~a~~~~~~ 547 (547) T protein:vir:10 535 ---AQGKGQAALKENQ 547 (547) T ss_pred ---hhcCcccchhccC Confidence 1111111111111 No 247 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=81.52 E-value=0.085 Score=26.46 Aligned_cols=443 Identities=12% Similarity=-0.015 Sum_probs=162.2 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+= | ..-+-+++..++.+|...+-.-...-.. =-.|.-|..++... .+.......-.. .... T Consensus 1 m~~-d---~~~~~~~l~~r~~~l~~~R~~~e~~w~e---~~~~~lP~~~~~~~----------~~~~~~~~~~~~-~~~~ 62 (549) T protein:vir:10 1 MTN-D---DAKILQALNADHGRMKEKRQSYEAVWND---VIDYLMPRLDKFGQ----------LPRPDSEKGRER-SQKM 62 (549) T ss_pred CCc-c---hHHHHHHHHHHHHHHHHHhhhHHHHHHH---HHHHhccccccccc----------cCCCCCCccccc-cccc Confidence 210 0 0000011112222222221110000000 00123332221110 000000000000 0111 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccc-cChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDK-PTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~-~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) .+ ++--.|++.+|..+..... ....-.|++...+..- .....++-...+++.+.... +-.+.+|+.-+..+ T Consensus 63 ~d-stg~~a~~~LAs~l~~~lt----pp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~---~~~~snf~~~~~~~ 134 (549) T protein:vir:10 63 FD-STAPLALRNFVAAMDSMIT----PATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAAR---YRWQGGFVTQMGAT 134 (549) T ss_pred cc-chHHHHHHHHHHHHHhhcc----CCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHH---hhhhcChHHHHHHH Confidence 22 3334577777766643211 1122234444433211 01111222333333332210 11123555566667 Q ss_pred HHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccccceEEEE---------------------------- Q lcl|NC_012530. 160 VRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTRGKIYRQ---------------------------- 211 (559) Q Consensus 160 v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~---------------------------- 211 (559) +.|+.++||+.+++..+. ++.+.+..++..++.+..|..|.+-. .|.. T Consensus 135 ~~~L~~~Gta~l~~~~~~-~~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~ 210 (549) T protein:vir:10 135 YQSIGLFGPGALMIEHDV-GKGIVYRNVPMQRLWFAENNSGLIDK---THVQWELTLRQAAQRFGRENLSPSMQSTLEKD 210 (549) T ss_pred HHHHHhhcceeeEEeecC-CCeeEEEEEEcCeEEEeeCCCCCeEE---EEEEeecCHHHHHHhcCcccCCHHHHHHhhcC Confidence 889999999999988764 34556666666777777777775311 1110 Q ss_pred ---------EecC---c--------------eeeeecccce-----------EEEecccCCCccCCcccccHHHHHHHHH Q lcl|NC_012530. 212 ---------YIDN---K--------------VRGSFTADEM-----------GMFIRNPRSDILSGGYGLSELEMGLREF 254 (559) Q Consensus 212 ---------~~~~---~--------------~~~~~~~~ev-----------i~~~~n~~~~~~~~~~G~Spl~~~~~~i 254 (559) .+.. . .......+.+ +..+++. .....||.||.+.+...+ T Consensus 211 ~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~esg~~e~P~~~~Rw~~---~~ge~YGrgp~~~~l~D~ 287 (549) T protein:vir:10 211 PEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYV---GTDDVYGGSPAYDAMPDV 287 (549) T ss_pred CCceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEeeccCCcccCCcceeeeee---cCCCccccchHHHHHHHH Confidence 0000 0 0000000111 1111111 122369999999999988 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc--cCCceeeeeccccch Q lcl|NC_012530. 255 ISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI--TAEDAKFVSMTQAED 332 (559) Q Consensus 255 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl--~~g~~~~~~ls~~~D 332 (559) .......+.....-.-...|..++.-.+. ..+. +...|. .+++. ..+...++|+....| T Consensus 288 k~L~~l~~~~l~~~~~~~~p~~~v~~~g~-----~~~~----------~l~pgg----~~~~~~~~~~~~~~~pl~~~~~ 348 (549) T protein:vir:10 288 RMANDMAKTNIRGAQKLVDPPLLANEDGV-----LDGF----------DLRSGA----LNWGGLNDKGEEMVKPLLTGKQ 348 (549) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeccccc-----cccc----------eeccCC----ccccccCCCCccceeeeccccc Confidence 88888888777777777777666522111 1110 112222 22332 234456777766666 Q ss_pred hHHH-HHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHH--------- Q lcl|NC_012530. 333 MQFQ-SWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLT--------- 402 (559) Q Consensus 333 ~qf~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln--------- 402 (559) .+.. +..+.....|.++|-+....+- .+...-+..+ .....+ -....|.|...+++++|- T Consensus 349 ~~~~~~~i~~~~~rI~~af~~d~~~~~-~~~~~~TAtE----V~~r~~-----E~~~~LGpv~~rl~~E~l~Pli~R~~~ 418 (549) T protein:vir:10 349 AQIGIEFAQDTRQTINQWFYVTLFQIL-VDSGDMTATE----VLQRAQ-----EKGVLLAPTLGRTQSELLGPMIAREVD 418 (549) T ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhhh-cCCCCccHHH----HHHHHH-----HHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 6643 4467778889999987764221 1111111111 111111 122344555555544432 Q ss_pred ----hhcccccc------Cccceeeecchh-hhhHHHHHH----HHHHH--HcC-------CCC----HHHHHHHhCCCC Q lcl|NC_012530. 403 ----NGIIRQIL------GDNYMLEFVGGD-TRSQQDKLK----SVQLE--LQT-------ATT----VNDYREKQGLPK 454 (559) Q Consensus 403 ----~~L~~~~~------~~~~~~~f~~l~-~~d~~~~~~----~~~~~--~~~-------~~T----~NE~R~~~gl~p 454 (559) ..+|++.. +..+.+++...+ +......+. ++... +.. -+. .+++-+.+|.|+ T Consensus 419 il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~ 498 (549) T protein:vir:10 419 ILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPV 498 (549) T ss_pred HHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCc Confidence 22344321 223556665433 322222221 11110 000 022 223334556654 Q ss_pred CCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccc Q lcl|NC_012530. 455 IAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQ 534 (559) Q Consensus 455 i~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 534 (559) ..+..+--++.+-+..+..+-..+..... ..... ..++- ++.++ T Consensus 499 ----~~irs~eev~~~r~~~~~qqq~~~~~~~a-~~a~~--------------------~a~~~-----------~~~~t 542 (549) T protein:vir:10 499 ----EAMSTDEELQAQQAAEAQAAQMQQMLAAA-PVAAG--------------------AIKDL-----------SDAQT 542 (549) T ss_pred ----cccCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--------------------HHHhh-----------hhhcC Confidence 11111110010000000000000000000 00000 00000 00000 Q ss_pred ccccccc Q lcl|NC_012530. 535 GVGKDGQ 541 (559) Q Consensus 535 ~~~~~~~ 541 (559) ..+.+-- T Consensus 543 a~~~~~~ 549 (549) T protein:vir:10 543 AAQTARV 549 (549) T ss_pred CCcccCC Confidence 0000000 No 248 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=72.75 E-value=0.18 Score=24.68 Aligned_cols=449 Identities=12% Similarity=0.053 Sum_probs=164.0 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |-=- -+.++.-+.+..++.+|...+-.-...-... -.|.-|.+...+ .. ..+. .. ... T Consensus 1 ~~~~--~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~---~~y~lP~~~~~~----~~---------~~~~---~~-~~~ 58 (543) T protein:vir:88 1 MAET--KREGLAEEGAKAVYERLKNDRVPYETRAENC---AKVTIPSLFPKD----SD---------NSST---DY-TTP 58 (543) T ss_pred Cccc--ccCcchHHHHHHHHHHHHHHHhHHHHHHHHH---HHHhccccCCCC----CC---------cccc---cc-ccc Confidence 1100 0233333333334444443221111000000 012333211100 00 0000 00 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--cc--Ch----hHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KP--TK----EQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~--~~----~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .+ ++--.|++.+|..+.....+ ..-.|++...+.. +. ++ ..+.-...+++.+.... .+.+| T Consensus 59 ~d-st~~~a~~~Laa~l~~~ltP-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-----~~snf 127 (543) T protein:vir:88 59 WQ-AVGARGLNNLSAKVMLALFP-----LQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYM-----EANSY 127 (543) T ss_pred cc-chHHHHHHHHHHHHHHhhcC-----CCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHH-----HhcCc Confidence 22 33345777777776543221 1224555444322 10 11 11111222222222211 12456 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCC----cEEEEEEecCceEEEEecCcccccc------------------------ Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNG----RLSHTRMVDPTTIYFANDEHGHRRT------------------------ 204 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G----~~~~L~~l~p~~V~~~~~~~g~~~~------------------------ 204 (559) +.-+..++.|+.++||+..++..+... .+...||| ....+..|..|.+.. T Consensus 128 ~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl--~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~ 205 (543) T protein:vir:88 128 RVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTL--HNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQ 205 (543) T ss_pred HHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEc--ceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHh Confidence 666667788999999999988765421 12444555 334444555553210 Q ss_pred -----c-ceEEEEE--ecC-ceee--------e-------ecccc--eEEEecccCCCccCCcccccHHHHHHHHHHHHH Q lcl|NC_012530. 205 -----R-GKIYRQY--IDN-KVRG--------S-------FTADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISHE 258 (559) Q Consensus 205 -----~-~~~y~~~--~~~-~~~~--------~-------~~~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~ 258 (559) . ...|..+ ..+ .... . +..++ .+..+++.. ....||.||.+-+...+.... T Consensus 206 ~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~ 282 (543) T protein:vir:88 206 EYKPEQELEVYTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKR---DGEHYGRSHVEEYLGDLNSLE 282 (543) T ss_pred hcCCccceEEEEEEEeecCCCcccccccccCeeeecCCCccccccCCceeeeeeec---CCCccccchHHHHHHHHHHHH Confidence 0 0011100 001 0000 0 00001 122222222 234699999999999998888 Q ss_pred HHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc--cCCceeeeeccccchhHH- Q lcl|NC_012530. 259 NTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI--TAEDAKFVSMTQAEDMQF- 335 (559) Q Consensus 259 ~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl--~~g~~~~~~ls~~~D~qf- 335 (559) ...+.....-.-...|..++.-++. ... .....|..+ .++ ..+++...++....|.+. T Consensus 283 ~l~~~~l~~~~~~~~pp~~v~~~g~-----~~~----------~~~~~~~~g----~~v~g~~~~v~~~~~~~~~~~~~~ 343 (543) T protein:vir:88 283 SLNEAMIKFAMISSKVVGLVNPNGI-----TQV----------RRLVKAQTG----DFVAGRKADIEFLQLEKTADFTVA 343 (543) T ss_pred HHHHHHHHHHHHHhcCceeeccccc-----cch----------hhcccCCCc----eeecCCCCcceeeecccccchhHH Confidence 8888787777777777766532211 111 111222222 222 234566666666567664 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHh------------ Q lcl|NC_012530. 336 QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTN------------ 403 (559) Q Consensus 336 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~------------ 403 (559) .+..+.....|.++|-+.. +...+...-+. .+ +. .+..-....|.|.+.+++++|-. T Consensus 344 ~~~i~~~~~rI~~af~~~~--~~~~~~~r~TA------tE--V~-~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r 412 (543) T protein:vir:88 344 KSVADAIEARLSYVFMLNS--AVQRSGERVTA------EE--IR-YVASELEDTLGGVYSILSQELQLPIVRVLLNQLQA 412 (543) T ss_pred HHHHHHHHHHHHHHHhhhh--hccCCCCcccH------HH--HH-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4567778888988996652 22222221111 11 11 11122233455555555544432 Q ss_pred -hccccccCccceeeecc----hhhhhHHHHH-HHHHHH--Hc--CC---CCHHH----HHHHhCCCCCCCCCEeeccce Q lcl|NC_012530. 404 -GIIRQILGDNYMLEFVG----GDTRSQQDKL-KSVQLE--LQ--TA---TTVND----YREKQGLPKIAGGDIILSAVY 466 (559) Q Consensus 404 -~L~~~~~~~~~~~~f~~----l~~~d~~~~~-~~~~~~--~~--~~---~T~NE----~R~~~gl~pi~gGD~~~~~~~ 466 (559) .+|++.....+.+++.. +.+....... .+.... +. .. +..++ +.+.+|.+|-. .+..+-- T Consensus 413 ~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~---i~r~~~e 489 (543) T protein:vir:88 413 TQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAG---LLLTEAE 489 (543) T ss_pred cCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhh---hcCCHHH Confidence 33433333345566542 2222111111 111111 10 01 22233 33445664421 1110000 Q ss_pred ecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccc Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQG 535 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 535 (559) ++.+.+ . .+.++..........+.. ..+.. .+.+..+ ++.......+|-...+. T Consensus 490 ~~~~~~---q----~~~q~~~~~~~~~~~~~~-----~~~~~--~~~~~~~-~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 490 KAQAQS---Q----EMLKQGGLNAAAGIGSGV-----AAQAT--ASPEAME-SAMDTAGVQPGPIATQV 543 (543) T ss_pred HHHHHH---H----HHHHHHHHHHHHHHhhch-----hhhhc--cChHHHH-HHhhhcCCCCCCCCCCC Confidence 011100 0 000000000000000000 00000 0001101 11100001111111111 No 249 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=68.38 E-value=0.24 Score=24.00 Aligned_cols=439 Identities=13% Similarity=0.063 Sum_probs=154.8 Q ss_pred Cc--chhhhccccccCCcchHHHHHHHHHHH---HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHH Q lcl|NC_012530. 1 MG--IFDRFRTKFYTDDPNAFFKHIDSKIAN---DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD 75 (559) Q Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 75 (559) |+ +..|| +.++.-+.++.. ++.. |.-|.....+-.. +.. .. T Consensus 1 m~~~~~~r~----------~~l~~~R~~~e~~w~e~~~---------y~lP~~~~~~~~~----------~~~---~~-- 46 (555) T protein:vir:17 1 MKHSAQAKY----------MMLRADREDYLDSGRQSAR---------LTLPYILTDEGHV----------QGG---YL-- 46 (555) T ss_pred ChhHHHHHH----------HHHHHHhhHHHHHHHHHHH---------HhcccccCCCCCc----------ccc---cc-- Confidence 44 33443 111221222211 2222 3333221111000 000 00 Q ss_pred HHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--c--cChhHHHHH----HHHHHHHHhcCCCCCC Q lcl|NC_012530. 76 VLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--K--PTKEQQKKI----DYAERYIERMGVDYSP 147 (559) Q Consensus 76 ~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~--~~~~~~~~~----~~~~~~L~~~~p~~~~ 147 (559) ....+ ++--.|++.+|..+..... ..+.-.|++...+.. + .+....... ..+++.+.... T Consensus 47 --~~~~d-st~~~a~~~Laa~l~~~lt----pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l----- 114 (555) T protein:vir:17 47 --PTPWQ-SVGSKGVNVLASKLMLSLF----PVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDI----- 114 (555) T ss_pred --ccccc-ccHHHHHHHHHHHHHHhhc----CCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHH----- Confidence 01122 2233577777776643211 112334555544432 1 111211111 22333333211 Q ss_pred ChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc---------------------- Q lcl|NC_012530. 148 IRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR---------------------- 205 (559) Q Consensus 148 ~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~---------------------- 205 (559) .+.+|+.-+..++.|+.++||+..++-.+ +..+|||. .+.+..|..|.+-.- T Consensus 115 ~~snf~~~~~~~~~~L~~~G~a~ly~~~~----~~~~~pl~--~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~ 188 (555) T protein:vir:17 115 AESSDRVHLEMAMKHLIVTGNALLYQGKK----NLKLYPLD--RFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLE 188 (555) T ss_pred HhcCcHHHHHHHHHHHHhHCeEEEEecCC----ceeEEEcC--eEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccc Confidence 12355666667788999999998776433 24556663 344455555532100 Q ss_pred ------------------------------------------ceEEEEEecCceee----eeccc--ceEEEecccCCCc Q lcl|NC_012530. 206 ------------------------------------------GKIYRQYIDNKVRG----SFTAD--EMGMFIRNPRSDI 237 (559) Q Consensus 206 ------------------------------------------~~~y~~~~~~~~~~----~~~~~--evi~~~~n~~~~~ 237 (559) ...|++-.++.... ..... =.+..+++..+ T Consensus 189 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~-- 266 (555) T protein:vir:17 189 GAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVD-- 266 (555) T ss_pred hhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecC-- Confidence 00011111111100 00000 11333333322 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccc Q lcl|NC_012530. 238 LSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMI 317 (559) Q Consensus 238 ~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl 317 (559) ...||.||.+-+.-.+.......+.....-.-...|..++.-++...+. +...|. ...|+ T Consensus 267 -ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~---------------~l~~~~----~g~v~ 326 (555) T protein:vir:17 267 -GEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQ---------------NLALAA----NGAII 326 (555) T ss_pred -CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcc---------------eeecCC----Cceee Confidence 3469999999999999888888887777777777777665332211111 111121 12344 Q ss_pred cC--CceeeeeccccchhHH-HHHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHH-------HHH Q lcl|NC_012530. 318 TA--EDAKFVSMTQAEDMQF-QSWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDA-------SKS 387 (559) Q Consensus 318 ~~--g~~~~~~ls~~~D~qf-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~-------~~~ 387 (559) .+ +++...++..+.|.+. .+..+..+..|-++|-+ ++..+...-|. +-...-.++.... +.. T Consensus 327 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~----~~~~d~~r~TA----tEV~~r~~E~~~~LGpv~~rl~~ 398 (555) T protein:vir:17 327 QGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM----LQVRQSERTTA----TEVQATVQELNEQIGGIYSNLTT 398 (555) T ss_pred cCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh----cCCCCcccchH----HHHHHHHHHHHHHHhHHHHHHHH Confidence 32 3344444445567663 34455566778788743 23222211111 1111111221111 223 Q ss_pred HHhhHHHHHHHHHHH-hhccccccCccceeee----cchhhhhHHHHH-HHHHHHHcC--------CCCH----HHHHHH Q lcl|NC_012530. 388 KGLMPLLDMIAKNLT-NGIIRQILGDNYMLEF----VGGDTRSQQDKL-KSVQLELQT--------ATTV----NDYREK 449 (559) Q Consensus 388 ~~l~P~~~~ie~~ln-~~L~~~~~~~~~~~~f----~~l~~~d~~~~~-~~~~~~~~~--------~~T~----NE~R~~ 449 (559) ..|.|++.+.-..+. ..+|++.....+..++ ..+.+....++. .++....+- .+.. +++-+. T Consensus 399 E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~ 478 (555) T protein:vir:17 399 ELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAA 478 (555) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHH Confidence 444444443333332 2234432222223332 223333222222 222221111 1222 333445 Q ss_pred hCCCCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccc Q lcl|NC_012530. 450 QGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSG 529 (559) Q Consensus 450 ~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 529 (559) +|.+|.. .+..+--+..+.+..+ ..+..+....+.....+.+......... ..+..+..+.+ T Consensus 479 ~Gv~p~~---ivrs~eev~~~rq~~~----~~~~q~~~~~qa~~~~~~~~~~~~~~~~-~~~~~~a~~~~---------- 540 (555) T protein:vir:17 479 QGIDTLQ---LINSPETMKQLGDQQK----QDMVQASLINQAGQLAKTPMAEQAMQLI-QQQQEGAQDAG---------- 540 (555) T ss_pred cCCChhh---hcCCHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhhhhhhhHHhcc-ccchhhhhHHH---------- Confidence 5665531 1111111111100000 0000000000000000000000000000 00000000000 Q ss_pred ccccccccccccccccchhhhhhccCCCCC Q lcl|NC_012530. 530 KDNQQGVGKDGQLKNKKNTNSYKQGGSSKK 559 (559) Q Consensus 530 ~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 559 (559) .+++.|-+.- T Consensus 541 --------------------~a~~~~~~~~ 550 (555) T protein:vir:17 541 --------------------AAESETSSAE 550 (555) T ss_pred --------------------HHHhhcCCcc Confidence 0011111100 No 250 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=53.32 E-value=0.53 Score=22.09 Aligned_cols=301 Identities=14% Similarity=0.086 Sum_probs=119.7 Q ss_pred hhhccccccccccccccccccccccccccccCCCCCcccHHH-HHHH---HhhCh----HHHHHHHHHHHHHHhhhhHhh Q lcl|NC_012530. 34 KALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITD-VLRQ---YSMNV----VLNAIINTRANQVTEYAHRAS 105 (559) Q Consensus 34 ~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~~~---~~~~~----~v~acv~~ia~~ia~~~~~~~ 105 (559) -.++.-+|.++ .++ .+.+ +.+. -..+| .-+.+-+-.|++||.+.-.+. T Consensus 1 ~~~~~~~~~~~-----------------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (320) T protein:vir:97 1 MGIFNFKKRET-----------------------LTP-ELKESIIRQVTIEDESPFTGTTDFNVRNEVAESIATYLGAYK 56 (320) T ss_pred CCccccccccc-----------------------cCh-hHHhhhhheeeeccCCCcccccccchhhHHHHHHHHHhhhhc Confidence 11111111110 000 0000 0000 00011 001122344566665432211 Q ss_pred hhcCCcceeeecccccccChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEE Q lcl|NC_012530. 106 TDDNGMGYQVRLKNGDKPTKEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTR 185 (559) Q Consensus 106 ~~~~g~~~~v~~~d~~~~~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~ 185 (559) .+ .. .+. |+. ++ ..|++.+|.|.|..--.|++.-.. .|.++ T Consensus 57 ~~-~~----------------------~~~--~~~------~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--- 97 (320) T protein:vir:97 57 TS-AK----------------------RLS--LLT------NN----PSFLRRLVKHALHNKTTYVYKSPT-YGWLI--- 97 (320) T ss_pred cc-cc----------------------eee--eee------CC----HHHHHHHHHHhhcccceEEeeCCc-cceee--- Confidence 11 00 000 111 11 258999999998877777665432 34222 Q ss_pred EecCceEEEEecCcc-cccccceEEEEEecCceeeeecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 186 MVDPTTIYFANDEHG-HRRTRGKIYRQYIDNKVRGSFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFN 264 (559) Q Consensus 186 ~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~ 264 (559) -|+-+|.-.+..-. ..++....-++++ ..|..-+|| ....||..|- .+..-++++. ... T Consensus 98 -~~~~~~~~~~~~~~~~~~D~FN~~V~mt-----vpfyD~~IL----------dnpl~gv~tq-e~gkM~g~a~---~~v 157 (320) T protein:vir:97 98 -TDSMTIEGLRARLTFTLPDPFNSAVTMT-----VPFYDVGII----------DSPLVEVDTE-EANKMLEAAY---SAV 157 (320) T ss_pred -ecceeeeeeeeeEEEecCcccceeEEEE-----eeeechhhh----------hhhhcccChH-HhhHHHHHHh---hhh Confidence 13323221111000 0011111122211 111111111 1234566664 2222233322 223 Q ss_pred HHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHH--HHhcCcccccccccccCCceeeeeccccchhHHHHHHHHH Q lcl|NC_012530. 265 DRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWT--ATSSGINGAYRIPMITAEDAKFVSMTQAEDMQFQSWLNYL 342 (559) Q Consensus 265 ~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~--~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf~e~~~~~ 342 (559) -+-+.|-+.....++.+-+....+..++...+++++.+ +.++|. +++ +.+-+++.+...---....-.... T Consensus 158 ~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk~mq~~A~~~nG~-----T~i--~~~dDI~Qi~pDYS~sn~~D~~l~ 230 (320) T protein:vir:97 158 MKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIKAMLATAELLSGY-----TYI--QRGDDVTQMMPDYTTSNVTDFAAM 230 (320) T ss_pred hhhccccceeEEEEecccchhHHHHHHHHHHHHHHHHHHHHHhcCc-----ccc--cCCcceeeecccccccchhHHHHH Confidence 34456666777788776554445566666667666544 234553 244 223445554210000011122344 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHH---HHHHHhhccccccCccceeeec Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMI---AKNLTNGIIRQILGDNYMLEFV 419 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~i---e~~ln~~L~~~~~~~~~~~~f~ 419 (559) +...+.-|+||-.+|- + +..+....+|+.+.+.|++.++ |-.|..++ ++.+.+.|- T Consensus 231 ~t~alS~y~m~~~IL~----G------------sAte~~~Iaf~~~~V~PLL~Q~~~~Ek~Lvy~m-----~~E~FVs~m 289 (320) T protein:vir:97 231 RTFAASQLSVSDKILD----G------------SATDGEKVAVMFRFVEPILEQFREYEPSLIYAM-----RDEFFVSFM 289 (320) T ss_pred HHHHHhhcCCchhhcc----c------------cCCcceeeehhhHhHHHHHHHhhhcCcceeeee-----ccceeeeee Confidence 5566778999988772 1 1223445678888999999997 54444433 222333332 Q ss_pred --chhhhhHHHHHHHHHHHHcCC---CCHHHHHHHhCCCCCCCCCEeec Q lcl|NC_012530. 420 --GGDTRSQQDKLKSVQLELQTA---TTVNDYREKQGLPKIAGGDIILS 463 (559) Q Consensus 420 --~l~~~d~~~~~~~~~~~~~~~---~T~NE~R~~~gl~pi~gGD~~~~ 463 (559) +....+ ..+.|| ..||| -.|||+--+ T Consensus 290 tTGG~l~S---------~~~~~~~~~~~~~~---------~~~~~~~~~ 320 (320) T protein:vir:97 290 TTGGMLNS---------NRVDGWGKEKAPNE---------SKGGDVGDV 320 (320) T ss_pred ecCceeec---------ccccccccccCCcc---------ccCCcccCC Confidence 111100 112233 12333 135554211 No 251 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=39.85 E-value=1 Score=20.59 Aligned_cols=448 Identities=12% Similarity=0.049 Sum_probs=156.4 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHH--HHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIAN--DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLR 78 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 78 (559) |-. +..+.+|.-+....++.+|...+-. ...+. =-.|.-|.....+... ...... T Consensus 1 ~~~-~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e-----~~~y~lP~~~~~~~~~-------------~~~~~~---- 57 (535) T protein:vir:94 1 MAS-SQKREGFAENGAKAVYDALKNDRNSYETRAEN-----CAKYTIPSLFPKDSDN-------------ASTDYT---- 57 (535) T ss_pred CCc-hhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhccccCCCCCCc-------------cccccC---- Confidence 322 2223333333333334443332211 00000 0112333221111000 000000 Q ss_pred HHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccChhHHHHHHHHHHHHHhcCC--CCCCChhhHHH Q lcl|NC_012530. 79 QYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPTKEQQKKIDYAERYIERMGV--DYSPIRDDFTS 154 (559) Q Consensus 79 ~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~~~~~~~~~~~~~~L~~~~p--~~~~~~~~~~~ 154 (559) ... .++--.|++.+|..+.....+ .+-.|++...+.. +.... ......+..||....- .....+.+|+. T Consensus 58 ~~~-dst~~~a~~~Laa~l~~~ltP-----~~~WF~l~~~d~~~~~~~~~-~~~~~~v~~~L~~ve~~~~~~~~~snf~~ 130 (535) T protein:vir:94 58 TPW-QAVGARGLNNLASKLMLALFP-----MQTWMKLTISEFEAKQLVAQ-PAELAKVEEGLSMVERILMNYIESNSYRV 130 (535) T ss_pred Ccc-cccHHHHHHHHHHHHHhhhcC-----CCCccccccChhhhhccccc-hhHHHHHHHHHHHHHHHHHHHHHhcCcHH Confidence 112 233345777777666432211 1234555444322 11110 0111123333322100 00011245666 Q ss_pred HHHHHHHHHHHcCCcceEEEECC-CCcEEEEEEecCceEEEEecCcccccc----------------------------- Q lcl|NC_012530. 155 FLRKLVRDTYTYDQVNYENTYDS-NGRLSHTRMVDPTTIYFANDEHGHRRT----------------------------- 204 (559) Q Consensus 155 f~~~~v~d~ll~Gna~~~i~rd~-~G~~~~L~~l~p~~V~~~~~~~g~~~~----------------------------- 204 (559) -+...+.|+.++||+..++..+. .+.....||| .++.+..|..|.+-. T Consensus 131 ~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl--~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~ 208 (535) T protein:vir:94 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRL--SSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGD 208 (535) T ss_pred HHHHHHHHHHhhCcEeEeeccCcCcccceEEEEc--CeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCC Confidence 66677889999999999987653 2223344554 556666666664310 Q ss_pred cce-EEE--------------EEecCceee------eecccceEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 RGK-IYR--------------QYIDNKVRG------SFTADEMGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELF 263 (559) Q Consensus 205 ~~~-~y~--------------~~~~~~~~~------~~~~~evi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~ 263 (559) ... .|. +..++.... .|..-=.+..+++.. ....||.||.+-+...+.......+. T Consensus 209 ~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~~~ 285 (535) T protein:vir:94 209 EMIDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRI---DGESYGRSYCEEYLGDLRSLENLQEA 285 (535) T ss_pred ceeEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeec---CCCccccchHHHHHHHHHHHHHHHHH Confidence 000 000 001111100 000001122222222 23469999999988888887777766 Q ss_pred HHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccccchhHH-HHHHHHH Q lcl|NC_012530. 264 NDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQAEDMQF-QSWLNYL 342 (559) Q Consensus 264 ~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~~~D~qf-~e~~~~~ 342 (559) ....-.-...|..++.-++ -..+. ....+..+. -++- ..+++...++....|.+. .+..+.. T Consensus 286 ~l~~~~~a~~~~~lv~p~g-----~~~~~----------~~~~~~~g~-~v~g-~~~~v~~~~~~~~~~~~~~~~~i~~~ 348 (535) T protein:vir:94 286 IVKMSMISAKVIGLVNPAG-----ITQVR----------RLTKAQTGD-FVSG-RPEDISFLQLEKAADFSVARAVSEQI 348 (535) T ss_pred HHHHHHHhccCCccccccc-----ccchh----------hcccCCCce-eecC-CcccceeeecccccchhHHHHHHHHH Confidence 6655555555554442111 11111 111111111 1111 123455666665567654 4556777 Q ss_pred HHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHH-------------hhccccc Q lcl|NC_012530. 343 INIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLT-------------NGIIRQI 409 (559) Q Consensus 343 ~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln-------------~~L~~~~ 409 (559) +..|.++|-+. ++...+...-+..+ ... +..-....|.|.+.+++++|- ..+|++. T Consensus 349 ~~rI~~af~~~--~~~~~d~~rvTAtE----V~~-----r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~ 417 (535) T protein:vir:94 349 EGRLSYAFMLN--SAVQRTGERVTAEE----IRY-----VASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPEL 417 (535) T ss_pred HHHHHHHHhHh--hhccCCCCCccHHH----HHH-----HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC Confidence 88888888432 12111211111111 111 112223345555555555443 2334332 Q ss_pred cCccceeeecchh----hh-hHHHHHHHHHHHHc------C-CCCH----HHHHHHhCCCCCCCCCEeeccceecccccc Q lcl|NC_012530. 410 LGDNYMLEFVGGD----TR-SQQDKLKSVQLELQ------T-ATTV----NDYREKQGLPKIAGGDIILSAVYIQRLGQQ 473 (559) Q Consensus 410 ~~~~~~~~f~~l~----~~-d~~~~~~~~~~~~~------~-~~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~ 473 (559) ...-+..++...+ +. +......++..... . .+.. +.+-+.+|.|+.. .+..+--++.+ T Consensus 418 p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~---i~rs~eev~~~--- 491 (535) T protein:vir:94 418 PKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSG---ILKTPEEKQQE--- 491 (535) T ss_pred ChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhh---hcCCHHHHHHH--- Confidence 2233455554322 11 11111112211100 0 1112 2223344544210 11000000000 Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccc Q lcl|NC_012530. 474 EQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPS 528 (559) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (559) .+..+...+ .. ......+..........+.... ....+-+++ |+ T Consensus 492 ~~q~~~~~~-~~----~~~~~~g~~~~~~~~~~~~~~~-~~~~~~g~~-----~~ 535 (535) T protein:vir:94 492 MAEAAQGTA-MQ----NAAASAGAGAGTMATASPENMK-AAAAQAGMA-----PN 535 (535) T ss_pred HHHHHHHHH-HH----HHHHHHHHhhhcccccChHHHH-HHHHHhccC-----CC Confidence 000000000 00 0000000000000000000000 000111111 10 No 252 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=36.05 E-value=1.2 Score=20.16 Aligned_cols=414 Identities=11% Similarity=0.050 Sum_probs=146.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |+= -+..++.++.++..+...+.= -.|.-|.....+ +.. ....+ .+. T Consensus 1 mk~-----------~~~~~~~~lkR~~~e~~w~e~-----a~~tlP~~~~~~---~~~----------~~~~~----~~~ 47 (510) T protein:vir:63 1 MKT-----------TAAMLWEKLRDGSVEQRAIEF-----AKTTLPYLMVDP---MSG----------SRGVV----EHD 47 (510) T ss_pred Chh-----------HHHHHHHHHhccchHHHHHHH-----HHhhccccCCCC---CCc----------ccccc----CCC Confidence 110 011222233222222111100 012333221110 000 00000 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccCh------hHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPTK------EQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~~------~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) . .++--.|++.+|..+..... ..+.-.|++...+.. +.+. +.++-...+++.+... ..+.+| T Consensus 48 ~-dstg~~a~~~LAa~l~~~lt----pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-----l~~snf 117 (510) T protein:vir:63 48 F-QSAGALLVNNLAAKLARSLF----PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQR-----LFQNAS 117 (510) T ss_pred c-cchHHHHHHHHHHHHHhhhc----CCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHH-----HHhcCc Confidence 2 23334677777777653211 112234444443321 1111 1111122222222211 112355 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------------------------- Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------------------------- 204 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------------------------- 204 (559) +.-+...+.|+..+||+.+++. .+|.....|||. +..+..|..|.+-. T Consensus 118 ~~~~~~~~~~Li~~G~a~l~~~--~~~~~~~~~pl~--~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~ 193 (510) T protein:vir:63 118 LAVLTQVIKLLIVTGNALLYRD--SDAATVVAWSLR--SYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLS 193 (510) T ss_pred HHHHHHHHHHHHhhCeEEEEEc--CCCcEEEEEEcc--eeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccC Confidence 6666677888999999877754 455555566664 45555555553210 Q ss_pred --cce-EEE----------------EEecCceeeee---cccc--eEEEecccCCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 --RGK-IYR----------------QYIDNKVRGSF---TADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISHENT 260 (559) Q Consensus 205 --~~~-~y~----------------~~~~~~~~~~~---~~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~ 260 (559) ... .|. +-.++...... ...+ .+..+++.. ....||.||.+.+...+...... T Consensus 194 ~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l 270 (510) T protein:vir:63 194 GSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLA---PGEHYGRGHVEDYIGDFAKLSLL 270 (510) T ss_pred CCcceEEEEEEEeecCCCceEEEEEEEecCceeccccccccccCceeeeeeeec---CCCccccchHHHHHHHHHHHHHH Confidence 000 000 00111110000 0011 112222221 23469999999999999888888 Q ss_pred HHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC--CceeeeeccccchhHH-HH Q lcl|NC_012530. 261 ELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA--EDAKFVSMTQAEDMQF-QS 337 (559) Q Consensus 261 ~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~--g~~~~~~ls~~~D~qf-~e 337 (559) .+.....-.-...|..++.-++ -..+. ....|..+ .++.+ +++...++....|.+. .+ T Consensus 271 ~~~~l~~a~~a~~~~~lv~p~g-----~~~~~----------~~~~~~~g----~~v~g~~~~v~~~~~~~~~d~~~~~~ 331 (510) T protein:vir:63 271 SEKLGLYELESLEVLNLVDEAK-----GAVVD----------DYQDAEMG----DYVPGGAEAVRAYERGDYNKMAAIQQ 331 (510) T ss_pred HHHHHHHHHHhccCCcccCccc-----ccchh----------hhccCCCc----eeecCCcccceeeecCcccchHHHHH Confidence 7777766666566654442111 11111 11112111 23322 2344444444556654 45 Q ss_pred HHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHH--HHHHHHHHhhHHHHHHHHHHHhhcc--------- Q lcl|NC_012530. 338 WLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNK--IDASKSKGLMPLLDMIAKNLTNGII--------- 406 (559) Q Consensus 338 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~--~~~~~~~~l~P~~~~ie~~ln~~L~--------- 406 (559) ..+.....|.++| ++......+ +.- +++|. +..-....|.|.+.++.++|-.-|+ T Consensus 332 ~i~~~~~rI~~af-----~~~l~~~~~----~rv-----TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r 397 (510) T protein:vir:63 332 SLQAVVVRLNQAF-----MYGANQRDA----ERV-----TAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD 397 (510) T ss_pred HHHHHHHHHHHHH-----HhhcccCCC----CCc-----CHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6677778888888 232111111 000 11111 1112233455555555444433332 Q ss_pred ----ccc--cCccceeeecch-hhhhHHHHHHHHHHHHc--C-------CCCH----HHHHHHhCCCCCCCCCEeeccce Q lcl|NC_012530. 407 ----RQI--LGDNYMLEFVGG-DTRSQQDKLKSVQLELQ--T-------ATTV----NDYREKQGLPKIAGGDIILSAVY 466 (559) Q Consensus 407 ----~~~--~~~~~~~~f~~l-~~~d~~~~~~~~~~~~~--~-------~~T~----NE~R~~~gl~pi~gGD~~~~~~~ 466 (559) +.. ......+++... -+.............+. + -+.. +++.+.+|.+|.. .+..+-. T Consensus 398 ~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~---ivrs~ee 474 (510) T protein:vir:63 398 ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQ---FYKSADE 474 (510) T ss_pred ccCCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhH---hcCCHHH Confidence 111 111222333322 22222222211111111 1 1222 3344455664421 1100000 Q ss_pred eccccccccccccccccc--ccccccccccCCCCCCCCCCCCccccccchhccccccccccccccc Q lcl|NC_012530. 467 IQRLGQQEQIKQNEFQRQ--QTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGK 530 (559) Q Consensus 467 ~~~l~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 530 (559) ++.+.. +..+...+.. .........+. ...+ .|- T Consensus 475 v~a~~~--~~~qq~~~~~~~~~~~~~~a~~~----~~~~------------------------~g~ 510 (510) T protein:vir:63 475 LQAEAE--QQRQQAAQAQAAQETLLEGASDM----TNAL------------------------AGV 510 (510) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHhh----cccc------------------------cCC Confidence 010000 0000000000 00000000000 0000 110 No 253 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=35.39 E-value=1.2 Score=20.08 Aligned_cols=427 Identities=12% Similarity=0.023 Sum_probs=154.6 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |.==-.-++.+--..+..++.+|...+-.-...-... -.|.-|.+...+ + ..... +.. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~---a~~~lP~~~~~~-------~--------~~~~~----~~~ 58 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHY---SKLTLPYLMNDK-------G--------DNETS----QNG 58 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHH---HHhhcccccCCC-------C--------Ccccc----CCc Confidence 2111111223333334444444444321100000000 012223211000 0 00000 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccC------hhHHHHHHHHHHHHHhcCCCCCCChhhH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPT------KEQQKKIDYAERYIERMGVDYSPIRDDF 152 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~------~~~~~~~~~~~~~L~~~~p~~~~~~~~~ 152 (559) .+ ++--.|++.+|..+..... ....-.|++...+.. ... .+..+-...+++.+... ..+.+| T Consensus 59 ~d-stg~~a~~~LAa~l~~~lt----pp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-----l~~snf 128 (516) T protein:vir:96 59 WQ-GVGAQATNHLANKLAQVLF----PAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKE-----LEQRQF 128 (516) T ss_pred cc-chHHHHHHHHHHHHHhhhc----CCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHH-----HHhcCc Confidence 22 3334577777777643211 112234444433221 000 11112222233333221 112356 Q ss_pred HHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc---------------------------- Q lcl|NC_012530. 153 TSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT---------------------------- 204 (559) Q Consensus 153 ~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~---------------------------- 204 (559) +.-+..++.|+.++||+.+++ +..+. ...||| .++.+..|..|.+-. T Consensus 129 ~~~~~~~~~~L~~~G~a~l~~--d~~~~-~~~~pl--~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~ 203 (516) T protein:vir:96 129 RPAVVEAFKHLIVAGSCMLYK--PSKGA-ISAIPM--HHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKK 203 (516) T ss_pred HHHHHHHHHHHHhHCeEeEEe--cCCCC-EEEEEc--CeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhh Confidence 666667788899999998776 34443 456666 445556666654310 Q ss_pred ----cce--------------EEEEEecCceeee---ecccce--EEEecccCCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_012530. 205 ----RGK--------------IYRQYIDNKVRGS---FTADEM--GMFIRNPRSDILSGGYGLSELEMGLREFISHENTE 261 (559) Q Consensus 205 ----~~~--------------~y~~~~~~~~~~~---~~~~ev--i~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~ 261 (559) ... .|++..++..... +...+. +..+++.. ....||.||.+-+.-.+....... T Consensus 204 ~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~~~~~e~P~~~~Rw~~~---~ge~YGrgp~~~~L~D~k~L~~l~ 280 (516) T protein:vir:96 204 CKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSKIKSEKLPFIPLTWKRS---YGEDWGRPLAEDYSGDLFVIQFLS 280 (516) T ss_pred cCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccccccccCCeeeeeeeec---CCCCcccchHHHhhHHHHHHHHHH Confidence 000 0011111111100 000111 22222222 234699999999998888888777 Q ss_pred HHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC--CceeeeeccccchhHH-HHH Q lcl|NC_012530. 262 LFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA--EDAKFVSMTQAEDMQF-QSW 338 (559) Q Consensus 262 ~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~--g~~~~~~ls~~~D~qf-~e~ 338 (559) +.....-.-...|..++.-++ -.... ....|..+ .++++ +.+...++....|.+. .+. T Consensus 281 ~~~l~~~~~a~~~~~lv~p~g-----~~~~~----------~l~~~~~g----~i~~g~~~~v~~~q~~~~~d~~~~~~~ 341 (516) T protein:vir:96 281 EAVARGAALMADIKYLIRPGA-----QTDVD----------HFVNSGTG----EVVTGVEEDIHIVQLGKYADLTPISAV 341 (516) T ss_pred HHHHHHHHHhcCCccccCccc-----ccchh----------hhccCCCc----eeecCCcccceeeecCcccchhHHHHH Confidence 766666555555554432111 11111 11222211 23332 2344444444556663 456 Q ss_pred HHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--------ccc Q lcl|NC_012530. 339 LNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGIIR--------QIL 410 (559) Q Consensus 339 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~~--------~~~ 410 (559) .+.....|.++|-+..... .+...- +..+ +. .+..-....|.|.+.++.++|-.-|+. +.. T Consensus 342 i~~~~~rI~~af~~~~l~~--r~~~rv------TAtE--V~-~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp 410 (516) T protein:vir:96 342 LEVYTRRIGVVFMMETMTR--RDAERV------TAVE--IQ-RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFT 410 (516) T ss_pred HHHHHHHHHHHHhhhhhcc--CCCccc------cHHH--HH-HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCc Confidence 6777788888886543221 111111 1111 11 122233456788888877776544432 111 Q ss_pred Cccceeeecc----hhhhhHHHHH-HHHHHHHcCC----------CC----HHHHHHHhCCCCCCCCCEeeccceecccc Q lcl|NC_012530. 411 GDNYMLEFVG----GDTRSQQDKL-KSVQLELQTA----------TT----VNDYREKQGLPKIAGGDIILSAVYIQRLG 471 (559) Q Consensus 411 ~~~~~~~f~~----l~~~d~~~~~-~~~~~~~~~~----------~T----~NE~R~~~gl~pi~gGD~~~~~~~~~~l~ 471 (559) ...+..++.. +.+....+.. .+.. .+... +. .+++.+.+|.|+- .+..+--+..+. T Consensus 411 ~~~v~~~~vs~l~~l~r~~~~~~i~~~~~-~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~----~irs~eev~~~~ 485 (516) T protein:vir:96 411 SDLVDPVIITGIEALGRMAELDKLANFAQ-YMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELP----FLKSAEEMAQEQ 485 (516) T ss_pred cccccceeechHHHHHHHHHHHHHHHHHH-HHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCcc----ccCCHHHHHHHH Confidence 1123333321 1122111111 1111 11111 11 2233344454431 111000000000 Q ss_pred cccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccccccc Q lcl|NC_012530. 472 QQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLK 543 (559) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k 543 (559) + .. +..+... +.... -|.+....+++.+++. T Consensus 486 ~------~~-~~~q~~~-~~a~~---------------------------------~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 486 E------AQ-MQAQQAQ-MLEEG---------------------------------VAKAVPGVIQQELKEA 516 (516) T ss_pred H------HH-HHHHHHH-HHHHH---------------------------------hhhhhhHHhhcccccC Confidence 0 00 0000000 00000 0000000000000000 No 254 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=33.53 E-value=1.3 Score=19.87 Aligned_cols=458 Identities=12% Similarity=0.070 Sum_probs=163.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKIANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQY 80 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 80 (559) |.-=+ -+.+..++.+|...+-.-...-.. =-.|.-|..+...- .. ..+. .... ... T Consensus 1 m~~~~-------~~~l~~r~~~l~~~R~~~e~~w~e---~~~~~lP~~~~~~~--~~------~~~~--~~~~----~~~ 56 (559) T protein:vir:95 1 MAETT-------KERLNKQFAQLESERQSFEPHWRE---LSDYINPRGSRFLT--SE------VNRN--DRRN----TRI 56 (559) T ss_pred CChhh-------HHHHHHHHHHHHHHhhHHHHHHHH---HHHHhccccCCcCC--CC------CCcc--cccc----ccc Confidence 32100 001112222222221110000000 00122332221100 00 0000 0000 011 Q ss_pred hhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeecccccccC-hhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHH Q lcl|NC_012530. 81 SMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKPT-KEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKL 159 (559) Q Consensus 81 ~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~~-~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~ 159 (559) . .++-..|++.+|..+..... ..+...|++...+..... ...++-...+++.+.... .+.+|+.-+..+ T Consensus 57 ~-dst~~~a~~~Las~l~~~lt----pp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l-----~~snf~~~~~~~ 126 (559) T protein:vir:95 57 I-DSTGTMAARTLASGMMSGIT----SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMF-----NKSNLYQSLPQL 126 (559) T ss_pred c-cchHHHHHHHHHHHHHHhhc----CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHH-----HhcCcHHHHHHH Confidence 1 23444577777776643211 122334555444432111 122222333333332211 123455556667 Q ss_pred HHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc------------------------------ce-- Q lcl|NC_012530. 160 VRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR------------------------------GK-- 207 (559) Q Consensus 160 v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------------~~-- 207 (559) +.|+.++||+.+++..+.. +.+.+.+++..++.+..|..|.+-.- .. T Consensus 127 ~~~L~~~Gta~l~~~~d~~-~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~ 205 (559) T protein:vir:95 127 YGSLGTYSTGAMAVLDDDE-DIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYE 205 (559) T ss_pred HHHHHhhCceeeEeecCCC-ceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCC Confidence 8899999999999877653 45677788888888888877743110 00 Q ss_pred EEEEE---ec---Ccee----------e-e-ec--ccc--e-----------EEEecccCCCccCCccccc-HHHHHHHH Q lcl|NC_012530. 208 IYRQY---ID---NKVR----------G-S-FT--ADE--M-----------GMFIRNPRSDILSGGYGLS-ELEMGLRE 253 (559) Q Consensus 208 ~y~~~---~~---~~~~----------~-~-~~--~~e--v-----------i~~~~n~~~~~~~~~~G~S-pl~~~~~~ 253 (559) .++.+ +. .... . . +. .+. + +..+++. .....||.| |...+... T Consensus 206 ~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~---~~ge~YGrg~P~~~al~d 282 (559) T protein:vir:95 206 KWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEV---NGEDVYGSSCPGMLALGP 282 (559) T ss_pred CeEEEEEEEeccccccccccccccceEEEEEEEecCCCceeeecCCcccCCccceeeee---cCCccccccchHHHhhHH Confidence 00000 00 0000 0 0 00 000 0 1111111 123469999 89988888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCc-eeeeeccc-cc Q lcl|NC_012530. 254 FISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAED-AKFVSMTQ-AE 331 (559) Q Consensus 254 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~-~~~~~ls~-~~ 331 (559) +.......+.......-...|..++. .. +.... .+..-|..+ .+-..++ -.++|+-. .. T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~--~~-----~~~~~--------~~l~pgg~~----~~~~~~~~~~i~p~~~~~~ 343 (559) T protein:vir:95 283 VKALQLLQKRKSQLIDKATNPPMVAP--TS-----LKNQR--------ASLLPGDIT----YIDQITGQDGFRPAYLVNP 343 (559) T ss_pred HHHHHHHHHHHHHHHHHHhcCceecc--cc-----ccccc--------eeeecccee----eeCCCCCcccceeeccccc Confidence 87777777777766677777755542 11 11000 011122111 1111111 22444421 23 Q ss_pred hhHHH-HHHHHHHHHHHHHhCCCHHH-hccccccccccccccchhhhhHHHH-------HHHHHHHHhhHHHHHHHHHHH Q lcl|NC_012530. 332 DMQFQ-SWLNYLINIICALVAMDPAE-IGMQNRGGATGNKSNSLNESNNQNK-------IDASKSKGLMPLLDMIAKNLT 402 (559) Q Consensus 332 D~qf~-e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~~~~~~~~an~~~~-------~~~~~~~~l~P~~~~ie~~ln 402 (559) +.+++ +..+.....|-++|-+.+.+ ++..+...-+..+ .....++. ...+....|.|++.+.=..+. T Consensus 344 ~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtE----V~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~ 419 (559) T protein:vir:95 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEA----VIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMV 419 (559) T ss_pred chHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 45543 33566788999999887643 2322222111111 11111221 223344455555555433333 Q ss_pred h-hccccc----cCccceeeecchh-hhhHHHHHH-------HHHHHHc---C---CCCHHH----HHHHhCCCCCCCCC Q lcl|NC_012530. 403 N-GIIRQI----LGDNYMLEFVGGD-TRSQQDKLK-------SVQLELQ---T---ATTVND----YREKQGLPKIAGGD 459 (559) Q Consensus 403 ~-~L~~~~----~~~~~~~~f~~l~-~~d~~~~~~-------~~~~~~~---~---~~T~NE----~R~~~gl~pi~gGD 459 (559) + .+|++. .+..+.+++...+ +.......+ .+..... . -+..++ +-+.+|.|+ + T Consensus 420 r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~----~ 495 (559) T protein:vir:95 420 RKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSP----T 495 (559) T ss_pred hcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCch----h Confidence 3 234432 1234666765443 222111111 1111100 0 123333 344566653 2 Q ss_pred EeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhcccccccccccccccccccccccc Q lcl|NC_012530. 460 IILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKD 539 (559) Q Consensus 460 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 539 (559) .+..+--+..+.+..+..+-..+........ ........+..... + +.-.++.+.- .+.. T Consensus 496 ~irs~~ev~~~rqqr~~~qq~~q~~~~~~~a-a~~~~~~~~~~~~~-~-------~~l~~~~~~~-----------~~~~ 555 (559) T protein:vir:95 496 VIVPQEQVEQARQQRAQQQQQQQMMAMGMAA-AQGVKTLSEAKTSD-P-------SVLSAMANAV-----------SGQG 555 (559) T ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhccccccCCC-h-------hHHHHHHHhh-----------cCcc Confidence 2221111111111000000000000000000 00000000000000 0 0001111000 0111 Q ss_pred cccccc Q lcl|NC_012530. 540 GQLKNK 545 (559) Q Consensus 540 ~~~k~~ 545 (559) ++ .| T Consensus 556 ~~--~~ 559 (559) T protein:vir:95 556 GQ--SQ 559 (559) T ss_pred cc--CC Confidence 11 00 No 255 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=31.78 E-value=1.5 Score=19.66 Aligned_cols=448 Identities=11% Similarity=0.069 Sum_probs=161.8 Q ss_pred CcchhhhccccccCCcchHHHHHHHH---HHHHHHhhhhccccccccccccccccccccccccccccCCCCCcccHHHHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSK---IANDTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVL 77 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 77 (559) |.-=+ ...+..++.+|... +...--+-+ .|.-|..++.. ... .+... .-. T Consensus 1 m~~~~-------~~~l~~r~~~l~~~R~~~e~~w~e~~------~~~lP~~~~~~--~~~----------~~~~~--~~~ 53 (556) T protein:vir:73 1 MAETE-------KERLLKQLAQLKNERTSFESHWLDLS------DFINPRGSRFL--TSD----------VNRDD--RRN 53 (556) T ss_pred CChhh-------HHHHHHHHHHHHHHhhHHHHHHHHHH------HHhccccCCcC--CCC----------CCcch--hhc Confidence 21000 00011111222211 111100000 12222221110 000 00000 000 Q ss_pred HHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccccc-ChhHHHHHHHHHHHHHhcCCCCCCChhhHHHHH Q lcl|NC_012530. 78 RQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGDKP-TKEQQKKIDYAERYIERMGVDYSPIRDDFTSFL 156 (559) Q Consensus 78 ~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~~~-~~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~ 156 (559) .... .++-..|++.+|..+..... ..+...|++...++.-. ....++....+++.+.... .+.+|+.-+ T Consensus 54 ~~~~-dst~~~a~~~Las~l~~~lt----pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~~~~ 123 (556) T protein:vir:73 54 TKIV-DPTGSMAQRILSSGMMSGIT----SPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVF-----NKSNLYQSL 123 (556) T ss_pred Cccc-cchHHHHHHHHHHHHHHhhc----CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHH-----HhcCcHHHH Confidence 0112 33444577777776643211 12233455554443211 1222223333444433221 123556666 Q ss_pred HHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCccccccc------------------------------c Q lcl|NC_012530. 157 RKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRTR------------------------------G 206 (559) Q Consensus 157 ~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------------~ 206 (559) ..++.|+.++||+.+++..+.. ..+.+.+++..++.+..|..|.+-.- + T Consensus 124 ~~~~~~L~~~G~a~l~~~~~~~-~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~ 202 (556) T protein:vir:73 124 PVMYASLGTFGTGAMAVMEDDQ-DVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENG 202 (556) T ss_pred HHHHHHHHhhCceeeeeeecCC-ceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcC Confidence 7778899999999999887754 45667777787877777766643110 0 Q ss_pred e--EEEEE---e---cCcee------------eeec---ccc------------eEEEecccCCCccCCccccc-HHHHH Q lcl|NC_012530. 207 K--IYRQY---I---DNKVR------------GSFT---ADE------------MGMFIRNPRSDILSGGYGLS-ELEMG 250 (559) Q Consensus 207 ~--~y~~~---~---~~~~~------------~~~~---~~e------------vi~~~~n~~~~~~~~~~G~S-pl~~~ 250 (559) . .++.+ + ..... ..+. ..+ .+..+++. .....||.| |.+-+ T Consensus 203 ~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw~~---~~ge~YGrg~P~~~~ 279 (556) T protein:vir:73 203 TYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEV---NGEDVYASSCPGMLA 279 (556) T ss_pred CccceEEEEEEEeccccccccccCcccceEEEEEEEecCCCceecccCCcccCCceeeeeee---cCCcccccCccHHHh Confidence 0 00000 0 00000 0000 000 11222222 223469999 89988 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCCceeeeeccc- Q lcl|NC_012530. 251 LREFISHENTELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAEDAKFVSMTQ- 329 (559) Q Consensus 251 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g~~~~~~ls~- 329 (559) ...+.......+.......-...|..++. .. +... .+ +...|.-+.+. + ..+.-.++|+-. T Consensus 280 lgD~k~L~~l~~~~l~~~~~~~~pp~~v~--~~-----~~~~---~~-----~~~pgg~~~~~--~-~~~~~~i~p~~~~ 341 (556) T protein:vir:73 280 LGQVKALQVEQKRKAQLIDKATNPPMVAP--TS-----LKNQ---RV-----SLLPGDVTYLD--V-ISGQDGFKPAYLV 341 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceecc--cc-----cccc---ce-----eeccCcccccc--C-CCCccceeeeccc Confidence 88887777777776666666667765542 11 1110 00 12222211111 1 122123455432 Q ss_pred cchhH-HHHHHHHHHHHHHHHhCCCHH-HhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHH----- Q lcl|NC_012530. 330 AEDMQ-FQSWLNYLINIICALVAMDPA-EIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLT----- 402 (559) Q Consensus 330 ~~D~q-f~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln----- 402 (559) ..|.+ ..+..+.....|-++|-++.. +++..+...-|..+ .....++ ....|.|.+.+++++|- T Consensus 342 ~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtE----v~~r~~E-----~~~~LG~v~~rl~~E~l~Pli~ 412 (556) T protein:vir:73 342 NPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEA----VIEMKEE-----KLLMLGPVLERLNDEALNPLID 412 (556) T ss_pred cccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHH----HHHHHHH-----HHHHhhHHHHHHHHHHHHHHHH Confidence 23444 345567788999999988753 23433322111111 1111111 22334555555444432 Q ss_pred -------h-hccccc----cCccceeeecchhhhhH-HHHHHH----HHHH--HcC-------CCCHHH----HHHHhCC Q lcl|NC_012530. 403 -------N-GIIRQI----LGDNYMLEFVGGDTRSQ-QDKLKS----VQLE--LQT-------ATTVND----YREKQGL 452 (559) Q Consensus 403 -------~-~L~~~~----~~~~~~~~f~~l~~~d~-~~~~~~----~~~~--~~~-------~~T~NE----~R~~~gl 452 (559) + .+|++. .+..+.+++...+.... ...... +... +.. -+..++ +-+.+|. T Consensus 413 r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gv 492 (556) T protein:vir:73 413 RVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGV 492 (556) T ss_pred HHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCC Confidence 2 233332 13346666655432211 111111 1100 111 122333 3445566 Q ss_pred CCCCCCCEeeccceecccccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccc Q lcl|NC_012530. 453 PKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDN 532 (559) Q Consensus 453 ~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 532 (559) |+ ..+..+-.++.+.+.....+-..+.... ............+. .+.++.. .+ .+.. ..|. T Consensus 493 p~----~~irs~eev~~~rq~r~~~qq~~~~~~~-~~~a~~~~~~~~~~-~~~~~~~-----l~--~~~~----~~g~-- 553 (556) T protein:vir:73 493 SP----TVIVPQEQVQGIREERAKQAQAAQAMAM-GQAAAQGAKTLSET-QTSDPSA-----LT--AIAN----AAGA-- 553 (556) T ss_pred Ch----hhcCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhc-cCCCHHH-----HH--HHHH----hhcC-- Confidence 53 1222111111110000000000000000 00000000000000 0000000 00 0000 0110 Q ss_pred cccccc Q lcl|NC_012530. 533 QQGVGK 538 (559) Q Consensus 533 ~~~~~~ 538 (559) +.| T Consensus 554 ---~~~ 556 (556) T protein:vir:73 554 ---PQQ 556 (556) T ss_pred ---CCC Confidence 111 No 256 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=28.14 E-value=1.8 Score=19.22 Aligned_cols=425 Identities=13% Similarity=0.031 Sum_probs=154.3 Q ss_pred CcchhhhccccccCCcchHHHHHHHHH--HH----HHHhhhhccccccccccccccccccccccccccccCCCCCcccHH Q lcl|NC_012530. 1 MGIFDRFRTKFYTDDPNAFFKHIDSKI--AN----DTASKALNGVDRAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRIT 74 (559) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~----~~~~~~~~gr~~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 74 (559) |.=- -.|.++--..+..++.+|...+ .+ ++.. |.-|.+.. . ..... .. T Consensus 1 ~~~~-~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~---------~tlP~~~~----~----------~~~~~-~~- 54 (515) T protein:vir:70 1 MQDT-ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAK---------LTLPYLMN----N----------KGDNE-TS- 54 (515) T ss_pred Ccch-hhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHH---------HhcccccC----C----------CCCcc-cc- Confidence 1100 0011222222334444442221 11 1211 23332110 0 00000 00 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHHhhhhHhhhhcCCcceeeeccccc--ccChhHHHHHHHHHHHHHhcCCC--CCCChh Q lcl|NC_012530. 75 DVLRQYSMNVVLNAIINTRANQVTEYAHRASTDDNGMGYQVRLKNGD--KPTKEQQKKIDYAERYIERMGVD--YSPIRD 150 (559) Q Consensus 75 ~~~~~~~~~~~v~acv~~ia~~ia~~~~~~~~~~~g~~~~v~~~d~~--~~~~~~~~~~~~~~~~L~~~~p~--~~~~~~ 150 (559) +... .++--.|++.+|..+..... ..+.-.|++...+.. ..+.. ......+..||....-. ....+. T Consensus 55 ---~~~~-dstg~~a~~~LAa~l~~~lt----pp~~~WF~l~~~d~~~~~l~~~-~~~~~~v~~~l~~ve~~~~~~l~~s 125 (515) T protein:vir:70 55 ---QNGW-QGVGAQATNHLANKLAQVLF----PAQRSFFRVDLTAKGEKVLDDR-GLKKTQLATIFARVETTAMKALEQR 125 (515) T ss_pred ---cccc-cchHHHHHHHHHHHHHHhhc----CCCCcccccccChhhhhccccc-hhHHHHHHHHHHHHHHHHHHHHHhc Confidence 0112 23344577777777653211 112234454433321 11110 01111222222221100 001123 Q ss_pred hHHHHHHHHHHHHHHcCCcceEEEECCCCcEEEEEEecCceEEEEecCcccccc-------------------------- Q lcl|NC_012530. 151 DFTSFLRKLVRDTYTYDQVNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT-------------------------- 204 (559) Q Consensus 151 ~~~~f~~~~v~d~ll~Gna~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------------------- 204 (559) +|+.-+...+.|+.++||+..++- ..+. ...||| .++.+..|..|.+-. T Consensus 126 nf~~~~~~~~~~L~~~G~a~l~~d--~~~~-~~~~pl--~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~ 200 (515) T protein:vir:70 126 QFRPAIVEVFKHLIVAGNCLLYKP--SKGA-MSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKG 200 (515) T ss_pred CchHHHHHHHHHHHhHCeEEEEEe--CCCC-eEEEEc--CeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhh Confidence 566666777888999999988763 3332 456666 345555555553310 Q ss_pred ------cceE--------------EEEEecCceeee---ecccc--eEEEecccCCCccCCcccccHHHHHHHHHHHHHH Q lcl|NC_012530. 205 ------RGKI--------------YRQYIDNKVRGS---FTADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISHEN 259 (559) Q Consensus 205 ------~~~~--------------y~~~~~~~~~~~---~~~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~ 259 (559) .... |++-.++..... ++..+ .+..+++.. ....||.||.+-+.-.+..... T Consensus 201 ~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y~~~e~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~ 277 (515) T protein:vir:70 201 KKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKESRIKSEKLPFIPLTWKRS---YGEDWGRPLAEDYSGDLFVIQF 277 (515) T ss_pred hhcCCCCceEEEEEEEecCCCceEEEEecCceeeccccccccccCCceeeeeeec---CCCCcccchHHHhhHHHHHHHH Confidence 0000 011111110000 00001 111222221 2246999999999999988888 Q ss_pred HHHHHHHHHHhcCCCceEEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccC--CceeeeeccccchhHH-H Q lcl|NC_012530. 260 TELFNDRFFTHGGTTKGILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITA--EDAKFVSMTQAEDMQF-Q 336 (559) Q Consensus 260 ~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~--g~~~~~~ls~~~D~qf-~ 336 (559) ..+.....-.-...|..++.-++ -.... ....|..+ .++.+ +.+...++....|.+. . T Consensus 278 l~~~~l~~~~~a~~p~~lv~~~g-----~~~~~----------~l~~~~~g----~iv~g~~~~v~~~~~~~~~d~~~~~ 338 (515) T protein:vir:70 278 LSEAMARGAALMADIKYLIRPGS-----QTDVD----------HFVNSGTG----EVITGVAEDIHIVQLGKYADLTPIS 338 (515) T ss_pred HHHHHHHHHHHhcCCCeeeCccc-----ccchh----------hccccCCc----eeecCCcccceeeecCcccchhHHH Confidence 88877777666677665552211 11111 11122211 23322 3344444444567664 4 Q ss_pred HHHHHHHHHHHHHhCCCHHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhc--------ccc Q lcl|NC_012530. 337 SWLNYLINIICALVAMDPAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGI--------IRQ 408 (559) Q Consensus 337 e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L--------~~~ 408 (559) +..+.....|.++|-+........++-|. .+ +. .+..-....|.|.+.++.++|-.-| +++ T Consensus 339 ~~i~~~~~rI~~af~~~~l~~rd~~rvTA--------tE--V~-~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~ 407 (515) T protein:vir:70 339 AVLEVYTRRIGVIFMMETMTRRDAERVTA--------VE--IQ-RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDS 407 (515) T ss_pred HHHHHHHHHHHHHHhhhhhhccCCccccH--------HH--HH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCC Confidence 55677788898999776533332222111 11 11 1222334467888888777775555 322 Q ss_pred ccCccceeeec----chhhhhHHHHHHHHHHHHcC----------CCCH----HHHHHHhCCCCCCCCCEeeccceeccc Q lcl|NC_012530. 409 ILGDNYMLEFV----GGDTRSQQDKLKSVQLELQT----------ATTV----NDYREKQGLPKIAGGDIILSAVYIQRL 470 (559) Q Consensus 409 ~~~~~~~~~f~----~l~~~d~~~~~~~~~~~~~~----------~~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l 470 (559) .-..-+...+. .+.+....+........+.. .+.. +++....|.|+- ++.+ - T Consensus 408 ~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~-----~~rs-----~ 477 (515) T protein:vir:70 408 FTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-----FLKS-----E 477 (515) T ss_pred CChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcc-----ccCC-----H Confidence 11111222221 12222111111111111111 1121 222222232210 0000 0 Q ss_pred ccccccccccccccccccccccccCCCCCCCCCCCCccccccchhccccccccccccccccccccccccccccc Q lcl|NC_012530. 471 GQQEQIKQNEFQRQQTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGYTGKDAKPSGKDNQQGVGKDGQLKN 544 (559) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~k~ 544 (559) ..+.+ .-.+..+. ........ ..|+.....+++...+ + T Consensus 478 eev~~---~r~q~~~~---~~~~~~~~-----------------------------~~~~a~~~~~~~~~~~-~ 515 (515) T protein:vir:70 478 EEMQQ---EMAQQAQA---QQEAMLNE-----------------------------GVAKAVPGVIQQEMKE-G 515 (515) T ss_pred HHHHH---HHHHHHHH---HHHHHHHH-----------------------------hhhhhcccchhhhhcc-C Confidence 00000 00000000 00000000 0000000000000000 0 No 257 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=21.73 E-value=2.6 Score=18.35 Aligned_cols=415 Identities=9% Similarity=0.018 Sum_probs=146.1 Q ss_pred HHHHHHHHHHHHHhhhhccc--c-ccccccccccccccccccccccccCCCCCcccHHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_012530. 20 FKHIDSKIANDTASKALNGV--D-RAYTEPVDGNLMFSTLEDTSIVPKPSPIAFGRITDVLRQYSMNVVLNAIINTRANQ 96 (559) Q Consensus 20 ~~~~~~~~~~~~~~~~~~gr--~-~a~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~v~acv~~ia~~ 96 (559) .+...+++.++-....-..+ + -.|+-|.....+ +. ..+..+ .+..+ ++--.|++.+|.. T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~~---~~----------~~~~~~----~~~~d-stg~~a~~~LAa~ 62 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP---MS----------GSRGVV----EHDFQ-SAGALLVNNLAAK 62 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccccCC---CC----------cccccc----cCccc-chHHHHHHHHHHH Confidence 23333333322211000000 0 012333221110 00 000000 01122 3334577878777 Q ss_pred HHhhhhHhhhhcCCcceeeeccccc--ccC------hhHHHHHHHHHHHHHhcCCCCCCChhhHHHHHHHHHHHHHHcCC Q lcl|NC_012530. 97 VTEYAHRASTDDNGMGYQVRLKNGD--KPT------KEQQKKIDYAERYIERMGVDYSPIRDDFTSFLRKLVRDTYTYDQ 168 (559) Q Consensus 97 ia~~~~~~~~~~~g~~~~v~~~d~~--~~~------~~~~~~~~~~~~~L~~~~p~~~~~~~~~~~f~~~~v~d~ll~Gn 168 (559) +..... ..+.-.|++...+.. +.. .+.++-...+++.+... ..+.+|+.-+..++.|+.++|| T Consensus 63 l~~~lt----pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-----l~~snf~~~~~~~~~~L~~~G~ 133 (510) T protein:vir:78 63 LARSLF----PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQR-----LFQNASLAVLTQVIKLLIVTGN 133 (510) T ss_pred HHHhhc----CCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHH-----HHhcCcHHHHHHHHHHHHhhCe Confidence 653211 112224444433321 110 01111122222222211 1123556666677888999999 Q ss_pred cceEEEECCCCcEEEEEEecCceEEEEecCcccccc------------------------------cce----------- Q lcl|NC_012530. 169 VNYENTYDSNGRLSHTRMVDPTTIYFANDEHGHRRT------------------------------RGK----------- 207 (559) Q Consensus 169 a~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~------------------------------~~~----------- 207 (559) +.+++.. .+.....||| .+..+..|..|.+-. ... T Consensus 134 a~l~~~~--~~~~~~~~pl--~~y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~ 209 (510) T protein:vir:78 134 ALLYRNS--DEATVVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKG 209 (510) T ss_pred EEEEEeC--CCCeEEEEEc--ceeEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecC Confidence 9876653 4445666776 345555565554310 000 Q ss_pred ----EE--EEEecCceeee---ecccc--eEEEecccCCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_012530. 208 ----IY--RQYIDNKVRGS---FTADE--MGMFIRNPRSDILSGGYGLSELEMGLREFISHENTELFNDRFFTHGGTTKG 276 (559) Q Consensus 208 ----~y--~~~~~~~~~~~---~~~~e--vi~~~~n~~~~~~~~~~G~Spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 276 (559) .| ++-.++..... +..++ .+..+++.. ....||.||.+-+...+.......+.....-.-...|.. T Consensus 210 ~~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~~~Rw~~~---~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~ 286 (510) T protein:vir:78 210 TAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLA---PGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) T ss_pred CCCcEEEEEEEecCeeeccccccccccCCeeeeeeeec---CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 00 00011111100 00011 122222222 234699999999999888888877776666555555554 Q ss_pred EEEecCccCCccCCHHHHHHHHHHHHHHhcCcccccccccccCC--ceeeeeccccchhHH-HHHHHHHHHHHHHHhCCC Q lcl|NC_012530. 277 ILLVKPSPSVTNTSMRALEDFKRHWTATSSGINGAYRIPMITAE--DAKFVSMTQAEDMQF-QSWLNYLINIICALVAMD 353 (559) Q Consensus 277 il~~~~~~~~~~~~~e~~~~l~~~~~~~~~G~~nag~~~vl~~g--~~~~~~ls~~~D~qf-~e~~~~~~~~Ia~~fgVP 353 (559) ++.-+ + -..+. ....|..+ .++++. ++...++....|.+. .+..+.....|.++| T Consensus 287 lv~p~-g----~~~~~----------~l~~~~~g----~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF--- 344 (510) T protein:vir:78 287 LVDEA-K----GAVVD----------DYQDAEMG----DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF--- 344 (510) T ss_pred ccCCc-c----ccchh----------hhccCCCc----eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHH--- Confidence 43211 1 11111 11111111 233332 233333334456654 455677777888887 Q ss_pred HHHhccccccccccccccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHhhcc-------------ccc--cCccceeee Q lcl|NC_012530. 354 PAEIGMQNRGGATGNKSNSLNESNNQNKIDASKSKGLMPLLDMIAKNLTNGII-------------RQI--LGDNYMLEF 418 (559) Q Consensus 354 p~~lg~~~~~~~~~~~~~~~~~an~~~~~~~~~~~~l~P~~~~ie~~ln~~L~-------------~~~--~~~~~~~~f 418 (559) ++......+ +.-+..+-. + +..-....|.|.+.++.++|-.-|+ +.. ......+++ T Consensus 345 --~~~l~~~~~----~rvTAtEV~--~-r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~ 415 (510) T protein:vir:78 345 --MYGANQRDA----ERVTAEEVR--I-TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG 415 (510) T ss_pred --hhccccCCC----CCcCHHHHH--H-HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeec Confidence 232111111 000111111 1 1112233455555554444433222 111 112233344 Q ss_pred c-chhhhhHHHHHHHHHHHHcCC---------CCH----HHHHHHhCCCCCCCCCEeeccceeccccccccccccccccc Q lcl|NC_012530. 419 V-GGDTRSQQDKLKSVQLELQTA---------TTV----NDYREKQGLPKIAGGDIILSAVYIQRLGQQEQIKQNEFQRQ 484 (559) Q Consensus 419 ~-~l~~~d~~~~~~~~~~~~~~~---------~T~----NE~R~~~gl~pi~gGD~~~~~~~~~~l~~~~~~~~~~~~~~ 484 (559) . .+.+.............+... +.. +++.+.+|.+|.. .+..+-.++.+....+......+.. T Consensus 416 is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~---ivrs~eev~a~~~~~~~q~~~~~~~ 492 (510) T protein:vir:78 416 LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQ---FYKSADELQAEAEEQRRQAAQAQAA 492 (510) T ss_pred ccHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhh---hcCCHHHHHHHHHHHHHHHHHHHHH Confidence 3 233332222222211111111 222 2334455654421 1100000000000000000000000 Q ss_pred ccccccccccCCCCCCCCCCCCccccccchhccccc Q lcl|NC_012530. 485 QTRLTQLESALQNPSGTPPTLPPSSSNSFQQNQEGY 520 (559) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (559) +.+.. ...+. -.+.. -++ T Consensus 493 ~~a~~---~~~~~-~~~~~--------------~g~ 510 (510) T protein:vir:78 493 QETLL---EGASD-MTNAL--------------AGV 510 (510) T ss_pred HHHHH---Hhhhh-hcccC--------------CCC Confidence 00000 00000 00000 001 Done!